Here are some microbenchmarks: In [63]: timeit dateutil.parser.parse('2013-05-11...

yread · on Oct 13, 2016

Of course you have a different machine but the OP was getting 2.5 us per parse in .NET versus your 89.5 us in Python. I wouldn't have expected such a difference. No wonder it's hot path

fnord123 · on Oct 13, 2016

Well that's dateutil (installed from pip) and not datetime (std). As part of log ingestion I would, of course, convert to UTC and drop the timezone distinctions since it does slow down python a lot when it has to worry about timezones. Working within the same units and no DST issues is much nicer/quicker.

Anyway, if you're installing packages from pip, may as well just install iso8601 and get the best performance - possibly beating .Net (who knows? as you said, I have a different machine than OP).

beagle3 · on Oct 14, 2016

The numpy version seems to be about 30 times faster than the iso8601 version - note the result there is in nanosecs, not microsecs like the others.

fnord123 · on Oct 14, 2016

Yeah but OP is using pypy and I don't know if numpy works on pypy fully. I think I read it does but I haven't tried it.

x1798DE · on Oct 13, 2016

dateutil spends most of its time inferring the format, it's not really designed as a performance component, it's designed as a "if it looks like a date we'll give you a datetime" style component.

dsp1234 · on Oct 13, 2016

Is there a particular reason that all of the loops are 10k except numpy which is 1M?

mrdmnd · on Oct 13, 2016

The TimeIt macro runs snippets for a variable number of iterations based on how long the snippet takes.

fnord123 · on Oct 13, 2016

Because it can do so many more iterations in a similar amount of time, it just does them.

joejev · on Oct 13, 2016

timeit does a run to decide how many loops to do, since it was much faster it ran it more times.