sawyl: (Default)
[personal profile] sawyl
Winnowing through logs in search of useful information, I've had a bit of a brainwave: instead of running a linear search through a file looking and ripping out the sections I want with sed pattern matches, I can take advantage of the ordered nature of the file and use a binary search. This should let me massively reduce the number of calls — log2(N) versus N — to the computationally expensive regular expression engine and should let me extract the interesting sections reasonably quickly without having to resort to threading (although I reserve the right to go parallel if impatience demands it).

ETA: In the process of coding up, I noticed another neat hack. If, in the case where I'm winnowing through a specific log of date-stamped entries, I replace my custom parser with dateutil.parser, not only does the cost drop by 10-20 times but the code becomes sufficiently general to work on any file where the first part of the line contains a log entry, e.g. both syslog and LoadLeveler. Excelsior!
This account has disabled anonymous posting.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

Profile

sawyl: (Default)
sawyl

August 2018

S M T W T F S
   123 4
5 6 7 8910 11
12131415161718
192021222324 25
262728293031 

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Feb. 5th, 2026 02:57 pm
Powered by Dreamwidth Studios