Recently in Tacit Knowledge Category

Cleaning Squid Logs

| | Comments (0) | TrackBacks (0)

Two quick unix commands that will clean a Squid log file for import into R. Very useful when you need to fine tune your cache strategy.

URL Regex Pattern

| | Comments (0) | TrackBacks (0)

Found in the Nutch source. A regex pattern used for matching URL strings in text. Used with Perl5Compiler in Jakarta ORO.