Two quick unix commands that will clean a Squid log file for import into R. Very useful when you need to fine tune your cache strategy.
October 2006 Archives
Interesting presentation given at C4, Actors: A Unifying Design Pattern for Scalable Concurrency.
Recently Jnan Dash of Foldera made a great comment regarding persistence. He said there are two ways to park your car. The first is to disassemble it and store it on a shelf, then reassemble it when you need it again. Or just put it somewhere fully assembled.
At the Office 2.0 conference, the need for more standards around data exchange and single sign on between applications was a common topic. I believe there is a need to discuss account provisioning as well.
There has been alot of talk around S3 for storage and backup. I find EC2 much more interesting as a foundation for a perpetual computing cluster.
At PARC tonight, Norvig gave a quick and interesting presentation on some of the AI concepts being employed at Google. My only takeaway is that Google is powered by the "law of large numbers".
I'll be at the Office 2.0 conference next week (Oct 11-12, 2006). Feel free to drop me a note if attending.
Found in the Nutch source. A regex pattern used for matching URL strings in text. Used with Perl5Compiler in Jakarta ORO.