October 2006 Archives

Cleaning Squid Logs

| | Comments (0) | TrackBacks (0)

Two quick unix commands that will clean a Squid log file for import into R. Very useful when you need to fine tune your cache strategy.

Actors Everywhere

| | Comments (0) | TrackBacks (0)

Park Your Data

| | Comments (0) | TrackBacks (0)

Recently Jnan Dash of Foldera made a great comment regarding persistence. He said there are two ways to park your car. The first is to disassemble it and store it on a shelf, then reassemble it when you need it again. Or just put it somewhere fully assembled.

At the Office 2.0 conference, the need for more standards around data exchange and single sign on between applications was a common topic. I believe there is a need to discuss account provisioning as well.

There has been alot of talk around S3 for storage and backup. I find EC2 much more interesting as a foundation for a perpetual computing cluster.

Powered By...

| | Comments (0) | TrackBacks (0)

At PARC tonight, Norvig gave a quick and interesting presentation on some of the AI concepts being employed at Google. My only takeaway is that Google is powered by the "law of large numbers".

Office 2.0

| | Comments (0) | TrackBacks (0)

I'll be at the Office 2.0 conference next week (Oct 11-12, 2006). Feel free to drop me a note if attending.

URL Regex Pattern

| | Comments (0) | TrackBacks (0)

Found in the Nutch source. A regex pattern used for matching URL strings in text. Used with Perl5Compiler in Jakarta ORO.