Recently in Cascading Category
I was approached by a fellow at Strata who asked me why Cascading wasn't free or open-source. I replied that it is, "it is open-source and totally free". He then asked me why I don't accept patches, clearly a misconception that's been floating around (see the end). Of course I replied, "because your patch will suck". Then I explained why.
Cascalog is one of their secret weapons, a Clojure-based query language for Hadoop that makes it simple for them to analyze their data in new ways. Inspired by the venerable Datalog, and built on top of Cascading, it allows you to write queries in Clojure and define even complex operations in simple code. Unlike alternatives like Pig or Hive, it's written within a general-purpose language, so there's no need for separate user-defined functions, but it's still a highly-structured way of defining queries.
I've just pushed Cascading 1.2 up. It has a number of performance improvements everyone will benefit from out of the box.
Here is a repost of an answer I gave on Quora: "How fast is Cascading compared to Pig or Hive?"
A pre-release of the upcoming Cascading 1.1 version is now available. Read more about its new features here in this announcement.