<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>manAmplified</title>
        <link>http://www.manamplified.org/</link>
        <description></description>
        <language>en</language>
        <copyright>Copyright 2012</copyright>
        <lastBuildDate>Wed, 02 Nov 2011 16:09:49 -0800</lastBuildDate>
        <generator>http://www.sixapart.com/movabletype/</generator>
        <docs>http://www.rssboard.org/rss-specification</docs>
        
        <item>
            <title>Growing A Company</title>
            <description><![CDATA[<p>This blog isn't dead yet, just spending my energy growing my <a href="http://www.concurrentinc.com/">company</a>.</p>

<p>If interested in helping, we are <a href="http://gigaom.com/cloud/concurrent-raises-900k-to-make-hadoop-easier/">seed funded</a> and <a href="http://www.concurrentinc.com/careers/">hiring</a>.</p>]]></description>
            <link>http://www.manamplified.org/archives/2011/11/growing-a-company.html</link>
            <guid>http://www.manamplified.org/archives/2011/11/growing-a-company.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
                <category domain="http://www.sixapart.com/ns/types#category">Thoughts</category>
            
            
            <pubDate>Wed, 02 Nov 2011 16:09:49 -0800</pubDate>
        </item>
        
        <item>
            <title>Common MapReduce Patterns</title>
            <description><![CDATA[<p>A 40 minute video of me butchering common patterns in MapReduce at Buzzwords this year.</p>

<p><iframe src="http://player.vimeo.com/video/25969055?title=0&amp;byline=0&amp;portrait=0" width="400" height="300" frameborder="0"></iframe><p><a href="http://vimeo.com/25969055">Chris Wensel COMMON MAPREDUCE PATTERNS</a> from <a href="http://vimeo.com/user7393055">ntc GmbH</a> on <a href="http://vimeo.com">Vimeo</a>.</p></p>]]></description>
            <link>http://www.manamplified.org/archives/2011/07/common-mapreduce-patterns.html</link>
            <guid>http://www.manamplified.org/archives/2011/07/common-mapreduce-patterns.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
                <category domain="http://www.sixapart.com/ns/types#category">Technical</category>
            
            
            <pubDate>Sat, 16 Jul 2011 10:05:29 -0800</pubDate>
        </item>
        
        <item>
            <title>Your Patch Sucks</title>
            <description><![CDATA[<p>I was approached by a fellow at Strata who asked me why <a href="http://www.cascading.org/">Cascading</a> wasn't free or open-source. I replied that it is, "it is open-source and totally free". He then asked me why I don't accept patches, clearly a misconception that's been floating around (see the end). Of course I replied, "because your patch will suck". Then I explained why. </p>]]></description>
            <link>http://www.manamplified.org/archives/2011/02/your-patch-sucks.html</link>
            <guid>http://www.manamplified.org/archives/2011/02/your-patch-sucks.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Mon, 07 Feb 2011 13:54:09 -0800</pubDate>
        </item>
        
        <item>
            <title>When is Hadoop Justified</title>
            <description><![CDATA[<p>At the SF Hadoop User Group last night, a question was posed as to what factors justify the use of an Apache Hadoop cluster vs. traditional approaches.</p>

<p>The answer isn't black and white but can be broken down into three intertwined heuristics.</p>

<p>Hadoop is more likely justified,</p>

<ul>
<li>the larger the corpus of data needed to satisfy the business problem (big-data).</li>
<li>the more complex the processes and algorithms required to satisfy the business problem.</li>
<li>and, the more distinct business problems need concurrent or overlapping access to a corpus of data (multi-tenancy).</li>
</ul>

<p>Thus Hadoop isn't strictly about huge data-sets, but also about absorbing complexity while maintaining scale. </p>

<p>FlightCaster, a <a href="http://www.cascading.org/">Cascading</a> user, doesn't have huge amounts of data, but they do have a very hard business problem, and Hadoop for them is completely justified.</p>

<p>Facebook on the other hand has huge data, but their tool of choice, Hive, doesn't encourage solutions to complex problems by virtue of being a syntax and SQL based. Of which was initially used to extract small data from the cluster for use by other systems or custom Hadoop jobs.</p>]]></description>
            <link>http://www.manamplified.org/archives/2011/01/when-is-hadoop-justified.html</link>
            <guid>http://www.manamplified.org/archives/2011/01/when-is-hadoop-justified.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Technical</category>
            
                <category domain="http://www.sixapart.com/ns/types#category">Tools</category>
            
            
            <pubDate>Thu, 13 Jan 2011 08:46:07 -0800</pubDate>
        </item>
        
        <item>
            <title>ReadWriteWeb on BackType and Cascalog</title>
            <description><![CDATA[<p>From <a href="http://www.readwriteweb.com/hack/2011/01/secrets-of-backtypes-data-engineers.php">Secrets of BackType's Data Engineers</a>:</p>

<blockquote><a href="https://github.com/nathanmarz/cascalog">Cascalog</a> is one of their secret weapons, a <a href="http://clojure.org/">Clojure</a>-based query language for Hadoop that makes it simple for them to analyze their data in new ways. Inspired by the venerable <a href="http://en.wikipedia.org/wiki/Datalog">Datalog</a>, and built on top of <a href="http://www.cascading.org/">Cascading</a>, it allows you to write queries in Clojure and define even complex operations in simple code. Unlike alternatives like Pig or Hive, it's written within a general-purpose language, so there's no need for separate user-defined functions, but it's still a highly-structured way of defining queries.</blockquote>

<p>It's worthy of note that Cascalog is a distant child of <a href="https://github.com/clj-sys/cascading-clojure">cascading-clojure</a>, created and used by <a href="http://flightcaster.com/">FlightCaster</a>. FlightCaster <a href="http://blog.flightcaster.com/flightcaster-acquired">was acquired this week</a>.</p>]]></description>
            <link>http://www.manamplified.org/archives/2011/01/readwriteweb-on-backtype-and-c.html</link>
            <guid>http://www.manamplified.org/archives/2011/01/readwriteweb-on-backtype-and-c.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
                <category domain="http://www.sixapart.com/ns/types#category">Tools</category>
            
            
            <pubDate>Wed, 12 Jan 2011 15:50:17 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading 1.2</title>
            <description><![CDATA[<p>I've just <a href="http://www.cascading.org/2010/12/cascading-12-now-available.html">pushed Cascading 1.2 up</a>. It has a number of performance improvements everyone will benefit from out of the box.</p>]]></description>
            <link>http://www.manamplified.org/archives/2010/12/cascading-12.html</link>
            <guid>http://www.manamplified.org/archives/2010/12/cascading-12.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Wed, 01 Dec 2010 16:53:48 -0800</pubDate>
        </item>
        
        <item>
            <title>Repost of &quot;How fast is Cascading compared to Pig or Hive?&quot;</title>
            <description><![CDATA[<p>Here is a repost of an answer I gave on Quora: <a href="http://www.quora.com/How-fast-is-Cascading-compared-to-Pig-or-Hive">"How fast is Cascading compared to Pig or Hive?"</a></p>]]></description>
            <link>http://www.manamplified.org/archives/2010/11/repost-of-how-fast-is-cascadin.html</link>
            <guid>http://www.manamplified.org/archives/2010/11/repost-of-how-fast-is-cascadin.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Wed, 10 Nov 2010 10:48:07 -0800</pubDate>
        </item>
        
        <item>
            <title>O&apos;Reilly Strata Conference 2011</title>
            <description><![CDATA[<p>The new <a href="http://strataconf.com/strata2011">Strata Conference</a> has just been announced with a Call for Proposals ending Sept 28. This new conference is on the 'business of data' and is the sister conference to Velocity. I'm excited to be a <a href="http://strataconf.com/strata2011/public/content/about#committee">committee member</a>.</p>]]></description>
            <link>http://www.manamplified.org/archives/2010/08/oreilly-strata-conference-2011.html</link>
            <guid>http://www.manamplified.org/archives/2010/08/oreilly-strata-conference-2011.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
                <category domain="http://www.sixapart.com/ns/types#category">Technical</category>
            
            
            <pubDate>Wed, 25 Aug 2010 14:01:22 -0800</pubDate>
        </item>
        
        <item>
            <title>BigDataCamp 2010</title>
            <description><![CDATA[<p>I'll be at the (<a href="http://www.concurrentinc.com/news-events/entry/bigdatacamp_2010/">and a sponsor of</a>) the <a href="http://bigdatacamp.org">BigDataCamp</a> the night before the Hadoop Summit. Sign up if you haven't. </p>]]></description>
            <link>http://www.manamplified.org/archives/2010/06/bigdatacamp-2010.html</link>
            <guid>http://www.manamplified.org/archives/2010/06/bigdatacamp-2010.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Mon, 14 Jun 2010 14:13:50 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading User Group</title>
            <description><![CDATA[<p>The first ever <a href="http://upcoming.yahoo.com/event/4421260/">Cascading User Group</a> will be this Thursday, September 24th, at <a href="http://rapleaf.com/">RapLeaf</a>. </p>

<p>There will be discussions on the future of Cascading, the work done by the <a href="http://flightcaster.com/">FlightCaster</a> folk integrating Cascading with <a href="http://clojure.org/">Clojure</a>, and various tips and techniques.</p>

<p>Hope to see you there.</p>]]></description>
            <link>http://www.manamplified.org/archives/2009/09/cascading-user-group.html</link>
            <guid>http://www.manamplified.org/archives/2009/09/cascading-user-group.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Sat, 19 Sep 2009 12:41:01 -0800</pubDate>
        </item>
        
        <item>
            <title>Cascading WIP 1.1</title>
            <description><![CDATA[<p>A pre-release of the upcoming Cascading 1.1 version is now available. Read more about its new features <a href="http://www.cascading.org/2009/06/cascading-wip-11.html">here in this announcement</a>.</p>]]></description>
            <link>http://www.manamplified.org/archives/2009/06/cascading-wip-11.html</link>
            <guid>http://www.manamplified.org/archives/2009/06/cascading-wip-11.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Sun, 14 Jun 2009 15:07:51 -0800</pubDate>
        </item>
        
        <item>
            <title>Open Source Bridge + Cascading</title>
            <description><![CDATA[<p>Quick reminder I'll be in Portland the bulk of next week (June 17-19) at Open Source Bridge <a href="http://opensourcebridge.org/sessions/111">presenting on Cascading and Hadoop</a>. Feel free to <a href="http://chris.wensel.net/">ping me</a> if you're around.</p>]]></description>
            <link>http://www.manamplified.org/archives/2009/06/open-source-bridge-cascading.html</link>
            <guid>http://www.manamplified.org/archives/2009/06/open-source-bridge-cascading.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Sat, 13 Jun 2009 12:29:09 -0800</pubDate>
        </item>
        
        <item>
            <title>ScaleCamp</title>
            <description><![CDATA[<p><a href="http://scalecamp.eventbrite.com/">ScaleCamp</a>, the night before the <a href="http://hadoopsummit09.eventbrite.com/">Hadoop Summit 2009</a>.</p>]]></description>
            <link>http://www.manamplified.org/archives/2009/05/scalecamp.html</link>
            <guid>http://www.manamplified.org/archives/2009/05/scalecamp.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Wed, 13 May 2009 21:30:04 -0800</pubDate>
        </item>
        
        <item>
            <title>Upcoming Presentations</title>
            <description>Thought I would share the dates/location of my next four presentations over the next few weeks.</description>
            <link>http://www.manamplified.org/archives/2009/05/upcoming-presentations.html</link>
            <guid>http://www.manamplified.org/archives/2009/05/upcoming-presentations.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Mon, 11 May 2009 10:15:28 -0800</pubDate>
        </item>
        
        <item>
            <title>The Whys and Hows of Hadoop</title>
            <description><![CDATA[<p>I've been either directly or indirectly involved in a fair number of Hadoop applications and deployments over the last year or so. In that time I've roughly characterized Hadoop usage into two "why's" and two "how's". </p>]]></description>
            <link>http://www.manamplified.org/archives/2009/05/the-whys-and-hows-of-hadoop.html</link>
            <guid>http://www.manamplified.org/archives/2009/05/the-whys-and-hows-of-hadoop.html</guid>
            
                <category domain="http://www.sixapart.com/ns/types#category">Cascading</category>
            
            
            <pubDate>Sun, 03 May 2009 10:02:00 -0800</pubDate>
        </item>
        
    </channel>
</rss>
