Skip to main content

Josh Wills: Apache Crunch: A Java Library for Easier MapReduce Programming Posted by Josh Wills on Dec 27, 2012

Apache Crunch: A Java Library for Easier MapReduce Programming: Apache Crunch (incubating) is a Java library for creating MapReduce pipelines that is based on Google's FlumeJava library. Like other high-level tools for creating MapReduce jobs, Crunch provides a library of patterns to implement common tasks like joining data, performing aggregations, and sorting records. Unlike those other tools...

Community: Java Tools