This project has retired. For details please refer to its Attic page.
Overview (Apache Crunch 0.4.0-incubating API)

Apache Crunch 0.4.0-incubating API

Core
org.apache.crunch Client-facing API and core abstractions.
org.apache.crunch.fn Commonly used functions for manipulating collections.
org.apache.crunch.impl.mem In-memory Pipeline implementation for rapid prototyping and testing.
org.apache.crunch.impl.mr A Pipeline implementation that runs on Hadoop MapReduce.
org.apache.crunch.io Data input and output for Pipelines.
org.apache.crunch.test Utilities for testing Crunch-based applications.
org.apache.crunch.types Common functionality for business object serialization.
org.apache.crunch.types.avro Business object serialization using Apache Avro.
org.apache.crunch.types.writable Business object serialization using Hadoop's Writables framework.
org.apache.crunch.util An assorted set of utilities.

 
Extension Library
org.apache.crunch.lib Joining, sorting, aggregating, and other commonly used functionality.
org.apache.crunch.lib.join Inner and outer joins on collections.

 
User-contributed Functionality
org.apache.crunch.contrib User contributions that may be interesting for special applications.
org.apache.crunch.contrib.bloomfilter Support for creating Bloom Filters.
org.apache.crunch.contrib.io.jdbc Support for reading data from RDBMS using JDBC

 
Examples
org.apache.crunch.examples Example applications demonstrating various aspects of Crunch.

 



Copyright © 2012 The Apache Software Foundation. All Rights Reserved.