Package org.apache.crunch.lib

Joining, sorting, aggregating, and other commonly used functionality.

See:
          Description

Class Summary
Aggregate Methods for performing various types of aggregations over PCollection instances.
Aggregate.PairValueComparator<K,V>  
Aggregate.TopKCombineFn<K,V>  
Aggregate.TopKFn<K,V>  
Cartesian Utilities for Cartesian products of two PTable or PCollection instances.
Cogroup  
Join Utilities for joining multiple PTable instances based on a common lastKey.
PTables Methods for performing common operations on PTables.
Sample  
Sample.SamplerFn<S>  
SecondarySort Utilities for performing a secondary sort on a PTable<K, Pair<V1, V2>> collection.
Set Utilities for performing set operations (difference, intersection, etc) on PCollection instances.
Sort Utilities for sorting PCollection instances.
Sort.ColumnOrder To sort by column 2 ascending then column 1 descending, you would use: sortPairs(coll, by(2, ASCENDING), by(1, DESCENDING)) Column numbering is 1-based.
 

Enum Summary
Sort.Order  
 

Package org.apache.crunch.lib Description

Joining, sorting, aggregating, and other commonly used functionality.



Copyright © 2012 The Apache Software Foundation. All Rights Reserved.