Methods for performing various types of aggregations over
Utilities for Cartesian products of two
Functions for computing the distinct elements of a
Utilities for joining multiple
Static functions for working with legacy Mappers and Reducers that live under the org.apache.hadoop.mapred.* package as part of Crunch pipelines.
Static functions for working with legacy Mappers and Reducers that live under the org.apache.hadoop.mapreduce.* package as part of Crunch pipelines.
Methods for performing common operations on PTables.
Output type for storing the results of a Quantiles computation
Methods for performing random sampling in a distributed fashion, either by accepting each record in a
Utilities for performing a secondary sort on a
Utilities for performing set operations (difference, intersection, etc) on
Utilities for controlling how the data in a
Utilities for sorting
To sort by column 2 ascending then column 1 descending, you would use:
Tools for creating top lists of items in PTables and PCollections
For signaling the order in which a sort should be done.
Copyright © 2017 The Apache Software Foundation. All rights reserved.