Interface | Description |
---|---|
Aggregator<T> |
Aggregate a sequence of values into a possibly smaller sequence of the same type.
|
CombineFn.Aggregator<T> | Deprecated
Use
Aggregator |
CombineFn.AggregatorFactory<T> | Deprecated
Use
PGroupedTable.combineValues(Aggregator) which doesn't require a factory. |
Emitter<T> |
Interface for writing outputs from a
DoFn . |
PCollection<S> |
A representation of an immutable, distributed collection of elements that is
the fundamental target of computations in Crunch.
|
PGroupedTable<K,V> |
The Crunch representation of a grouped
PTable . |
Pipeline |
Manages the state of a pipeline execution.
|
PObject<T> |
A
PObject represents a singleton object value that results from a distributed
computation. |
PTable<K,V> |
A sub-interface of
PCollection that represents an immutable,
distributed multi-map of keys and values. |
Source<T> |
A
Source represents an input data set that is an input to one or more
MapReduce jobs. |
SourceTarget<T> |
An interface for classes that implement both the
Source and the
Target interfaces. |
TableSource<K,V> |
The interface
Source implementations that return a PTable . |
TableSourceTarget<K,V> |
An interface for classes that implement both the
TableSource and the
Target interfaces. |
Target |
A
Target represents the output destination of a Crunch PCollection
in the context of a Crunch job. |
Tuple |
A fixed-size collection of Objects, used in Crunch for representing joins
between
PCollection s. |
Class | Description |
---|---|
CombineFn<S,T> | |
CombineFn.AggregatorCombineFn<K,V> | Deprecated
Use the
Aggregators.toCombineFn(org.apache.crunch.Aggregator) adapter |
CombineFn.FirstNAggregator<V> | Deprecated |
CombineFn.LastNAggregator<V> | Deprecated |
CombineFn.MaxBigInts | Deprecated |
CombineFn.MaxDoubles | Deprecated |
CombineFn.MaxFloats | Deprecated |
CombineFn.MaxInts | Deprecated |
CombineFn.MaxLongs | Deprecated |
CombineFn.MaxNAggregator<V extends Comparable<V>> | Deprecated |
CombineFn.MinBigInts | Deprecated |
CombineFn.MinDoubles | Deprecated |
CombineFn.MinFloats | Deprecated |
CombineFn.MinInts | Deprecated |
CombineFn.MinLongs | Deprecated |
CombineFn.MinNAggregator<V extends Comparable<V>> | Deprecated |
CombineFn.PairAggregator<V1,V2> | Deprecated |
CombineFn.QuadAggregator<A,B,C,D> | Deprecated |
CombineFn.SimpleAggregator<T> | Deprecated |
CombineFn.StringConcatAggregator | Deprecated |
CombineFn.SumBigInts | Deprecated |
CombineFn.SumDoubles | Deprecated |
CombineFn.SumFloats | Deprecated |
CombineFn.SumInts | Deprecated |
CombineFn.SumLongs | Deprecated |
CombineFn.TripAggregator<A,B,C> | Deprecated |
CombineFn.TupleNAggregator | Deprecated |
DoFn<S,T> |
Base class for all data processing functions in Crunch.
|
FilterFn<T> |
A
DoFn for the common case of filtering the members of a
PCollection based on a boolean condition. |
FilterFn.AndFn<S> | Deprecated |
FilterFn.NotFn<S> | Deprecated |
FilterFn.OrFn<S> | Deprecated |
GroupingOptions |
Options that can be passed to a
groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed. |
GroupingOptions.Builder |
Builder class for creating
GroupingOptions instances. |
MapFn<S,T> |
A
DoFn for the common case of emitting exactly one value for each
input record. |
Pair<K,V> |
A convenience class for two-element
Tuple s. |
ParallelDoOptions |
Container class that includes optional information about a
parallelDo operation
applied to a PCollection . |
ParallelDoOptions.Builder | |
PipelineResult |
Container for the results of a call to
run or done on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline. |
PipelineResult.StageResult | |
Tuple3<V1,V2,V3> |
A convenience class for three-element
Tuple s. |
Tuple4<V1,V2,V3,V4> |
A convenience class for four-element
Tuple s. |
TupleN |
A
Tuple instance for an arbitrary number of values. |
Enum | Description |
---|---|
Target.WriteMode |
An enum to represent different options the client may specify
for handling the case where the output path, table, etc.
|
Exception | Description |
---|---|
CrunchRuntimeException |
A
RuntimeException implementation that includes some additional options
for the Crunch execution engine to track reporting status. |
Copyright © 2013 The Apache Software Foundation. All Rights Reserved.