|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||
PCollection instances.PObject encapsulating a Map made up of the keys and values in this
PTable.
SourceTarget types.InputFormat for Avro data files.OutputFormat for Avro data files.RecordReader for Avro data files.AvroTypeFamily for convenient static importing.InputFormat for text files.PTable.
PTable.
PTable or PCollection
instances.DoFn is
associated with.
DoFn is
associated with.
DoFn is
associated with.
PTable arguments.
DoFn implementation that converts an Iterable of
values into a single value.CombineFn that delegates all of the actual work to an
Aggregator instance.CombineFn.
comm utility.
DoFn, or takes the output of a DoFn and write it to the
output key/values.File.
Path.
PTable that contains the unique elements of this collection mapped to a count
of their occurrences.
PTable instance that contains the counts of each unique
element of this PCollection.
PTables (using the same
strategy as Pig's CROSS operator).
PTables (using the same
strategy as Pig's CROSS operator).
PCollections (using the
same strategy as Pig's CROSS operator).
PCollections (using the
same strategy as Pig's CROSS operator).
Tool interface that creates a Pipeline
instance and provides methods for working with the Pipeline from inside of
the Tool's run method.run.
DoFn.MapFn into a key-value pair that is
used to convert from a PCollection<V> to a PTable<K, V>.PCollection.
PCollection.
DoFn for the common case of filtering the members of a
PCollection based on a boolean condition.Source types.PTables.
Configuration instance associated with this pipeline.
SourceTarget that is able to read/write data using the serialization format
specified by this PType.
Pair.
PTypeFamily that this PType belongs to.
File below the temporary directory.
PGroupedTable value.
PType of the key.
PCollection.
Path below the temporary directory.
Pipeline associated with this PCollection.
PTableType of this PTable.
PType of this PCollection.
Path.
PCollection in
bytes.
Source.
TupleFactory for a given Tuple implementation.
PType for this source.
PType.
PTypeFamily of this PCollection.
PObject.
PType of the value.
GroupingOptions to control how the grouping is executed.
groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed.GroupingOptions instances.PTables.
PTable instances based on a common
lastKey.PTables.
DoFn for performing joins.PCollection made up of the keys in this PTable.
PTables.
PCollection.
DoFn for the common case of emitting exactly one value for each
input record.PTables.PObject of the maximum element of this instance.
PObject of the minimum element of this instance.
Tuples.PCollection and
returns a new PCollection that is the output of this processing.
PCollection and
returns a new PCollection that is the output of this processing.
parallelDo instance, but returns a
PTable instance instead of a PCollection.
parallelDo instance, but returns a
PTable instance instead of a PCollection.
PTable.PType instance for PGroupedTable instances.run or done on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline.PObject represents a singleton object value that results from a distributed
computation.PCollection.
PCollection that represents an immutable,
distributed multi-map of keys and values.PType specifically for PTable objects.PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS.PType instances that have the same
serialization/storage backing format.PTypes from different
PTypeFamily implementations.Source into a PCollection that is
available to jobs run using this Pipeline instance.
TableSource instances that map to
PTables.
SourceTarget instance can be read
into the local client.PTables.
Serialization used by jobs configured with AvroJob.PCollection
will cause it to change in side.
PTable<K, Pair<V1, V2>> collection.FileNamingScheme that uses an incrementing sequence number in
order to generate unique file names.PCollection instances.Configuration to use with this pipeline.
TaskInputOutputContext to this
DoFn instance.
PCollection instances.PCollection using the natural ordering of its elements.
PCollection using the natural ordering of its elements in
the order specified.
PTable using the natural ordering of its keys.
PTable using the natural ordering of its keys in the
order specified.
PCollection instance that contains all of the elements of
this instance in sorted order.
sortPairs(coll, by(2, ASCENDING), by(1, DESCENDING))
Column numbering is 1-based.PTable instance and then apply a
DoFn to the resulting sorted data to yield an output PCollection<T>.
PTable instance and then apply a
DoFn to the resulting sorted data to yield an output PTable<U, V>.
PCollection of Pairs using the specified column
ordering.
PCollection of Tuple4s using the specified column
ordering.
PCollection of Tuple3s using the specified column
ordering.
PCollection of TupleNs using the specified column
ordering.
Source represents an input data set that is an input to one or more
MapReduce jobs.Source and the
Target interfaces.Source implementations that return a PTable.Target represents the output destination of a Crunch job.TemporaryPath.
Target types.PCollections.Tuples.Tuples.Tuple instance for an arbitrary number of values.Tuple interface.PCollection instance that acts as the union of this
PCollection and the input PCollections.
PTable instance that acts as the union of this
PTable and the input PTables.
PCollection made up of the values in this PTable.
WritableTypeFamily for convenient static importing.Writable-based implementation of the
PTypeFamily interface.PCollection to the given Target,
using the storage format specified by the target.
PTable to the given Target.
out.
|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||