|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
PCollection
instances.SourceTarget
types.SourceTarget
types.Closeable
when AutoClosingIterator.hasNext()
returns false.InputFormat
for Avro data files.OutputFormat
for Avro data files.RecordReader
for Avro data files.AvroTypeFamily
for convenient static importing.InputFormat
for text files.PTable
.
PTable
.
PTable
or PCollection
instances.DoFn
is
associated with.
DoFn
is
associated with.
DoFn
is
associated with.
PTable
arguments.
DoFn
implementation that converts an Iterable
of
values into a single value.CombineFn
that delegates all of the actual work to an
Aggregator
instance.CombineFn
.
comm
utility.
DoFn
, or takes the output of a DoFn
and write it to the
output key/values.File
.
Path
.
PTable
that contains the unique elements of this
collection mapped to a count of their occurrences.
PTable
instance that contains the counts of each unique
element of this PCollection.
PTable
s (using the same
strategy as Pig's CROSS operator).
PTable
s (using the same
strategy as Pig's CROSS operator).
PCollection
s (using the
same strategy as Pig's CROSS operator).
PCollection
s (using the
same strategy as Pig's CROSS operator).
Tool
interface that creates a Pipeline
instance and provides methods for working with the Pipeline from inside of
the Tool's run method.run
.
DoFn
.MapFn
into a key-value pair that is
used to convert from a PCollection<V>
to a PTable<K, V>
.PCollection
.
PCollection
.
DoFn
for the common case of filtering the members of a
PCollection
based on a boolean condition.Source
types.Source
types.PTable
s.
Configuration
instance associated with this pipeline.
SourceTarget
that is able to read/write data using the
serialization format specified by this PType
.
Pair
.
PTypeFamily
that this PType
belongs to.
File
below the temporary directory.
PGroupedTable
value.
PType
of the key.
PCollection
.
Path
below the temporary directory.
Pipeline
associated with this PCollection.
InputSplit
is complete.
PTableType
of this PTable
.
PType
of this PCollection
.
Path
.
PCollection
in
bytes.
Source
.
TupleFactory
for a given Tuple implementation.
PType
for this source.
PType
.
PTypeFamily
of this PCollection
.
PType
of the value.
GroupingOptions
to control how the grouping is executed.
groupByKey
operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed.GroupingOptions
instances.DoFn
is
associated with.
DoFn
is
associated with.
DoFn
is
associated with.
DoFn
is
associated with.
DoFn
is
associated with.
Emitter
instance that writes emitted records to a backing
List
.PTable
s.
Emitter
implementation that links the output of one DoFn
to the input of another DoFn
.DoNode
instances in a job and builds a
String that identifies the stages of the pipeline that belong to this job.PTable
instances based on a common
lastKey.PTable
s.
DoFn
for performing joins.PCollection
made up of the keys in this PTable.
PTable
s.
DoFn
for the common case of emitting exactly one value for each
input record.PTable
s.PCollection
made up of only the maximum element of this
instance.
PCollection
made up of only the minimum element of this
instance.
DoNode
instance, so we know
how to use it within the context of a particular MR job.Tuple
s.PCollection
and
returns a new PCollection
that is the output of this processing.
PCollection
and
returns a new PCollection
that is the output of this processing.
parallelDo
instance, but returns a
PTable
instance instead of a PCollection
.
parallelDo
instance, but returns a
PTable
instance instead of a PCollection
.
PTable
.PType
instance for PGroupedTable
instances.run
or done
on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline.PCollection
.
PCollection
that represents an immutable,
distributed multi-map of keys and values.PType
specifically for PTable
objects.PType
defines a mapping between a data type that is used in a
Crunch pipeline and a serialization and storage format that is used to
read/write data from/to HDFS.PType
instances that have the same
serialization/storage backing format.PType
s from different
PTypeFamily
implementations.Source
into a PCollection
that is
available to jobs run using this Pipeline
instance.
TableSource
instances that map to
PTable
s.
SourceTarget
instance can be read
into the local client.PTable
s.
Serialization
used by jobs configured with AvroJob
.ReflectDataFactory
class to work with Scala classes.PCollection
will cause it to change in side.
PCollection
instances.Configuration
to use with this pipeline.
Configuration
instance to be used during unit tests.
TaskInputOutputContext
to this
DoFn
instance.
PCollection
instances.PCollection
using the natural ordering of its elements.
PCollection
using the natural ordering of its elements in
the order specified.
PTable
using the natural ordering of its keys.
PTable
using the natural ordering of its keys in the
order specified.
PCollection
instance that contains all of the elements of
this instance in sorted order.
sortPairs(coll, by(2, ASCENDING), by(1, DESCENDING))
Column numbering is 1-based.PCollection
of Pair
s using the specified column
ordering.
PCollection
of Tuple4
s using the specified column
ordering.
PCollection
of Tuple3
s using the specified column
ordering.
PCollection
of TupleN
s using the specified column
ordering.
Source
represents an input data set that is an input to one or more
MapReduce jobs.Source
and the
Target
interfaces.Source
implementations that return a PTable
.Target
represents the output destination of a Crunch job.TemporaryPath
.
Target
types.Target
types.PCollection
s.Tuple
s.Tuple
s.Tuple
instance for an arbitrary number of values.Tuple
interface.PCollection
instance that acts as the union of this
PCollection
and the input PCollection
s.
PTable
instance that acts as the union of this
PTable
and the input PTable
s.
PCollection
made up of the values in this PTable.
WritableTypeFamily
for convenient static importing.Writable
-based implementation of the
PTypeFamily
interface.PCollection
to the given Target
,
using the storage format specified by the target.
PTable
to the given Target
.
out
.
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |