Package | Description |
---|---|
org.apache.crunch |
Client-facing API and core abstractions.
|
org.apache.crunch.contrib.text | |
org.apache.crunch.fn |
Commonly used functions for manipulating collections.
|
org.apache.crunch.impl.dist | |
org.apache.crunch.impl.dist.collect | |
org.apache.crunch.impl.mem |
In-memory Pipeline implementation for rapid prototyping and testing.
|
org.apache.crunch.impl.spark | |
org.apache.crunch.impl.spark.collect | |
org.apache.crunch.impl.spark.fn | |
org.apache.crunch.impl.spark.serde | |
org.apache.crunch.io |
Data input and output for Pipelines.
|
org.apache.crunch.lambda |
Alternative Crunch API using Java 8 features to allow construction of pipelines using lambda functions and method
references.
|
org.apache.crunch.lib |
Joining, sorting, aggregating, and other commonly used functionality.
|
org.apache.crunch.lib.join |
Inner and outer joins on collections.
|
org.apache.crunch.lib.sort | |
org.apache.crunch.types |
Common functionality for business object serialization.
|
org.apache.crunch.types.avro |
Business object serialization using Apache Avro.
|
org.apache.crunch.types.orc | |
org.apache.crunch.types.writable |
Business object serialization using Hadoop's Writables framework.
|
Class and Description |
---|
Converter
Converts the input key/value from a MapReduce task into the input to a
DoFn , or takes the output of a DoFn and write it to the
output key/values. |
PGroupedTableType
The
PType instance for PGroupedTable instances. |
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PGroupedTableType
The
PType instance for PGroupedTable instances. |
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PGroupedTableType
The
PType instance for PGroupedTable instances. |
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
Converter
Converts the input key/value from a MapReduce task into the input to a
DoFn , or takes the output of a DoFn and write it to the
output key/values. |
PGroupedTableType
The
PType instance for PGroupedTable instances. |
Class and Description |
---|
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PGroupedTableType
The
PType instance for PGroupedTable instances. |
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
Class and Description |
---|
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
TupleFactory |
Class and Description |
---|
Converter
Converts the input key/value from a MapReduce task into the input to a
DoFn , or takes the output of a DoFn and write it to the
output key/values. |
DeepCopier
Performs deep copies of values.
|
NoOpDeepCopier
A
DeepCopier that does nothing, and just returns the input value without copying anything. |
PGroupedTableType
The
PType instance for PGroupedTable instances. |
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
TupleFactory |
Class and Description |
---|
Converter
Converts the input key/value from a MapReduce task into the input to a
DoFn , or takes the output of a DoFn and write it to the
output key/values. |
DeepCopier
Performs deep copies of values.
|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Class and Description |
---|
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
TupleFactory |
Class and Description |
---|
Converter
Converts the input key/value from a MapReduce task into the input to a
DoFn , or takes the output of a DoFn and write it to the
output key/values. |
DeepCopier
Performs deep copies of values.
|
PTableType
An extension of
PType specifically for PTable objects. |
PType
A
PType defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. |
PTypeFamily
An abstract factory for creating
PType instances that have the same
serialization/storage backing format. |
Copyright © 2016 The Apache Software Foundation. All rights reserved.