|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use org.apache.crunch | |
---|---|
org.apache.crunch | Client-facing API and core abstractions. |
org.apache.crunch.contrib.bloomfilter | Support for creating Bloom Filters. |
org.apache.crunch.contrib.io.jdbc | Support for reading data from RDBMS using JDBC |
org.apache.crunch.examples | Example applications demonstrating various aspects of Crunch. |
org.apache.crunch.fn | Commonly used functions for manipulating collections. |
org.apache.crunch.impl.mem | In-memory Pipeline implementation for rapid prototyping and testing. |
org.apache.crunch.impl.mr | A Pipeline implementation that runs on Hadoop MapReduce. |
org.apache.crunch.io | Data input and output for Pipelines. |
org.apache.crunch.lib | Joining, sorting, aggregating, and other commonly used functionality. |
org.apache.crunch.lib.join | Inner and outer joins on collections. |
org.apache.crunch.types | Common functionality for business object serialization. |
org.apache.crunch.types.avro | Business object serialization using Apache Avro. |
org.apache.crunch.types.writable | Business object serialization using Hadoop's Writables framework. |
org.apache.crunch.util | An assorted set of utilities. |
Classes in org.apache.crunch used by org.apache.crunch | |
---|---|
CombineFn
A special DoFn implementation that converts an Iterable of
values into a single value. |
|
CombineFn.Aggregator
|
|
CombineFn.AggregatorFactory
Interface for constructing new aggregator instances. |
|
CombineFn.SimpleAggregator
Base class for aggregators that do not require any initialization. |
|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
FilterFn
A DoFn for the common case of filtering the members of a
PCollection based on a boolean condition. |
|
GroupingOptions
Options that can be passed to a groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed. |
|
GroupingOptions.Builder
Builder class for creating GroupingOptions instances. |
|
MapFn
A DoFn for the common case of emitting exactly one value for each
input record. |
|
Pair
A convenience class for two-element Tuple s. |
|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
PGroupedTable
The Crunch representation of a grouped PTable . |
|
Pipeline
Manages the state of a pipeline execution. |
|
PipelineResult
Container for the results of a call to run or done on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline. |
|
PipelineResult.StageResult
|
|
PObject
A PObject represents a singleton object value that results from a distributed
computation. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
TableSource
The interface Source implementations that return a PTable . |
|
Target
A Target represents the output destination of a Crunch job. |
|
Tuple
A fixed-size collection of Objects, used in Crunch for representing joins between PCollection s. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
Classes in org.apache.crunch used by org.apache.crunch.contrib.bloomfilter | |
---|---|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
Pair
A convenience class for two-element Tuple s. |
|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
PObject
A PObject represents a singleton object value that results from a distributed
computation. |
Classes in org.apache.crunch used by org.apache.crunch.contrib.io.jdbc | |
---|---|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
Classes in org.apache.crunch used by org.apache.crunch.examples | |
---|---|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
Classes in org.apache.crunch used by org.apache.crunch.fn | |
---|---|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
MapFn
A DoFn for the common case of emitting exactly one value for each
input record. |
|
Pair
A convenience class for two-element Tuple s. |
Classes in org.apache.crunch used by org.apache.crunch.impl.mem | |
---|---|
Pair
A convenience class for two-element Tuple s. |
|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
Pipeline
Manages the state of a pipeline execution. |
|
PipelineResult
Container for the results of a call to run or done on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
|
TableSource
The interface Source implementations that return a PTable . |
|
Target
A Target represents the output destination of a Crunch job. |
Classes in org.apache.crunch used by org.apache.crunch.impl.mr | |
---|---|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
Pipeline
Manages the state of a pipeline execution. |
|
PipelineResult
Container for the results of a call to run or done on the
Pipeline interface that includes details and statistics about the component
stages of the data pipeline. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
TableSource
The interface Source implementations that return a PTable . |
|
Target
A Target represents the output destination of a Crunch job. |
Classes in org.apache.crunch used by org.apache.crunch.io | |
---|---|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
TableSource
The interface Source implementations that return a PTable . |
|
Target
A Target represents the output destination of a Crunch job. |
Classes in org.apache.crunch used by org.apache.crunch.lib | |
---|---|
CombineFn
A special DoFn implementation that converts an Iterable of
values into a single value. |
|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
Pair
A convenience class for two-element Tuple s. |
|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
PObject
A PObject represents a singleton object value that results from a distributed
computation. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
Classes in org.apache.crunch used by org.apache.crunch.lib.join | |
---|---|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
Pair
A convenience class for two-element Tuple s. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
Classes in org.apache.crunch used by org.apache.crunch.types | |
---|---|
DoFn
Base class for all data processing functions in Crunch. |
|
Emitter
Interface for writing outputs from a DoFn . |
|
GroupingOptions
Options that can be passed to a groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed. |
|
MapFn
A DoFn for the common case of emitting exactly one value for each
input record. |
|
Pair
A convenience class for two-element Tuple s. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
Tuple
A fixed-size collection of Objects, used in Crunch for representing joins between PCollection s. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
Classes in org.apache.crunch used by org.apache.crunch.types.avro | |
---|---|
GroupingOptions
Options that can be passed to a groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed. |
|
MapFn
A DoFn for the common case of emitting exactly one value for each
input record. |
|
Pair
A convenience class for two-element Tuple s. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
Tuple
A fixed-size collection of Objects, used in Crunch for representing joins between PCollection s. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
Classes in org.apache.crunch used by org.apache.crunch.types.writable | |
---|---|
GroupingOptions
Options that can be passed to a groupByKey operation in order to
exercise finer control over how the partitioning, grouping, and sorting of
keys is performed. |
|
MapFn
A DoFn for the common case of emitting exactly one value for each
input record. |
|
Pair
A convenience class for two-element Tuple s. |
|
SourceTarget
An interface for classes that implement both the Source and the
Target interfaces. |
|
Tuple
A fixed-size collection of Objects, used in Crunch for representing joins between PCollection s. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
Classes in org.apache.crunch used by org.apache.crunch.util | |
---|---|
Pair
A convenience class for two-element Tuple s. |
|
PCollection
A representation of an immutable, distributed collection of elements that is the fundamental target of computations in Crunch. |
|
PTable
A sub-interface of PCollection that represents an immutable,
distributed multi-map of keys and values. |
|
Source
A Source represents an input data set that is an input to one or more
MapReduce jobs. |
|
TableSource
The interface Source implementations that return a PTable . |
|
Target
A Target represents the output destination of a Crunch job. |
|
Tuple3
A convenience class for three-element Tuple s. |
|
Tuple4
A convenience class for four-element Tuple s. |
|
TupleN
A Tuple instance for an arbitrary number of values. |
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |