Package | Description |
---|---|
org.apache.crunch |
Client-facing API and core abstractions.
|
org.apache.crunch.impl.dist.collect | |
org.apache.crunch.impl.spark.collect |
Modifier and Type | Method and Description |
---|---|
ParallelDoOptions |
ParallelDoOptions.Builder.build() |
Modifier and Type | Method and Description |
---|---|
<K,V> PTable<K,V> |
PCollection.parallelDo(String name,
DoFn<S,Pair<K,V>> doFn,
PTableType<K,V> type,
ParallelDoOptions options)
Similar to the other
parallelDo instance, but returns a
PTable instance instead of a PCollection . |
<T> PCollection<T> |
PCollection.parallelDo(String name,
DoFn<S,T> doFn,
PType<T> type,
ParallelDoOptions options)
Applies the given doFn to the elements of this
PCollection and
returns a new PCollection that is the output of this processing. |
Modifier and Type | Method and Description |
---|---|
ParallelDoOptions |
PCollectionImpl.getParallelDoOptions() |
Modifier and Type | Method and Description |
---|---|
<S,T> BaseDoCollection<T> |
PCollectionFactory.createDoCollection(String name,
PCollectionImpl<S> chainingCollection,
DoFn<S,T> fn,
PType<T> type,
ParallelDoOptions options) |
<S,K,V> BaseDoTable<K,V> |
PCollectionFactory.createDoTable(String name,
PCollectionImpl<S> chainingCollection,
DoFn<S,Pair<K,V>> fn,
PTableType<K,V> type,
ParallelDoOptions options) |
<S> BaseInputCollection<S> |
PCollectionFactory.createInputCollection(Source<S> source,
String named,
DistributedPipeline distributedPipeline,
ParallelDoOptions doOpts) |
<K,V> BaseInputTable<K,V> |
PCollectionFactory.createInputTable(TableSource<K,V> source,
String named,
DistributedPipeline distributedPipeline,
ParallelDoOptions doOpts) |
<K,V> PTable<K,V> |
PCollectionImpl.parallelDo(String name,
DoFn<S,Pair<K,V>> fn,
PTableType<K,V> type,
ParallelDoOptions options) |
<T> PCollection<T> |
PCollectionImpl.parallelDo(String name,
DoFn<S,T> fn,
PType<T> type,
ParallelDoOptions options) |
Constructor and Description |
---|
BaseInputCollection(Source<S> source,
String name,
DistributedPipeline pipeline,
ParallelDoOptions doOpts) |
BaseInputTable(TableSource<K,V> source,
String name,
DistributedPipeline pipeline,
ParallelDoOptions doOpts) |
PCollectionImpl(String name,
DistributedPipeline pipeline,
ParallelDoOptions doOptions) |
PTableBase(String name,
DistributedPipeline pipeline,
ParallelDoOptions options) |
Modifier and Type | Method and Description |
---|---|
<S,T> BaseDoCollection<T> |
SparkCollectFactory.createDoCollection(String name,
PCollectionImpl<S> parent,
DoFn<S,T> fn,
PType<T> type,
ParallelDoOptions options) |
<S,K,V> BaseDoTable<K,V> |
SparkCollectFactory.createDoTable(String name,
PCollectionImpl<S> parent,
DoFn<S,Pair<K,V>> fn,
PTableType<K,V> type,
ParallelDoOptions options) |
<S> BaseInputCollection<S> |
SparkCollectFactory.createInputCollection(Source<S> source,
String named,
DistributedPipeline pipeline,
ParallelDoOptions doOpts) |
<K,V> BaseInputTable<K,V> |
SparkCollectFactory.createInputTable(TableSource<K,V> source,
String named,
DistributedPipeline pipeline,
ParallelDoOptions doOpts) |
Constructor and Description |
---|
InputTable(TableSource<K,V> source,
String named,
DistributedPipeline pipeline,
ParallelDoOptions doOpts) |
Copyright © 2016 The Apache Software Foundation. All rights reserved.