| Package | Description | 
|---|---|
| org.apache.crunch.impl.dist.collect | |
| org.apache.crunch.impl.mr | A Pipeline implementation that runs on Hadoop MapReduce. | 
| org.apache.crunch.impl.spark | |
| org.apache.crunch.impl.spark.collect | 
| Modifier and Type | Method and Description | 
|---|---|
| DistributedPipeline | PCollectionImpl. getPipeline() | 
| Modifier and Type | Method and Description | 
|---|---|
| <S> BaseInputCollection<S> | PCollectionFactory. createInputCollection(Source<S> source,
                     String named,
                     DistributedPipeline distributedPipeline,
                     ParallelDoOptions doOpts) | 
| <K,V> BaseInputTable<K,V> | PCollectionFactory. createInputTable(TableSource<K,V> source,
                String named,
                DistributedPipeline distributedPipeline,
                ParallelDoOptions doOpts) | 
| Constructor and Description | 
|---|
| BaseInputCollection(Source<S> source,
                   DistributedPipeline pipeline) | 
| BaseInputCollection(Source<S> source,
                   String name,
                   DistributedPipeline pipeline,
                   ParallelDoOptions doOpts) | 
| BaseInputTable(TableSource<K,V> source,
              DistributedPipeline pipeline) | 
| BaseInputTable(TableSource<K,V> source,
              String name,
              DistributedPipeline pipeline,
              ParallelDoOptions doOpts) | 
| EmptyPCollection(DistributedPipeline pipeline,
                PType<T> ptype) | 
| EmptyPTable(DistributedPipeline pipeline,
           PTableType<K,V> ptype) | 
| PCollectionImpl(String name,
               DistributedPipeline pipeline) | 
| PCollectionImpl(String name,
               DistributedPipeline pipeline,
               ParallelDoOptions doOptions) | 
| PTableBase(String name,
          DistributedPipeline pipeline) | 
| PTableBase(String name,
          DistributedPipeline pipeline,
          ParallelDoOptions options) | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MRPipelinePipeline implementation that is executed within Hadoop MapReduce. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | SparkPipeline | 
| Modifier and Type | Method and Description | 
|---|---|
| <S> BaseInputCollection<S> | SparkCollectFactory. createInputCollection(Source<S> source,
                     String named,
                     DistributedPipeline pipeline,
                     ParallelDoOptions doOpts) | 
| <K,V> BaseInputTable<K,V> | SparkCollectFactory. createInputTable(TableSource<K,V> source,
                String named,
                DistributedPipeline pipeline,
                ParallelDoOptions doOpts) | 
| Constructor and Description | 
|---|
| EmptyPCollection(DistributedPipeline pipeline,
                PType<T> ptype) | 
| EmptyPTable(DistributedPipeline pipeline,
           PTableType<K,V> ptype) | 
| InputTable(TableSource<K,V> source,
          String named,
          DistributedPipeline pipeline,
          ParallelDoOptions doOpts) | 
Copyright © 2017 The Apache Software Foundation. All rights reserved.