|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.crunch.impl.mem.MemPipeline
public class MemPipeline
| Method Summary | ||
|---|---|---|
static
|
collectionOf(Iterable<T> collect)
|
|
static
|
collectionOf(T... ts)
|
|
PipelineResult |
done()
Run any remaining jobs required to generate outputs and then clean up any intermediate data files that were created in this run or previous calls to run. |
|
void |
enableDebug()
Turn on debug logging for jobs that are run from this pipeline. |
|
org.apache.hadoop.conf.Configuration |
getConfiguration()
Returns the Configuration instance associated with this pipeline. |
|
static Pipeline |
getInstance()
|
|
String |
getName()
Returns the name of this pipeline. |
|
|
materialize(PCollection<T> pcollection)
Create the given PCollection and read the data it contains into the returned Collection instance for client use. |
|
|
read(Source<T> source)
Converts the given Source into a PCollection that is
available to jobs run using this Pipeline instance. |
|
|
read(TableSource<K,V> source)
A version of the read method for TableSource instances that map to
PTables. |
|
PCollection<String> |
readTextFile(String pathName)
A convenience method for reading a text file. |
|
PipelineResult |
run()
Constructs and executes a series of MapReduce jobs in order to write data to the output targets. |
|
void |
setConfiguration(org.apache.hadoop.conf.Configuration conf)
Set the Configuration to use with this pipeline. |
|
static
|
tableOf(Iterable<Pair<S,T>> pairs)
|
|
static
|
tableOf(S s,
T t,
Object... more)
|
|
static
|
typedCollectionOf(PType<T> ptype,
Iterable<T> collect)
|
|
static
|
typedCollectionOf(PType<T> ptype,
T... ts)
|
|
static
|
typedTableOf(PTableType<S,T> ptype,
Iterable<Pair<S,T>> pairs)
|
|
static
|
typedTableOf(PTableType<S,T> ptype,
S s,
T t,
Object... more)
|
|
void |
write(PCollection<?> collection,
Target target)
Write the given collection to the given target on the next pipeline run. |
|
|
writeTextFile(PCollection<T> collection,
String pathName)
A convenience method for writing a text file. |
|
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public static Pipeline getInstance()
public static <T> PCollection<T> collectionOf(T... ts)
public static <T> PCollection<T> collectionOf(Iterable<T> collect)
public static <T> PCollection<T> typedCollectionOf(PType<T> ptype,
T... ts)
public static <T> PCollection<T> typedCollectionOf(PType<T> ptype,
Iterable<T> collect)
public static <S,T> PTable<S,T> tableOf(S s,
T t,
Object... more)
public static <S,T> PTable<S,T> typedTableOf(PTableType<S,T> ptype,
S s,
T t,
Object... more)
public static <S,T> PTable<S,T> tableOf(Iterable<Pair<S,T>> pairs)
public static <S,T> PTable<S,T> typedTableOf(PTableType<S,T> ptype,
Iterable<Pair<S,T>> pairs)
public void setConfiguration(org.apache.hadoop.conf.Configuration conf)
PipelineConfiguration to use with this pipeline.
setConfiguration in interface Pipelinepublic org.apache.hadoop.conf.Configuration getConfiguration()
PipelineConfiguration instance associated with this pipeline.
getConfiguration in interface Pipelinepublic <T> PCollection<T> read(Source<T> source)
PipelineSource into a PCollection that is
available to jobs run using this Pipeline instance.
read in interface Pipelinesource - The source of data
public <K,V> PTable<K,V> read(TableSource<K,V> source)
PipelineTableSource instances that map to
PTables.
read in interface Pipelinesource - The source of the data
public void write(PCollection<?> collection,
Target target)
Pipeline
write in interface Pipelinecollection - The collectiontarget - The output targetpublic PCollection<String> readTextFile(String pathName)
Pipeline
readTextFile in interface Pipeline
public <T> void writeTextFile(PCollection<T> collection,
String pathName)
Pipeline
writeTextFile in interface Pipelinepublic <T> Iterable<T> materialize(PCollection<T> pcollection)
Pipeline
materialize in interface Pipelinepcollection - The PCollection to materialize
public PipelineResult run()
Pipeline
run in interface Pipelinepublic PipelineResult done()
Pipelinerun.
done in interface Pipelinepublic void enableDebug()
Pipeline
enableDebug in interface Pipelinepublic String getName()
Pipeline
getName in interface Pipeline
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||