Package | Description |
---|---|
org.apache.crunch |
Client-facing API and core abstractions.
|
org.apache.crunch.impl.dist | |
org.apache.crunch.impl.mem |
In-memory Pipeline implementation for rapid prototyping and testing.
|
org.apache.crunch.impl.mr |
A Pipeline implementation that runs on Hadoop MapReduce.
|
org.apache.crunch.impl.spark | |
org.apache.crunch.util |
An assorted set of utilities.
|
Modifier and Type | Field and Description |
---|---|
static PipelineResult |
PipelineResult.DONE |
static PipelineResult |
PipelineResult.EMPTY |
Modifier and Type | Method and Description |
---|---|
PipelineResult |
Pipeline.done()
Run any remaining jobs required to generate outputs and then clean up any
intermediate data files that were created in this run or previous calls to
run . |
PipelineResult |
PipelineExecution.getResult()
Retrieve the result of a pipeline if it has been completed, otherwise
null . |
PipelineResult |
Pipeline.run()
Constructs and executes a series of MapReduce jobs in order to write data
to the output targets.
|
Modifier and Type | Method and Description |
---|---|
PipelineResult |
DistributedPipeline.done() |
Modifier and Type | Method and Description |
---|---|
PipelineResult |
MemPipeline.done() |
PipelineResult |
MemPipeline.run() |
Modifier and Type | Method and Description |
---|---|
PipelineResult |
MRPipeline.run() |
Modifier and Type | Method and Description |
---|---|
PipelineResult |
SparkPipeline.done() |
PipelineResult |
SparkRuntime.get() |
PipelineResult |
SparkRuntime.get(long timeout,
TimeUnit unit) |
PipelineResult |
SparkRuntime.getResult() |
PipelineResult |
SparkPipeline.run() |
Modifier and Type | Method and Description |
---|---|
PipelineResult |
CrunchTool.done() |
PipelineResult |
CrunchTool.run() |
Copyright © 2016 The Apache Software Foundation. All rights reserved.