Package | Description |
---|---|
org.apache.crunch |
Client-facing API and core abstractions.
|
org.apache.crunch.impl.mem |
In-memory Pipeline implementation for rapid prototyping and testing.
|
org.apache.crunch.impl.mr |
A Pipeline implementation that runs on Hadoop MapReduce.
|
org.apache.crunch.io |
Data input and output for Pipelines.
|
org.apache.crunch.util |
An assorted set of utilities.
|
Modifier and Type | Interface and Description |
---|---|
interface |
TableSourceTarget<K,V>
An interface for classes that implement both the
TableSource and the
Target interfaces. |
Modifier and Type | Method and Description |
---|---|
<K,V> PTable<K,V> |
Pipeline.read(TableSource<K,V> tableSource)
A version of the read method for
TableSource instances that map to
PTable s. |
Modifier and Type | Method and Description |
---|---|
<K,V> PTable<K,V> |
MemPipeline.read(TableSource<K,V> source) |
Modifier and Type | Method and Description |
---|---|
<K,V> PTable<K,V> |
MRPipeline.read(TableSource<K,V> source) |
Modifier and Type | Method and Description |
---|---|
static <K,V> TableSource<K,V> |
From.formattedFile(org.apache.hadoop.fs.Path path,
Class<? extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<?,?>> formatClass,
PType<K> keyType,
PType<V> valueType)
Creates a
TableSource<K, V> for reading data from files that have custom
FileInputFormat implementations not covered by the provided TableSource
and Source factory methods. |
static <K extends org.apache.hadoop.io.Writable,V extends org.apache.hadoop.io.Writable> |
From.formattedFile(org.apache.hadoop.fs.Path path,
Class<? extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<K,V>> formatClass,
Class<K> keyClass,
Class<V> valueClass)
Creates a
TableSource<K, V> for reading data from files that have custom
FileInputFormat<K, V> implementations not covered by the provided TableSource
and Source factory methods. |
static <K,V> TableSource<K,V> |
From.formattedFile(String pathName,
Class<? extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<?,?>> formatClass,
PType<K> keyType,
PType<V> valueType)
Creates a
TableSource<K, V> for reading data from files that have custom
FileInputFormat implementations not covered by the provided TableSource
and Source factory methods. |
static <K extends org.apache.hadoop.io.Writable,V extends org.apache.hadoop.io.Writable> |
From.formattedFile(String pathName,
Class<? extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<K,V>> formatClass,
Class<K> keyClass,
Class<V> valueClass)
Creates a
TableSource<K, V> for reading data from files that have custom
FileInputFormat<K, V> implementations not covered by the provided TableSource
and Source factory methods. |
static <K extends org.apache.hadoop.io.Writable,V extends org.apache.hadoop.io.Writable> |
From.sequenceFile(org.apache.hadoop.fs.Path path,
Class<K> keyClass,
Class<V> valueClass)
Creates a
TableSource<K, V> instance for the SequenceFile(s) at the given Path . |
static <K,V> TableSource<K,V> |
From.sequenceFile(org.apache.hadoop.fs.Path path,
PType<K> keyType,
PType<V> valueType)
Creates a
TableSource<K, V> instance for the SequenceFile(s) at the given Path . |
static <K extends org.apache.hadoop.io.Writable,V extends org.apache.hadoop.io.Writable> |
From.sequenceFile(String pathName,
Class<K> keyClass,
Class<V> valueClass)
Creates a
TableSource<K, V> instance for the SequenceFile(s) at the given path name. |
static <K,V> TableSource<K,V> |
From.sequenceFile(String pathName,
PType<K> keyType,
PType<V> valueType)
Creates a
TableSource<K, V> instance for the SequenceFile(s) at the given path name. |
Modifier and Type | Method and Description |
---|---|
<K,V> PTable<K,V> |
CrunchTool.read(TableSource<K,V> tableSource) |
Copyright © 2013 The Apache Software Foundation. All Rights Reserved.