|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface PType<T>
A PType
defines a mapping between a data type that is used in a Crunch pipeline and a
serialization and storage format that is used to read/write data from/to HDFS. Every
PCollection
has an associated PType
that tells Crunch how to read/write data from
that PCollection
.
Method Summary | |
---|---|
Converter |
getConverter()
|
ReadableSourceTarget<T> |
getDefaultFileSource(org.apache.hadoop.fs.Path path)
Returns a SourceTarget that is able to read/write data using the serialization format
specified by this PType . |
T |
getDetachedValue(T value)
Returns a copy of a value (or the value itself) that can safely be retained. |
PTypeFamily |
getFamily()
Returns the PTypeFamily that this PType belongs to. |
MapFn<Object,T> |
getInputMapFn()
|
MapFn<T,Object> |
getOutputMapFn()
|
List<PType> |
getSubTypes()
Returns the sub-types that make up this PType if it is a composite instance, such as a tuple. |
Class<T> |
getTypeClass()
Returns the Java type represented by this PType . |
void |
initialize(org.apache.hadoop.conf.Configuration conf)
Initialize this PType for use within a DoFn. |
Method Detail |
---|
Class<T> getTypeClass()
PType
.
PTypeFamily getFamily()
PTypeFamily
that this PType
belongs to.
MapFn<Object,T> getInputMapFn()
MapFn<T,Object> getOutputMapFn()
Converter getConverter()
void initialize(org.apache.hadoop.conf.Configuration conf)
getDetachedValue(Object)
.
conf
- Configuration objectgetDetachedValue(Object)
T getDetachedValue(T value)
This is useful when iterable values being processed in a DoFn (via a reducer) need to be held
on to for more than the scope of a single iteration, as a reducer (and therefore also a DoFn
that has an Iterable as input) re-use deserialized values. More information on object reuse is
available in the DoFn
class documentation.
value
- The value to be deep-copied
ReadableSourceTarget<T> getDefaultFileSource(org.apache.hadoop.fs.Path path)
SourceTarget
that is able to read/write data using the serialization format
specified by this PType
.
List<PType> getSubTypes()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |