This project has retired. For details please refer to its Attic page.
ReadableDataImpl (Apache Crunch 0.9.0 API)

org.apache.crunch.io.impl
Class ReadableDataImpl<T>

java.lang.Object
  extended by org.apache.crunch.io.impl.ReadableDataImpl<T>
All Implemented Interfaces:
Serializable, ReadableData<T>
Direct Known Subclasses:
AvroParquetReadableData, AvroReadableData, HFileReadableData, SeqFileReadableData, TextReadableData, TrevniReadableData

public abstract class ReadableDataImpl<T>
extends Object
implements ReadableData<T>

See Also:
Serialized Form

Constructor Summary
protected ReadableDataImpl(List<org.apache.hadoop.fs.Path> paths)
           
 
Method Summary
 void configure(org.apache.hadoop.conf.Configuration conf)
          Allows this instance to specify any additional configuration settings that may be needed by the job that it is launched in.
protected abstract  FileReaderFactory<T> getFileReaderFactory()
           
 Set<SourceTarget<?>> getSourceTargets()
           
 Iterable<T> read(org.apache.hadoop.mapreduce.TaskInputOutputContext<?,?,?,?> ctxt)
          Read the data referenced by this instance within the given context.
 ReadableData<T> setParent(SourceTarget<?> parent)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ReadableDataImpl

protected ReadableDataImpl(List<org.apache.hadoop.fs.Path> paths)
Method Detail

setParent

public ReadableData<T> setParent(SourceTarget<?> parent)

getSourceTargets

public Set<SourceTarget<?>> getSourceTargets()
Specified by:
getSourceTargets in interface ReadableData<T>
Returns:
Any SourceTarget instances that must exist before the data in this instance can be read. Used by the planner in sequencing job processing.

configure

public void configure(org.apache.hadoop.conf.Configuration conf)
Description copied from interface: ReadableData
Allows this instance to specify any additional configuration settings that may be needed by the job that it is launched in.

Specified by:
configure in interface ReadableData<T>
Parameters:
conf - The Configuration object for the job

getFileReaderFactory

protected abstract FileReaderFactory<T> getFileReaderFactory()

read

public Iterable<T> read(org.apache.hadoop.mapreduce.TaskInputOutputContext<?,?,?,?> ctxt)
                 throws IOException
Description copied from interface: ReadableData
Read the data referenced by this instance within the given context.

Specified by:
read in interface ReadableData<T>
Parameters:
ctxt - The context of the task that is reading the data
Returns:
An iterable reference to the data in this instance
Throws:
IOException - If the data cannot be read


Copyright © 2014 The Apache Software Foundation. All Rights Reserved.