|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.crunch.contrib.text.AbstractSimpleExtractor<T>
public abstract class AbstractSimpleExtractor<T>
Base class for the common case Extractor
instances that construct a single
object from a block of text stored in a String
, with support for error handling
and reporting.
Constructor Summary | |
---|---|
protected |
AbstractSimpleExtractor(T defaultValue)
|
protected |
AbstractSimpleExtractor(T defaultValue,
TokenizerFactory scannerFactory)
|
Method Summary | |
---|---|
protected abstract T |
doExtract(Tokenizer tokenizer)
Subclasses must override this method to return a new instance of the class that this Extractor instance is designed to parse. |
boolean |
errorOnLastRecord()
Returns true if the last call to extract on this instance
threw an exception that was handled. |
T |
extract(String input)
Extract a value with the type of this instance. |
T |
getDefaultValue()
Returns the default value for this Extractor in case of an
error. |
ExtractorStats |
getStats()
Return statistics about how many errors this Extractor instance
encountered while parsing input data. |
void |
initialize()
Perform any initialization required by this Extractor during the
start of a map or reduce task. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface org.apache.crunch.contrib.text.Extractor |
---|
getPType |
Constructor Detail |
---|
protected AbstractSimpleExtractor(T defaultValue)
protected AbstractSimpleExtractor(T defaultValue, TokenizerFactory scannerFactory)
Method Detail |
---|
public void initialize()
Extractor
Extractor
during the
start of a map or reduce task.
initialize
in interface Extractor<T>
public T extract(String input)
Extractor
extract
in interface Extractor<T>
public boolean errorOnLastRecord()
Extractor
extract
on this instance
threw an exception that was handled.
errorOnLastRecord
in interface Extractor<T>
public T getDefaultValue()
Extractor
Extractor
in case of an
error.
getDefaultValue
in interface Extractor<T>
public ExtractorStats getStats()
Extractor
Extractor
instance
encountered while parsing input data.
getStats
in interface Extractor<T>
protected abstract T doExtract(Tokenizer tokenizer)
Extractor
instance is designed to parse.
Any runtime parsing exceptions from the given Tokenizer
instance
should be thrown so that they may be caught by the error handling logic
inside of this class.
tokenizer
- The Tokenizer
instance for the current record
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |