|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.crunch.contrib.text.AbstractSimpleExtractor<T>
public abstract class AbstractSimpleExtractor<T>
Base class for the common case Extractor instances that construct a single
object from a block of text stored in a String, with support for error handling
and reporting.
| Constructor Summary | |
|---|---|
protected |
AbstractSimpleExtractor(T defaultValue)
|
protected |
AbstractSimpleExtractor(T defaultValue,
TokenizerFactory scannerFactory)
|
| Method Summary | |
|---|---|
protected abstract T |
doExtract(Tokenizer tokenizer)
Subclasses must override this method to return a new instance of the class that this Extractor instance is designed to parse. |
boolean |
errorOnLastRecord()
Returns true if the last call to extract on this instance
threw an exception that was handled. |
T |
extract(String input)
Extract a value with the type of this instance. |
T |
getDefaultValue()
Returns the default value for this Extractor in case of an
error. |
ExtractorStats |
getStats()
Return statistics about how many errors this Extractor instance
encountered while parsing input data. |
void |
initialize()
Perform any initialization required by this Extractor during the
start of a map or reduce task. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.apache.crunch.contrib.text.Extractor |
|---|
getPType |
| Constructor Detail |
|---|
protected AbstractSimpleExtractor(T defaultValue)
protected AbstractSimpleExtractor(T defaultValue,
TokenizerFactory scannerFactory)
| Method Detail |
|---|
public void initialize()
ExtractorExtractor during the
start of a map or reduce task.
initialize in interface Extractor<T>public T extract(String input)
Extractor
extract in interface Extractor<T>public boolean errorOnLastRecord()
Extractorextract on this instance
threw an exception that was handled.
errorOnLastRecord in interface Extractor<T>public T getDefaultValue()
ExtractorExtractor in case of an
error.
getDefaultValue in interface Extractor<T>public ExtractorStats getStats()
ExtractorExtractor instance
encountered while parsing input data.
getStats in interface Extractor<T>protected abstract T doExtract(Tokenizer tokenizer)
Extractor instance is designed to parse.
Any runtime parsing exceptions from the given Tokenizer instance
should be thrown so that they may be caught by the error handling logic
inside of this class.
tokenizer - The Tokenizer instance for the current record
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||