Package | Description |
---|---|
org.apache.crunch.contrib.text |
Modifier and Type | Method and Description |
---|---|
TokenizerFactory |
TokenizerFactory.Builder.build()
Returns a new
TokenizerFactory with settings determined by this
Builder instance. |
static TokenizerFactory |
TokenizerFactory.getDefaultInstance()
Returns a default
TokenizerFactory that uses whitespace as a delimiter and does
not skip any input fields. |
Modifier and Type | Method and Description |
---|---|
static <T> Extractor<Collection<T>> |
Extractors.xcollect(TokenizerFactory scannerFactory,
Extractor<T> extractor) |
static <T extends Tuple> |
Extractors.xcustom(Class<T> clazz,
TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for a subclass of
Tuple with a constructor that
has the given extractor types that uses the given TokenizerFactory
for parsing the sub-fields. |
static <K,V> Extractor<Pair<K,V>> |
Extractors.xpair(TokenizerFactory scannerFactory,
Extractor<K> one,
Extractor<V> two)
Returns an Extractor for pairs of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C> Extractor<Tuple3<A,B,C>> |
Extractors.xtriple(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c)
Returns an Extractor for triples of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static Extractor<TupleN> |
Extractors.xtupleN(TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for an arbitrary number of types that uses the given
TokenizerFactory
for parsing the sub-fields. |
Constructor and Description |
---|
AbstractCompositeExtractor(TokenizerFactory scannerFactory,
List<Extractor<?>> extractors) |
AbstractSimpleExtractor(T defaultValue,
TokenizerFactory scannerFactory) |
Copyright © 2013 The Apache Software Foundation. All Rights Reserved.