Package | Description |
---|---|
org.apache.crunch.contrib.text |
Modifier and Type | Class and Description |
---|---|
class |
AbstractCompositeExtractor<T>
Base class for
Extractor instances that delegates the parsing of fields to other
Extractor instances, primarily used for constructing composite records that implement
the Tuple interface. |
class |
AbstractSimpleExtractor<T>
Base class for the common case
Extractor instances that construct a single
object from a block of text stored in a String , with support for error handling
and reporting. |
Modifier and Type | Method and Description |
---|---|
static Extractor<Boolean> |
Extractors.xboolean()
Returns an Extractor for booleans.
|
static Extractor<Boolean> |
Extractors.xboolean(Boolean defaultValue) |
static <T> Extractor<Collection<T>> |
Extractors.xcollect(TokenizerFactory scannerFactory,
Extractor<T> extractor) |
static <T extends Tuple> |
Extractors.xcustom(Class<T> clazz,
TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for a subclass of
Tuple with a constructor that
has the given extractor types that uses the given TokenizerFactory
for parsing the sub-fields. |
static Extractor<Double> |
Extractors.xdouble()
Returns an Extractor for doubles.
|
static Extractor<Double> |
Extractors.xdouble(Double defaultValue) |
static Extractor<Float> |
Extractors.xfloat()
Returns an Extractor for floats.
|
static Extractor<Float> |
Extractors.xfloat(Float defaultValue) |
static Extractor<Integer> |
Extractors.xint()
Returns an Extractor for integers.
|
static Extractor<Integer> |
Extractors.xint(Integer defaultValue)
Returns an Extractor for integers.
|
static Extractor<Long> |
Extractors.xlong()
Returns an Extractor for longs.
|
static Extractor<Long> |
Extractors.xlong(Long defaultValue)
Returns an Extractor for longs.
|
static <K,V> Extractor<Pair<K,V>> |
Extractors.xpair(TokenizerFactory scannerFactory,
Extractor<K> one,
Extractor<V> two)
Returns an Extractor for pairs of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static Extractor<String> |
Extractors.xstring()
Returns an Extractor for strings.
|
static Extractor<String> |
Extractors.xstring(String defaultValue) |
static <A,B,C> Extractor<Tuple3<A,B,C>> |
Extractors.xtriple(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c)
Returns an Extractor for triples of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static Extractor<TupleN> |
Extractors.xtupleN(TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for an arbitrary number of types that uses the given
TokenizerFactory
for parsing the sub-fields. |
Modifier and Type | Method and Description |
---|---|
static <T> PCollection<T> |
Parse.parse(String groupName,
PCollection<String> input,
Extractor<T> extractor)
Parses the lines of the input
PCollection<String> and returns a PCollection<T> using
the given Extractor<T> . |
static <T> PCollection<T> |
Parse.parse(String groupName,
PCollection<String> input,
PTypeFamily ptf,
Extractor<T> extractor)
Parses the lines of the input
PCollection<String> and returns a PCollection<T> using
the given Extractor<T> that uses the given PTypeFamily . |
static <K,V> PTable<K,V> |
Parse.parseTable(String groupName,
PCollection<String> input,
Extractor<Pair<K,V>> extractor)
Parses the lines of the input
PCollection<String> and returns a PTable<K, V> using
the given Extractor<Pair<K, V>> . |
static <K,V> PTable<K,V> |
Parse.parseTable(String groupName,
PCollection<String> input,
PTypeFamily ptf,
Extractor<Pair<K,V>> extractor)
Parses the lines of the input
PCollection<String> and returns a PTable<K, V> using
the given Extractor<Pair<K, V>> that uses the given PTypeFamily . |
static <T> Extractor<Collection<T>> |
Extractors.xcollect(TokenizerFactory scannerFactory,
Extractor<T> extractor) |
static <T extends Tuple> |
Extractors.xcustom(Class<T> clazz,
TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for a subclass of
Tuple with a constructor that
has the given extractor types that uses the given TokenizerFactory
for parsing the sub-fields. |
static <K,V> Extractor<Pair<K,V>> |
Extractors.xpair(TokenizerFactory scannerFactory,
Extractor<K> one,
Extractor<V> two)
Returns an Extractor for pairs of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <K,V> Extractor<Pair<K,V>> |
Extractors.xpair(TokenizerFactory scannerFactory,
Extractor<K> one,
Extractor<V> two)
Returns an Extractor for pairs of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> |
Extractors.xquad(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c,
Extractor<D> d)
Returns an Extractor for quads of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C> Extractor<Tuple3<A,B,C>> |
Extractors.xtriple(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c)
Returns an Extractor for triples of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C> Extractor<Tuple3<A,B,C>> |
Extractors.xtriple(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c)
Returns an Extractor for triples of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static <A,B,C> Extractor<Tuple3<A,B,C>> |
Extractors.xtriple(TokenizerFactory scannerFactory,
Extractor<A> a,
Extractor<B> b,
Extractor<C> c)
Returns an Extractor for triples of the given types that uses the given
TokenizerFactory
for parsing the sub-fields. |
static Extractor<TupleN> |
Extractors.xtupleN(TokenizerFactory scannerFactory,
Extractor... extractors)
Returns an Extractor for an arbitrary number of types that uses the given
TokenizerFactory
for parsing the sub-fields. |
Constructor and Description |
---|
AbstractCompositeExtractor(TokenizerFactory scannerFactory,
List<Extractor<?>> extractors) |
Copyright © 2016 The Apache Software Foundation. All rights reserved.