| Package | Description | 
|---|---|
| org.apache.crunch.contrib.text | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AbstractCompositeExtractor<T>Base class for  Extractorinstances that delegates the parsing of fields to otherExtractorinstances, primarily used for constructing composite records that implement
 theTupleinterface. | 
| class  | AbstractSimpleExtractor<T>Base class for the common case  Extractorinstances that construct a single
 object from a block of text stored in aString, with support for error handling
 and reporting. | 
| Modifier and Type | Method and Description | 
|---|---|
| static Extractor<Boolean> | Extractors. xboolean()Returns an Extractor for booleans. | 
| static Extractor<Boolean> | Extractors. xboolean(Boolean defaultValue) | 
| static <T> Extractor<Collection<T>> | Extractors. xcollect(TokenizerFactory scannerFactory,
        Extractor<T> extractor) | 
| static <T extends Tuple> | Extractors. xcustom(Class<T> clazz,
       TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for a subclass of  Tuplewith a constructor that
 has the given extractor types that uses the givenTokenizerFactoryfor parsing the sub-fields. | 
| static Extractor<Double> | Extractors. xdouble()Returns an Extractor for doubles. | 
| static Extractor<Double> | Extractors. xdouble(Double defaultValue) | 
| static Extractor<Float> | Extractors. xfloat()Returns an Extractor for floats. | 
| static Extractor<Float> | Extractors. xfloat(Float defaultValue) | 
| static Extractor<Integer> | Extractors. xint()Returns an Extractor for integers. | 
| static Extractor<Integer> | Extractors. xint(Integer defaultValue)Returns an Extractor for integers. | 
| static Extractor<Long> | Extractors. xlong()Returns an Extractor for longs. | 
| static Extractor<Long> | Extractors. xlong(Long defaultValue)Returns an Extractor for longs. | 
| static <K,V> Extractor<Pair<K,V>> | Extractors. xpair(TokenizerFactory scannerFactory,
     Extractor<K> one,
     Extractor<V> two)Returns an Extractor for pairs of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static Extractor<String> | Extractors. xstring()Returns an Extractor for strings. | 
| static Extractor<String> | Extractors. xstring(String defaultValue) | 
| static <A,B,C> Extractor<Tuple3<A,B,C>> | Extractors. xtriple(TokenizerFactory scannerFactory,
       Extractor<A> a,
       Extractor<B> b,
       Extractor<C> c)Returns an Extractor for triples of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static Extractor<TupleN> | Extractors. xtupleN(TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for an arbitrary number of types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| Modifier and Type | Method and Description | 
|---|---|
| static <T> PCollection<T> | Parse. parse(String groupName,
     PCollection<String> input,
     Extractor<T> extractor)Parses the lines of the input  PCollection<String>and returns aPCollection<T>using
 the givenExtractor<T>. | 
| static <T> PCollection<T> | Parse. parse(String groupName,
     PCollection<String> input,
     PTypeFamily ptf,
     Extractor<T> extractor)Parses the lines of the input  PCollection<String>and returns aPCollection<T>using
 the givenExtractor<T>that uses the givenPTypeFamily. | 
| static <K,V> PTable<K,V> | Parse. parseTable(String groupName,
          PCollection<String> input,
          Extractor<Pair<K,V>> extractor)Parses the lines of the input  PCollection<String>and returns aPTable<K, V>using
 the givenExtractor<Pair<K, V>>. | 
| static <K,V> PTable<K,V> | Parse. parseTable(String groupName,
          PCollection<String> input,
          PTypeFamily ptf,
          Extractor<Pair<K,V>> extractor)Parses the lines of the input  PCollection<String>and returns aPTable<K, V>using
 the givenExtractor<Pair<K, V>>that uses the givenPTypeFamily. | 
| static <T> Extractor<Collection<T>> | Extractors. xcollect(TokenizerFactory scannerFactory,
        Extractor<T> extractor) | 
| static <T extends Tuple> | Extractors. xcustom(Class<T> clazz,
       TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for a subclass of  Tuplewith a constructor that
 has the given extractor types that uses the givenTokenizerFactoryfor parsing the sub-fields. | 
| static <K,V> Extractor<Pair<K,V>> | Extractors. xpair(TokenizerFactory scannerFactory,
     Extractor<K> one,
     Extractor<V> two)Returns an Extractor for pairs of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <K,V> Extractor<Pair<K,V>> | Extractors. xpair(TokenizerFactory scannerFactory,
     Extractor<K> one,
     Extractor<V> two)Returns an Extractor for pairs of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C> Extractor<Tuple3<A,B,C>> | Extractors. xtriple(TokenizerFactory scannerFactory,
       Extractor<A> a,
       Extractor<B> b,
       Extractor<C> c)Returns an Extractor for triples of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C> Extractor<Tuple3<A,B,C>> | Extractors. xtriple(TokenizerFactory scannerFactory,
       Extractor<A> a,
       Extractor<B> b,
       Extractor<C> c)Returns an Extractor for triples of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C> Extractor<Tuple3<A,B,C>> | Extractors. xtriple(TokenizerFactory scannerFactory,
       Extractor<A> a,
       Extractor<B> b,
       Extractor<C> c)Returns an Extractor for triples of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static Extractor<TupleN> | Extractors. xtupleN(TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for an arbitrary number of types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| Constructor and Description | 
|---|
| AbstractCompositeExtractor(TokenizerFactory scannerFactory,
                          List<Extractor<?>> extractors) | 
Copyright © 2017 The Apache Software Foundation. All rights reserved.