| Package | Description | 
|---|---|
| org.apache.crunch.contrib.text | 
| Modifier and Type | Method and Description | 
|---|---|
| TokenizerFactory | TokenizerFactory.Builder. build()Returns a new  TokenizerFactorywith settings determined by thisBuilderinstance. | 
| static TokenizerFactory | TokenizerFactory. getDefaultInstance()Returns a default  TokenizerFactorythat uses whitespace as a delimiter and does
 not skip any input fields. | 
| Modifier and Type | Method and Description | 
|---|---|
| static <T> Extractor<Collection<T>> | Extractors. xcollect(TokenizerFactory scannerFactory,
        Extractor<T> extractor) | 
| static <T extends Tuple> | Extractors. xcustom(Class<T> clazz,
       TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for a subclass of  Tuplewith a constructor that
 has the given extractor types that uses the givenTokenizerFactoryfor parsing the sub-fields. | 
| static <K,V> Extractor<Pair<K,V>> | Extractors. xpair(TokenizerFactory scannerFactory,
     Extractor<K> one,
     Extractor<V> two)Returns an Extractor for pairs of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C,D> Extractor<Tuple4<A,B,C,D>> | Extractors. xquad(TokenizerFactory scannerFactory,
     Extractor<A> a,
     Extractor<B> b,
     Extractor<C> c,
     Extractor<D> d)Returns an Extractor for quads of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static <A,B,C> Extractor<Tuple3<A,B,C>> | Extractors. xtriple(TokenizerFactory scannerFactory,
       Extractor<A> a,
       Extractor<B> b,
       Extractor<C> c)Returns an Extractor for triples of the given types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| static Extractor<TupleN> | Extractors. xtupleN(TokenizerFactory scannerFactory,
       Extractor... extractors)Returns an Extractor for an arbitrary number of types that uses the given  TokenizerFactoryfor parsing the sub-fields. | 
| Constructor and Description | 
|---|
| AbstractCompositeExtractor(TokenizerFactory scannerFactory,
                          List<Extractor<?>> extractors) | 
Copyright © 2017 The Apache Software Foundation. All rights reserved.