|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.crunch.contrib.text.TokenizerFactory
public class TokenizerFactory
Factory class that constructs Tokenizer
instances for input strings that use a fixed
set of delimiters, skip patterns, locales, and sets of indices to keep or drop.
Nested Class Summary | |
---|---|
static class |
TokenizerFactory.Builder
A class for constructing new TokenizerFactory instances using the Builder pattern. |
Method Summary | |
---|---|
static TokenizerFactory.Builder |
builder()
Factory method for creating a TokenizerFactory.Builder instance. |
Tokenizer |
create(String input)
Return a Scanner instance that wraps the input string and uses the delimiter,
skip, and locale settings for this TokenizerFactory instance. |
static TokenizerFactory |
getDefaultInstance()
Returns a default TokenizerFactory that uses whitespace as a delimiter and does
not skip any input fields. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Method Detail |
---|
public static TokenizerFactory getDefaultInstance()
TokenizerFactory
that uses whitespace as a delimiter and does
not skip any input fields.
TokenizerFactory
public Tokenizer create(String input)
Scanner
instance that wraps the input string and uses the delimiter,
skip, and locale settings for this TokenizerFactory
instance.
input
- The input string
Scanner
instance with appropriate settingspublic static TokenizerFactory.Builder builder()
TokenizerFactory.Builder
instance.
TokenizerFactory.Builder
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |