|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.crunch.contrib.text.TokenizerFactory
public class TokenizerFactory
Factory class that constructs Tokenizer instances for input strings that use a fixed
set of delimiters, skip patterns, locales, and sets of indices to keep or drop.
| Nested Class Summary | |
|---|---|
static class |
TokenizerFactory.Builder
A class for constructing new TokenizerFactory instances using the Builder pattern. |
| Method Summary | |
|---|---|
static TokenizerFactory.Builder |
builder()
Factory method for creating a TokenizerFactory.Builder instance. |
Tokenizer |
create(String input)
Return a Scanner instance that wraps the input string and uses the delimiter,
skip, and locale settings for this TokenizerFactory instance. |
static TokenizerFactory |
getDefaultInstance()
Returns a default TokenizerFactory that uses whitespace as a delimiter and does
not skip any input fields. |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public static TokenizerFactory getDefaultInstance()
TokenizerFactory that uses whitespace as a delimiter and does
not skip any input fields.
TokenizerFactorypublic Tokenizer create(String input)
Scanner instance that wraps the input string and uses the delimiter,
skip, and locale settings for this TokenizerFactory instance.
input - The input string
Scanner instance with appropriate settingspublic static TokenizerFactory.Builder builder()
TokenizerFactory.Builder instance.
TokenizerFactory.Builder
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||