Package | Description |
---|---|
gov.sandia.cognition.text.token |
Provides text tokenization algorithms.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractCharacterBasedTokenizer
An abstract implementation of a tokenizer that considers each character
individually.
|
class |
AbstractTokenizer
Abstract implementation of the
Tokenizer interface. |
class |
LetterNumberTokenizer
A tokenizer that creates tokens from sequences of letters and numbers,
treating everything else as a delimiter.
|