Package | Description |
---|---|
gov.sandia.cognition.text.term |
Provides term representing text content in documents.
|
gov.sandia.cognition.text.token |
Provides text tokenization algorithms.
|
Modifier and Type | Method and Description |
---|---|
Term |
TermFactory.createTerm(Token token)
Creates a new term from the given token.
|
Modifier and Type | Class and Description |
---|---|
class |
DefaultToken
A default implementation of the
Token interface. |
Modifier and Type | Method and Description |
---|---|
java.lang.Iterable<Token> |
AbstractCharacterBasedTokenizer.tokenize(java.io.Reader reader) |
java.lang.Iterable<Token> |
Tokenizer.tokenize(java.io.Reader reader)
Converts the string from the given reader into an ordered list of tokens.
|
java.lang.Iterable<Token> |
AbstractTokenizer.tokenize(java.lang.String s) |
java.lang.Iterable<Token> |
Tokenizer.tokenize(java.lang.String s)
Convert the given string into a corresponding ordered list of tokens.
|
java.lang.Iterable<Token> |
AbstractTokenizer.tokenize(Textual textual) |
java.lang.Iterable<Token> |
Tokenizer.tokenize(Textual textual)
Convert the given string into a corresponding ordered list of tokens.
|