| Package | Description |
|---|---|
| gov.sandia.cognition.text.convert |
Provides classes for converting objects to a textual representation.
|
| gov.sandia.cognition.text.document |
Provides representations for textual documents.
|
| gov.sandia.cognition.text.document.extractor |
Provides extractors for pulling textual documents out of files.
|
| Modifier and Type | Method and Description |
|---|---|
Textual |
DocumentFieldConcatenator.evaluate(Document document) |
Field |
DocumentSingleFieldConverter.evaluate(Document input) |
| Modifier and Type | Class and Description |
|---|---|
class |
AbstractDocument
An abstract implementation of the
Document interface. |
class |
DefaultDocument
A default implementation of the
Document interface. |
| Modifier and Type | Method and Description |
|---|---|
Document |
AbstractSingleDocumentExtractor.extractDocument(java.io.File file) |
Document |
SingleDocumentExtractor.extractDocument(java.io.File file)
Attempts to extract a document from the given file.
|
Document |
AbstractSingleDocumentExtractor.extractDocument(java.net.URI uri) |
Document |
SingleDocumentExtractor.extractDocument(java.net.URI uri)
Attempts to extract a document from the given file.
|
Document |
SingleDocumentExtractor.extractDocument(java.net.URLConnection connection)
Attempts to extract a document from the given file.
|
Document |
TextDocumentExtractor.extractDocument(java.net.URLConnection connection) |
| Modifier and Type | Method and Description |
|---|---|
java.lang.Iterable<? extends Document> |
AbstractDocumentExtractor.extractAll(java.io.File file) |
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.io.File file) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.io.File file)
Attempts to extract all of the documents from the given file.
|
java.lang.Iterable<? extends Document> |
AbstractDocumentExtractor.extractAll(java.net.URI uri) |
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.net.URI uri) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.net.URI uri)
Attempts to extract all of the documents from the given file.
|
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.net.URLConnection connection) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.net.URLConnection connection)
Attempts to extract all of the documents from the given file.
|