Package | Description |
---|---|
gov.sandia.cognition.text.convert |
Provides classes for converting objects to a textual representation.
|
gov.sandia.cognition.text.document |
Provides representations for textual documents.
|
gov.sandia.cognition.text.document.extractor |
Provides extractors for pulling textual documents out of files.
|
Modifier and Type | Method and Description |
---|---|
Textual |
DocumentFieldConcatenator.evaluate(Document document) |
Field |
DocumentSingleFieldConverter.evaluate(Document input) |
Modifier and Type | Class and Description |
---|---|
class |
AbstractDocument
An abstract implementation of the
Document interface. |
class |
DefaultDocument
A default implementation of the
Document interface. |
Modifier and Type | Method and Description |
---|---|
Document |
AbstractSingleDocumentExtractor.extractDocument(java.io.File file) |
Document |
SingleDocumentExtractor.extractDocument(java.io.File file)
Attempts to extract a document from the given file.
|
Document |
AbstractSingleDocumentExtractor.extractDocument(java.net.URI uri) |
Document |
SingleDocumentExtractor.extractDocument(java.net.URI uri)
Attempts to extract a document from the given file.
|
Document |
SingleDocumentExtractor.extractDocument(java.net.URLConnection connection)
Attempts to extract a document from the given file.
|
Document |
TextDocumentExtractor.extractDocument(java.net.URLConnection connection) |
Modifier and Type | Method and Description |
---|---|
java.lang.Iterable<? extends Document> |
AbstractDocumentExtractor.extractAll(java.io.File file) |
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.io.File file) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.io.File file)
Attempts to extract all of the documents from the given file.
|
java.lang.Iterable<? extends Document> |
AbstractDocumentExtractor.extractAll(java.net.URI uri) |
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.net.URI uri) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.net.URI uri)
Attempts to extract all of the documents from the given file.
|
java.util.List<? extends Document> |
AbstractSingleDocumentExtractor.extractAll(java.net.URLConnection connection) |
java.lang.Iterable<? extends Document> |
DocumentExtractor.extractAll(java.net.URLConnection connection)
Attempts to extract all of the documents from the given file.
|