public interface DocumentExtractor
Modifier and Type | Method and Description |
---|---|
boolean |
canExtract(java.io.File file)
Determines if the given file can be extracted by this extractor.
|
boolean |
canExtract(java.net.URI uri)
Determines if the given file can be extracted by this extractor.
|
boolean |
canExtract(java.net.URLConnection connection)
Determines if the given file can be extracted by this extractor.
|
java.lang.Iterable<? extends Document> |
extractAll(java.io.File file)
Attempts to extract all of the documents from the given file.
|
java.lang.Iterable<? extends Document> |
extractAll(java.net.URI uri)
Attempts to extract all of the documents from the given file.
|
java.lang.Iterable<? extends Document> |
extractAll(java.net.URLConnection connection)
Attempts to extract all of the documents from the given file.
|
boolean canExtract(java.io.File file) throws java.io.IOException
file
- The file to extract.java.io.IOException
- If there is an IO error.boolean canExtract(java.net.URI uri) throws java.io.IOException
uri
- The URI of the file to extract.java.io.IOException
- If there is an IO error.boolean canExtract(java.net.URLConnection connection) throws java.io.IOException
connection
- The connection to the file to extract.java.io.IOException
- If there is an IO error.java.lang.Iterable<? extends Document> extractAll(java.io.File file) throws DocumentExtractionException, java.io.IOException
file
- The file to extract.DocumentExtractionException
- If there is an error extracting data from the file.java.io.IOException
- If there is an IO error.java.lang.Iterable<? extends Document> extractAll(java.net.URI uri) throws DocumentExtractionException, java.io.IOException
uri
- The URI of the file to extract.DocumentExtractionException
- If there is an error extracting data from the file.java.io.IOException
- If there is an IO error.java.lang.Iterable<? extends Document> extractAll(java.net.URLConnection connection) throws DocumentExtractionException, java.io.IOException
connection
- The connection to the file to extract.DocumentExtractionException
- If there is an error extracting data from the file.java.io.IOException
- If there is an IO error.