public class NGramFilter extends AbstractCloneableSerializable implements TermFilter
Modifier and Type | Field and Description |
---|---|
static int |
DEFAULT_SIZE
The default is a bigram.
|
protected int |
size
The size of the n-gram.
|
Constructor and Description |
---|
NGramFilter()
Creates a new
NGramFilter with the default size. |
NGramFilter(int size)
Creates a new
NGramFilter with the given size. |
Modifier and Type | Method and Description |
---|---|
NGramFilter |
clone()
This makes public the clone method on the
Object class and
removes the exception that it throws. |
java.util.Collection<TermOccurrence> |
filterTerms(java.lang.Iterable<? extends TermOccurrence> terms)
Filters the given list of terms into a new list of terms based on some
internal criteria for what constitutes a term.
|
int |
getSize()
Gets the size of the n-gram created by the filter.
|
void |
setSize(int size)
Sets the size of the n-gram created by the filter.
|
public static final int DEFAULT_SIZE
protected int size
public NGramFilter()
NGramFilter
with the default size.public NGramFilter(int size)
NGramFilter
with the given size.size
- The size of the n-grams to create. Must be greater than 1.public NGramFilter clone()
AbstractCloneableSerializable
Object
class and
removes the exception that it throws. Its default behavior is to
automatically create a clone of the exact type of object that the
clone is called on and to copy all primitives but to keep all references,
which means it is a shallow copy.
Extensions of this class may want to override this method (but call
super.clone()
to implement a "smart copy". That is, to target
the most common use case for creating a copy of the object. Because of
the default behavior being a shallow copy, extending classes only need
to handle fields that need to have a deeper copy (or those that need to
be reset). Some of the methods in ObjectUtil
may be helpful in
implementing a custom clone method.
Note: The contract of this method is that you must use
super.clone()
as the basis for your implementation.clone
in interface CloneableSerializable
clone
in class AbstractCloneableSerializable
public java.util.Collection<TermOccurrence> filterTerms(java.lang.Iterable<? extends TermOccurrence> terms)
TermFilter
filterTerms
in interface TermFilter
terms
- The terms to filter.public int getSize()
public void setSize(int size)
size
- The size of the n-gram created by the filter. Must be greater than
1.