OutputType
- The output category type for the training data.@PublicationReference(author={"David A. Cieslak","Nitesh V. Chawla"}, title="Increasing Skew Insensitivity of Decision Trees with Hellinger Distance", type=TechnicalReport, year=2008, publication="Notre Dame University Computer Science and Engineering Technical Reports", url="http://www.cse.nd.edu/Reports/2008/TR-2008-06.pdf") public class VectorThresholdHellingerDistanceLearner<OutputType> extends AbstractVectorThresholdMaximumGainLearner<OutputType>
DEFAULT_MIN_SPLIT_SIZE, dimensionsToConsider, minSplitSize
Constructor and Description |
---|
VectorThresholdHellingerDistanceLearner()
Creates a new
VectorThresholdHellingerDistanceLearner . |
VectorThresholdHellingerDistanceLearner(int minSplitSize)
Creates a new
VectorThresholdHellingerDistanceLearner . |
Modifier and Type | Method and Description |
---|---|
VectorThresholdHellingerDistanceLearner<OutputType> |
clone()
This makes public the clone method on the
Object class and
removes the exception that it throws. |
double |
computeSplitGain(DefaultDataDistribution<OutputType> baseCounts,
DefaultDataDistribution<OutputType> positiveCounts,
DefaultDataDistribution<OutputType> negativeCounts)
Computes the split gain by computing the mean Hellinger distance for the
given split.
|
computeBestGainAndThreshold, computeBestGainAndThreshold, getDimensionsToConsider, getMinSplitSize, learn, setDimensionsToConsider, setMinSplitSize
public VectorThresholdHellingerDistanceLearner()
VectorThresholdHellingerDistanceLearner
.public VectorThresholdHellingerDistanceLearner(int minSplitSize)
VectorThresholdHellingerDistanceLearner
.minSplitSize
- The minimum split size. Must be positive.public VectorThresholdHellingerDistanceLearner<OutputType> clone()
AbstractCloneableSerializable
Object
class and
removes the exception that it throws. Its default behavior is to
automatically create a clone of the exact type of object that the
clone is called on and to copy all primitives but to keep all references,
which means it is a shallow copy.
Extensions of this class may want to override this method (but call
super.clone()
to implement a "smart copy". That is, to target
the most common use case for creating a copy of the object. Because of
the default behavior being a shallow copy, extending classes only need
to handle fields that need to have a deeper copy (or those that need to
be reset). Some of the methods in ObjectUtil
may be helpful in
implementing a custom clone method.
Note: The contract of this method is that you must use
super.clone()
as the basis for your implementation.clone
in interface CloneableSerializable
clone
in class AbstractVectorThresholdMaximumGainLearner<OutputType>
public double computeSplitGain(DefaultDataDistribution<OutputType> baseCounts, DefaultDataDistribution<OutputType> positiveCounts, DefaultDataDistribution<OutputType> negativeCounts)
computeSplitGain
in class AbstractVectorThresholdMaximumGainLearner<OutputType>
baseCounts
- The histogram of counts before the split.positiveCounts
- The counts on the positive side of the threshold.negativeCounts
- The counts on the negative side of the threshold.