Class DatasetSplitter
Utility class for creating training / test / cross validation indexes from the original index.
Inherited Members
Namespace: Lucene.Net.Classification.Utils
Assembly: Lucene.Net.Classification.dll
Syntax
public class DatasetSplitter
Constructors
DatasetSplitter(double, double)
Create a DatasetSplitter by giving test and cross validation IDXs sizes
Declaration
public DatasetSplitter(double testRatio, double crossValidationRatio)
Parameters
Type | Name | Description |
---|---|---|
double | testRatio | the ratio of the original index to be used for the test IDX as a double between 0.0 and 1.0 |
double | crossValidationRatio | the ratio of the original index to be used for the c.v. IDX as a double between 0.0 and 1.0 |
Methods
Split(AtomicReader, Directory, Directory, Directory, Analyzer, params string[])
Split a given index into 3 indexes for training, test and cross validation tasks respectively
Declaration
public virtual void Split(AtomicReader originalIndex, Directory trainingIndex, Directory testIndex, Directory crossValidationIndex, Analyzer analyzer, params string[] fieldNames)
Parameters
Type | Name | Description |
---|---|---|
AtomicReader | originalIndex | an Lucene.Net.Index.AtomicReader on the source index |
Directory | trainingIndex | a Lucene.Net.Store.Directory used to write the training index |
Directory | testIndex | a Lucene.Net.Store.Directory used to write the test index |
Directory | crossValidationIndex | a Lucene.Net.Store.Directory used to write the cross validation index |
Analyzer | analyzer | Lucene.Net.Analysis.Analyzer used to create the new docs |
string[] | fieldNames | names of fields that need to be put in the new indexes or |
Exceptions
Type | Condition |
---|---|
IOException | if any writing operation fails on any of the indexes |