Class DatasetSplitter
Utility class for creating training / test / cross validation indexes from the original index.
Inheritance
Inherited Members
Namespace: Lucene.Net.Classification.Utils
Assembly: Lucene.Net.Classification.dll
Syntax
public class DatasetSplitter
  Constructors
| Improve this Doc View SourceDatasetSplitter(Double, Double)
Create a DatasetSplitter by giving test and cross validation IDXs sizes
Declaration
public DatasetSplitter(double testRatio, double crossValidationRatio)
  Parameters
| Type | Name | Description | 
|---|---|---|
| System.Double | testRatio | the ratio of the original index to be used for the test IDX as a System.Double between 0.0 and 1.0  | 
      
| System.Double | crossValidationRatio | the ratio of the original index to be used for the c.v. IDX as a System.Double between 0.0 and 1.0  | 
      
Methods
| Improve this Doc View SourceSplit(AtomicReader, Directory, Directory, Directory, Analyzer, String[])
Split a given index into 3 indexes for training, test and cross validation tasks respectively
Declaration
public virtual void Split(AtomicReader originalIndex, Directory trainingIndex, Directory testIndex, Directory crossValidationIndex, Analyzer analyzer, params string[] fieldNames)
  Parameters
| Type | Name | Description | 
|---|---|---|
| AtomicReader | originalIndex | an AtomicReader on the source index  | 
      
| Directory | trainingIndex | a Directory used to write the training index  | 
      
| Directory | testIndex | a Directory used to write the test index  | 
      
| Directory | crossValidationIndex | a Directory used to write the cross validation index  | 
      
| Analyzer | analyzer | Analyzer used to create the new docs  | 
      
| System.String[] | fieldNames | names of fields that need to be put in the new indexes or   | 
      
Exceptions
| Type | Condition | 
|---|---|
| System.IO.IOException | if any writing operation fails on any of the indexes  |