Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class DatasetSplitter

    Utility class for creating training / test / cross validation indexes from the original index.

    Inheritance
    object
    DatasetSplitter
    Inherited Members
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Classification.Utils
    Assembly: Lucene.Net.Classification.dll
    Syntax
    public class DatasetSplitter

    Constructors

    DatasetSplitter(double, double)

    Create a DatasetSplitter by giving test and cross validation IDXs sizes

    Declaration
    public DatasetSplitter(double testRatio, double crossValidationRatio)
    Parameters
    Type Name Description
    double testRatio

    the ratio of the original index to be used for the test IDX as a double between 0.0 and 1.0

    double crossValidationRatio

    the ratio of the original index to be used for the c.v. IDX as a double between 0.0 and 1.0

    Methods

    Split(AtomicReader, Directory, Directory, Directory, Analyzer, params string[])

    Split a given index into 3 indexes for training, test and cross validation tasks respectively

    Declaration
    public virtual void Split(AtomicReader originalIndex, Directory trainingIndex, Directory testIndex, Directory crossValidationIndex, Analyzer analyzer, params string[] fieldNames)
    Parameters
    Type Name Description
    AtomicReader originalIndex

    an Lucene.Net.Index.AtomicReader on the source index

    Directory trainingIndex

    a Lucene.Net.Store.Directory used to write the training index

    Directory testIndex

    a Lucene.Net.Store.Directory used to write the test index

    Directory crossValidationIndex

    a Lucene.Net.Store.Directory used to write the cross validation index

    Analyzer analyzer

    Lucene.Net.Analysis.Analyzer used to create the new docs

    string[] fieldNames

    names of fields that need to be put in the new indexes or null if all should be used

    Exceptions
    Type Condition
    IOException

    if any writing operation fails on any of the indexes

    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.