Show / Hide Table of Contents

    Class ExtractReuters

    Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body

    Inheritance
    System.Object
    ExtractReuters
    Namespace: Lucene.Net.Benchmarks.Utils
    Assembly: Lucene.Net.Benchmark.dll
    Syntax
    public class ExtractReuters : object

    Constructors

    | Improve this Doc View Source

    ExtractReuters(DirectoryInfo, DirectoryInfo)

    Declaration
    public ExtractReuters(DirectoryInfo reutersDir, DirectoryInfo outputDir)
    Parameters
    Type Name Description
    DirectoryInfo reutersDir
    DirectoryInfo outputDir

    Methods

    | Improve this Doc View Source

    Extract()

    Declaration
    public virtual void Extract()
    | Improve this Doc View Source

    ExtractFile(FileInfo)

    Override if you wish to change what is extracted

    Declaration
    protected virtual void ExtractFile(FileInfo sgmFile)
    Parameters
    Type Name Description
    FileInfo sgmFile
    | Improve this Doc View Source

    Main(String[])

    Declaration
    public static void Main(string[] args)
    Parameters
    Type Name Description
    System.String[] args
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)