Class ExtractReuters
Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body
Inheritance
System.Object
ExtractReuters
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Benchmarks.Utils
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class ExtractReuters
Constructors
| Improve this Doc View SourceExtractReuters(DirectoryInfo, DirectoryInfo)
Declaration
public ExtractReuters(DirectoryInfo reutersDir, DirectoryInfo outputDir)
Parameters
Type | Name | Description |
---|---|---|
System.IO.DirectoryInfo | reutersDir | |
System.IO.DirectoryInfo | outputDir |
Methods
| Improve this Doc View SourceExtract()
Declaration
public virtual void Extract()
ExtractFile(FileInfo)
Override if you wish to change what is extracted
Declaration
protected virtual void ExtractFile(FileInfo sgmFile)
Parameters
Type | Name | Description |
---|---|---|
System.IO.FileInfo | sgmFile |
Main(String[])
Declaration
public static void Main(string[] args)
Parameters
Type | Name | Description |
---|---|---|
System.String[] | args |