Class TrecContentSource
Implements a ContentSource over the TREC collection.
Implements
System.IDisposable
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Benchmarks.ByTask.Feeds
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class TrecContentSource : ContentSource, IDisposable
Remarks
Supports the following configuration parameters (on top of ContentSource):
- work.dirspecifies the working directory. Required if "docs.dir" denotes a relative path (default=work).
- docs.dirspecifies the directory where the TREC files reside. Can be set to a relative path if "work.dir" is also specified (default=trec).
- trec.doc.parserspecifies the TrecDocParser class to use for parsing the TREC documents content (default=TrecGov2Parser).
- html.parserspecifies the IHTMLParser class to use for parsing the HTML parts of the TREC documents content (default=DemoHTMLParser).
- content.source.encodingif not specified, ISO-8859-1 is used.
- if
true
, do not append iteration number to docname
Fields
| Improve this Doc View SourceDOC
Declaration
public static readonly string DOC
Field Value
Type | Description |
---|---|
System.String |
DOCNO
Declaration
public static readonly string DOCNO
Field Value
Type | Description |
---|---|
System.String |
NEW_LINE
separator between lines in the buffer
Declaration
public static readonly string NEW_LINE
Field Value
Type | Description |
---|---|
System.String |
TERMINATING_DOC
Declaration
public static readonly string TERMINATING_DOC
Field Value
Type | Description |
---|---|
System.String |
TERMINATING_DOCNO
Declaration
public static readonly string TERMINATING_DOCNO
Field Value
Type | Description |
---|---|
System.String |
Methods
| Improve this Doc View SourceDispose(Boolean)
Declaration
protected override void Dispose(bool disposing)
Parameters
Type | Name | Description |
---|---|---|
System.Boolean | disposing |
Overrides
| Improve this Doc View SourceGetNextDocData(DocData)
Declaration
public override DocData GetNextDocData(DocData docData)
Parameters
Type | Name | Description |
---|---|---|
DocData | docData |
Returns
Type | Description |
---|---|
DocData |
Overrides
| Improve this Doc View SourceParseDate(String)
Declaration
public virtual DateTime? ParseDate(string dateStr)
Parameters
Type | Name | Description |
---|---|---|
System.String | dateStr |
Returns
Type | Description |
---|---|
System.Nullable<System.DateTime> |
ResetInputs()
Declaration
public override void ResetInputs()
Overrides
| Improve this Doc View SourceSetConfig(Config)
Declaration
public override void SetConfig(Config config)
Parameters
Type | Name | Description |
---|---|---|
Config | config |
Overrides
Implements
System.IDisposable