Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class TrecGov2Parser

    Parser for the GOV2 collection format

    Inheritance
    object
    TrecDocParser
    TrecGov2Parser
    Inherited Members
    TrecDocParser.DEFAULT_PATH_TYPE
    TrecDocParser.PathType(FileInfo)
    TrecDocParser.StripTags(StringBuilder, int)
    TrecDocParser.StripTags(string, int)
    TrecDocParser.Extract(StringBuilder, string, string, int, string[])
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Benchmarks.ByTask.Feeds
    Assembly: Lucene.Net.Benchmark.dll
    Syntax
    public class TrecGov2Parser : TrecDocParser

    Methods

    Parse(DocData, string, TrecContentSource, StringBuilder, ParsePathType)

    Parse the text prepared in docBuf into a result DocData, no synchronization is required.

    Declaration
    public override DocData Parse(DocData docData, string name, TrecContentSource trecSrc, StringBuilder docBuf, TrecDocParser.ParsePathType pathType)
    Parameters
    Type Name Description
    DocData docData

    Reusable result.

    string name

    Name that should be set to the result.

    TrecContentSource trecSrc

    Calling trec content source.

    StringBuilder docBuf

    Text to parse.

    TrecDocParser.ParsePathType pathType

    Type of parsed file, or UNKNOWN if unknown - may be used by parsers to alter their behavior according to the file path type.

    Returns
    Type Description
    DocData
    Overrides
    TrecDocParser.Parse(DocData, string, TrecContentSource, StringBuilder, TrecDocParser.ParsePathType)
    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.