Show / Hide Table of Contents

    Class ClassicAnalyzer

    Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.

    You must specify the required LuceneVersion compatibility when creating ClassicAnalyzer:

    • As of 3.1, StopFilter correctly handles Unicode 4.0 supplementary characters in stopwords
    • As of 2.9, StopFilter preserves position increments
    • As of 2.4, Tokens incorrectly identified as acronyms are corrected (see LUCENE-1068)
    ClassicAnalyzer was named StandardAnalyzer in Lucene versions prior to 3.1. As of 3.1, StandardAnalyzer implements Unicode text segmentation, as specified by UAX#29.

    Inheritance
    System.Object
    Analyzer
    StopwordAnalyzerBase
    ClassicAnalyzer
    Inherited Members
    StopwordAnalyzerBase.m_stopwords
    StopwordAnalyzerBase.m_matchVersion
    StopwordAnalyzerBase.StopwordSet
    StopwordAnalyzerBase.LoadStopwordSet(Boolean, Type, String, String)
    StopwordAnalyzerBase.LoadStopwordSet(FileInfo, LuceneVersion)
    StopwordAnalyzerBase.LoadStopwordSet(TextReader, LuceneVersion)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
    Analyzer.GetTokenStream(String, TextReader)
    Analyzer.GetTokenStream(String, String)
    Analyzer.InitReader(String, TextReader)
    Analyzer.GetPositionIncrementGap(String)
    Analyzer.GetOffsetGap(String)
    Analyzer.Strategy
    Analyzer.Dispose()
    Analyzer.Dispose(Boolean)
    Analyzer.GLOBAL_REUSE_STRATEGY
    Analyzer.PER_FIELD_REUSE_STRATEGY
    Namespace: Lucene.Net.Analysis.Standard
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public sealed class ClassicAnalyzer : StopwordAnalyzerBase

    Constructors

    | Improve this Doc View Source

    ClassicAnalyzer(LuceneVersion)

    Builds an analyzer with the default stop words (STOP_WORDS_SET).

    Declaration
    public ClassicAnalyzer(LuceneVersion matchVersion)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene compatibility version - See ClassicAnalyzer

    | Improve this Doc View Source

    ClassicAnalyzer(LuceneVersion, CharArraySet)

    Builds an analyzer with the given stop words.

    Declaration
    public ClassicAnalyzer(LuceneVersion matchVersion, CharArraySet stopWords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene compatibility version - See ClassicAnalyzer

    CharArraySet stopWords

    stop words

    | Improve this Doc View Source

    ClassicAnalyzer(LuceneVersion, TextReader)

    Builds an analyzer with the stop words from the given reader.

    Declaration
    public ClassicAnalyzer(LuceneVersion matchVersion, TextReader stopwords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene compatibility version - See ClassicAnalyzer

    TextReader stopwords

    to read stop words from

    See Also
    GetWordSet(TextReader, LuceneVersion)

    Fields

    | Improve this Doc View Source

    DEFAULT_MAX_TOKEN_LENGTH

    Default maximum allowed token length

    Declaration
    public const int DEFAULT_MAX_TOKEN_LENGTH = null
    Field Value
    Type Description
    System.Int32
    | Improve this Doc View Source

    STOP_WORDS_SET

    An unmodifiable set containing some common English words that are usually not useful for searching.

    Declaration
    public static readonly CharArraySet STOP_WORDS_SET
    Field Value
    Type Description
    CharArraySet

    Properties

    | Improve this Doc View Source

    MaxTokenLength

    Gets or sets maximum allowed token length. If a token is seen that exceeds this length then it is discarded. This setting only takes effect the next time tokenStream or tokenStream is called.

    Declaration
    public int MaxTokenLength { get; set; }
    Property Value
    Type Description
    System.Int32

    Methods

    | Improve this Doc View Source

    CreateComponents(String, TextReader)

    Declaration
    protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    System.String fieldName
    TextReader reader
    Returns
    Type Description
    TokenStreamComponents
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)