Show / Hide Table of Contents

    Class BulgarianAnalyzer

    Analyzer for Bulgarian.

    This analyzer implements light-stemming as specified by: Searching Strategies for the Bulgarian Language http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf

    Inheritance
    System.Object
    Analyzer
    StopwordAnalyzerBase
    BulgarianAnalyzer
    Inherited Members
    StopwordAnalyzerBase.m_stopwords
    StopwordAnalyzerBase.m_matchVersion
    StopwordAnalyzerBase.StopwordSet
    StopwordAnalyzerBase.LoadStopwordSet(Boolean, Type, String, String)
    StopwordAnalyzerBase.LoadStopwordSet(FileInfo, LuceneVersion)
    StopwordAnalyzerBase.LoadStopwordSet(TextReader, LuceneVersion)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >)
    Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
    Analyzer.GetTokenStream(String, TextReader)
    Analyzer.GetTokenStream(String, String)
    Analyzer.InitReader(String, TextReader)
    Analyzer.GetPositionIncrementGap(String)
    Analyzer.GetOffsetGap(String)
    Analyzer.Strategy
    Analyzer.Dispose()
    Analyzer.Dispose(Boolean)
    Analyzer.GLOBAL_REUSE_STRATEGY
    Analyzer.PER_FIELD_REUSE_STRATEGY
    Namespace: Lucene.Net.Analysis.Bg
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public sealed class BulgarianAnalyzer : StopwordAnalyzerBase

    Constructors

    | Improve this Doc View Source

    BulgarianAnalyzer(LuceneVersion)

    Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.

    Declaration
    public BulgarianAnalyzer(LuceneVersion matchVersion)
    Parameters
    Type Name Description
    LuceneVersion matchVersion
    | Improve this Doc View Source

    BulgarianAnalyzer(LuceneVersion, CharArraySet)

    Builds an analyzer with the given stop words.

    Declaration
    public BulgarianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion
    CharArraySet stopwords
    | Improve this Doc View Source

    BulgarianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)

    Builds an analyzer with the given stop words and a stem exclusion set. If a stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before BulgarianStemFilter.

    Declaration
    public BulgarianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
    Parameters
    Type Name Description
    LuceneVersion matchVersion
    CharArraySet stopwords
    CharArraySet stemExclusionSet

    Fields

    | Improve this Doc View Source

    DEFAULT_STOPWORD_FILE

    File containing default Bulgarian stopwords.

    Default stopword list is from http://members.unine.ch/jacques.savoy/clef/index.html The stopword list is BSD-Licensed.

    Declaration
    public const string DEFAULT_STOPWORD_FILE = null
    Field Value
    Type Description
    System.String

    Properties

    | Improve this Doc View Source

    DefaultStopSet

    Returns an unmodifiable instance of the default stop-words set.

    Declaration
    public static CharArraySet DefaultStopSet { get; }
    Property Value
    Type Description
    CharArraySet

    an unmodifiable instance of the default stop-words set.

    Methods

    | Improve this Doc View Source

    CreateComponents(String, TextReader)

    Creates a TokenStreamComponents which tokenizes all the text in the provided .

    Declaration
    protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    System.String fieldName
    TextReader reader
    Returns
    Type Description
    TokenStreamComponents

    A TokenStreamComponents built from an StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, SetKeywordMarkerFilter if a stem exclusion set is provided and BulgarianStemFilter.

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)