Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class UkrainianMorfologikAnalyzer

    A dictionary-based Lucene.Net.Analysis.Analyzer for Ukrainian.

    Inheritance
    object
    Analyzer
    StopwordAnalyzerBase
    UkrainianMorfologikAnalyzer
    Implements
    IDisposable
    Inherited Members
    StopwordAnalyzerBase.StopwordSet
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, ReuseStrategy)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>, ReuseStrategy)
    Analyzer.GetTokenStream(string, TextReader)
    Analyzer.GetTokenStream(string, string)
    Analyzer.GetPositionIncrementGap(string)
    Analyzer.GetOffsetGap(string)
    Analyzer.Strategy
    Analyzer.Dispose()
    Analyzer.GLOBAL_REUSE_STRATEGY
    Analyzer.PER_FIELD_REUSE_STRATEGY
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Analysis.Uk
    Assembly: Lucene.Net.Analysis.Morfologik.dll
    Syntax
    public sealed class UkrainianMorfologikAnalyzer : StopwordAnalyzerBase, IDisposable

    Constructors

    UkrainianMorfologikAnalyzer(LuceneVersion)

    Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.

    Declaration
    public UkrainianMorfologikAnalyzer(LuceneVersion matchVersion)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene.Net.Util.LuceneVersion to match.

    UkrainianMorfologikAnalyzer(LuceneVersion, CharArraySet)

    Builds an analyzer with the given stop words.

    Declaration
    public UkrainianMorfologikAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene.Net.Util.LuceneVersion to match.

    CharArraySet stopwords

    A stopword set.

    UkrainianMorfologikAnalyzer(LuceneVersion, CharArraySet, CharArraySet)

    Builds an analyzer with the given stop words. If a non-empty stem exclusion set is provided this analyzer will add a Lucene.Net.Analysis.Miscellaneous.SetKeywordMarkerFilter before stemming.

    Declaration
    public UkrainianMorfologikAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene.Net.Util.LuceneVersion to match.

    CharArraySet stopwords

    A stopword set.

    CharArraySet stemExclusionSet

    A set of terms not to be stemmed.

    Fields

    DEFAULT_STOPWORD_FILE

    File containing default Ukrainian stopwords.

    Declaration
    public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
    Field Value
    Type Description
    string

    Properties

    DefaultStopSet

    Returns an unmodifiable instance of the default stop words set.

    Declaration
    public static CharArraySet DefaultStopSet { get; }
    Property Value
    Type Description
    CharArraySet

    Default stop words set.

    Methods

    CreateComponents(string, TextReader)

    Creates a Lucene.Net.Analysis.TokenStreamComponents which tokenizes all the text in the provided TextReader.

    Declaration
    protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    string fieldName
    TextReader reader
    Returns
    Type Description
    TokenStreamComponents

    A Lucene.Net.Analysis.TokenStreamComponents built from a Lucene.Net.Analysis.Standard.StandardTokenizer filtered with Lucene.Net.Analysis.Core.LowerCaseFilter, Lucene.Net.Analysis.Core.StopFilter, Lucene.Net.Analysis.Miscellaneous.SetKeywordMarkerFilter if a stem exclusion set is provided and MorfologikFilter on the Ukrainian dictionary.

    Overrides
    Analyzer.CreateComponents(string, TextReader)

    InitReader(string, TextReader)

    Override this if you want to add a Lucene.Net.Analysis.CharFilter chain.

    The default implementation returns reader unchanged.
    Declaration
    protected override TextReader InitReader(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    string fieldName

    Lucene.Net.Index.IIndexableField name being indexed

    TextReader reader

    original TextReader

    Returns
    Type Description
    TextReader

    reader, optionally decorated with Lucene.Net.Analysis.CharFilter(s)

    Overrides
    Analyzer.InitReader(string, TextReader)

    Implements

    IDisposable
    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.