Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class GreekAnalyzer

    Lucene.Net.Analysis.Analyzer for the Greek language.

    Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.

    You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating GreekAnalyzer:

    • As of 3.1, StandardFilter and GreekStemmer are used by default.
    • As of 2.9, StopFilter preserves position increments

    NOTE: This class uses the same Lucene.Net.Util.LuceneVersion dependent settings as StandardAnalyzer.

    Inheritance
    object
    Analyzer
    StopwordAnalyzerBase
    GreekAnalyzer
    Implements
    IDisposable
    Inherited Members
    StopwordAnalyzerBase.StopwordSet
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, ReuseStrategy)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>, ReuseStrategy)
    Analyzer.GetTokenStream(string, TextReader)
    Analyzer.GetTokenStream(string, string)
    Analyzer.GetPositionIncrementGap(string)
    Analyzer.GetOffsetGap(string)
    Analyzer.Strategy
    Analyzer.Dispose()
    Analyzer.GLOBAL_REUSE_STRATEGY
    Analyzer.PER_FIELD_REUSE_STRATEGY
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Analysis.El
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public sealed class GreekAnalyzer : StopwordAnalyzerBase, IDisposable

    Constructors

    GreekAnalyzer(LuceneVersion)

    Builds an analyzer with the default stop words.

    Declaration
    public GreekAnalyzer(LuceneVersion matchVersion)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene compatibility version, See Lucene.Net.Util.LuceneVersion

    GreekAnalyzer(LuceneVersion, CharArraySet)

    Builds an analyzer with the given stop words.

    NOTE: The stopwords set should be pre-processed with the logic of GreekLowerCaseFilter for best results.

    Declaration
    public GreekAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene compatibility version, See Lucene.Net.Util.LuceneVersion

    CharArraySet stopwords

    a stopword set

    Fields

    DEFAULT_STOPWORD_FILE

    File containing default Greek stopwords.

    Declaration
    public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
    Field Value
    Type Description
    string

    Properties

    DefaultStopSet

    Returns a set of default Greek-stopwords

    Declaration
    public static CharArraySet DefaultStopSet { get; }
    Property Value
    Type Description
    CharArraySet

    a set of default Greek-stopwords

    Methods

    CreateComponents(string, TextReader)

    Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided TextReader.

    Declaration
    protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    string fieldName
    TextReader reader
    Returns
    Type Description
    TokenStreamComponents

    Lucene.Net.Analysis.TokenStreamComponents built from a StandardTokenizer filtered with GreekLowerCaseFilter, StandardFilter, StopFilter, and GreekStemFilter

    Overrides
    Analyzer.CreateComponents(string, TextReader)

    Implements

    IDisposable
    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.