Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.El

    Analyzer for Greek.

    Classes

    GreekAnalyzer

    Lucene.Net.Analysis.Analyzer for the Greek language.

    Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.

    You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating GreekAnalyzer:

    • As of 3.1, StandardFilter and GreekStemmer are used by default.
    • As of 2.9, StopFilter preserves position increments

    NOTE: This class uses the same Lucene.Net.Util.LuceneVersion dependent settings as StandardAnalyzer.

    GreekLowerCaseFilter

    Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.

    You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating GreekLowerCaseFilter:

    • As of 3.1, supplementary characters are properly lowercased.

    GreekLowerCaseFilterFactory

    Factory for GreekLowerCaseFilter.

    <fieldType name="text_glc" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.GreekLowerCaseFilterFactory"/>
      </analyzer>
    </fieldType>

    GreekStemFilter

    A Lucene.Net.Analysis.TokenFilter that applies GreekStemmer to stem Greek words.

    To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom Lucene.Net.Analysis.TokenFilter that sets the KeywordAttribute before this Lucene.Net.Analysis.TokenStream.

    NOTE: Input is expected to be casefolded for Greek (including folding of final sigma to sigma), and with diacritics removed. This can be achieved by using either GreekLowerCaseFilter or ICUFoldingFilter before GreekStemFilter.

    This is a Lucene.NET EXPERIMENTAL API, use at your own risk

    GreekStemFilterFactory

    Factory for GreekStemFilter.

    <fieldType name="text_gstem" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.GreekLowerCaseFilterFactory"/>
        <filter class="solr.GreekStemFilterFactory"/>
      </analyzer>
    </fieldType>

    GreekStemmer

    A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. Georgios Ntais

    NOTE: Input is expected to be casefolded for Greek (including folding of final sigma to sigma), and with diacritics removed. This can be achieved with either GreekLowerCaseFilter or ICUFoldingFilter.

    This is a Lucene.NET EXPERIMENTAL API, use at your own risk

    • Improve this Doc
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.