Namespace Lucene.Net.Analysis.El
Analyzer for Greek.
Classes
GreekAnalyzer
Lucene.Net.Analysis.Analyzer for the Greek language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating GreekAnalyzer:
- As of 3.1, StandardFilter and GreekStemmer are used by default.
- As of 2.9, StopFilter preserves position increments
NOTE
: This class uses the same Lucene.Net.Util.LuceneVersion
dependent settings as StandardAnalyzer.
GreekLowerCaseFilter
Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating GreekLowerCaseFilter:
- As of 3.1, supplementary characters are properly lowercased.
GreekLowerCaseFilterFactory
Factory for GreekLowerCaseFilter.
<fieldType name="text_glc" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.GreekLowerCaseFilterFactory"/>
</analyzer>
</fieldType>
GreekStemFilter
A Lucene.Net.Analysis.TokenFilter that applies GreekStemmer to stem Greek words.
To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom Lucene.Net.Analysis.TokenFilter that sets the KeywordAttribute before this Lucene.Net.Analysis.TokenStream.
NOTE: Input is expected to be casefolded for Greek (including folding of final sigma to sigma), and with diacritics removed. This can be achieved by using either GreekLowerCaseFilter or ICUFoldingFilter before GreekStemFilter.
GreekStemFilterFactory
Factory for GreekStemFilter.
<fieldType name="text_gstem" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.GreekLowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
</fieldType>
GreekStemmer
A stemmer for Greek words, according to: Development of a Stemmer for the
Greek Language.
Georgios Ntais
NOTE: Input is expected to be casefolded for Greek (including folding of final sigma to sigma), and with diacritics removed. This can be achieved with either GreekLowerCaseFilter or ICUFoldingFilter.