Namespace Lucene.Net.Analysis.El
Analyzer for Greek.
Classes
GreekAnalyzer
Analyzer for the Greek language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required Lucene
- As of 3.1, StandardFilter and GreekStemmer are used by default.
- As of 2.9, StopFilter preserves position increments
NOTE
: This class uses the same Lucene
GreekLowerCaseFilter
Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.
You must specify the required Lucene
- As of 3.1, supplementary characters are properly lowercased.
GreekLowerCaseFilterFactory
Factory for Greek
<fieldType name="text_glc" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.GreekLowerCaseFilterFactory"/>
</analyzer>
</fieldType>
GreekStemFilter
A Token
To prevent terms from being stemmed use an instance of
Set
NOTE: Input is expected to be casefolded for Greek (including folding of final
sigma to sigma), and with diacritics removed. This can be achieved by using
either Greek
GreekStemFilterFactory
Factory for Greek
<fieldType name="text_gstem" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.GreekLowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
</fieldType>
GreekStemmer
A stemmer for Greek words, according to: Development of a Stemmer for the
Greek Language.
Georgios Ntais
NOTE: Input is expected to be casefolded for Greek (including folding of final
sigma to sigma), and with diacritics removed. This can be achieved with
either Greek