Namespace Lucene.Net.Analysis.Ru
Analyzer for Russian.
Classes
RussianAnalyzer
Lucene.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required Lucene.
- As of 3.1, Standard
Tokenizer is used, Snowball stemming is done with SnowballFilter , and Snowball stopwords are used by default.
RussianLetterTokenizer
A Russian
You must specify the required Lucene.
- As of 3.1, Char
Tokenizer uses an int based API to normalize and detect token characters. See IsToken and Normalize(Int32) for details.Char(Int32)
RussianLetterTokenizerFactory
RussianLightStemFilter
A Lucene.
To prevent terms from being stemmed use an instance of
Set
RussianLightStemFilterFactory
Factory for Russian
<fieldType name="text_rulgtstem" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.RussianLightStemFilterFactory"/>
</analyzer>
</fieldType>
RussianLightStemmer
Light Stemmer for Russian.
This stemmer implements the following algorithm:
Indexing and Searching Strategies for the Russian Language.
Ljiljana Dolamic and Jacques Savoy.