Class RussianAnalyzer
Analyzer for Russian language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required Lucene
- As of 3.1, Standard
Tokenizer is used, Snowball stemming is done with SnowballFilter , and Snowball stopwords are used by default.
Inherited Members
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
Namespace: Lucene.Net.Analysis.Ru
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class RussianAnalyzer : StopwordAnalyzerBase
Constructors
| Improve this Doc View SourceRussianAnalyzer(LuceneVersion)
Declaration
public RussianAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion |
RussianAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public RussianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion | lucene compatibility version |
Char |
stopwords | a stopword set |
RussianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public RussianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion | lucene compatibility version |
Char |
stopwords | a stopword set |
Char |
stemExclusionSet | a set of words not to be stemmed |
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default Russian stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = null
Field Value
Type | Description |
---|---|
System. |
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
Char |
an unmodifiable instance of the default stop-words set. |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates
Token
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System. |
fieldName | |
Text |
reader |
Returns
Type | Description |
---|---|
Token |
Token |