Class RussianAnalyzer
Analyzer for Russian language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required LuceneVersion compatibility when creating RussianAnalyzer:
- As of 3.1, StandardTokenizer is used, Snowball stemming is done with SnowballFilter, and Snowball stopwords are used by default.
 
Implements
System.IDisposable
  Inherited Members
      System.Object.Equals(System.Object)
    
    
      System.Object.Equals(System.Object, System.Object)
    
    
      System.Object.GetHashCode()
    
    
      System.Object.GetType()
    
    
      System.Object.MemberwiseClone()
    
    
      System.Object.ReferenceEquals(System.Object, System.Object)
    
    
      System.Object.ToString()
    
  Namespace: Lucene.Net.Analysis.Ru
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class RussianAnalyzer : StopwordAnalyzerBase, IDisposable
  Constructors
| Improve this Doc View SourceRussianAnalyzer(LuceneVersion)
Declaration
public RussianAnalyzer(LuceneVersion matchVersion)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | 
RussianAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public RussianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | lucene compatibility version  | 
      
| CharArraySet | stopwords | a stopword set  | 
      
RussianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public RussianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | lucene compatibility version  | 
      
| CharArraySet | stopwords | a stopword set  | 
      
| CharArraySet | stemExclusionSet | a set of words not to be stemmed  | 
      
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default Russian stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "russian_stop.txt"
  Field Value
| Type | Description | 
|---|---|
| System.String | 
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
  Property Value
| Type | Description | 
|---|---|
| CharArraySet | an unmodifiable instance of the default stop-words set.  | 
      
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates TokenStreamComponents used to tokenize all the text in the provided System.IO.TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
  Parameters
| Type | Name | Description | 
|---|---|---|
| System.String | fieldName | |
| System.IO.TextReader | reader | 
Returns
| Type | Description | 
|---|---|
| TokenStreamComponents | TokenStreamComponents built from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter , SetKeywordMarkerFilter if a stem exclusion set is provided, and SnowballFilter | 
Overrides
Implements
      System.IDisposable