Class BulgarianAnalyzer
Lucene.Net.Analysis.Analyzer for Bulgarian.
This analyzer implements light-stemming as specified by: Searching Strategies for the Bulgarian Language http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Bg
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class BulgarianAnalyzer : StopwordAnalyzerBase, IDisposableConstructors
| Improve this Doc View SourceBulgarianAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.
Declaration
public BulgarianAnalyzer(LuceneVersion matchVersion)Parameters
| Type | Name | Description | 
|---|---|---|
| Lucene.Net.Util.LuceneVersion | matchVersion | 
BulgarianAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words.
Declaration
public BulgarianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)Parameters
| Type | Name | Description | 
|---|---|---|
| Lucene.Net.Util.LuceneVersion | matchVersion | |
| CharArraySet | stopwords | 
BulgarianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words and a stem exclusion set. If a stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before BulgarianStemFilter.
Declaration
public BulgarianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)Parameters
| Type | Name | Description | 
|---|---|---|
| Lucene.Net.Util.LuceneVersion | matchVersion | |
| CharArraySet | stopwords | |
| CharArraySet | stemExclusionSet | 
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default Bulgarian stopwords.
Default stopword list is from http://members.unine.ch/jacques.savoy/clef/index.html The stopword list is BSD-Licensed.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"Field Value
| Type | Description | 
|---|---|
| System.String | 
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }Property Value
| Type | Description | 
|---|---|
| CharArraySet | an unmodifiable instance of the default stop-words set. | 
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates a Lucene.Net.Analysis.TokenStreamComponents which tokenizes all the text in the provided System.IO.TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)Parameters
| Type | Name | Description | 
|---|---|---|
| System.String | fieldName | |
| System.IO.TextReader | reader | 
Returns
| Type | Description | 
|---|---|
| Lucene.Net.Analysis.TokenStreamComponents | A Lucene.Net.Analysis.TokenStreamComponents built from an StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, SetKeywordMarkerFilter if a stem exclusion set is provided and BulgarianStemFilter. |