Class BrazilianAnalyzer
Lucene.Net.Analysis.Analyzer for Brazilian Portuguese language.
Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (words that will not be stemmed, but indexed).
NOTE: This class uses the same Lucene.Net.Util.LuceneVersion dependent settings as StandardAnalyzer.
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Br
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class BrazilianAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
BrazilianAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words (DefaultStopSet).
Declaration
public BrazilianAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion |
BrazilianAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public BrazilianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
BrazilianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words and stemming exclusion words
Declaration
public BrazilianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
CharArraySet | stemExclusionSet | a set of terms not to be stemmed |
Fields
DEFAULT_STOPWORD_FILE
File containing default Brazilian Portuguese stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
Field Value
Type | Description |
---|---|
string |
Properties
DefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
CharArraySet | an unmodifiable instance of the default stop-words set. |
Methods
CreateComponents(string, TextReader)
Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
string | fieldName | |
TextReader | reader |
Returns
Type | Description |
---|---|
TokenStreamComponents | Lucene.Net.Analysis.TokenStreamComponents built from a StandardTokenizer filtered with LowerCaseFilter, StandardFilter, StopFilter, and BrazilianStemFilter. |