Class PortugueseAnalyzer
Lucene.Net.Analysis.Analyzer for Portuguese.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating PortugueseAnalyzer:
- As of 3.6, PortugueseLightStemFilter is used for less aggressive stemming.
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Pt
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class PortugueseAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
| Improve this Doc View SourcePortugueseAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.
Declaration
public PortugueseAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion |
PortugueseAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words.
Declaration
public PortugueseAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
PortugueseAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words. If a non-empty stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before stemming.
Declaration
public PortugueseAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
CharArraySet | stemExclusionSet | a set of terms not to be stemmed |
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default Portuguese stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "portuguese_stop.txt"
Field Value
Type | Description |
---|---|
System.String |
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
CharArraySet | default stop words set. |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates a Lucene.Net.Analysis.TokenStreamComponents which tokenizes all the text in the provided System.IO.TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System.String | fieldName | |
System.IO.TextReader | reader |
Returns
Type | Description |
---|---|
Lucene.Net.Analysis.TokenStreamComponents | A Lucene.Net.Analysis.TokenStreamComponents built from an StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter , SetKeywordMarkerFilter if a stem exclusion set is provided and PortugueseLightStemFilter. |