Class IndonesianAnalyzer
Lucene.Net.Analysis.Analyzer for Indonesian (Bahasa)
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Id
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class IndonesianAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
IndonesianAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.
Declaration
public IndonesianAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion |
IndonesianAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public IndonesianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
IndonesianAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop word. If a none-empty stem exclusion set is provided this analyzer will add a SetKeywordMarkerFilter before IndonesianStemFilter.
Declaration
public IndonesianAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
CharArraySet | stemExclusionSet | a set of terms not to be stemmed |
Fields
DEFAULT_STOPWORD_FILE
File containing default Indonesian stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
Field Value
Type | Description |
---|---|
string |
Properties
DefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
CharArraySet | an unmodifiable instance of the default stop-words set. |
Methods
CreateComponents(string, TextReader)
Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
string | fieldName | |
TextReader | reader |
Returns
Type | Description |
---|---|
TokenStreamComponents | Lucene.Net.Analysis.TokenStreamComponents built from an StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, SetKeywordMarkerFilter if a stem exclusion set is provided and IndonesianStemFilter. |