Class ThaiAnalyzer
Lucene.Net.Analysis.Analyzer for Thai language. It uses ICU4N.Text.BreakIterator to break words.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating ThaiAnalyzer:
- As of 3.6, a set of Thai stopwords is used by default
Inheritance
System.Object
Lucene.Net.Analysis.Analyzer
Lucene.Net.Analysis.Util.StopwordAnalyzerBase
ThaiAnalyzer
Implements
System.IDisposable
Inherited Members
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.m_stopwords
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.m_matchVersion
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.StopwordSet
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.LoadStopwordSet(System.Boolean, System.Type, System.String, System.String)
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.LoadStopwordSet(System.IO.FileInfo, Lucene.Net.Util.LuceneVersion)
Lucene.Net.Analysis.Util.StopwordAnalyzerBase.LoadStopwordSet(System.IO.TextReader, Lucene.Net.Util.LuceneVersion)
Lucene.Net.Analysis.Analyzer.Strategy
Lucene.Net.Analysis.Analyzer.Dispose()
Lucene.Net.Analysis.Analyzer.GLOBAL_REUSE_STRATEGY
Lucene.Net.Analysis.Analyzer.PER_FIELD_REUSE_STRATEGY
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Analysis.Th
Assembly: Lucene.Net.ICU.dll
Syntax
public sealed class ThaiAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
| Improve this Doc View SourceThaiAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words.
Declaration
public ThaiAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | lucene compatibility version |
ThaiAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words.
Declaration
public ThaiAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | lucene compatibility version |
Lucene.Net.Analysis.Util.CharArraySet | stopwords | a stopword set |
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default Thai stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
Field Value
Type | Description |
---|---|
System.String |
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
Lucene.Net.Analysis.Util.CharArraySet | default stop words set. |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided System.IO.TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System.String | fieldName | |
System.IO.TextReader | reader |
Returns
Type | Description |
---|---|
Lucene.Net.Analysis.TokenStreamComponents | Lucene.Net.Analysis.TokenStreamComponents built from a Lucene.Net.Analysis.Standard.StandardTokenizer filtered with Lucene.Net.Analysis.Standard.StandardFilter, Lucene.Net.Analysis.Core.LowerCaseFilter, ThaiWordFilter, and Lucene.Net.Analysis.Core.StopFilter |
Overrides
Implements
System.IDisposable