Class SmartChineseWordTokenFilterFactory
Factory for the SmartChineseAnalyzerWordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add a Lucene.Net.Analysis.Miscellaneous.WordDelimiterFilter after to remove these (with concatenate off), or use the SmartChinese stoplist with a Lucene.Net.Analysis.Core.StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
Note
This API is experimental and might change in incompatible ways in the next release.
Inherited Members
TokenFilterFactory.AvailableTokenFilters
TokenFilterFactory.ReloadTokenFilters()
AbstractAnalysisFactory.LUCENE_MATCH_VERSION_PARAM
AbstractAnalysisFactory.m_luceneMatchVersion
AbstractAnalysisFactory.OriginalArgs
AbstractAnalysisFactory.AssureMatchVersion()
AbstractAnalysisFactory.LuceneMatchVersion
AbstractAnalysisFactory.GetClassArg()
AbstractAnalysisFactory.IsExplicitLuceneMatchVersion
Namespace: Lucene.Net.Analysis.Cn.Smart
Assembly: Lucene.Net.Analysis.SmartCn.dll
Syntax
[Obsolete("Use HMMChineseTokenizerFactory instead")]
public class SmartChineseWordTokenFilterFactory : TokenFilterFactory
Constructors
SmartChineseWordTokenFilterFactory(IDictionary<string, string>)
Creates a new SmartChineseWordTokenFilterFactory
Declaration
public SmartChineseWordTokenFilterFactory(IDictionary<string, string> args)
Parameters
Type | Name | Description |
---|---|---|
IDictionary<string, string> | args |
Methods
Create(TokenStream)
Transform the specified input Lucene.Net.Analysis.TokenStream
Declaration
public override TokenStream Create(TokenStream input)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | input |
Returns
Type | Description |
---|---|
TokenStream |
Overrides
Lucene.Net.Analysis.Util.TokenFilterFactory.Create(Lucene.Net.Analysis.TokenStream)