Class SmartChineseWordTokenFilterFactory
Factory for the SmartChineseAnalyzer WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add a WordDelimiterFilter after to remove these (with concatenate off), or use the SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
This is a Lucene.NET EXPERIMENTAL API, use at your own risk
Inheritance
System.Object
SmartChineseWordTokenFilterFactory
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Analysis.Cn.Smart
Assembly: Lucene.Net.Analysis.SmartCn.dll
Syntax
[Obsolete("Use HMMChineseTokenizerFactory instead")]
public class SmartChineseWordTokenFilterFactory : TokenFilterFactory
Constructors
| Improve this Doc View SourceSmartChineseWordTokenFilterFactory(IDictionary<String, String>)
Creates a new SmartChineseWordTokenFilterFactory
Declaration
public SmartChineseWordTokenFilterFactory(IDictionary<string, string> args)
Parameters
Type | Name | Description |
---|---|---|
System.Collections.Generic.IDictionary<System.String, System.String> | args |
Methods
| Improve this Doc View SourceCreate(TokenStream)
Declaration
public override TokenStream Create(TokenStream input)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | input |
Returns
Type | Description |
---|---|
TokenStream |