Class ChineseFilter
A TokenFilter with a stop word table.
- Numeric tokens are removed.
- English tokens must be larger than 1 character.
- One Chinese character as one Chinese word.
- Add Chinese stop words, such as \ue400
- Dictionary based Chinese word extraction
- Intelligent Chinese word extraction
Implements
IDisposable
Inherited Members
Namespace: Lucene.Net.Analysis.Cn
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class ChineseFilter : TokenFilter, IDisposable
Constructors
| Improve this Doc View SourceChineseFilter(TokenStream)
Declaration
public ChineseFilter(TokenStream in)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | in |
Fields
| Improve this Doc View SourceSTOP_WORDS
Declaration
public static readonly string[] STOP_WORDS
Field Value
Type | Description |
---|---|
System.String[] |
Methods
| Improve this Doc View SourceIncrementToken()
Declaration
public override bool IncrementToken()
Returns
Type | Description |
---|---|
System.Boolean |
Overrides
Implements
IDisposable