Class JapaneseKatakanaStemFilter
A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). Only katakana words longer than a minimum length are stemmed (default is four).
Implements
IDisposable
Inherited Members
Namespace: Lucene.Net.Analysis.Ja
Assembly: Lucene.Net.Analysis.Kuromoji.dll
Syntax
public sealed class JapaneseKatakanaStemFilter : TokenFilter, IDisposable
Remarks
Note that only full-width katakana characters are supported. Please use a CJKWidthFilter to convert half-width katakana to full-width before using this filter.
In order to prevent terms from being stemmed, use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the IKeywordAttribute before this TokenStream.
Constructors
| Improve this Doc View SourceJapaneseKatakanaStemFilter(TokenStream)
Declaration
public JapaneseKatakanaStemFilter(TokenStream input)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | input |
JapaneseKatakanaStemFilter(TokenStream, Int32)
Declaration
public JapaneseKatakanaStemFilter(TokenStream input, int minimumLength)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | input | |
System.Int32 | minimumLength |
Fields
| Improve this Doc View SourceDEFAULT_MINIMUM_LENGTH
Declaration
public static readonly int DEFAULT_MINIMUM_LENGTH
Field Value
Type | Description |
---|---|
System.Int32 |
Methods
| Improve this Doc View SourceIncrementToken()
Declaration
public override bool IncrementToken()
Returns
Type | Description |
---|---|
System.Boolean |
Overrides
Implements
IDisposable