Class JapaneseKatakanaStemFilter

A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). Only katakana words longer than a minimum length are stemmed (default is four).

Inheritance

System.Object

AttributeSource

TokenStream

TokenFilter

JapaneseKatakanaStemFilter

Implements

IDisposable

Inherited Members

TokenFilter.m_input

TokenFilter.End()

TokenFilter.Dispose(Boolean)

TokenFilter.Reset()

TokenStream.Dispose()

AttributeSource.GetAttributeFactory()

AttributeSource.GetAttributeClassesEnumerator()

AttributeSource.GetAttributeImplsEnumerator()

AttributeSource.AddAttributeImpl(Attribute)

AttributeSource.AddAttribute<T>()

AttributeSource.HasAttributes

AttributeSource.HasAttribute<T>()

AttributeSource.GetAttribute<T>()

AttributeSource.ClearAttributes()

AttributeSource.CaptureState()

AttributeSource.RestoreState(AttributeSource.State)

AttributeSource.GetHashCode()

AttributeSource.Equals(Object)

AttributeSource.ReflectAsString(Boolean)

AttributeSource.ReflectWith(IAttributeReflector)

AttributeSource.CloneAttributes()

AttributeSource.CopyTo(AttributeSource)

AttributeSource.ToString()

Namespace: Lucene.Net.Analysis.Ja

Assembly: Lucene.Net.Analysis.Kuromoji.dll

Syntax

public sealed class JapaneseKatakanaStemFilter : TokenFilter, IDisposable

Remarks

Note that only full-width katakana characters are supported. Please use a CJKWidthFilter to convert half-width katakana to full-width before using this filter.

In order to prevent terms from being stemmed, use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the IKeywordAttribute before this TokenStream.

Constructors

| Improve this Doc View Source

JapaneseKatakanaStemFilter(TokenStream)

Declaration

public JapaneseKatakanaStemFilter(TokenStream input)

Parameters

Type	Name	Description
TokenStream	input

| Improve this Doc View Source

JapaneseKatakanaStemFilter(TokenStream, Int32)

Declaration

public JapaneseKatakanaStemFilter(TokenStream input, int minimumLength)

Parameters

Type	Name	Description
TokenStream	input
System.Int32	minimumLength

Fields

| Improve this Doc View Source

DEFAULT_MINIMUM_LENGTH

Declaration

public static readonly int DEFAULT_MINIMUM_LENGTH

Field Value

Type	Description
System.Int32

Methods

| Improve this Doc View Source

IncrementToken()

Declaration

public override bool IncrementToken()

Returns

Type	Description
System.Boolean

Overrides

TokenStream.IncrementToken()

Implements

IDisposable