Show / Hide Table of Contents

    Class JapaneseKatakanaStemFilter

    A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). Only katakana words longer than a minimum length are stemmed (default is four).

    Inheritance
    System.Object
    AttributeSource
    TokenStream
    TokenFilter
    JapaneseKatakanaStemFilter
    Implements
    IDisposable
    Inherited Members
    TokenFilter.m_input
    TokenFilter.End()
    TokenFilter.Dispose(Boolean)
    TokenFilter.Reset()
    TokenStream.Dispose()
    AttributeSource.GetAttributeFactory()
    AttributeSource.GetAttributeClassesEnumerator()
    AttributeSource.GetAttributeImplsEnumerator()
    AttributeSource.AddAttributeImpl(Attribute)
    AttributeSource.AddAttribute<T>()
    AttributeSource.HasAttributes
    AttributeSource.HasAttribute<T>()
    AttributeSource.GetAttribute<T>()
    AttributeSource.ClearAttributes()
    AttributeSource.CaptureState()
    AttributeSource.RestoreState(AttributeSource.State)
    AttributeSource.GetHashCode()
    AttributeSource.Equals(Object)
    AttributeSource.ReflectAsString(Boolean)
    AttributeSource.ReflectWith(IAttributeReflector)
    AttributeSource.CloneAttributes()
    AttributeSource.CopyTo(AttributeSource)
    AttributeSource.ToString()
    Namespace: Lucene.Net.Analysis.Ja
    Assembly: Lucene.Net.Analysis.Kuromoji.dll
    Syntax
    public sealed class JapaneseKatakanaStemFilter : TokenFilter, IDisposable
    Remarks

    Note that only full-width katakana characters are supported. Please use a CJKWidthFilter to convert half-width katakana to full-width before using this filter.

    In order to prevent terms from being stemmed, use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the IKeywordAttribute before this TokenStream.

    Constructors

    | Improve this Doc View Source

    JapaneseKatakanaStemFilter(TokenStream)

    Declaration
    public JapaneseKatakanaStemFilter(TokenStream input)
    Parameters
    Type Name Description
    TokenStream input
    | Improve this Doc View Source

    JapaneseKatakanaStemFilter(TokenStream, Int32)

    Declaration
    public JapaneseKatakanaStemFilter(TokenStream input, int minimumLength)
    Parameters
    Type Name Description
    TokenStream input
    System.Int32 minimumLength

    Fields

    | Improve this Doc View Source

    DEFAULT_MINIMUM_LENGTH

    Declaration
    public static readonly int DEFAULT_MINIMUM_LENGTH
    Field Value
    Type Description
    System.Int32

    Methods

    | Improve this Doc View Source

    IncrementToken()

    Declaration
    public override bool IncrementToken()
    Returns
    Type Description
    System.Boolean
    Overrides
    TokenStream.IncrementToken()

    Implements

    IDisposable
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)