Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class RussianLetterTokenizer

    A RussianLetterTokenizer is a Lucene.Net.Analysis.Tokenizer that extends LetterTokenizer by also allowing the basic Latin digits 0-9.

    You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating RussianLetterTokenizer:

    • As of 3.1, CharTokenizer uses an int based API to normalize and detect token characters. See IsTokenChar(int) and Normalize(int) for details.
    Inheritance
    object
    AttributeSource
    TokenStream
    Tokenizer
    CharTokenizer
    RussianLetterTokenizer
    Implements
    IDisposable
    Inherited Members
    CharTokenizer.Normalize(int)
    CharTokenizer.IncrementToken()
    CharTokenizer.End()
    CharTokenizer.Reset()
    Tokenizer.m_input
    Tokenizer.Dispose(bool)
    Tokenizer.CorrectOffset(int)
    Tokenizer.SetReader(TextReader)
    TokenStream.Dispose()
    AttributeSource.GetAttributeFactory()
    AttributeSource.GetAttributeClassesEnumerator()
    AttributeSource.GetAttributeImplsEnumerator()
    AttributeSource.AddAttributeImpl(Attribute)
    AttributeSource.AddAttribute<T>()
    AttributeSource.HasAttributes
    AttributeSource.HasAttribute<T>()
    AttributeSource.GetAttribute<T>()
    AttributeSource.ClearAttributes()
    AttributeSource.CaptureState()
    AttributeSource.RestoreState(AttributeSource.State)
    AttributeSource.GetHashCode()
    AttributeSource.Equals(object)
    AttributeSource.ReflectAsString(bool)
    AttributeSource.ReflectWith(IAttributeReflector)
    AttributeSource.CloneAttributes()
    AttributeSource.CopyTo(AttributeSource)
    AttributeSource.ToString()
    object.Equals(object, object)
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    Namespace: Lucene.Net.Analysis.Ru
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    [Obsolete("(3.1) Use StandardTokenizer instead, which has the same functionality.")]
    public class RussianLetterTokenizer : CharTokenizer, IDisposable

    Constructors

    RussianLetterTokenizer(LuceneVersion, AttributeFactory, TextReader)

    Construct a new RussianLetterTokenizer using a given Lucene.Net.Util.AttributeSource.AttributeFactory.

    Declaration
    public RussianLetterTokenizer(LuceneVersion matchVersion, AttributeSource.AttributeFactory factory, TextReader @in)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    lucene compatibility version

    AttributeSource.AttributeFactory factory

    the attribute factory to use for this Lucene.Net.Analysis.Tokenizer

    TextReader in

    the input to split up into tokens

    RussianLetterTokenizer(LuceneVersion, TextReader)

    Construct a new RussianLetterTokenizer.

    Declaration
    public RussianLetterTokenizer(LuceneVersion matchVersion, TextReader @in)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    lucene compatibility version

    TextReader in

    the input to split up into tokens

    Methods

    IsTokenChar(int)

    Collects only characters which satisfy IsLetter(int).

    Declaration
    protected override bool IsTokenChar(int c)
    Parameters
    Type Name Description
    int c
    Returns
    Type Description
    bool
    Overrides
    CharTokenizer.IsTokenChar(int)

    Implements

    IDisposable
    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.