Show / Hide Table of Contents

    Class Lucene43EdgeNGramTokenizer

    Old version of EdgeNGramTokenizer which doesn't handle correctly supplementary characters.

    Inheritance
    System.Object
    AttributeSource
    TokenStream
    Tokenizer
    Lucene43EdgeNGramTokenizer
    Implements
    IDisposable
    Inherited Members
    Tokenizer.m_input
    Tokenizer.Dispose(Boolean)
    Tokenizer.CorrectOffset(Int32)
    Tokenizer.SetReader(TextReader)
    TokenStream.Dispose()
    AttributeSource.GetAttributeFactory()
    AttributeSource.GetAttributeClassesEnumerator()
    AttributeSource.GetAttributeImplsEnumerator()
    AttributeSource.AddAttributeImpl(Attribute)
    AttributeSource.AddAttribute<T>()
    AttributeSource.HasAttributes
    AttributeSource.HasAttribute<T>()
    AttributeSource.GetAttribute<T>()
    AttributeSource.ClearAttributes()
    AttributeSource.CaptureState()
    AttributeSource.RestoreState(AttributeSource.State)
    AttributeSource.GetHashCode()
    AttributeSource.Equals(Object)
    AttributeSource.ReflectAsString(Boolean)
    AttributeSource.ReflectWith(IAttributeReflector)
    AttributeSource.CloneAttributes()
    AttributeSource.CopyTo(AttributeSource)
    AttributeSource.ToString()
    Namespace: Lucene.Net.Analysis.NGram
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public sealed class Lucene43EdgeNGramTokenizer : Tokenizer, IDisposable

    Constructors

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, AttributeSource.AttributeFactory, TextReader, Lucene43EdgeNGramTokenizer.Side, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, AttributeSource.AttributeFactory factory, TextReader input, Lucene43EdgeNGramTokenizer.Side side, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    AttributeSource.AttributeFactory factory

    AttributeSource.AttributeFactory to use

    TextReader input

    holding the input to be tokenized

    Lucene43EdgeNGramTokenizer.Side side

    the Lucene43EdgeNGramTokenizer.Side from which to chop off an n-gram

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, AttributeSource.AttributeFactory, TextReader, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, AttributeSource.AttributeFactory factory, TextReader input, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    AttributeSource.AttributeFactory factory

    AttributeSource.AttributeFactory to use

    TextReader input

    holding the input to be tokenized

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, AttributeSource.AttributeFactory, TextReader, String, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, AttributeSource.AttributeFactory factory, TextReader input, string sideLabel, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    AttributeSource.AttributeFactory factory

    AttributeSource.AttributeFactory to use

    TextReader input

    holding the input to be tokenized

    System.String sideLabel

    the name of the Lucene43EdgeNGramTokenizer.Side from which to chop off an n-gram

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, TextReader, Lucene43EdgeNGramTokenizer.Side, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, TextReader input, Lucene43EdgeNGramTokenizer.Side side, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    TextReader input

    holding the input to be tokenized

    Lucene43EdgeNGramTokenizer.Side side

    the Lucene43EdgeNGramTokenizer.Side from which to chop off an n-gram

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, TextReader, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, TextReader input, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    TextReader input

    holding the input to be tokenized

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    | Improve this Doc View Source

    Lucene43EdgeNGramTokenizer(LuceneVersion, TextReader, String, Int32, Int32)

    Creates Lucene43EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

    Declaration
    public Lucene43EdgeNGramTokenizer(LuceneVersion version, TextReader input, string sideLabel, int minGram, int maxGram)
    Parameters
    Type Name Description
    LuceneVersion version

    the Lucene match version - See LuceneVersion

    TextReader input

    holding the input to be tokenized

    System.String sideLabel

    the name of the Lucene43EdgeNGramTokenizer.Side from which to chop off an n-gram

    System.Int32 minGram

    the smallest n-gram to generate

    System.Int32 maxGram

    the largest n-gram to generate

    Fields

    | Improve this Doc View Source

    DEFAULT_MAX_GRAM_SIZE

    Declaration
    public const int DEFAULT_MAX_GRAM_SIZE = null
    Field Value
    Type Description
    System.Int32
    | Improve this Doc View Source

    DEFAULT_MIN_GRAM_SIZE

    Declaration
    public const int DEFAULT_MIN_GRAM_SIZE = null
    Field Value
    Type Description
    System.Int32
    | Improve this Doc View Source

    DEFAULT_SIDE

    Declaration
    public const Lucene43EdgeNGramTokenizer.Side DEFAULT_SIDE = Lucene43EdgeNGramTokenizer.Side.FRONT
    Field Value
    Type Description
    Lucene43EdgeNGramTokenizer.Side

    Methods

    | Improve this Doc View Source

    End()

    Declaration
    public override void End()
    Overrides
    TokenStream.End()
    | Improve this Doc View Source

    GetSide(String)

    Declaration
    public static Lucene43EdgeNGramTokenizer.Side GetSide(string sideName)
    Parameters
    Type Name Description
    System.String sideName
    Returns
    Type Description
    Lucene43EdgeNGramTokenizer.Side
    | Improve this Doc View Source

    IncrementToken()

    Returns the next token in the stream, or null at EOS.

    Declaration
    public override bool IncrementToken()
    Returns
    Type Description
    System.Boolean
    Overrides
    TokenStream.IncrementToken()
    | Improve this Doc View Source

    Reset()

    Declaration
    public override void Reset()
    Overrides
    Tokenizer.Reset()

    Implements

    IDisposable
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)