Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class IndicNormalizer

    Normalizes the Unicode representation of text in Indian languages.

    Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html

    Inheritance
    object
    IndicNormalizer
    Inherited Members
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Analysis.In
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class IndicNormalizer

    Methods

    Normalize(char[], int)

    Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.

    Declaration
    public virtual int Normalize(char[] text, int len)
    Parameters
    Type Name Description
    char[] text

    input text

    int len

    valid length

    Returns
    Type Description
    int

    normalized length

    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.