Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class IndicNormalizer

    Normalizes the Unicode representation of text in Indian languages.

    Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html

    Inheritance
    System.Object
    IndicNormalizer
    Inherited Members
    System.Object.Equals(System.Object)
    System.Object.Equals(System.Object, System.Object)
    System.Object.GetHashCode()
    System.Object.GetType()
    System.Object.MemberwiseClone()
    System.Object.ReferenceEquals(System.Object, System.Object)
    System.Object.ToString()
    Namespace: Lucene.Net.Analysis.In
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class IndicNormalizer

    Methods

    | Improve this Doc View Source

    Normalize(Char[], Int32)

    Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.

    Declaration
    public virtual int Normalize(char[] text, int len)
    Parameters
    Type Name Description
    System.Char[] text

    input text

    System.Int32 len

    valid length

    Returns
    Type Description
    System.Int32

    normalized length

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.