Show / Hide Table of Contents

    Class IndicNormalizer

    Normalizes the Unicode representation of text in Indian languages.

    Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html

    Inheritance
    System.Object
    IndicNormalizer
    Namespace: Lucene.Net.Analysis.In
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class IndicNormalizer : object

    Methods

    | Improve this Doc View Source

    Normalize(Char[], Int32)

    Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.

    Declaration
    public virtual int Normalize(char[] text, int len)
    Parameters
    Type Name Description
    System.Char[] text

    input text

    System.Int32 len

    valid length

    Returns
    Type Description
    System.Int32

    normalized length

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)