Class IndicNormalizer

Normalizes the Unicode representation of text in Indian languages.

Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html

Inheritance

object

IndicNormalizer

Inherited Members

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Namespace: Lucene.Net.Analysis.In

Assembly: Lucene.Net.Analysis.Common.dll

Syntax

public class IndicNormalizer

Methods

Normalize(char[], int)

Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.

Declaration

public virtual int Normalize(char[] text, int len)

Parameters

Type	Name	Description
char[]	text	input text
int	len	valid length

Returns

Type	Description
int	normalized length