Class IndicNormalizer
Normalizes the Unicode representation of text in Indian languages.
Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html
Inherited Members
Namespace: Lucene.Net.Analysis.In
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class IndicNormalizer
Methods
Normalize(char[], int)
Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.
Declaration
public virtual int Normalize(char[] text, int len)
Parameters
Type | Name | Description |
---|---|---|
char[] | text | input text |
int | len | valid length |
Returns
Type | Description |
---|---|
int | normalized length |