Class IndicNormalizer
Normalizes the Unicode representation of text in Indian languages.
Follows guidelines from Unicode 5.2, chapter 6, South Asian Scripts I and graphical decompositions from http://ldc.upenn.edu/myl/IndianScriptsUnicode.html
Inheritance
System.Object
IndicNormalizer
Namespace: Lucene.Net.Analysis.In
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class IndicNormalizer : object
Methods
| Improve this Doc View SourceNormalize(Char[], Int32)
Normalizes input text, and returns the new length. The length will always be less than or equal to the existing length.
Declaration
public virtual int Normalize(char[] text, int len)
Parameters
Type | Name | Description |
---|---|---|
System.Char[] | text | input text |
System.Int32 | len | valid length |
Returns
Type | Description |
---|---|
System.Int32 | normalized length |