Class PersianNormalizer
Normalizer for Persian.
Normalization is done in-place for efficiency, operating on a termbuffer.
Normalization is defined as:
- Normalization of various heh + hamza forms and heh goal to heh.
- Normalization of farsi yeh and yeh barree to arabic yeh
- Normalization of persian keheh to arabic kaf
Inheritance
System.Object
PersianNormalizer
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class PersianNormalizer : object
Fields
|
Improve this Doc
View Source
FARSI_YEH
Declaration
public const char FARSI_YEH = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
HAMZA_ABOVE
Declaration
public const char HAMZA_ABOVE = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
HEH
Declaration
public const char HEH = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
HEH_GOAL
Declaration
public const char HEH_GOAL = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
HEH_YEH
Declaration
public const char HEH_YEH = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
KAF
Declaration
public const char KAF = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
KEHEH
Declaration
public const char KEHEH = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
YEH
Declaration
public const char YEH = null
Field Value
Type |
Description |
System.Char |
|
|
Improve this Doc
View Source
YEH_BARREE
Declaration
public const char YEH_BARREE = null
Field Value
Type |
Description |
System.Char |
|
Methods
|
Improve this Doc
View Source
Normalize(Char[], Int32)
Normalize an input buffer of Persian text
Declaration
public virtual int Normalize(char[] s, int len)
Parameters
Type |
Name |
Description |
System.Char[] |
s |
input buffer
|
System.Int32 |
len |
length of input buffer
|
Returns
Type |
Description |
System.Int32 |
length of input buffer after normalization
|