Class PersianNormalizer
Normalizer for Persian.
Normalization is done in-place for efficiency, operating on a termbuffer.
Normalization is defined as:
- Normalization of various heh + hamza forms and heh goal to heh.
- Normalization of farsi yeh and yeh barree to arabic yeh
- Normalization of persian keheh to arabic kaf
Inheritance
System.Object
PersianNormalizer
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Analysis.Fa
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class PersianNormalizer
Fields
| Improve this Doc View SourceFARSI_YEH
Declaration
public const char FARSI_YEH = 'ی'
Field Value
Type | Description |
---|---|
System.Char |
HAMZA_ABOVE
Declaration
public const char HAMZA_ABOVE = 'ٔ'
Field Value
Type | Description |
---|---|
System.Char |
HEH
Declaration
public const char HEH = 'ه'
Field Value
Type | Description |
---|---|
System.Char |
HEH_GOAL
Declaration
public const char HEH_GOAL = 'ہ'
Field Value
Type | Description |
---|---|
System.Char |
HEH_YEH
Declaration
public const char HEH_YEH = 'ۀ'
Field Value
Type | Description |
---|---|
System.Char |
KAF
Declaration
public const char KAF = 'ك'
Field Value
Type | Description |
---|---|
System.Char |
KEHEH
Declaration
public const char KEHEH = 'ک'
Field Value
Type | Description |
---|---|
System.Char |
YEH
Declaration
public const char YEH = 'ي'
Field Value
Type | Description |
---|---|
System.Char |
YEH_BARREE
Declaration
public const char YEH_BARREE = 'ے'
Field Value
Type | Description |
---|---|
System.Char |
Methods
| Improve this Doc View SourceNormalize(Char[], Int32)
Normalize an input buffer of Persian text
Declaration
public virtual int Normalize(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
System.Char[] | s | input buffer |
System.Int32 | len | length of input buffer |
Returns
Type | Description |
---|---|
System.Int32 | length of input buffer after normalization |