Class ArabicStemmer
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer.
Stemming is defined as:
- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Inheritance
System.Object
ArabicStemmer
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Analysis.Ar
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class ArabicStemmer
Fields
| Improve this Doc View SourceALEF
Declaration
public const char ALEF = 'ا'
Field Value
Type | Description |
---|---|
System.Char |
BEH
Declaration
public const char BEH = 'ب'
Field Value
Type | Description |
---|---|
System.Char |
FEH
Declaration
public const char FEH = 'ف'
Field Value
Type | Description |
---|---|
System.Char |
HEH
Declaration
public const char HEH = 'ه'
Field Value
Type | Description |
---|---|
System.Char |
KAF
Declaration
public const char KAF = 'ك'
Field Value
Type | Description |
---|---|
System.Char |
LAM
Declaration
public const char LAM = 'ل'
Field Value
Type | Description |
---|---|
System.Char |
NOON
Declaration
public const char NOON = 'ن'
Field Value
Type | Description |
---|---|
System.Char |
prefixes
Declaration
public static readonly char[][] prefixes
Field Value
Type | Description |
---|---|
System.Char[][] |
suffixes
Declaration
public static readonly char[][] suffixes
Field Value
Type | Description |
---|---|
System.Char[][] |
TEH
Declaration
public const char TEH = 'ت'
Field Value
Type | Description |
---|---|
System.Char |
TEH_MARBUTA
Declaration
public const char TEH_MARBUTA = 'ة'
Field Value
Type | Description |
---|---|
System.Char |
WAW
Declaration
public const char WAW = 'و'
Field Value
Type | Description |
---|---|
System.Char |
YEH
Declaration
public const char YEH = 'ي'
Field Value
Type | Description |
---|---|
System.Char |
Methods
| Improve this Doc View SourceStem(Char[], Int32)
Stem an input buffer of Arabic text.
Declaration
public virtual int Stem(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
System.Char[] | s | input buffer |
System.Int32 | len | length of input buffer |
Returns
Type | Description |
---|---|
System.Int32 | length of input buffer after normalization |
StemPrefix(Char[], Int32)
Stem a prefix off an Arabic word.
Declaration
public virtual int StemPrefix(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
System.Char[] | s | input buffer |
System.Int32 | len | length of input buffer |
Returns
Type | Description |
---|---|
System.Int32 | new length of input buffer after stemming. |
StemSuffix(Char[], Int32)
Stem suffix(es) off an Arabic word.
Declaration
public virtual int StemSuffix(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
System.Char[] | s | input buffer |
System.Int32 | len | length of input buffer |
Returns
Type | Description |
---|---|
System.Int32 | new length of input buffer after stemming |