Class ArabicStemmer
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Inherited Members
Namespace: Lucene.Net.Analysis.Ar
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class ArabicStemmer
Fields
ALEF
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char ALEF = 'ا'
Field Value
Type | Description |
---|---|
char |
BEH
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char BEH = 'ب'
Field Value
Type | Description |
---|---|
char |
FEH
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char FEH = 'ف'
Field Value
Type | Description |
---|---|
char |
HEH
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char HEH = 'ه'
Field Value
Type | Description |
---|---|
char |
KAF
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char KAF = 'ك'
Field Value
Type | Description |
---|---|
char |
LAM
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char LAM = 'ل'
Field Value
Type | Description |
---|---|
char |
NOON
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char NOON = 'ن'
Field Value
Type | Description |
---|---|
char |
TEH
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char TEH = 'ت'
Field Value
Type | Description |
---|---|
char |
TEH_MARBUTA
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char TEH_MARBUTA = 'ة'
Field Value
Type | Description |
---|---|
char |
WAW
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char WAW = 'و'
Field Value
Type | Description |
---|---|
char |
YEH
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public const char YEH = 'ي'
Field Value
Type | Description |
---|---|
char |
Properties
Prefixes
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public static IList<char[]> Prefixes { get; }
Property Value
Type | Description |
---|---|
IList<char[]> |
Suffixes
Stemmer for Arabic.
Stemming is done in-place for efficiency, operating on a termbuffer. Stemming is defined as:- Removal of attached definite article, conjunction, and prepositions.
- Stemming of common suffixes.
Declaration
public static IList<char[]> Suffixes { get; }
Property Value
Type | Description |
---|---|
IList<char[]> |
Methods
Stem(char[], int)
Stem an input buffer of Arabic text.
Declaration
public virtual int Stem(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
char[] | s | input buffer |
int | len | length of input buffer |
Returns
Type | Description |
---|---|
int | length of input buffer after normalization |
StemPrefix(char[], int)
Stem a prefix off an Arabic word.
Declaration
public virtual int StemPrefix(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
char[] | s | input buffer |
int | len | length of input buffer |
Returns
Type | Description |
---|---|
int | new length of input buffer after stemming. |
StemSuffix(char[], int)
Stem suffix(es) off an Arabic word.
Declaration
public virtual int StemSuffix(char[] s, int len)
Parameters
Type | Name | Description |
---|---|---|
char[] | s | input buffer |
int | len | length of input buffer |
Returns
Type | Description |
---|---|
int | new length of input buffer after stemming |