Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class ArabicStemmer

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Inheritance
    object
    ArabicStemmer
    Inherited Members
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.MemberwiseClone()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Analysis.Ar
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class ArabicStemmer

    Fields

    ALEF

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char ALEF = 'ا'
    Field Value
    Type Description
    char

    BEH

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char BEH = 'ب'
    Field Value
    Type Description
    char

    FEH

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char FEH = 'ف'
    Field Value
    Type Description
    char

    HEH

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char HEH = 'ه'
    Field Value
    Type Description
    char

    KAF

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char KAF = 'ك'
    Field Value
    Type Description
    char

    LAM

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char LAM = 'ل'
    Field Value
    Type Description
    char

    NOON

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char NOON = 'ن'
    Field Value
    Type Description
    char

    TEH

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char TEH = 'ت'
    Field Value
    Type Description
    char

    TEH_MARBUTA

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char TEH_MARBUTA = 'ة'
    Field Value
    Type Description
    char

    WAW

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char WAW = 'و'
    Field Value
    Type Description
    char

    YEH

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public const char YEH = 'ي'
    Field Value
    Type Description
    char

    Properties

    Prefixes

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public static IList<char[]> Prefixes { get; }
    Property Value
    Type Description
    IList<char[]>

    Suffixes

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:
    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.
    Declaration
    public static IList<char[]> Suffixes { get; }
    Property Value
    Type Description
    IList<char[]>

    Methods

    Stem(char[], int)

    Stem an input buffer of Arabic text.

    Declaration
    public virtual int Stem(char[] s, int len)
    Parameters
    Type Name Description
    char[] s

    input buffer

    int len

    length of input buffer

    Returns
    Type Description
    int

    length of input buffer after normalization

    StemPrefix(char[], int)

    Stem a prefix off an Arabic word.

    Declaration
    public virtual int StemPrefix(char[] s, int len)
    Parameters
    Type Name Description
    char[] s

    input buffer

    int len

    length of input buffer

    Returns
    Type Description
    int

    new length of input buffer after stemming.

    StemSuffix(char[], int)

    Stem suffix(es) off an Arabic word.

    Declaration
    public virtual int StemSuffix(char[] s, int len)
    Parameters
    Type Name Description
    char[] s

    input buffer

    int len

    length of input buffer

    Returns
    Type Description
    int

    new length of input buffer after stemming

    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.