Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class ArabicStemmer

    Stemmer for Arabic.

    Stemming is done in-place for efficiency, operating on a termbuffer.

    Stemming is defined as:

    • Removal of attached definite article, conjunction, and prepositions.
    • Stemming of common suffixes.

    Inheritance
    System.Object
    ArabicStemmer
    Inherited Members
    System.Object.Equals(System.Object)
    System.Object.Equals(System.Object, System.Object)
    System.Object.GetHashCode()
    System.Object.GetType()
    System.Object.MemberwiseClone()
    System.Object.ReferenceEquals(System.Object, System.Object)
    System.Object.ToString()
    Namespace: Lucene.Net.Analysis.Ar
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class ArabicStemmer

    Fields

    | Improve this Doc View Source

    ALEF

    Declaration
    public const char ALEF = 'ا'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    BEH

    Declaration
    public const char BEH = 'ب'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    FEH

    Declaration
    public const char FEH = 'ف'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH

    Declaration
    public const char HEH = 'ه'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    KAF

    Declaration
    public const char KAF = 'ك'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    LAM

    Declaration
    public const char LAM = 'ل'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    NOON

    Declaration
    public const char NOON = 'ن'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    TEH

    Declaration
    public const char TEH = 'ت'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    TEH_MARBUTA

    Declaration
    public const char TEH_MARBUTA = 'ة'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    WAW

    Declaration
    public const char WAW = 'و'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    YEH

    Declaration
    public const char YEH = 'ي'
    Field Value
    Type Description
    System.Char

    Properties

    | Improve this Doc View Source

    Prefixes

    Declaration
    public static IList<char[]> Prefixes { get; }
    Property Value
    Type Description
    System.Collections.Generic.IList<System.Char[]>
    | Improve this Doc View Source

    Suffixes

    Declaration
    public static IList<char[]> Suffixes { get; }
    Property Value
    Type Description
    System.Collections.Generic.IList<System.Char[]>

    Methods

    | Improve this Doc View Source

    Stem(Char[], Int32)

    Stem an input buffer of Arabic text.

    Declaration
    public virtual int Stem(char[] s, int len)
    Parameters
    Type Name Description
    System.Char[] s

    input buffer

    System.Int32 len

    length of input buffer

    Returns
    Type Description
    System.Int32

    length of input buffer after normalization

    | Improve this Doc View Source

    StemPrefix(Char[], Int32)

    Stem a prefix off an Arabic word.

    Declaration
    public virtual int StemPrefix(char[] s, int len)
    Parameters
    Type Name Description
    System.Char[] s

    input buffer

    System.Int32 len

    length of input buffer

    Returns
    Type Description
    System.Int32

    new length of input buffer after stemming.

    | Improve this Doc View Source

    StemSuffix(Char[], Int32)

    Stem suffix(es) off an Arabic word.

    Declaration
    public virtual int StemSuffix(char[] s, int len)
    Parameters
    Type Name Description
    System.Char[] s

    input buffer

    System.Int32 len

    length of input buffer

    Returns
    Type Description
    System.Int32

    new length of input buffer after stemming

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.