• API

    Show / Hide Table of Contents

    Class PersianNormalizer

    Normalizer for Persian.

    Normalization is done in-place for efficiency, operating on a termbuffer.

    Normalization is defined as:

    • Normalization of various heh + hamza forms and heh goal to heh.
    • Normalization of farsi yeh and yeh barree to arabic yeh
    • Normalization of persian keheh to arabic kaf

    Inheritance
    System.Object
    PersianNormalizer
    Inherited Members
    System.Object.Equals(System.Object)
    System.Object.Equals(System.Object, System.Object)
    System.Object.GetHashCode()
    System.Object.GetType()
    System.Object.MemberwiseClone()
    System.Object.ReferenceEquals(System.Object, System.Object)
    System.Object.ToString()
    Namespace: Lucene.Net.Analysis.Fa
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class PersianNormalizer

    Fields

    | Improve this Doc View Source

    FARSI_YEH

    Declaration
    public const char FARSI_YEH = 'ی'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HAMZA_ABOVE

    Declaration
    public const char HAMZA_ABOVE = 'ٔ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH

    Declaration
    public const char HEH = 'ه'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH_GOAL

    Declaration
    public const char HEH_GOAL = 'ہ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH_YEH

    Declaration
    public const char HEH_YEH = 'ۀ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    KAF

    Declaration
    public const char KAF = 'ك'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    KEHEH

    Declaration
    public const char KEHEH = 'ک'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    YEH

    Declaration
    public const char YEH = 'ي'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    YEH_BARREE

    Declaration
    public const char YEH_BARREE = 'ے'
    Field Value
    Type Description
    System.Char

    Methods

    | Improve this Doc View Source

    Normalize(Char[], Int32)

    Normalize an input buffer of Persian text

    Declaration
    public virtual int Normalize(char[] s, int len)
    Parameters
    Type Name Description
    System.Char[] s

    input buffer

    System.Int32 len

    length of input buffer

    Returns
    Type Description
    System.Int32

    length of input buffer after normalization

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)