Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class PersianNormalizer

    Normalizer for Persian.

    Normalization is done in-place for efficiency, operating on a termbuffer.

    Normalization is defined as:

    • Normalization of various heh + hamza forms and heh goal to heh.
    • Normalization of farsi yeh and yeh barree to arabic yeh
    • Normalization of persian keheh to arabic kaf

    Inheritance
    System.Object
    PersianNormalizer
    Inherited Members
    System.Object.Equals(System.Object)
    System.Object.Equals(System.Object, System.Object)
    System.Object.GetHashCode()
    System.Object.GetType()
    System.Object.MemberwiseClone()
    System.Object.ReferenceEquals(System.Object, System.Object)
    System.Object.ToString()
    Namespace: Lucene.Net.Analysis.Fa
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public class PersianNormalizer

    Fields

    | Improve this Doc View Source

    FARSI_YEH

    Declaration
    public const char FARSI_YEH = 'ی'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HAMZA_ABOVE

    Declaration
    public const char HAMZA_ABOVE = 'ٔ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH

    Declaration
    public const char HEH = 'ه'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH_GOAL

    Declaration
    public const char HEH_GOAL = 'ہ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    HEH_YEH

    Declaration
    public const char HEH_YEH = 'ۀ'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    KAF

    Declaration
    public const char KAF = 'ك'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    KEHEH

    Declaration
    public const char KEHEH = 'ک'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    YEH

    Declaration
    public const char YEH = 'ي'
    Field Value
    Type Description
    System.Char
    | Improve this Doc View Source

    YEH_BARREE

    Declaration
    public const char YEH_BARREE = 'ے'
    Field Value
    Type Description
    System.Char

    Methods

    | Improve this Doc View Source

    Normalize(Char[], Int32)

    Normalize an input buffer of Persian text

    Declaration
    public virtual int Normalize(char[] s, int len)
    Parameters
    Type Name Description
    System.Char[] s

    input buffer

    System.Int32 len

    length of input buffer

    Returns
    Type Description
    System.Int32

    length of input buffer after normalization

    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.