Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class NLPLemmatizerOp

    Supply OpenNLP Lemmatizer tools.

    Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.

    The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.

    Inheritance
    System.Object
    NLPLemmatizerOp
    Namespace: Lucene.Net.Analysis.OpenNlp.Tools
    Assembly: Lucene.Net.Analysis.OpenNLP.dll
    Syntax
    public class NLPLemmatizerOp : object

    Constructors

    | Improve this Doc View Source

    NLPLemmatizerOp(Stream, LemmatizerModel)

    Declaration
    public NLPLemmatizerOp(Stream dictionary, LemmatizerModel lemmatizerModel)
    Parameters
    Type Name Description
    Stream dictionary
    LemmatizerModel lemmatizerModel

    Methods

    | Improve this Doc View Source

    Lemmatize(String[], String[])

    Declaration
    public virtual string[] Lemmatize(string[] words, string[] postags)
    Parameters
    Type Name Description
    System.String[] words
    System.String[] postags
    Returns
    Type Description
    System.String[]
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.