Class NLPLemmatizerOp
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.
The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.
Inheritance
System.Object
NLPLemmatizerOp
Namespace: Lucene.Net.Analysis.OpenNlp.Tools
Assembly: Lucene.Net.Analysis.OpenNLP.dll
Syntax
public class NLPLemmatizerOp : object
Constructors
| Improve this Doc View SourceNLPLemmatizerOp(Stream, LemmatizerModel)
Declaration
public NLPLemmatizerOp(Stream dictionary, LemmatizerModel lemmatizerModel)
Parameters
Type | Name | Description |
---|---|---|
Stream | dictionary | |
LemmatizerModel | lemmatizerModel |
Methods
| Improve this Doc View SourceLemmatize(String[], String[])
Declaration
public virtual string[] Lemmatize(string[] words, string[] postags)
Parameters
Type | Name | Description |
---|---|---|
System.String[] | words | |
System.String[] | postags |
Returns
Type | Description |
---|---|
System.String[] |