Class NLPLemmatizerOp
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens. The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.Inherited Members
Namespace: Lucene.Net.Analysis.OpenNlp.Tools
Assembly: Lucene.Net.Analysis.OpenNLP.dll
Syntax
public class NLPLemmatizerOp
Constructors
NLPLemmatizerOp(Stream, LemmatizerModel)
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens. The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.Declaration
public NLPLemmatizerOp(Stream dictionary, LemmatizerModel lemmatizerModel)
Parameters
Type | Name | Description |
---|---|---|
Stream | dictionary | |
LemmatizerModel | lemmatizerModel |
Methods
Lemmatize(string[], string[])
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens. The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.Declaration
public virtual string[] Lemmatize(string[] words, string[] postags)
Parameters
Type | Name | Description |
---|---|---|
string[] | words | |
string[] | postags |
Returns
Type | Description |
---|---|
string[] |