Namespace Lucene.Net.Analysis.OpenNlp.Tools
Classes
NLPChunkerOp
Supply OpenNLP Chunking tool Requires binary models from OpenNLP project on SourceForge.
NLPLemmatizerOp
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.
The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.
NLPNERTaggerOp
Supply OpenNLP Named Entity Resolution tool Requires binary models from OpenNLP project on SourceForge.
Usage: from the OpenNLP documentation:
"The NameFinderME class is not thread safe, it must only be called from one thread. To use multiple threads multiple NameFinderME instances sharing the same model instance can be created. The input text should be segmented into documents, sentences and tokens. To perform entity detection an application calls the find method for every sentence in the document. After every document clearAdaptiveData must be called to clear the adaptive data in the feature generators. Not calling clearAdaptiveData can lead to a sharp drop in the detection rate after a few documents."
NLPPOSTaggerOp
Supply OpenNLP Parts-Of-Speech Tagging tool. Requires binary models from OpenNLP project on SourceForge.
NLPSentenceDetectorOp
Supply OpenNLP Sentence Detector tool. Requires binary models from OpenNLP project on SourceForge.
NLPTokenizerOp
Supply OpenNLP Sentence Tokenizer tool. Requires binary models from OpenNLP project on SourceForge.
OpenNLPOpsFactory
Supply OpenNLP Named Entity Recognizer. Cache model file objects. Assumes model files are thread-safe.