Namespace Lucene.Net.Analysis.Ja.Dict
Classes
BinaryDictionary
Base class for a binary-encoded in-memory dictionary.
CharacterDefinition
Character category data.
ConnectionCosts
n-gram connection cost data
Dictionary
TokenInfoDictionary
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
TokenInfoFST
Thin wrapper around an FST with root-arc caching for Japanese.
Depending upon fasterButMoreRam, either just kana (191 arcs), or kana and han (28,607 arcs) are cached. The latter offers additional performance at the cost of more RAM.
UnknownDictionary
Dictionary for unknown-word handling.
UserDictionary
Class for building a User Dictionary. This class allows for custom segmentation of phrases.
Interfaces
IDictionary
Dictionary interface for retrieving morphological data by id.