Enum JapaneseTokenizerMode

Tokenization mode: this determines how the tokenizer handles compound and unknown words.

public enum JapaneseTokenizerMode

Name	Description
EXTENDED	Extended mode outputs unigrams for unknown words. Note This API is experimental and might change in incompatible ways in the next release.
NORMAL	Ordinary segmentation: no decomposition for compounds,
SEARCH	Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.