Class HyphenationCompoundWordTokenFilterFactory
Factory for HyphenationCompoundWordTokenFilter.
This factory accepts the following parameters:
(mandatory): path to the FOP xml hyphenation pattern. See http://offo.sourceforge.net/hyphenation/.hyphenator
(optional): encoding of the xml hyphenation file. defaults to UTF-8.encoding
(optional): dictionary of words. defaults to no dictionary.dictionary
(optional): minimal word length that gets decomposed. defaults to 5.minWordSize
(optional): minimum length of subwords. defaults to 2.minSubwordSize
(optional): maximum length of subwords. defaults to 15.maxSubwordSize
(optional): if true, adds only the longest matching subword to the stream. defaults to false.onlyLongestMatch
<fieldType name="text_hyphncomp" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.HyphenationCompoundWordTokenFilterFactory" hyphenator="hyphenator.xml" encoding="UTF-8"
dictionary="dictionary.txt" minWordSize="5" minSubwordSize="2" maxSubwordSize="15" onlyLongestMatch="false"/>
</analyzer>
</fieldType>
Inheritance
System.Object
HyphenationCompoundWordTokenFilterFactory
Implements
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Lucene.Net.Analysis.Compound
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public class HyphenationCompoundWordTokenFilterFactory : TokenFilterFactory, IResourceLoaderAware
Constructors
| Improve this Doc View SourceHyphenationCompoundWordTokenFilterFactory(IDictionary<String, String>)
Creates a new HyphenationCompoundWordTokenFilterFactory
Declaration
public HyphenationCompoundWordTokenFilterFactory(IDictionary<string, string> args)
Parameters
Type | Name | Description |
---|---|---|
System.Collections.Generic.IDictionary<System.String, System.String> | args |
Methods
| Improve this Doc View SourceCreate(TokenStream)
Declaration
public override TokenStream Create(TokenStream input)
Parameters
Type | Name | Description |
---|---|---|
TokenStream | input |
Returns
Type | Description |
---|---|
TokenStream |
Overrides
| Improve this Doc View SourceInform(IResourceLoader)
Declaration
public virtual void Inform(IResourceLoader loader)
Parameters
Type | Name | Description |
---|---|---|
IResourceLoader | loader |