Class GermanAnalyzer
Analyzer for German language.
Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed). A default set of stopwords is used unless an alternative list is specified, but the exclusion list is empty by default.
You must specify the required LuceneVersion compatibility when creating GermanAnalyzer:
NOTE: This class uses the same LuceneVersion dependent settings as StandardAnalyzer.
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.De
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class GermanAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
| Improve this Doc View SourceGermanAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words: DefaultStopSet.
Declaration
public GermanAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion |
GermanAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public GermanAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
GermanAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public GermanAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionSet)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | lucene compatibility version |
CharArraySet | stopwords | a stopword set |
CharArraySet | stemExclusionSet | a stemming exclusion set |
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default German stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "german_stop.txt"
Field Value
Type | Description |
---|---|
System.String |
Properties
| Improve this Doc View SourceDefaultStopSet
Returns a set of default German-stopwords
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
CharArraySet | a set of default German-stopwords |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates TokenStreamComponents used to tokenize all the text in the provided System.IO.TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System.String | fieldName | |
System.IO.TextReader | reader |
Returns
Type | Description |
---|---|
TokenStreamComponents | TokenStreamComponents built from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, SetKeywordMarkerFilter if a stem exclusion set is provided, GermanNormalizationFilter and GermanLightStemFilter |