Class DutchAnalyzer

Lucene.Net.Analysis.Analyzer for Dutch language.

Supports an external list of stopwords (words that will not be indexed at all), an external list of exclusions (word that will not be stemmed, but indexed) and an external list of word-stem pairs that overrule the algorithm (dictionary stemming). A default set of stopwords is used unless an alternative list is specified, but the exclusion list is empty by default.

You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating DutchAnalyzer:

As of 3.6, DutchAnalyzer(LuceneVersion, CharArraySet) and DutchAnalyzer(LuceneVersion, CharArraySet, CharArraySet) also populate the default entries for the stem override dictionary
As of 3.1, Snowball stemming is done with SnowballFilter, LowerCaseFilter is used prior to StopFilter, and Snowball stopwords are used by default.
As of 2.9, StopFilter preserves position increments

NOTE: This class uses the same Lucene.Net.Util.LuceneVersion dependent settings as StandardAnalyzer.

Inheritance

System.Object

Lucene.Net.Analysis.Analyzer

DutchAnalyzer

Implements

System.IDisposable

Inherited Members

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, ReuseStrategy)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, Func<String, TextReader, TextReader>)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, Func<String, TextReader, TextReader>, ReuseStrategy)

Analyzer.GetTokenStream(String, TextReader)

Analyzer.GetTokenStream(String, String)

Analyzer.InitReader(String, TextReader)

Analyzer.GetPositionIncrementGap(String)

Analyzer.GetOffsetGap(String)

Lucene.Net.Analysis.Analyzer.Strategy

Lucene.Net.Analysis.Analyzer.Dispose()

Analyzer.Dispose(Boolean)

Lucene.Net.Analysis.Analyzer.GLOBAL_REUSE_STRATEGY

Lucene.Net.Analysis.Analyzer.PER_FIELD_REUSE_STRATEGY

System.Object.Equals(System.Object)

System.Object.Equals(System.Object, System.Object)

System.Object.GetHashCode()

System.Object.GetType()

System.Object.MemberwiseClone()

System.Object.ReferenceEquals(System.Object, System.Object)

System.Object.ToString()

Namespace: Lucene.Net.Analysis.Nl

Assembly: Lucene.Net.Analysis.Common.dll

Syntax

public sealed class DutchAnalyzer : Analyzer, IDisposable

Constructors

| Improve this Doc View Source

DutchAnalyzer(LuceneVersion)

Builds an analyzer with the default stop words (DefaultStopSet) and a few default entries for the stem exclusion table.

Declaration

public DutchAnalyzer(LuceneVersion matchVersion)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	matchVersion

| Improve this Doc View Source

DutchAnalyzer(LuceneVersion, CharArraySet)

Declaration

public DutchAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	matchVersion
CharArraySet	stopwords

| Improve this Doc View Source

DutchAnalyzer(LuceneVersion, CharArraySet, CharArraySet)

Declaration

public DutchAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionTable)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	matchVersion
CharArraySet	stopwords
CharArraySet	stemExclusionTable

| Improve this Doc View Source

DutchAnalyzer(LuceneVersion, CharArraySet, CharArraySet, CharArrayMap<String>)

Declaration

public DutchAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionTable, CharArrayMap<string> stemOverrideDict)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	matchVersion
CharArraySet	stopwords
CharArraySet	stemExclusionTable
CharArrayMap<System.String>	stemOverrideDict

Fields

| Improve this Doc View Source

DEFAULT_STOPWORD_FILE

File containing default Dutch stopwords.

Declaration

public const string DEFAULT_STOPWORD_FILE = "dutch_stop.txt"

Field Value

Type	Description
System.String

Properties

| Improve this Doc View Source

DefaultStopSet

Returns an unmodifiable instance of the default stop-words set.

Declaration

public static CharArraySet DefaultStopSet { get; }

Property Value

Type	Description
CharArraySet	an unmodifiable instance of the default stop-words set.

Methods

| Improve this Doc View Source

CreateComponents(String, TextReader)

Returns a (possibly reused) Lucene.Net.Analysis.TokenStream which tokenizes all the text in the provided System.IO.TextReader.

Declaration

protected override TokenStreamComponents CreateComponents(string fieldName, TextReader aReader)

Parameters

Type	Name	Description
System.String	fieldName
System.IO.TextReader	aReader

Returns

Type	Description
Lucene.Net.Analysis.TokenStreamComponents	A Lucene.Net.Analysis.TokenStream built from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, SetKeywordMarkerFilter if a stem exclusion set is provided, StemmerOverrideFilter, and SnowballFilter

Overrides

Analyzer.CreateComponents(String, TextReader)

Implements

System.IDisposable