Class CzechAnalyzer
Lucene.Net.Analysis.Analyzer for Czech language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating CzechAnalyzer:
- As of 3.1, words are stemmed with CzechStemFilter
 - As of 2.9, StopFilter preserves position increments
 - As of 2.4, Tokens incorrectly identified as acronyms are corrected (see LUCENE-1068)
 
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Cz
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class CzechAnalyzer : StopwordAnalyzerBase, IDisposable
  Constructors
CzechAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words (DefaultStopSet).
Declaration
public CzechAnalyzer(LuceneVersion matchVersion)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to match  | 
      
CzechAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words.
Declaration
public CzechAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to match  | 
      
| CharArraySet | stopwords | a stopword set  | 
      
CzechAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words and a set of work to be excluded from the CzechStemFilter.
Declaration
public CzechAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclusionTable)
  Parameters
| Type | Name | Description | 
|---|---|---|
| LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to match  | 
      
| CharArraySet | stopwords | a stopword set  | 
      
| CharArraySet | stemExclusionTable | a stemming exclusion set  | 
      
Fields
DEFAULT_STOPWORD_FILE
File containing default Czech stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"
  Field Value
| Type | Description | 
|---|---|
| string | 
Properties
DefaultStopSet
Returns a set of default Czech-stopwords
Declaration
public static CharArraySet DefaultStopSet { get; }
  Property Value
| Type | Description | 
|---|---|
| CharArraySet | a set of default Czech-stopwords  | 
      
Methods
CreateComponents(string, TextReader)
Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided TextReader.
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
  Parameters
| Type | Name | Description | 
|---|---|---|
| string | fieldName | |
| TextReader | reader | 
Returns
| Type | Description | 
|---|---|
| TokenStreamComponents | Lucene.Net.Analysis.TokenStreamComponents built from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, StopFilter, and CzechStemFilter (only if version is >= LUCENE_31). If a version is >= LUCENE_31 and a stem exclusion set is provided via CzechAnalyzer(LuceneVersion, CharArraySet, CharArraySet) a SetKeywordMarkerFilter is added before CzechStemFilter.  |