Class FrenchAnalyzer
Analyzer for French language.
Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed). A default set of stopwords is used unless an alternative list is specified, but the exclusion list is empty by default.
You must specify the required Lucene
- As of 3.6, French
Light is used for less aggressive stemming.Stem Filter - As of 3.1, Snowball stemming is done with Snowball
Filter , LowerCase is used prior to StopFilter Filter , and ElisionFilter and Snowball stopwords are used by default. - As of 2.9, Stop
Filter preserves position increments
NOTE: This class uses the same Lucene
Inherited Members
Namespace: Lucene.Net.Analysis.Fr
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class FrenchAnalyzer : StopwordAnalyzerBase
Constructors
| Improve this Doc View SourceFrenchAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words (Default
Declaration
public FrenchAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion |
FrenchAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public FrenchAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion | lucene compatibility version |
Char |
stopwords | a stopword set |
FrenchAnalyzer(LuceneVersion, CharArraySet, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public FrenchAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords, CharArraySet stemExclutionSet)
Parameters
Type | Name | Description |
---|---|---|
Lucene |
matchVersion | lucene compatibility version |
Char |
stopwords | a stopword set |
Char |
stemExclutionSet | a stemming exclusion set |
Fields
| Improve this Doc View SourceDEFAULT_ARTICLES
Default set of articles for Elision
Declaration
public static readonly CharArraySet DEFAULT_ARTICLES
Field Value
Type | Description |
---|---|
Char |
DEFAULT_STOPWORD_FILE
File containing default French stopwords.
Declaration
public const string DEFAULT_STOPWORD_FILE = null
Field Value
Type | Description |
---|---|
System. |
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }
Property Value
Type | Description |
---|---|
Char |
an unmodifiable instance of the default stop-words set. |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates
Token
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System. |
fieldName | |
Text |
reader |
Returns
Type | Description |
---|---|
Token |
Token |