Class StopAnalyzer
Filters LetterTokenizer with LowerCaseFilter and StopFilter.
You must specify the required LuceneVersion compatibility when creating StopAnalyzer:
- As of 3.1, StopFilter correctly handles Unicode 4.0 supplementary characters in stopwords
- As of 2.9, position increments are preserved
Inherited Members
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >)
Lucene.Net.Analysis.Analyzer.NewAnonymous(Func<, , >, Func<, , >, Lucene.Net.Analysis.ReuseStrategy)
Namespace: Lucene.Net.Analysis.Core
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class StopAnalyzer : StopwordAnalyzerBase
Constructors
| Improve this Doc View SourceStopAnalyzer(LuceneVersion)
Builds an analyzer which removes words in ENGLISH_STOP_WORDS_SET.
Declaration
public StopAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | See LuceneVersion |
StopAnalyzer(LuceneVersion, FileInfo)
Builds an analyzer with the stop words from the given file.
Declaration
public StopAnalyzer(LuceneVersion matchVersion, FileInfo stopwordsFile)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | See LuceneVersion |
FileInfo | stopwordsFile | File to load stop words from |
See Also
| Improve this Doc View SourceStopAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the stop words from the given set.
Declaration
public StopAnalyzer(LuceneVersion matchVersion, CharArraySet stopWords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | See LuceneVersion |
CharArraySet | stopWords | Set of stop words |
StopAnalyzer(LuceneVersion, TextReader)
Builds an analyzer with the stop words from the given reader.
Declaration
public StopAnalyzer(LuceneVersion matchVersion, TextReader stopwords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | See LuceneVersion |
TextReader | stopwords |
See Also
Fields
| Improve this Doc View SourceENGLISH_STOP_WORDS_SET
An unmodifiable set containing some common English words that are not usually useful for searching.
Declaration
public static readonly CharArraySet ENGLISH_STOP_WORDS_SET
Field Value
Type | Description |
---|---|
CharArraySet |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Creates
TokenStreamComponents
used to tokenize all the text in the provided
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System.String | fieldName | |
TextReader | reader |
Returns
Type | Description |
---|---|
TokenStreamComponents | TokenStreamComponents built from a LowerCaseTokenizer filtered with StopFilter |