Class StopFilter
Removes stop words from a token stream.
You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating StopFilter:
- As of 3.1, StopFilter correctly handles Unicode 4.0 supplementary characters in stopwords and position increments are preserved
Inheritance
Implements
Inherited Members
Namespace: Lucene.Net.Analysis.Core
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class StopFilter : FilteringTokenFilter, IDisposable
Constructors
| Improve this Doc View SourceStopFilter(LuceneVersion, TokenStream, CharArraySet)
Constructs a filter which removes words from the input Lucene.Net.Analysis.TokenStream that are named in the CharArraySet.
Declaration
public StopFilter(LuceneVersion matchVersion, TokenStream in, CharArraySet stopWords)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | Lucene version to enable correct Unicode 4.0 behavior in the stop set if Version > 3.0. See Lucene.Net.Util.LuceneVersion> for details. |
Lucene.Net.Analysis.TokenStream | in | Input Lucene.Net.Analysis.TokenStream |
CharArraySet | stopWords | A CharArraySet representing the stopwords. |
See Also
Methods
| Improve this Doc View SourceAccept()
Returns the next input Token whose Term is not a stop word.
Declaration
protected override bool Accept()
Returns
Type | Description |
---|---|
System.Boolean |
Overrides
| Improve this Doc View SourceMakeStopSet(LuceneVersion, String[])
Builds a CharArraySet from an array of stop words,
appropriate for passing into the StopFilter constructor.
This permits this stopWords
construction to be cached once when
an Lucene.Net.Analysis.Analyzer is constructed.
Declaration
public static CharArraySet MakeStopSet(LuceneVersion matchVersion, params string[] stopWords)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0 |
System.String[] | stopWords | An array of stopwords |
Returns
Type | Description |
---|---|
CharArraySet |
See Also
| Improve this Doc View SourceMakeStopSet(LuceneVersion, String[], Boolean)
Creates a stopword set from the given stopword array.
Declaration
public static CharArraySet MakeStopSet(LuceneVersion matchVersion, string[] stopWords, bool ignoreCase)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0 |
System.String[] | stopWords | An array of stopwords |
System.Boolean | ignoreCase | If true, all words are lower cased first. |
Returns
Type | Description |
---|---|
CharArraySet | a Set (CharArraySet) containing the words |
MakeStopSet<T1>(LuceneVersion, IList<T1>)
Builds a CharArraySet from an array of stop words,
appropriate for passing into the StopFilter constructor.
This permits this stopWords
construction to be cached once when
an Lucene.Net.Analysis.Analyzer is constructed.
Declaration
public static CharArraySet MakeStopSet<T1>(LuceneVersion matchVersion, IList<T1> stopWords)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0 |
System.Collections.Generic.IList<T1> | stopWords | A List of System.Strings or char[] or any other ToString()-able list representing the stopwords |
Returns
Type | Description |
---|---|
CharArraySet | A Set (CharArraySet) containing the words |
Type Parameters
Name | Description |
---|---|
T1 |
See Also
| Improve this Doc View SourceMakeStopSet<T1>(LuceneVersion, IList<T1>, Boolean)
Creates a stopword set from the given stopword list.
Declaration
public static CharArraySet MakeStopSet<T1>(LuceneVersion matchVersion, IList<T1> stopWords, bool ignoreCase)
Parameters
Type | Name | Description |
---|---|---|
Lucene.Net.Util.LuceneVersion | matchVersion | Lucene.Net.Util.LuceneVersion to enable correct Unicode 4.0 behavior in the returned set if Version > 3.0 |
System.Collections.Generic.IList<T1> | stopWords | A List of System.Strings or char[] or any other ToString()-able list representing the stopwords |
System.Boolean | ignoreCase | if true, all words are lower cased first |
Returns
Type | Description |
---|---|
CharArraySet | A Set (CharArraySet) containing the words |
Type Parameters
Name | Description |
---|---|
T1 |