Class UAX29URLEmailAnalyzer
Filters UAX29URLEmailTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words.
You must specify the required LuceneVersion compatibility when creating UAX29URLEmailAnalyzer
Inherited Members
Namespace: Lucene.Net.Analysis.Standard
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class UAX29URLEmailAnalyzer : StopwordAnalyzerBase
Constructors
| Improve this Doc View SourceUAX29URLEmailAnalyzer(LuceneVersion)
Builds an analyzer with the default stop words (STOP_WORDS_SET.
Declaration
public UAX29URLEmailAnalyzer(LuceneVersion matchVersion)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | Lucene version to match - See UAX29URLEmailAnalyzer |
UAX29URLEmailAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words.
Declaration
public UAX29URLEmailAnalyzer(LuceneVersion matchVersion, CharArraySet stopWords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | Lucene version to match - See UAX29URLEmailAnalyzer |
CharArraySet | stopWords | stop words |
UAX29URLEmailAnalyzer(LuceneVersion, TextReader)
Builds an analyzer with the stop words from the given reader.
Declaration
public UAX29URLEmailAnalyzer(LuceneVersion matchVersion, TextReader stopwords)
Parameters
Type | Name | Description |
---|---|---|
LuceneVersion | matchVersion | Lucene version to match - See UAX29URLEmailAnalyzer |
TextReader | stopwords |
See Also
Fields
| Improve this Doc View SourceDEFAULT_MAX_TOKEN_LENGTH
Default maximum allowed token length
Declaration
public const int DEFAULT_MAX_TOKEN_LENGTH = null
Field Value
Type | Description |
---|---|
System.Int32 |
STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not useful for searching.
Declaration
public static readonly CharArraySet STOP_WORDS_SET
Field Value
Type | Description |
---|---|
CharArraySet |
Properties
| Improve this Doc View SourceMaxTokenLength
Set maximum allowed token length. If a token is seen that exceeds this length then it is discarded. This setting only takes effect the next time tokenStream or tokenStream is called.
Declaration
public int MaxTokenLength { get; set; }
Property Value
Type | Description |
---|---|
System.Int32 |
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
Parameters
Type | Name | Description |
---|---|---|
System.String | fieldName | |
TextReader | reader |
Returns
Type | Description |
---|---|
TokenStreamComponents |