Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Class UAX29URLEmailAnalyzer

    Filters UAX29URLEmailTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words.

    You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating UAX29URLEmailAnalyzer

    Inheritance
    object
    Analyzer
    StopwordAnalyzerBase
    UAX29URLEmailAnalyzer
    Implements
    IDisposable
    Inherited Members
    StopwordAnalyzerBase.StopwordSet
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, ReuseStrategy)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>)
    Analyzer.NewAnonymous(Func<string, TextReader, TokenStreamComponents>, Func<string, TextReader, TextReader>, ReuseStrategy)
    Analyzer.GetTokenStream(string, TextReader)
    Analyzer.GetTokenStream(string, string)
    Analyzer.GetPositionIncrementGap(string)
    Analyzer.GetOffsetGap(string)
    Analyzer.Strategy
    Analyzer.Dispose()
    Analyzer.GLOBAL_REUSE_STRATEGY
    Analyzer.PER_FIELD_REUSE_STRATEGY
    object.Equals(object)
    object.Equals(object, object)
    object.GetHashCode()
    object.GetType()
    object.ReferenceEquals(object, object)
    object.ToString()
    Namespace: Lucene.Net.Analysis.Standard
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public sealed class UAX29URLEmailAnalyzer : StopwordAnalyzerBase, IDisposable

    Constructors

    UAX29URLEmailAnalyzer(LuceneVersion)

    Builds an analyzer with the default stop words (STOP_WORDS_SET.

    Declaration
    public UAX29URLEmailAnalyzer(LuceneVersion matchVersion)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene version to match - See UAX29URLEmailAnalyzer

    UAX29URLEmailAnalyzer(LuceneVersion, CharArraySet)

    Builds an analyzer with the given stop words.

    Declaration
    public UAX29URLEmailAnalyzer(LuceneVersion matchVersion, CharArraySet stopWords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene version to match - See UAX29URLEmailAnalyzer

    CharArraySet stopWords

    stop words

    UAX29URLEmailAnalyzer(LuceneVersion, TextReader)

    Builds an analyzer with the stop words from the given reader.

    Declaration
    public UAX29URLEmailAnalyzer(LuceneVersion matchVersion, TextReader stopwords)
    Parameters
    Type Name Description
    LuceneVersion matchVersion

    Lucene version to match - See UAX29URLEmailAnalyzer

    TextReader stopwords

    TextReader to read stop words from

    See Also
    GetWordSet(TextReader, LuceneVersion)

    Fields

    DEFAULT_MAX_TOKEN_LENGTH

    Default maximum allowed token length

    Declaration
    public const int DEFAULT_MAX_TOKEN_LENGTH = 255
    Field Value
    Type Description
    int

    STOP_WORDS_SET

    An unmodifiable set containing some common English words that are usually not useful for searching.

    Declaration
    public static readonly CharArraySet STOP_WORDS_SET
    Field Value
    Type Description
    CharArraySet

    Properties

    MaxTokenLength

    Set maximum allowed token length. If a token is seen that exceeds this length then it is discarded. This setting only takes effect the next time tokenStream or tokenStream is called.

    Declaration
    public int MaxTokenLength { get; set; }
    Property Value
    Type Description
    int

    Methods

    CreateComponents(string, TextReader)

    Creates a new Lucene.Net.Analysis.TokenStreamComponents instance for this analyzer.

    Declaration
    protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)
    Parameters
    Type Name Description
    string fieldName

    the name of the fields content passed to the Lucene.Net.Analysis.TokenStreamComponents sink as a reader

    TextReader reader

    the reader passed to the Lucene.Net.Analysis.Tokenizer constructor

    Returns
    Type Description
    TokenStreamComponents

    the Lucene.Net.Analysis.TokenStreamComponents for this analyzer.

    Overrides
    Analyzer.CreateComponents(string, TextReader)

    Implements

    IDisposable
    Back to top Copyright © 2024 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.