Show / Hide Table of Contents

    Class TokenizerFactory

    Abstract parent class for analysis factories that create Tokenizer instances.

    Inheritance
    System.Object
    AbstractAnalysisFactory
    TokenizerFactory
    ArabicLetterTokenizerFactory
    CJKTokenizerFactory
    ChineseTokenizerFactory
    KeywordTokenizerFactory
    LetterTokenizerFactory
    LowerCaseTokenizerFactory
    WhitespaceTokenizerFactory
    EdgeNGramTokenizerFactory
    NGramTokenizerFactory
    PathHierarchyTokenizerFactory
    PatternTokenizerFactory
    RussianLetterTokenizerFactory
    ClassicTokenizerFactory
    StandardTokenizerFactory
    UAX29URLEmailTokenizerFactory
    WikipediaTokenizerFactory
    Inherited Members
    AbstractAnalysisFactory.LUCENE_MATCH_VERSION_PARAM
    AbstractAnalysisFactory.m_luceneMatchVersion
    AbstractAnalysisFactory.OriginalArgs
    AbstractAnalysisFactory.AssureMatchVersion()
    AbstractAnalysisFactory.LuceneMatchVersion
    AbstractAnalysisFactory.Require(IDictionary<String, String>, String)
    AbstractAnalysisFactory.Require(IDictionary<String, String>, String, ICollection<String>)
    AbstractAnalysisFactory.Require(IDictionary<String, String>, String, ICollection<String>, Boolean)
    AbstractAnalysisFactory.Get(IDictionary<String, String>, String, String)
    AbstractAnalysisFactory.Get(IDictionary<String, String>, String, ICollection<String>)
    AbstractAnalysisFactory.Get(IDictionary<String, String>, String, ICollection<String>, String)
    AbstractAnalysisFactory.Get(IDictionary<String, String>, String, ICollection<String>, String, Boolean)
    AbstractAnalysisFactory.RequireInt32(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetInt32(IDictionary<String, String>, String, Int32)
    AbstractAnalysisFactory.RequireBoolean(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetBoolean(IDictionary<String, String>, String, Boolean)
    AbstractAnalysisFactory.RequireSingle(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetSingle(IDictionary<String, String>, String, Single)
    AbstractAnalysisFactory.RequireChar(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetChar(IDictionary<String, String>, String, Char)
    AbstractAnalysisFactory.GetSet(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetPattern(IDictionary<String, String>, String)
    AbstractAnalysisFactory.GetCulture(IDictionary<String, String>, String, CultureInfo)
    AbstractAnalysisFactory.GetWordSet(IResourceLoader, String, Boolean)
    AbstractAnalysisFactory.GetLines(IResourceLoader, String)
    AbstractAnalysisFactory.GetSnowballWordSet(IResourceLoader, String, Boolean)
    AbstractAnalysisFactory.SplitFileNames(String)
    AbstractAnalysisFactory.GetClassArg()
    AbstractAnalysisFactory.IsExplicitLuceneMatchVersion
    Namespace: Lucene.Net.Analysis.Util
    Assembly: Lucene.Net.Analysis.Common.dll
    Syntax
    public abstract class TokenizerFactory : AbstractAnalysisFactory

    Constructors

    | Improve this Doc View Source

    TokenizerFactory(IDictionary<String, String>)

    Initialize this factory via a set of key-value pairs.

    Declaration
    protected TokenizerFactory(IDictionary<string, string> args)
    Parameters
    Type Name Description
    IDictionary<System.String, System.String> args

    Properties

    | Improve this Doc View Source

    AvailableTokenizers

    returns a list of all available tokenizer names from the host project's referenced assemblies

    Declaration
    public static ICollection<string> AvailableTokenizers { get; }
    Property Value
    Type Description
    ICollection<System.String>

    Methods

    | Improve this Doc View Source

    Create(AttributeSource.AttributeFactory, TextReader)

    Creates a TokenStream of the specified input using the given AttributeSource.AttributeFactory

    Declaration
    public abstract Tokenizer Create(AttributeSource.AttributeFactory factory, TextReader input)
    Parameters
    Type Name Description
    AttributeSource.AttributeFactory factory
    TextReader input
    Returns
    Type Description
    Tokenizer
    | Improve this Doc View Source

    Create(TextReader)

    Creates a TokenStream of the specified input using the default attribute factory.

    Declaration
    public Tokenizer Create(TextReader input)
    Parameters
    Type Name Description
    TextReader input
    Returns
    Type Description
    Tokenizer
    | Improve this Doc View Source

    ForName(String, IDictionary<String, String>)

    looks up a tokenizer by name from the host project's referenced assemblies

    Declaration
    public static TokenizerFactory ForName(string name, IDictionary<string, string> args)
    Parameters
    Type Name Description
    System.String name
    IDictionary<System.String, System.String> args
    Returns
    Type Description
    TokenizerFactory
    | Improve this Doc View Source

    LookupClass(String)

    looks up a tokenizer class by name from the host project's referenced assemblies

    Declaration
    public static Type LookupClass(string name)
    Parameters
    Type Name Description
    System.String name
    Returns
    Type Description
    Type
    | Improve this Doc View Source

    ReloadTokenizers()

    Reloads the factory list. Changes to the factories are visible after the method ends, all iterators (AvailableTokenizers,...) stay consistent.

    NOTE: Only new factories are added, existing ones are never removed or replaced.

    This method is expensive and should only be called for discovery of new factories on the given classpath/classloader!

    Declaration
    public static void ReloadTokenizers()
    • Improve this Doc
    • View Source
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)