Class HindiAnalyzer

Analyzer for Hindi.

You must specify the required Lucene.Net.Util.LuceneVersion compatibility when creating HindiAnalyzer:

As of 3.6, StandardTokenizer is used for tokenization

Inheritance

System.Object

Lucene.Net.Analysis.Analyzer

StopwordAnalyzerBase

HindiAnalyzer

Implements

System.IDisposable

Inherited Members

StopwordAnalyzerBase.m_stopwords

StopwordAnalyzerBase.m_matchVersion

StopwordAnalyzerBase.StopwordSet

StopwordAnalyzerBase.LoadStopwordSet(Boolean, Type, String, String)

StopwordAnalyzerBase.LoadStopwordSet(FileInfo, LuceneVersion)

StopwordAnalyzerBase.LoadStopwordSet(TextReader, LuceneVersion)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, ReuseStrategy)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, Func<String, TextReader, TextReader>)

Analyzer.NewAnonymous(Func<String, TextReader, TokenStreamComponents>, Func<String, TextReader, TextReader>, ReuseStrategy)

Analyzer.GetTokenStream(String, TextReader)

Analyzer.GetTokenStream(String, String)

Analyzer.InitReader(String, TextReader)

Analyzer.GetPositionIncrementGap(String)

Analyzer.GetOffsetGap(String)

Lucene.Net.Analysis.Analyzer.Strategy

Lucene.Net.Analysis.Analyzer.Dispose()

Analyzer.Dispose(Boolean)

Lucene.Net.Analysis.Analyzer.GLOBAL_REUSE_STRATEGY

Lucene.Net.Analysis.Analyzer.PER_FIELD_REUSE_STRATEGY

System.Object.Equals(System.Object)

System.Object.Equals(System.Object, System.Object)

System.Object.GetHashCode()

System.Object.GetType()

System.Object.MemberwiseClone()

System.Object.ReferenceEquals(System.Object, System.Object)

System.Object.ToString()

Namespace: Lucene.Net.Analysis.Hi

Assembly: Lucene.Net.Analysis.Common.dll

Syntax

public sealed class HindiAnalyzer : StopwordAnalyzerBase, IDisposable

Constructors

| Improve this Doc View Source

HindiAnalyzer(LuceneVersion)

Builds an analyzer with the default stop words: DEFAULT_STOPWORD_FILE.

Declaration

public HindiAnalyzer(LuceneVersion version)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	version

| Improve this Doc View Source

HindiAnalyzer(LuceneVersion, CharArraySet)

Builds an analyzer with the given stop words

Declaration

public HindiAnalyzer(LuceneVersion version, CharArraySet stopwords)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	version	lucene compatibility version
CharArraySet	stopwords	a stopword set

| Improve this Doc View Source

HindiAnalyzer(LuceneVersion, CharArraySet, CharArraySet)

Builds an analyzer with the given stop words

Declaration

public HindiAnalyzer(LuceneVersion version, CharArraySet stopwords, CharArraySet stemExclusionSet)

Parameters

Type	Name	Description
Lucene.Net.Util.LuceneVersion	version	lucene compatibility version
CharArraySet	stopwords	a stopword set
CharArraySet	stemExclusionSet	a stemming exclusion set

Fields

| Improve this Doc View Source

DEFAULT_STOPWORD_FILE

File containing default Hindi stopwords.

Default stopword list is from http://members.unine.ch/jacques.savoy/clef/index.html The stopword list is BSD-Licensed.

Declaration

public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"

Field Value

Type	Description
System.String

Properties

| Improve this Doc View Source

DefaultStopSet

Returns an unmodifiable instance of the default stop-words set.

Declaration

public static CharArraySet DefaultStopSet { get; }

Property Value

Type	Description
CharArraySet	an unmodifiable instance of the default stop-words set.

Methods

| Improve this Doc View Source

CreateComponents(String, TextReader)

Creates Lucene.Net.Analysis.TokenStreamComponents used to tokenize all the text in the provided System.IO.TextReader.

Declaration

protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)

Parameters

Type	Name	Description
System.String	fieldName
System.IO.TextReader	reader

Returns

Type	Description
Lucene.Net.Analysis.TokenStreamComponents	Lucene.Net.Analysis.TokenStreamComponents built from a StandardTokenizer filtered with LowerCaseFilter, IndicNormalizationFilter, HindiNormalizationFilter, SetKeywordMarkerFilter if a stem exclusion set is provided, HindiStemFilter, and Hindi Stop words

Overrides

Analyzer.CreateComponents(String, TextReader)

Implements

System.IDisposable