Class CJKAnalyzer
An Lucene.Net.Analysis.Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case with LowerCaseFilter, forms bigrams of CJK with CJKBigramFilter, and filters stopwords with StopFilter
Implements
System.IDisposable
  Inherited Members
      Lucene.Net.Analysis.Analyzer.Strategy
    
    
      Lucene.Net.Analysis.Analyzer.Dispose()
    
    
    
      Lucene.Net.Analysis.Analyzer.GLOBAL_REUSE_STRATEGY
    
    
      Lucene.Net.Analysis.Analyzer.PER_FIELD_REUSE_STRATEGY
    
    
      System.Object.Equals(System.Object)
    
    
      System.Object.Equals(System.Object, System.Object)
    
    
      System.Object.GetHashCode()
    
    
      System.Object.GetType()
    
    
      System.Object.MemberwiseClone()
    
    
      System.Object.ReferenceEquals(System.Object, System.Object)
    
    
      System.Object.ToString()
    
  Namespace: Lucene.Net.Analysis.Cjk
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
public sealed class CJKAnalyzer : StopwordAnalyzerBase, IDisposableConstructors
| Improve this Doc View SourceCJKAnalyzer(LuceneVersion)
Builds an analyzer which removes words in DefaultStopSet.
Declaration
public CJKAnalyzer(LuceneVersion matchVersion)Parameters
| Type | Name | Description | 
|---|---|---|
| Lucene.Net.Util.LuceneVersion | matchVersion | 
CJKAnalyzer(LuceneVersion, CharArraySet)
Builds an analyzer with the given stop words
Declaration
public CJKAnalyzer(LuceneVersion matchVersion, CharArraySet stopwords)Parameters
| Type | Name | Description | 
|---|---|---|
| Lucene.Net.Util.LuceneVersion | matchVersion | lucene compatibility version | 
| CharArraySet | stopwords | a stopword set | 
Fields
| Improve this Doc View SourceDEFAULT_STOPWORD_FILE
File containing default CJK stopwords.
Currently it contains some common English words that are not usually useful for searching and some double-byte interpunctions.
Declaration
public const string DEFAULT_STOPWORD_FILE = "stopwords.txt"Field Value
| Type | Description | 
|---|---|
| System.String | 
Properties
| Improve this Doc View SourceDefaultStopSet
Returns an unmodifiable instance of the default stop-words set.
Declaration
public static CharArraySet DefaultStopSet { get; }Property Value
| Type | Description | 
|---|---|
| CharArraySet | an unmodifiable instance of the default stop-words set. | 
Methods
| Improve this Doc View SourceCreateComponents(String, TextReader)
Declaration
protected override TokenStreamComponents CreateComponents(string fieldName, TextReader reader)Parameters
| Type | Name | Description | 
|---|---|---|
| System.String | fieldName | |
| System.IO.TextReader | reader | 
Returns
| Type | Description | 
|---|---|
| Lucene.Net.Analysis.TokenStreamComponents | 
Overrides
Implements
      System.IDisposable