Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.CharFilters

    Classes

    BaseCharFilter

    Base utility class for implementing a CharFilter. You subclass this, and then record mappings by calling AddOffCorrectMap(Int32, Int32), and then invoke the correct method to correct an offset.

    HTMLStripCharFilter

    A CharFilter that wraps another System.IO.TextReader and attempts to strip out HTML constructs.

    HTMLStripCharFilterFactory

    Factory for HTMLStripCharFilter.

    <fieldType name="text_html" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <charFilter class="solr.HTMLStripCharFilterFactory" escapedTags="a, title" />
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
      </analyzer>
    </fieldType>

    MappingCharFilter

    Simplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the character stream, and correcting the resulting changes to the offsets. Matching is greedy (longest pattern matching at a given point wins). Replacement is allowed to be the empty string.

    MappingCharFilterFactory

    Factory for MappingCharFilter.

    <fieldType name="text_map" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <charFilter class="solr.MappingCharFilterFactory" mapping="mapping.txt"/>
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
      </analyzer>
    </fieldType>

    @since Solr 1.4

    NormalizeCharMap

    Holds a map of System.String input to System.String output, to be used with NormalizeCharMap.Builder. Use the MappingCharFilter to create this.

    NormalizeCharMap.Builder

    Builds an NormalizeCharMap.

    Call add() until you have added all the mappings, then call build() to get a NormalizeCharMap @lucene.experimental

    Back to top Copyright © 2019 Licensed to the Apache Software Foundation (ASF)