Namespace Lucene.Net.Analysis.CharFilters
Classes
BaseCharFilter
Base utility class for implementing a CharFilter. You subclass this, and then record mappings by calling AddOffCorrectMap(Int32, Int32), and then invoke the correct method to correct an offset.
HTMLStripCharFilter
A CharFilter that wraps another System.IO.TextReader and attempts to strip out HTML constructs.
HTMLStripCharFilterFactory
Factory for HTMLStripCharFilter.
<fieldType name="text_html" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<charFilter class="solr.HTMLStripCharFilterFactory" escapedTags="a, title" />
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
</analyzer>
</fieldType>
MappingCharFilter
Simplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the character stream, and correcting the resulting changes to the offsets. Matching is greedy (longest pattern matching at a given point wins). Replacement is allowed to be the empty string.
MappingCharFilterFactory
Factory for MappingCharFilter.
<fieldType name="text_map" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<charFilter class="solr.MappingCharFilterFactory" mapping="mapping.txt"/>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
</analyzer>
</fieldType>
@since Solr 1.4
NormalizeCharMap
Holds a map of System.String input to System.String output, to be used with NormalizeCharMap.Builder. Use the MappingCharFilter to create this.
NormalizeCharMap.Builder
Builds an NormalizeCharMap.
Call add() until you have added all the mappings, then call build() to get a NormalizeCharMap @lucene.experimental