Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.Wikipedia

    Tokenizer that is aware of Wikipedia syntax.

    Classes

    WikipediaTokenizer

    Extension of StandardTokenizer that is aware of Wikipedia syntax. It is based off of the Wikipedia tutorial available at http://en.wikipedia.org/wiki/Wikipedia:Tutorial, but it may not be complete.

    This is a Lucene.NET EXPERIMENTAL API, use at your own risk

    WikipediaTokenizerFactory

    Factory for WikipediaTokenizer.

    <fieldType name="text_wiki" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.WikipediaTokenizerFactory"/>
      </analyzer>
    </fieldType>
    • Improve this Doc
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)