Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.Wikipedia

    Tokenizer that is aware of Wikipedia syntax.

    Classes

    WikipediaTokenizer

    Extension of StandardTokenizer that is aware of Wikipedia syntax. It is based off of the Wikipedia tutorial available at http://en.wikipedia.org/wiki/Wikipedia:Tutorial, but it may not be complete.

    This is a Lucene.NET EXPERIMENTAL API, use at your own risk

    WikipediaTokenizerFactory

    Factory for WikipediaTokenizer.

    <fieldType name="text_wiki" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.WikipediaTokenizerFactory"/>
      </analyzer>
    </fieldType>
    • Improve this Doc
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.