Fork me on GitHub
  • API

    Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.Lv

    Analyzer for Latvian.

    Classes

    LatvianAnalyzer

    Lucene.Net.Analysis.Analyzer for Latvian.

    LatvianStemFilter

    A Lucene.Net.Analysis.TokenFilter that applies LatvianStemmer to stem Latvian words.

    To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom Lucene.Net.Analysis.TokenFilter that sets the KeywordAttribute before this Lucene.Net.Analysis.TokenStream.

    LatvianStemFilterFactory

    Factory for LatvianStemFilter.

    <fieldType name="text_lvstem" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.LatvianStemFilterFactory"/>
      </analyzer>
    </fieldType>

    LatvianStemmer

    Light stemmer for Latvian.

    This is a light version of the algorithm in Karlis Kreslin's PhD thesis A stemming algorithm for Latvian with the following modifications:

    • Only explicitly stems noun and adjective morphology
    • Stricter length/vowel checks for the resulting stems (verb etc suffix stripping is removed)
    • Removes only the primary inflectional suffixes: case and number for nouns ; case, number, gender, and definitiveness for adjectives.
    • Palatalization is only handled when a declension II,V,VI noun suffix is removed.

    • Improve this Doc
    Back to top Copyright © 2020 The Apache Software Foundation, Licensed under the Apache License, Version 2.0
    Apache Lucene.Net, Lucene.Net, Apache, the Apache feather logo, and the Apache Lucene.Net project logo are trademarks of The Apache Software Foundation.
    All other marks mentioned may be trademarks or registered trademarks of their respective owners.