Show / Hide Table of Contents

    Namespace Lucene.Net.Analysis.Lv

    Analyzer for Latvian.

    Classes

    LatvianAnalyzer

    Analyzer for Latvian.

    LatvianStemFilter

    A TokenFilter that applies LatvianStemmer to stem Latvian words.

    To prevent terms from being stemmed use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.

    LatvianStemFilterFactory

    Factory for LatvianStemFilter.

    <fieldType name="text_lvstem" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.LatvianStemFilterFactory"/>
      </analyzer>
    </fieldType>

    LatvianStemmer

    Light stemmer for Latvian.

    This is a light version of the algorithm in Karlis Kreslin's PhD thesis A stemming algorithm for Latvian with the following modifications:

    • Only explicitly stems noun and adjective morphology
    • Stricter length/vowel checks for the resulting stems (verb etc suffix stripping is removed)
    • Removes only the primary inflectional suffixes: case and number for nouns ; case, number, gender, and definitiveness for adjectives.
    • Palatalization is only handled when a declension II,V,VI noun suffix is removed.

    • Improve this Doc
    Back to top Copyright © 2020 Licensed to the Apache Software Foundation (ASF)