Lucene.Net  3.0.3
Lucene.Net is a port of the Lucene search engine library, written in C# and targeted at .NET runtime users.
 All Classes Namespaces Files Functions Variables Typedefs Enumerations Properties Pages
Protected Member Functions | List of all members
Lucene.Net.Analysis.De.GermanDIN2Stemmer Class Reference

A stemmer for the german language that uses the DIN-5007-2 "Phone Book" rules for handling umlaut characters. More...

Inherits Lucene.Net.Analysis.De.GermanStemmer.

Protected Member Functions

override void Substitute (StringBuilder buffer)
 Do some substitutions for the term to reduce overstemming:
 

Additional Inherited Members

- Protected Attributes inherited from Lucene.Net.Analysis.De.GermanStemmer
int substCount = 0
 Amount of characters that are removed with Substitute() while stemming.
 

Detailed Description

A stemmer for the german language that uses the DIN-5007-2 "Phone Book" rules for handling umlaut characters.

Definition at line 34 of file GermanDIN2Stemmer.cs.

Member Function Documentation

override void Lucene.Net.Analysis.De.GermanDIN2Stemmer.Substitute ( StringBuilder  buffer)
protectedvirtual

Do some substitutions for the term to reduce overstemming:

  • Substitute Umlauts with their corresponding vowel: äöü -> aou, "ß" is substituted by "ss"
  • Substitute a second char of a pair of equal characters with an asterisk: ?? -> ?*
  • Substitute some common character combinations with a token: sch/ch/ei/ie/ig/st -> $/В§/%/&/#/!

Reimplemented from Lucene.Net.Analysis.De.GermanStemmer.

Definition at line 36 of file GermanDIN2Stemmer.cs.


The documentation for this class was generated from the following file: