Lucene.Net
3.0.3
Lucene.Net is a .NET port of the Java Lucene Indexing Library
|
CharStream adds CorrectOffset functionality over System.IO.TextReader. All Tokenizers accept a CharStream instead of System.IO.TextReader as input, which enables arbitrary character based filtering before tokenization. The CorrectOffset method fixed offsets to account for removal or insertion of characters, so that the offsets reported in the tokens match the character offsets of the original Reader. More...
Inherits StreamReader.
Inherited by Lucene.Net.Analysis.CharFilter, and Lucene.Net.Analysis.CharReader.
Public Member Functions | |
abstract int | CorrectOffset (int currentOff) |
Called by CharFilter(s) and Tokenizer to correct token offset. | |
Protected Member Functions | |
CharStream (System.IO.StreamReader reader) | |
CharStream adds CorrectOffset functionality over System.IO.TextReader. All Tokenizers accept a CharStream instead of System.IO.TextReader as input, which enables arbitrary character based filtering before tokenization. The CorrectOffset method fixed offsets to account for removal or insertion of characters, so that the offsets reported in the tokens match the character offsets of the original Reader.
Definition at line 30 of file CharStream.cs.
|
protected |
Definition at line 32 of file CharStream.cs.
|
pure virtual |
Called by CharFilter(s) and Tokenizer to correct token offset.
currentOff | offset as seen in the output |
Implemented in Lucene.Net.Analysis.CharFilter, and Lucene.Net.Analysis.CharReader.