Namespace Lucene.Net.Codecs.Compressing
StoredFieldsFormat that allows cross-document and cross-field compression of stored fields.
Classes
CompressingStoredFieldsFormat
A StoredFieldsFormat that is very similar to Lucene40StoredFieldsFormat but compresses documents in chunks in order to improve the compression ratio.
For a chunk size of chunkSize
bytes, this StoredFieldsFormat
does not support documents larger than (231 - chunkSize
)
bytes. In case this is a problem, you should use another format, such as
Lucene40StoredFieldsFormat.
For optimal performance, you should use a MergePolicy that returns segments that have the biggest byte size first.
CompressingStoredFieldsIndexReader
Random-access reader for CompressingStoredFieldsIndexWriter.
CompressingStoredFieldsIndexWriter
Efficient index format for block-based Codecs.
this writer generates a file which can be loaded into memory using memory-efficient data structures to quickly locate the block that contains any document.
In order to have a compact in-memory representation, for every block of 1024 chunks, this index computes the average number of bytes per chunk and for every chunk, only stores the difference between
Data is written as follows:
Notes
CompressingStoredFieldsReader
StoredFieldsReader impl for CompressingStoredFieldsFormat.
CompressingStoredFieldsWriter
StoredFieldsWriter impl for CompressingStoredFieldsFormat.
CompressingTermVectorsFormat
A TermVectorsFormat that compresses chunks of documents together in order to improve the compression ratio.
CompressingTermVectorsReader
TermVectorsReader for CompressingTermVectorsFormat.
CompressingTermVectorsWriter
TermVectorsWriter for CompressingTermVectorsFormat.
CompressionMode
A compression mode. Tells how much effort should be spent on compression and decompression of stored fields.
Compressor
A data compressor.
Decompressor
A decompressor.
LZ4
LZ4 compression and decompression routines.
http://code.google.com/p/lz4/ http://fastcompression.blogspot.fr/p/lz4.html