Class EnwikiContentSource
A ContentSource which reads the English Wikipedia dump. You can read
the .bz2
file directly (it will be decompressed on the fly). Config
properties:
- keep.image.only.docsfalse|true (default true).
- docs.file<path to the file>
Implements
Inherited Members
Namespace: Lucene.Net.Benchmarks.ByTask.Feeds
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class EnwikiContentSource : ContentSource, IDisposable
Constructors
EnwikiContentSource()
A ContentSource which reads the English Wikipedia dump. You can read
the .bz2
file directly (it will be decompressed on the fly). Config
properties:
- keep.image.only.docsfalse|true (default true).
- docs.file<path to the file>
Declaration
public EnwikiContentSource()
Methods
Dispose(bool)
Called when reading from this content source is no longer required.
Declaration
protected override void Dispose(bool disposing)
Parameters
Type | Name | Description |
---|---|---|
bool | disposing |
Overrides
GetNextDocData(DocData)
Returns the next DocData from the content source. Implementations must account for multi-threading, as multiple threads can call this method simultaneously.
Declaration
public override DocData GetNextDocData(DocData docData)
Parameters
Type | Name | Description |
---|---|---|
DocData | docData |
Returns
Type | Description |
---|---|
DocData |
Overrides
OpenInputStream()
Open the input stream.
Declaration
protected virtual Stream OpenInputStream()
Returns
Type | Description |
---|---|
Stream |
ResetInputs()
Resets the input for this content source, so that the test would behave as if it was just started, input-wise.
NOTE: the default implementation resets the number of bytes and items generated since the last reset, so it's important to callbase.ResetInputs()
in case you override this method.
Declaration
public override void ResetInputs()
Overrides
SetConfig(Config)
Sets the Config for this content source. If you override this
method, you must call base.SetConfig(config)
.
Declaration
public override void SetConfig(Config config)
Parameters
Type | Name | Description |
---|---|---|
Config | config |