Class EnwikiContentSource
A ContentSource which reads the English Wikipedia dump. You can read
the .bz2 file directly (it will be decompressed on the fly). Config
properties:
- keep.image.only.docsfalse|true (default true).
- docs.file<path to the file>
Implements
Inherited Members
Namespace: Lucene.Net.Benchmarks.ByTask.Feeds
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class EnwikiContentSource : ContentSource, IDisposable
Constructors
EnwikiContentSource()
A ContentSource which reads the English Wikipedia dump. You can read
the .bz2 file directly (it will be decompressed on the fly). Config
properties:
- keep.image.only.docsfalse|true (default true).
- docs.file<path to the file>
Declaration
public EnwikiContentSource()
Methods
Dispose(bool)
Called when reading from this content source is no longer required.
Declaration
protected override void Dispose(bool disposing)
Parameters
| Type | Name | Description |
|---|---|---|
| bool | disposing |
Overrides
GetNextDocData(DocData)
Returns the next DocData from the content source. Implementations must account for multi-threading, as multiple threads can call this method simultaneously.
Declaration
public override DocData GetNextDocData(DocData docData)
Parameters
| Type | Name | Description |
|---|---|---|
| DocData | docData |
Returns
| Type | Description |
|---|---|
| DocData |
Overrides
OpenInputStream()
Open the input stream.
Declaration
protected virtual Stream OpenInputStream()
Returns
| Type | Description |
|---|---|
| Stream |
ResetInputs()
Resets the input for this content source, so that the test would behave as if it was just started, input-wise.
NOTE: the default implementation resets the number of bytes and items generated since the last reset, so it's important to callbase.ResetInputs() in case you override this method.
Declaration
public override void ResetInputs()
Overrides
SetConfig(Config)
Sets the Config for this content source. If you override this
method, you must call base.SetConfig(config).
Declaration
public override void SetConfig(Config config)
Parameters
| Type | Name | Description |
|---|---|---|
| Config | config |