Class EnwikiContentSource
A ContentSource which reads the English Wikipedia dump. You can read
the .bz2
file directly (it will be decompressed on the fly). Config
properties:
- keep.image.only.docsfalse|true (default true).
- docs.file<path to the file>
Inheritance
System.Object
EnwikiContentSource
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class EnwikiContentSource : ContentSource
Constructors
|
Improve this Doc
View Source
EnwikiContentSource()
Declaration
public EnwikiContentSource()
Methods
|
Improve this Doc
View Source
Dispose(Boolean)
Declaration
protected override void Dispose(bool disposing)
Parameters
Type |
Name |
Description |
System.Boolean |
disposing |
|
Overrides
|
Improve this Doc
View Source
GetNextDocData(DocData)
Declaration
public override DocData GetNextDocData(DocData docData)
Parameters
Type |
Name |
Description |
DocData |
docData |
|
Returns
Overrides
|
Improve this Doc
View Source
OpenInputStream()
Declaration
protected virtual Stream OpenInputStream()
Returns
|
Improve this Doc
View Source
ResetInputs()
Declaration
public override void ResetInputs()
Overrides
|
Improve this Doc
View Source
SetConfig(Config)
Declaration
public override void SetConfig(Config config)
Parameters
Type |
Name |
Description |
Config |
config |
|
Overrides