Class CommonTermsQuery

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Creates a new CommonTermsQuery

Type	Name	Description
Occur	highFreqOccur	Lucene.Net.Search.Occur used for high frequency terms
Occur	lowFreqOccur	Lucene.Net.Search.Occur used for low frequency terms
float	maxTermFrequency	a value in [0..1) (or absolute number >=1) representing the maximum threshold of a terms document frequency to be considered a low frequency term.

Type	Condition
ArgumentException	if Lucene.Net.Search.Occur.MUST_NOT is pass as `lowFreqOccur` or `highFreqOccur`

Type	Name	Description
Occur	highFreqOccur	Lucene.Net.Search.Occur used for high frequency terms
Occur	lowFreqOccur	Lucene.Net.Search.Occur used for low frequency terms
float	maxTermFrequency	a value in [0..1) (or absolute number >=1) representing the maximum threshold of a terms document frequency to be considered a low frequency term.
bool	disableCoord	disables Coord(int, int) in scoring for the low / high frequency sub-queries

Type	Condition
ArgumentException	if Lucene.Net.Search.Occur.MUST_NOT is pass as `lowFreqOccur` or `highFreqOccur`

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
bool

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
float

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
float

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
Occur

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
float

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
float

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
Occur

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
float

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Description
IList<Term>

Gets or Sets a minimum number of the high frequent optional BooleanClauses which must be satisfied in order to produce a match on the low frequency terms query part. This method accepts a float value in the range [0..1) as a fraction of the actual query terms in the low frequent clause or a number >=1 as an absolut number of clauses that need to match.

By default no optional clauses are necessary for a match (unless there are no required clauses). If this method is used, then the specified number of clauses is required.

Type	Description
float

Returns true iff Coord(int, int) is disabled in scoring for the high and low frequency query instance. The top level query will always disable coords.

Type	Description
bool

Gets or Sets a minimum number of the low frequent optional BooleanClauses which must be satisfied in order to produce a match on the low frequency terms query part. This method accepts a float value in the range [0..1) as a fraction of the actual query terms in the low frequent clause or a number >=1 as an absolut number of clauses that need to match.

By default no optional clauses are necessary for a match (unless there are no required clauses). If this method is used, then the specified number of clauses is required.

Type	Description
float

Adds a term to the CommonTermsQuery

Type	Name	Description
Term	term	the term to add

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Name	Description
int	maxDoc
TermContext[]	contextArray
Term[]	queryTerms

Type	Description
Query

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Name	Description
int	numOptional

Type	Description
int

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Name	Description
int	numOptional

Type	Description
int

A query that executes high-frequency terms in a optional sub-query to prevent slow queries due to "common" terms like stopwords. This query builds 2 queries off the Add(Term) added terms: low-frequency terms are added to a required boolean clause and high-frequency terms are added to an optional boolean clause. The optional clause is only executed if the required "low-frequency" clause matches. Scores produced by this query will be slightly different than plain Lucene.Net.Search.BooleanQuery scorer mainly due to differences in the Coord(int, int) number of leaf queries in the required boolean clause. In most cases, high-frequency terms are unlikely to significantly contribute to the document score unless at least one of the low-frequency terms are matched. This query can improve query execution times significantly if applicable.

CommonTermsQuery has several advantages over stopword filtering at index or query time since a term can be "classified" based on the actual document frequency in the index and can prevent slow queries even across domains without specialized stopword files.

Note: if the query only contains high-frequency terms the query is rewritten into a plain conjunction query ie. all high-frequency terms need to match in order to match a document.

Collection initializer note: To create and populate a CommonTermsQuery in a single statement, you can use the following example as a guide:

var query = new CommonTermsQuery() {
    new Term("field", "microsoft"), 
    new Term("field", "office")
};

Type	Name	Description
IndexReader	reader
IList<AtomicReaderContext>	leaves
TermContext[]	contextArray
Term[]	queryTerms

Determines whether the specified object is equal to the current object.

Type	Name	Description
object	obj	The object to compare with the current object.

Type	Description
bool	true if the specified object is equal to the current object; otherwise, false.

Expert: adds all terms occurring in this query to the terms set. Only works if this query is in its rewritten (Lucene.Net.Search.Query.Rewrite(Lucene.Net.Index.IndexReader)) form.

Type	Name	Description
ISet<Term>	terms

Type	Condition
InvalidOperationException	If this query is not yet rewritten

Returns an enumerator that iterates through the m_terms collection.

Type	Description
IEnumerator<Term>	An enumerator that can be used to iterate through the m_terms collection.

Serves as the default hash function.

Type	Description
int	A hash code for the current object.

Builds a new Lucene.Net.Search.TermQuery instance.

This is intended for subclasses that wish to customize the generated queries.

Type	Name	Description
Term	term	term
TermContext	context	the Lucene.Net.Index.TermContext to be used to create the low level term query. Can be `null`.

Type	Description
Query	new Lucene.Net.Search.TermQuery instance

Expert: called to re-write queries into primitive queries. For example, a Lucene.Net.Search.PrefixQuery will be rewritten into a Lucene.Net.Search.BooleanQuery that consists of Lucene.Net.Search.TermQuerys.

Type	Name	Description
IndexReader	reader

Type	Description
Query

Prints a query to a string, with field assumed to be the default field and omitted.

Inheritance

Implements

Inherited Members

Namespace: Lucene.Net.Queries

Assembly: Lucene.Net.Queries.dll

Syntax

Constructors

CommonTermsQuery(Occur, Occur, float)

Declaration

Parameters

Exceptions

CommonTermsQuery(Occur, Occur, float, bool)

Declaration

Parameters

Exceptions

Fields

m_disableCoord

Declaration

Field Value

m_highFreqBoost

Declaration

Field Value

m_highFreqMinNrShouldMatch

Declaration

Field Value

m_highFreqOccur

Declaration

Field Value

m_lowFreqBoost

Declaration

Field Value

m_lowFreqMinNrShouldMatch

Declaration

Field Value

m_lowFreqOccur

Declaration

Field Value

m_maxTermFrequency

Declaration

Field Value

m_terms

Declaration

Field Value

Properties

HighFreqMinimumNumberShouldMatch

Declaration

Property Value

IsCoordDisabled

Declaration

Property Value

LowFreqMinimumNumberShouldMatch

Declaration

Property Value

Methods

Add(Term)

Declaration

Parameters

BuildQuery(int, TermContext[], Term[])

Declaration

Parameters

Returns

CalcHighFreqMinimumNumberShouldMatch(int)

Declaration

Parameters

Returns

CalcLowFreqMinimumNumberShouldMatch(int)

Declaration

Parameters

Returns

CollectTermContext(IndexReader, IList<AtomicReaderContext>, TermContext[], Term[])

Declaration

Parameters

Equals(object)

Declaration

Parameters

Returns

Overrides

ExtractTerms(ISet<Term>)

Declaration