Skip to content

Better cross-field support #41106

@jimczi

Description

@jimczi

Lucene has a new query in the sandbox called BM25FQuery. It is similar to the BlendedTermQuery that we use for cross-fields search but the new query also merges the document statistics (freq + norm) in a way that preserve the benefits of using BM25 formula (term frequency saturates quickly).
There are some work to be done on the Lucene side to improve the integration of this query:
https://issues.apache.org/jira/browse/LUCENE-8710
https://issues.apache.org/jira/browse/LUCENE-8711
However we could already replace the BlendedTermQuery with BM25F since the main logic is in place and should already improve the ranking of documents when cross-fields mode is used in a query.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions