LUCENE-2959: Implementing State of the Art Ranking for Lucene

David Nemeskey

Abstract

Lucene employs the Vector Space Model (VSM) to rank documents, which compares unfavorably to state of the art algorithms, such as BM25. Moreover, the architecture is tailored specifically to VSM, which makes the addition of new ranking functions a non-trivial task. This project aims to bring state of the art ranking methods to Lucene and to implement a query architecture with pluggable ranking functions.

Additional Information

This feature will be implemented in the flexscoring branch, where preliminary work has already been done. See https://issues.apache.org/jira/browse/LUCENE-2392.

Code samples