The Ranking Evaluation API allows to evaluate the quality of ranked search results over a set of typical search queries. Given this set of queries and a list or manually rated documents, the _rank_eval
endpoint calculates and returns typical information retrieval metrics like mean reciprocal rank, precision or discounted cumulative gain.
For further on the API are available in the documentation
This demo is part of a longer blog post with more detailed steps about running it.
In order to run the demo, you need at least Elasticsearch 7.0 installed.
In case you are running an earlier 6.x version of Elasticsearch, there might be some older 6.x branches
with the correct syntax and data format for your version.
Next, run the setup.sh
script contained in this project. The script assumes you are running Elasticsearch locally on port 9200 and installs the demo data, including settings and mappings, to an index called enwiki_rank
.
After that you should be ready to run the examples from demo_rank_eval.txt
in the Kibana Console.
This demo is based on a small subset of documents from the English Wikipedia dump which is also available in an Elasticsearch bulk format. The relevance judgement data used for the evaluation is based on data collected in the Wikimedia labs Discernatron Project and is available for registered Wikimedia users for download separately.