Add RAG FiQA MRR optimization community notebook #176

kamran-rapidfireAI · 2026-02-10T05:33:41Z

Summary

Adds rag_fiqa_mrr_optimization.ipynb to the community_notebooks/ folder
Sourced from the AI Winter 2025 competition notebooks

Test plan

Verify notebook renders correctly on GitHub
Confirm notebook cells execute without errors

Made with Cursor

Note

Low Risk
Adds a standalone notebook only; no library/runtime code paths are modified, with risk limited to notebook execution/dependency assumptions.

Overview
Adds a new community Colab notebook, community_notebooks/rag_fiqa_mrr_optimization.ipynb, that runs a RapidFire AI multi-config RAG evaluation on the FiQA dataset.

The notebook installs/initializes RapidFire AI, downsamples and filters FiQA queries/corpus, grid-searches over RAG chunking and reranker top_n settings with a vLLM Qwen generator, computes retrieval metrics (including MRR), and outputs a results DataFrame plus simple metric plots/log viewing helpers.

^{Written by Cursor Bugbot for commit 4099c2b. This will update automatically on new commits. Configure here.}

Adds rag_fiqa_mrr_optimization.ipynb from the AI Winter 2025 competition notebooks repo (RapidFireAI/ai-winter-2025-competition-notebooks). Co-authored-by: Cursor <cursoragent@cursor.com>

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-02-10T05:36:48Z

community_notebooks/rag_fiqa_mrr_optimization.ipynb

+        "        recalls.append(recall)\n",
+        "        f1_scores.append(f1)\n",
+        "        ndcgs.append(compute_ndcg_at_k(retrieved_set, expected_set, k=5))\n",
+        "        rrs.append(compute_rr(retrieved_set, expected_set))\n",


Set conversion destroys ordering for rank-sensitive metrics

High Severity

The ordered list of retrieved documents (pred) is converted to a Python set via set(pred), which destroys the ranking order. This retrieved_set is then passed to compute_ndcg_at_k and compute_rr, both of which are rank-sensitive metrics that depend on document position. Iterating a set yields arbitrary order, so NDCG and MRR — the notebook's primary optimization target — produce meaningless, non-deterministic values.

Additional Locations (1)

community_notebooks/rag_fiqa_mrr_optimization.ipynb#L348-L369

cursor · 2026-02-10T05:36:48Z

community_notebooks/rag_fiqa_mrr_optimization.ipynb

+        "    ideal_relevance = [3] * ideal_length + [0] * (k - ideal_length)\n",
+        "    idcg = sum(rel / math.log2(i + 2) for i, rel in enumerate(ideal_relevance))\n",
+        "\n",
+        "    return dcg / idcg if idcg > 0 else 0.0\n",


NDCG uses mismatched relevance scales in DCG vs IDCG

Medium Severity

In compute_ndcg_at_k, the actual DCG is computed with binary relevance values (0 or 1), but the ideal DCG (idcg) uses a relevance value of 3 for each relevant document. This mismatch means the NDCG score is systematically scaled down by a factor of ~3, making the metric incorrect. Both DCG and IDCG need to use the same relevance scale.

…lation code and cleaning up plotting logic. This simplifies the notebook and enhances readability.

Add RAG FiQA MRR optimization community notebook

c4bfba4

Adds rag_fiqa_mrr_optimization.ipynb from the AI Winter 2025 competition notebooks repo (RapidFireAI/ai-winter-2025-competition-notebooks). Co-authored-by: Cursor <cursoragent@cursor.com>

cursor bot reviewed Feb 10, 2026

View reviewed changes

Refactor RAG FiQA MRR optimization notebook by removing mlflow instal…

4099c2b

…lation code and cleaning up plotting logic. This simplifies the notebook and enhances readability.

kamran-rapidfireAI marked this pull request as draft February 10, 2026 06:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RAG FiQA MRR optimization community notebook #176

Add RAG FiQA MRR optimization community notebook #176

Uh oh!

kamran-rapidfireAI commented Feb 10, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Feb 10, 2026

Uh oh!

cursor bot Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add RAG FiQA MRR optimization community notebook #176

Are you sure you want to change the base?

Add RAG FiQA MRR optimization community notebook #176

Uh oh!

Conversation

kamran-rapidfireAI commented Feb 10, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Feb 10, 2026

Choose a reason for hiding this comment

Set conversion destroys ordering for rank-sensitive metrics

Uh oh!

cursor bot Feb 10, 2026

Choose a reason for hiding this comment

NDCG uses mismatched relevance scales in DCG vs IDCG

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kamran-rapidfireAI commented Feb 10, 2026 •

edited by cursor bot

Loading