Need higher level RAG metrics #2598

devinbost · 2024-06-14T14:35:20Z

Problem & Motivation

There is a huge wave of interest around high accuracy Q&A, such as via Retrieval Augmented Generation (RAG). RAG accuracy is largely driven by how well vector search is able to retrieve the correct context to answer questions via an LLM. When evaluating embedding models, vector search retrieval metrics are helpful but insufficient because they don't reveal how well the retrieved content actually answers the target questions.

Pitch

I'd love to see an integration with a tool like our new ragulate library (Apache 2 licensed) that would simplify model evaluation on RAG Q&A: https://github.com/epinzur/ragulate/tree/main

Additional context

I was going to suggest that you integrate with trulens, but then I discovered that we built ragulate to automate much of the process of using trulens, and we'd love feedback on it.

github-actions · 2024-06-14T14:35:42Z

Hi! thanks for your contribution!, great first issue!

Borda · 2024-06-17T12:15:11Z

@devinbost, thank you for your suggestion. Indeed, it would be nice to have such meters available in TM. Said so, I would love to see your PR adding them as complete code not referring to an external package... 🦩

devinbost added the enhancement New feature or request label Jun 14, 2024

stancld added the New metric label Jul 15, 2024

stancld added this to the future milestone Jul 15, 2024

Borda added the Important milestonish label Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need higher level RAG metrics #2598

Need higher level RAG metrics #2598

devinbost commented Jun 14, 2024

github-actions bot commented Jun 14, 2024

Borda commented Jun 17, 2024

Need higher level RAG metrics #2598

Need higher level RAG metrics #2598

Comments

devinbost commented Jun 14, 2024

Problem & Motivation

Pitch

Additional context

github-actions bot commented Jun 14, 2024

Borda commented Jun 17, 2024