Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,14 @@ We have several goals in open sourcing this:
- Showing how we evaluate each task
- Encouraging others to benchmark their solutions on these tasks (we are always looking for better ways of doing things!)

## Benchmarking Results

Read some of the articles about benchmarking results on our blog.

* Agent Tool Use: https://blog.langchain.dev/benchmarking-agent-tool-use/
* Query Analysis in High Cardinality Situations: https://blog.langchain.dev/high-cardinality/
* Rag on Tables: https://blog.langchain.dev/benchmarking-rag-on-tables/

## Installation

To install the packages, run the following command:
Expand Down