Skip to content

Commit 76df3fe

Browse files
authored
Update README.md
1 parent 61c4162 commit 76df3fe

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -59,6 +59,8 @@ Each LLM has an ELO score based on its results.
5959
| 13 | **together:meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo:vision** | 1269.84 |
6060
| 14 | anthropic:claude-3-sonnet-20240229:text | 1029.31 |
6161

62+
*Note: In our experiments, Claude 3 Sonnet got a low score due to many refusal to fight and large API latencies.*
63+
6264
### Win rate matrix
6365

6466
![Win rate matrix](notebooks/result_matrix.png)

0 commit comments

Comments
 (0)