-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: EleutherAI/lm-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix error due in Collating queries with different continuation lengths (fixes #2984)
#2987
opened May 15, 2025 by
ameyagodbole
Loading…
Added RuBLiMP, a Russian benchmark of linguistic minimal pairs
#2951
opened May 2, 2025 by
vmkhlv
Loading…
feat: Add LIBRA benchmark for long-context evaluation
#2943
opened Apr 30, 2025 by
karimovaSvetlana
Loading…
4 tasks done
Resolve the inconsistency between the description of output_path and the actual logic
#2928
opened Apr 24, 2025 by
rangehow
Loading…
Added selection filter: take_last
#2923
opened Apr 18, 2025 by
JamesClarke7283
Loading…
3 tasks done
Add Qwen/Qwen-Audio-Chat support for audio modality
#2906
opened Apr 14, 2025 by
artemorloff
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-17.