Skip to content

Pull requests: huggingface/lighteval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Integrate alyah benchmark
#1117 opened Jan 12, 2026 by amztheorytii Loading…
[EVAL] SciCode new-task
#1086 opened Nov 27, 2025 by akshathmangudi Loading…
Evals on the hub
#1082 opened Nov 24, 2025 by NathanHB Loading…
Feature/tvd mi metric feature
#1080 opened Nov 22, 2025 by zrobertson466920 Loading…
graceful shutdown of vllm async bug
#1064 opened Nov 17, 2025 by f14-bertolotti Loading…
Adds Profbench new-task
#1041 opened Nov 6, 2025 by NathanHB Loading…
Fix PERPLEXITY task
#1037 opened Nov 4, 2025 by ScottHoang Loading…
Legal NLP tasks on Swiss data
#1032 opened Oct 31, 2025 by rolshoven Loading…
Add support to vllm==0.11.0
#1027 opened Oct 22, 2025 by anmarques Loading…
Wrap vllm inputs to compatible with VLLM>=0.10.2
#1003 opened Oct 2, 2025 by JIElite Loading…
Fix caching logic
#994 opened Sep 25, 2025 by jxmorris12 Loading…
Fix deberta overflow error bug
#990 opened Sep 24, 2025 by amstu2 Loading…
run slow tests aginst vllm and transformers main
#985 opened Sep 23, 2025 by NathanHB Loading…
Add ChartQA new-task
#954 opened Sep 11, 2025 by 0xjunhao Loading…
ProTip! Updated in the last three days: updated:>2026-01-18.