Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adapting multimodal tasks for phi3.5 vision
#2909 opened Apr 14, 2025 by artemorloff Loading…
Audio modality: add openbmb/MiniCPM-o-2_6 model
#2908 opened Apr 14, 2025 by artemorloff Loading…
tasks README: fix dead link
#2899 opened Apr 11, 2025 by dtrifiro Loading…
Longbench bugfix
#2895 opened Apr 9, 2025 by baberabb Loading…
enable evaluation from yaml config file
#2893 opened Apr 8, 2025 by artemorloff Loading…
Added AIME Support
#2892 opened Apr 8, 2025 by Zephyr271828 Loading…
2 of 3 tasks
Added C4 Support
#2889 opened Apr 8, 2025 by Zephyr271828 Loading…
2 of 3 tasks
Fix GPQA CoT n shot
#2888 opened Apr 7, 2025 by anmarques Loading…
Support override_generation_config in vLLM
#2882 opened Apr 5, 2025 by yeqcharlotte Loading…
Add question suffix before the <|assistant|> tag
#2876 opened Apr 4, 2025 by TingchenFu Loading…
allow API key in GGUFLM
#2862 opened Mar 29, 2025 by Huge Loading…
Feature: Add Sambanova Integration
#2859 opened Mar 28, 2025 by luisfucros Loading…
update gguf backend to use Chat-completion API
#2856 opened Mar 28, 2025 by falkbene Loading…
Fix slow gguf tests
#2846 opened Mar 26, 2025 by For-rest2005 Loading…
Add support for quantization_config
#2842 opened Mar 25, 2025 by jerryzh168 Loading…
Add simple Dockerfile and instructions
#2837 opened Mar 24, 2025 by kiersten-stokes Loading…
feat: Numeric bench
#2835 opened Mar 24, 2025 by Gresham429 Loading…
E3 c v3 name entity recognition
#2812 opened Mar 18, 2025 by sfarzi Loading…
Add new task named e3c_v3_re
#2806 opened Mar 17, 2025 by sfarzi Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.