Skip to content

Pull requests: stanford-crfm/helm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix LocalWindowService to respect combined sequence budget
#4231 opened Apr 24, 2026 by Erotemic Contributor Loading…
Fix missing f-string on CodeInsights past-mistake example headers
#4219 opened Apr 20, 2026 by Chessing234 Contributor Loading…
feat(frontend): add predictions page search filter
#4216 opened Apr 20, 2026 by MukundaKatta Loading…
fix(medhelm): use quasi match for PubMedQA
#4214 opened Apr 20, 2026 by MukundaKatta Loading…
fix(frontend): keep instance metrics readable
#4212 opened Apr 19, 2026 by MukundaKatta Loading…
fix(frontend): honor instancesPage query param
#4210 opened Apr 19, 2026 by MukundaKatta Loading…
Fix CodeInsightsCorrectCodeScenario prompt conditional precedence
#4207 opened Apr 17, 2026 by Chessing234 Contributor Loading…
Fix AutobencherSafetyScenario parsing the file path string as JSON
#4206 opened Apr 17, 2026 by Chessing234 Contributor Loading…
Fix operator precedence skipping robustness metric group check
#4193 opened Apr 13, 2026 by Chessing234 Contributor Loading…
Fix unescaped '.' in final_number_exact_match regex
#4191 opened Apr 11, 2026 by Chessing234 Contributor Loading…
Fix duplicate entries in med_dialog dataset (#3746)
#4178 opened Apr 7, 2026 by Chessing234 Contributor Loading…
3 tasks
Benchmark Data Contamination Scenario
#4149 opened Apr 1, 2026 by IriedsonSouto Contributor Loading…
5 tasks done
Medhelm Epic MedHELM
#3787 opened Aug 4, 2025 by MiguelAFH Collaborator Loading…
Add basic dockerfile
#3740 opened Jul 14, 2025 by Erotemic Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-04-26.