Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

How to load the local MMLU dataset
#2491 opened Nov 14, 2024 by DtYXs
Overwrite default tasks
#2487 opened Nov 13, 2024 by jonoillar
Cannot reproduce LLaMA 3 8B on hendrycks_math validation For validation of task implementations.
#2479 opened Nov 11, 2024 by liuxiaozhu01
Why Different Versions Make a Big Difference in HellaSwag zero-shot validation For validation of task implementations.
#2478 opened Nov 11, 2024 by cquxl
[QUESTION] hello, I has a problem…
#2477 opened Nov 10, 2024 by sofiaserkhir
task load return error
#2466 opened Nov 7, 2024 by pod2c
Why is using vLLM via lm-eval-harness slower than using vLLM directly? asking questions For asking for clarification / support on library usage.
#2445 opened Oct 30, 2024 by WuXnkris
Wrong format of the few-shot examples in mgsm_direct tasks good first issue Good for newcomers validation For validation of task implementations.
#2444 opened Oct 30, 2024 by zxcvuser
Improve preprocessing for paws-x and xnli tasks feature request A feature that isn't implemented yet. good first issue Good for newcomers
#2442 opened Oct 30, 2024 by zxcvuser
GPU with GGFU LLM
#2429 opened Oct 25, 2024 by Znbne
Llama3.1-8B-Instruct evaluation fails asking questions For asking for clarification / support on library usage.
#2428 opened Oct 25, 2024 by Isaaclgz
test speculative decode accuracy asking questions For asking for clarification / support on library usage.
#2424 opened Oct 24, 2024 by baoqianmagik
Question related to how to use the validation and training splits. asking questions For asking for clarification / support on library usage.
#2423 opened Oct 24, 2024 by sorobedio
bbh_zeroshot fails during to a custom filter issue. bug Something isn't working.
#2422 opened Oct 23, 2024 by shamanez
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.