CI refactor; allow setting args in config #2893

artemorloff · 2025-04-08T23:05:35Z

might be much more convenient to pass path to config file to run evaluation via lm-eval

PR allows a user to pass --config PATH_TO_YAML_FILE arg that contains the same args as the console evaluation script. With recent updates there are too much args to track them on console. It may be easier to write one yaml file (provide example in configs/ dir) and pass it into the script.

Also PR allows to override this config. If user provides --config PATH param along with other params, they prevail over the same params in config. So, user may define all params in yaml file and change only those that they need right now, others remain as they are in yaml file. To grant that all params from console will be treated with "high priority" added action to each param that just adds arg indicating that some param was explicitly stated by the user from console. Those params won't be changed by config file.

Also config is parsed right after reading to avoid using simple_parse_args_string and make the code more unified and simplified.

artemorloff · 2025-04-08T23:07:31Z

use case: CUDA_VISIBLE_DEVICES=0 lm_eval --config configs/default_config.yaml

or one can explicitly state: ``CUDA_VISIBLE_DEVICES=0 lm_eval --config configs/default_config.yaml --limit 10 --model hf

StellaAthena · 2025-04-09T13:45:01Z

This is a really good idea. I haven't looked at the implementation itself but I think that this functionality is going to be hugely valuable. I also strongly agree with the choice to allow both CLI arguments and yaml arguments and having the CLI take precidence over the yaml arguments.

artemorloff · 2025-04-14T16:53:59Z

@baberabb @StellaAthena i think this would be a great feature for all users - to keep configs in a dir and no need in writing the flags in terminal

baberabb · 2025-04-14T17:44:25Z

Hi @artemorloff!. This is generally great, especially as the number of arguments which need to pass keep in increasing. One thing: have you considered using a dataclass? Something like:

@dataclass
class EvalConfig:
    model: str = "hf"  
    tasks: Optional[str] = None
    ...
    
    @classmethod
    def from_yaml(cls, path):
        # load and convert yaml
        
    @classmethod
    def from_args(cls, args, yaml_path=None):
        ...

Think this would make the code easier to read, and more maintainable. Happy to sketch it out more if you think it's helpful.

artemorloff · 2025-04-16T21:52:50Z

@baberabb seems a good idea. will prepare the draft

artemorloff · 2025-04-22T21:55:03Z

@baberabb added prototype of Eval config class that is used to unify evaluation. Based on BaseModel so that it may later be reworked the way that params are validated and updated inside config class, not inside main.py funcs

baberabb · 2025-05-19T10:19:07Z

Hi @artemorloff! tysm for working on this. I'll take a closer look this week, but feel free to ping me in case I don't respond by Friday. Couple of initial points (mostly backward compatibility):

we don't want to modify the args of evaluate too much and keep it backward compatible as many people use it directly in their training code and libraries.
I like the addition of pydantic as well, but need to investigate that it doesn't create any dependency issues (esp. when using in training loop).

baberabb · 2025-06-23T11:19:43Z

Hi @artemorloff! thanks again for working on this, and sorry for the late review. The logic overall looks pretty great, but made some slight changes:

Use dataclass, rather than pydantic. We might want to reconsider this later, but lets not add the dependency just yet.
Still pass all the args to simple_evaluate explicitly as we normally do, so that's it easier when calling outside of cli.

Let me know what you think!

# Conflicts: # lm_eval/__main__.py

enable evaluation from yaml config file

b5d16d6

artemorloff requested review from baberabb and StellaAthena as code owners April 8, 2025 23:05

artemorloff added 3 commits April 22, 2025 14:40

Merge remote-tracking branch 'origin' into feature/eval_from_config

b2e1bfc

add separate eval_config class

c1e4339

pre-commit prettify

61fc5bf

update pyproject with pydantic dep

9c94fb2

baberabb mentioned this pull request May 6, 2025

model args as json_path #2939

Closed

use dataclass; don't pass config to simple_evaluate

d6b1405

baberabb added 13 commits June 23, 2025 16:20

remove pydantic dependency

d0884a9

Merge branch 'main' into feature/eval_from_config

601be34

nit

caab782

remove prints

d816f64

Merge branch 'main' into feature/eval_from_config

2ebef47

# Conflicts: # lm_eval/__main__.py

nit

82517de

modularize cli

30fa3c7

add subcommands

febdcc5

cleanup

9de9365

cleanup

b7d3f0d

pre-commit

be78dc7

pre-commit

c59d4e2

fix help

768f55b

baberabb added 8 commits July 4, 2025 03:44

cleanup

613d383

update docs

560905c

update docs

897ed70

fix logging

442ce51

improve logging

dbe4c39

add docs

f3cfff6

nit

b9ee592

add tests

15ce554

baberabb mentioned this pull request Jul 5, 2025

Refactor CLI to subcommand structure #3111

Open

baberabb changed the title ~~enable evaluation from yaml config file~~ CI refactor; allow setting args in config Jul 5, 2025

baberabb mentioned this pull request Jul 5, 2025

Streamlining lm-eval Architecture #3083

Open

9 tasks

baberabb added 2 commits July 10, 2025 22:37

Merge branch 'main' into feature/eval_from_config

84d02f7

fix: update default values and improve help text in configuration files

91e49e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CI refactor; allow setting args in config #2893

CI refactor; allow setting args in config #2893

Uh oh!

artemorloff commented Apr 8, 2025

Uh oh!

artemorloff commented Apr 8, 2025

Uh oh!

StellaAthena commented Apr 9, 2025

Uh oh!

artemorloff commented Apr 14, 2025

Uh oh!

baberabb commented Apr 14, 2025

Uh oh!

artemorloff commented Apr 16, 2025

Uh oh!

artemorloff commented Apr 22, 2025

Uh oh!

baberabb commented May 19, 2025

Uh oh!

baberabb commented Jun 23, 2025

Uh oh!

Uh oh!

CI refactor; allow setting args in config #2893

Are you sure you want to change the base?

CI refactor; allow setting args in config #2893

Uh oh!

Conversation

artemorloff commented Apr 8, 2025

Uh oh!

artemorloff commented Apr 8, 2025

Uh oh!

StellaAthena commented Apr 9, 2025

Uh oh!

artemorloff commented Apr 14, 2025

Uh oh!

baberabb commented Apr 14, 2025

Uh oh!

artemorloff commented Apr 16, 2025

Uh oh!

artemorloff commented Apr 22, 2025

Uh oh!

baberabb commented May 19, 2025

Uh oh!

baberabb commented Jun 23, 2025

Uh oh!

Uh oh!