Clean configs documentation #1944

qgallouedec · 2024-08-18T13:55:38Z

This PR focuses on standardizing and enhancing the configuration documentation across the codebase. Consistency and uniformity make maintenance easier and improve readability. Future PRs will extend these improvements to other components, particularly the trainers.

Key Changes

Line Wrapping: Applied a consistent line wrap at column 120 to improve readability.
Definite Articles: Removed definite articles where possible to streamline language.
Type Annotations:
- Always include type definitions, indicating if a parameter is optional and specifying the default value.
- Note that Optional means that the value can be None, and *optional* means that it is not required for the user to pass a value. Eg: For values that can be None:
```
foo (`Optional[int]`, *optional*, defaults to `None`):
```
- For values that can't be None.
```
foo (`int`, *optional*, defaults to `4`):
```
String Defaults:
- Ensured that default string values are wrapped in double quotes:
```
defaults to `"foo"`
```
Dictionary Typing:
- Replaced generic Dict type hints with more explicit Dict[str, Any] to clarify expected key-value pairs.
Default Value Formatting:
- Consistently surrounded default values with backticks for improved formatting:
```
defaults to `4`
```
Consistency Across Configurations: Ensured that similar arguments across different configurations have consistent descriptions
Consistent main docstring:

Overall, this is the template:

@dataclass
class FOOConfig(TrainingArguments):
    r"""
    Configuration class for the [`FOOTrainer`].

    Using [`~transformers.HfArgumentParser`] we can turn this class into
    [argparse](https://docs.python.org/3/library/argparse#module-argparse) arguments that can be specified on the
    command line.

    Args:
        foo (`Optional[str]`, *optional*, defaults to `None`):
            Description of foo.
        bar (str, *optional*, defaults to `"barbar"`):
            Description of the bar. This description can be long, but make sure you break the line so that the maximum
            length is 220.
        baz (`Optional[Dict[str, Any]]`, *optional*, defaults to `None`):
            Description of the baz.
    """
    foo: Optional[str] = None
    bar: str = "barbar"
    baz: Optional[Dict[str, Any]] = None

Progress

Possibly breaking changes delayed for another PR

DDPOConfig and AlignProbConfignow inherits from TrainingArguments. It implies:

To avoid conflict with the super class, I've removed train_batch_size in favour of per_device_train_batch_size of the super class.
To avoid conflict with the super class, I've removed run_name which is already available in the super class.
output_dir is now a required argument

HuggingFaceDocBuilderDev · 2024-08-18T14:02:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…o clean-config

trl/trainer/alignprop_config.py

…OTrainer

trl/trainer/dpo_config.py

qgallouedec · 2024-09-04T07:11:35Z

trl/trainer/dpo_config.py

@@ -125,14 +150,14 @@ class DPOConfig(TrainingArguments):
    truncation_mode: str = "keep_end"
    max_length: Optional[int] = None
    max_prompt_length: Optional[int] = None
-    max_target_length: Optional[int] = None
+    max_completion_length: Optional[int] = None


I realized this was not consistent with other trainers. Changing it shouldn't be breaking, right? Or maybe add a post-init warning?
@kashif @edbeeching @lewtun

I'll add a post-init warning.

…o clean-config

qgallouedec added 2 commits August 18, 2024 13:38

Clean BCO

c2d9a62

Optional[int]

e3083f1

kashif approved these changes Aug 19, 2024

View reviewed changes

qgallouedec and others added 25 commits August 19, 2024 10:01

fix sft config

c7b2fbc

Merge branch 'main' into clean-config

e7a80bb

alignprop config

50dbc86

Merge branch 'main' into clean-config

b718fba

upadte tempfile to work with output_dir

4a8aba6

Merge branch 'clean-config' of https://github.com/huggingface/trl int…

6ae94e9

…o clean-config

Merge branch 'main' into clean-config

3ed49fd

clean kto config

f847f56

intro docstring

69525f9

style

c73f43a

reward config

11f6e7e

orpo config

946e2e5

Merge branch 'main' into clean-config

21df122

warning in trainer, not in config

a1bff9c

cpo config

006a454

Merge branch 'main' into clean-config

c9264ee

ppo v2

01d8814

Merge branch 'clean-config' of https://github.com/huggingface/trl int…

5cd9eef

…o clean-config

model config

9bef508

ddpo and per_device_train_batch_size (instead of (train_batch_size)

0a49bca

Merge branch 'main' into clean-config

1c9bba7

rloo

216856a

Online config

7270936

tmp_dir in test_ddpo

05bacaf

style

451b4fc

qgallouedec commented Aug 27, 2024

View reviewed changes

trl/trainer/alignprop_config.py Show resolved Hide resolved

qgallouedec added 17 commits September 3, 2024 18:06

overview

92a2206

better latex

81d5147

is_encoder_decoder uniform

71c110a

proper ticks

e60c3b0

fix latex

a964090

uniform generate_during_eval

45d4f99

uniform truncation_mode

3bc2d30

ref_model_mixup_alpha

66a4861

ref_model_mixup_alpha and ref_model_sync_steps

e2d8f7f

Uniform model_init_kwargs and ref_model_init_kwargs

79347d9

rpo_alpha

9ba37a9

Update maximum length argument names in config files

52f69b1

Update loss_type descriptions in config files

0fabc42

Update max_target_length to max_completion_length in CPOConfig and CP…

e1abc3a

…OTrainer

Update padding value in config files

d618f0c

Update precompute_ref_log_probs flag documentation

594677c

Fix typos and update comments in dpo_config.py and sft_config.py

5dee9ab

qgallouedec marked this pull request as ready for review September 3, 2024 22:04

qgallouedec commented Sep 3, 2024

View reviewed changes

trl/trainer/dpo_config.py Outdated Show resolved Hide resolved

qgallouedec changed the title ~~[WIP] Clean configs documentation~~ Clean configs documentation Sep 3, 2024

Merge branch 'main' into clean-config

47431f8

qgallouedec commented Sep 4, 2024

View reviewed changes

kashif approved these changes Sep 4, 2024

View reviewed changes

kashif mentioned this pull request Sep 4, 2024

Improves formatting of docstring + newlines #2006

Merged

2 tasks

qgallouedec and others added 3 commits September 4, 2024 07:22

post init warning for max_target_length

19af1fa

Merge branch 'clean-config' of https://github.com/huggingface/trl int…

34b38b0

…o clean-config

Merge branch 'main' into clean-config

07c9cab

qgallouedec merged commit fc20db8 into main Sep 4, 2024
10 checks passed

qgallouedec deleted the clean-config branch September 4, 2024 08:07

qgallouedec mentioned this pull request Nov 15, 2024

🔀 Add MergeModelCallBack #2282

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean configs documentation #1944

Clean configs documentation #1944

qgallouedec commented Aug 18, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 18, 2024

qgallouedec Sep 4, 2024

qgallouedec Sep 4, 2024

Clean configs documentation #1944

Clean configs documentation #1944

Conversation

qgallouedec commented Aug 18, 2024 • edited Loading

Key Changes

Progress

Possibly breaking changes delayed for another PR

HuggingFaceDocBuilderDev commented Aug 18, 2024

qgallouedec Sep 4, 2024

Choose a reason for hiding this comment

qgallouedec Sep 4, 2024

Choose a reason for hiding this comment

qgallouedec commented Aug 18, 2024 •

edited

Loading