Align EOS token ID between tokenizer and generation config #663

lewtun · 2025-05-27T09:53:56Z

This PR addresses a nasty footgun in transformers, where a change to the base model's tokenizer must ALSO be propagated to the model's generation config. Without this change, the pipeline() function produces unbounded generations because it relies on the base model's EOS token (e.g. <|endoftext|>) instead of the one we set in the tokenizer config (e.g. <|im_end|>)

Note that this has no impact on vllm since there the EOS token is inferred from the tokenizer_config.json file.

cc @qgallouedec @kashif for viz as this might be necessary to include on the SFT script of TRL too.

I also took the opportunity to clean up some of the "demo" recipes as I think it's better to have a single source of truth for well-tested recipes.

lewtun · 2025-05-27T09:55:07Z

README.md

    - `grpo.py`: trains a model with GRPO on a given dataset.
    - `sft.py`: performs a simple SFT of a model on a dataset.
-    - `evaluate.py`: evaluates a model on the R1 benchmarks.


No longer exists since we migrated the evals to lighteval natively

lewtun · 2025-05-27T09:55:59Z

src/open_r1/grpo.py

@@ -140,6 +140,9 @@ def make_conversation(example, prompt_column: str = script_args.dataset_prompt_c
    # Save model and create model card
    ##################################
    logger.info("*** Save model ***")
+    # Align the model's generation config with the tokenizer's eos token
+    # to avoid unbounded generation in the transformers `pipeline()` function
+    trainer.model.generation_config.eos_token_id = tokenizer.eos_token_id


Here is the key change to avoid the generation footgun

kashif · 2025-05-27T10:28:41Z

ok checking

Align EOS token ID between tokenizer and generation config

7c98a70

lewtun requested a review from edbeeching May 27, 2025 09:54

lewtun commented May 27, 2025

View reviewed changes

Fix

25dba46

lewtun mentioned this pull request May 27, 2025

Fail to reproduce MATH-500 Score on DeepSeek-R1-Distill-Qwen-1.5B #354

Closed

edbeeching approved these changes May 27, 2025

View reviewed changes

lewtun merged commit 33f84de into main May 27, 2025
1 check passed

lewtun deleted the fix-eos branch May 27, 2025 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Align EOS token ID between tokenizer and generation config #663

Align EOS token ID between tokenizer and generation config #663

Uh oh!

lewtun commented May 27, 2025 •

edited

Loading

Uh oh!

lewtun May 27, 2025

Uh oh!

lewtun May 27, 2025

Uh oh!

kashif commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Align EOS token ID between tokenizer and generation config #663

Align EOS token ID between tokenizer and generation config #663

Uh oh!

Conversation

lewtun commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun May 27, 2025

Choose a reason for hiding this comment

Uh oh!

lewtun May 27, 2025

Choose a reason for hiding this comment

Uh oh!

kashif commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

lewtun commented May 27, 2025 •

edited

Loading