fix: GPT OSS Conversion Script Enhancements #42901

KyleMylonakisProtopia · 2025-12-16T12:41:13Z

What does this PR do?

GPT OSS Models have a conversion script which allows transforming from the distributed quantized weights to a Hugging Face model in BFloat16. This also converts aspects of the tokenizer.

As a side note, it also fixes a minor bug in the TikToken tokenizer which caused a hard crash when iterating over a NoneType object when converting the tokenizer.

Fixes # (issue)
This resolves an open issue in the Hugging Face Accelerate repository by making the conversion script functional: huggingface/accelerate#3882 (comment)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests? No

Who can review?

@ArthurZucker @Cyrilvallez

Rocketknight1

Conversion script updates look good! I'm a little wary about the change in the core convert_slow_tokenizer.py, though, so I might need a second opinion from an expert there

Rocketknight1 · 2025-12-16T14:39:07Z

src/transformers/models/gpt_oss/convert_gpt_oss_weights_to_hf.py

-        with open(chat_template_path, "w") as f:
-            json.dump({"chat_template": chat_template}, f, indent=2)
+    print("Saving chat template...")
+    chat_template_path = os.path.join(save_dir, "chat_template.json")


We prefer raw chat_template.jinja in modern conversions!

Would you like me to change the filename to chat_template.jinja?

I changed the extension name as requested.

Rocketknight1 · 2025-12-16T14:39:47Z

src/transformers/convert_slow_tokenizer.py

-        tokenizer.add_special_tokens(
-            [AddedToken(token, normalized=False, special=True) for token in self.extra_special_tokens]
-        )
+        if self.extra_special_tokens is not None:
+            tokenizer.add_special_tokens(
+                [AddedToken(token, normalized=False, special=True) for token in self.extra_special_tokens]
+            )


I'm not sure why we need changes in the core code! cc @itazap @ArthurZucker before I can approve this

It hard-crashes otherwise because self.extra_special_tokens can be None, and is in the conversion script.

Bumping for @itazap and @ArthurZucker feedback.

Great! How can we get this over the line? Would love to see this change in Transformers 5.0.0 release.

…pt' into fix_conversion_script

Cyrilvallez · 2025-12-18T10:06:14Z

cc @ArthurZucker

github-actions · 2025-12-18T17:39:21Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gpt_oss

github-actions · 2025-12-18T17:47:29Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=42901&sha=e48a61

KyleMylonakisProtopia added 5 commits December 16, 2025 05:35

fix: Remove instruct as it should always be true and correct num_experts

4b9d824

docs: improve docstring

1cbae63

fix: remove unused argument

9a8d05c

fix: Only add extra special tokens if they exist

2544cd6

Merge branch 'main' into fix_conversion_script

28bb4da

This was referenced Dec 16, 2025

accelerate.load_checkpoint_and_dispatch does not load GPT-OSS Models correctly huggingface/accelerate#3882

Open

Improve GPT OSS Conversion Script #42902

Open

Merge branch 'main' into fix_conversion_script

82ef0a7

Rocketknight1 reviewed Dec 16, 2025

View reviewed changes

KyleMylonakisProtopia and others added 7 commits December 16, 2025 08:53

fix: prefer .jinja extension for chat template

87e4b34

Merge remote-tracking branch 'refs/remotes/origin/fix_conversion_scri…

c808092

…pt' into fix_conversion_script

Merge branch 'main' into fix_conversion_script

527681d

Merge branch 'main' into fix_conversion_script

f536461

Merge branch 'main' into fix_conversion_script

67dc5f2

Merge branch 'main' into fix_conversion_script

3446ab9

Merge branch 'main' into fix_conversion_script

ffbb71a

KyleMylonakisProtopia added 4 commits December 18, 2025 10:10

Merge branch 'main' into fix_conversion_script

98f4ff0

Merge branch 'main' into fix_conversion_script

1e01438

Merge branch 'main' into fix_conversion_script

f85b5d3

Merge branch 'main' into fix_conversion_script

e48a612

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: GPT OSS Conversion Script Enhancements #42901

fix: GPT OSS Conversion Script Enhancements #42901

KyleMylonakisProtopia commented Dec 16, 2025

Uh oh!

Rocketknight1 left a comment

Uh oh!

Rocketknight1 Dec 16, 2025

Uh oh!

KyleMylonakisProtopia Dec 16, 2025

Uh oh!

KyleMylonakisProtopia Dec 16, 2025

Uh oh!

Rocketknight1 Dec 16, 2025

Uh oh!

KyleMylonakisProtopia Dec 16, 2025

Uh oh!

KyleMylonakisProtopia Dec 17, 2025

Uh oh!

itazap Dec 17, 2025

Uh oh!

KyleMylonakisProtopia Dec 18, 2025

Uh oh!

Cyrilvallez commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: GPT OSS Conversion Script Enhancements #42901

Are you sure you want to change the base?

fix: GPT OSS Conversion Script Enhancements #42901

Conversation

KyleMylonakisProtopia commented Dec 16, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants