Bump transformers and torch #117

jackzhxng · 2025-08-01T00:54:10Z

Summary

Pin bumps

Bump torch libraries to more recent versions, since nightlies are only hosted for the last two months and the oldest nightly we can use is now 20250601
Bump transformers to 4.54.1
Bump torchao

Code changes

Includes changes to absorb the huggingface/transformers#39106 kv cache refactor introducewd by the transformers upgrade, which specifies kv cache attributes per layer. cache_config is also no longer a CacheConfig instance but a dict after this PR, so we change to using .get()

Infra changes

Remove mac tests, see #122 for more details. This also allows us to iterate more quickly by cutting down unnecessary CI, since there's technically no need to run on Mac to test export when Linux tests already cover that. Mac tests with larger runners are enabled reciprocally for major LLM models in ExecuTorch in pytorch/executorch#13400.

Known failures

❗ T5
❗ Whisper
Granite - runs out of disk space

HuggingFaceDocBuilderDev · 2025-08-01T00:58:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

….out

kimishpatel · 2025-08-06T23:26:55Z

optimum/executorch/attentions/custom_kv_cache.py


        # Create a list of CustomKVCache instances, one per layer
        self.kv_cache = torch.nn.ModuleList()
-        for _ in range(config.num_hidden_layers):


what happened here? like config doesnt exist anymore?

It still exists, feel like it's more idiomatic to iterate over the actual layers

This reverts commit 99805f8.

kimishpatel · 2025-08-14T17:06:24Z

optimum/executorch/modeling.py

+        self._temp_dir = None
+
+    def __del__(self):
+        """Clean up temporary files when the model instance is destroyed."""


shouldnt this already happen automatically?

Yeah probably, but added just to be extra sure that it's cleaned up between tests

This reverts commit 896f0da.

This reverts commit 1e0a671.

This reverts commit ae488b1.

This reverts commit a3009ca.

kimishpatel · 2025-08-15T19:46:06Z

optimum/executorch/attentions/custom_kv_cache.py

-                    n_heads=self.num_key_value_heads,
-                    head_dim=self.head_dim,
+                    max_batch_size=layer.max_batch_size,
+                    max_context_length=layer.max_cache_len,


wait what is happening here? is this same as sliding_window_len

Yeah they removed sliding_window_len, it's now just max_cache_len

https://github.com/huggingface/transformers/blob/main/src/transformers/cache_utils.py#L357
https://github.com/huggingface/transformers/blob/main/src/transformers/cache_utils.py#L265

Bump transformers to 4.54.1

3a960bb

jackzhxng force-pushed the jz/bump-transformers branch from 89ed1c5 to 3a960bb Compare August 1, 2025 00:55

Bump torch

3d223a2

jackzhxng changed the title ~~Bump transformers to 4.54.1~~ Bump transformers and torch Aug 1, 2025

jackzhxng changed the title ~~Bump transformers and torch~~ [WIP] Bump transformers and torch Aug 1, 2025

jackzhxng requested a review from guangy10 August 1, 2025 19:33

larryliu0820 mentioned this pull request Aug 1, 2025

Bump transformers pin #118

Closed

jackzhxng changed the title ~~[WIP] Bump transformers and torch~~ Bump transformers and torch Aug 4, 2025

jackzhxng force-pushed the jz/bump-transformers branch 2 times, most recently from 867eb8b to 300ccdf Compare August 4, 2025 19:12

jackzhxng added 2 commits August 4, 2025 12:58

Fix no module found error for custom_kv_cache

207f8b1

Try to fix Missing operator: [8] quantized_decomposed::embedding_byte…

bc82841

….out

jackzhxng force-pushed the jz/bump-transformers branch from 300ccdf to bc82841 Compare August 4, 2025 19:58

guangy10 mentioned this pull request Aug 4, 2025

bump all deps version #119

Closed

Fix quantization requires torchao >= 0.11.0

35fc918

jackzhxng force-pushed the jz/bump-transformers branch from b110649 to 35fc918 Compare August 4, 2025 20:27

This was referenced Aug 4, 2025

Audio multimodal support jackzhxng/optimum-executorch#2

Closed

Add support for Voxtral jackzhxng/optimum-executorch#1

Closed

Fix sliding window, print loaded ops

6a26464

jackzhxng force-pushed the jz/bump-transformers branch from d89e18d to 6a26464 Compare August 5, 2025 00:19

jackzhxng added 2 commits August 4, 2025 18:33

Bump ET nightly pin, fixes missing quantized ops

4d68263

Fix no Q_ANNOTATION_KEY

6a3e1d4

jackzhxng requested a review from kimishpatel August 6, 2025 21:34

kimishpatel reviewed Aug 6, 2025

View reviewed changes

kimishpatel approved these changes Aug 6, 2025

View reviewed changes

guangy10 approved these changes Aug 7, 2025

View reviewed changes

guangy10 mentioned this pull request Aug 7, 2025

Add qnn recipe #121

Open

guangy10 approved these changes Aug 8, 2025

View reviewed changes

jackzhxng force-pushed the jz/bump-transformers branch from 93cbd54 to 64b41b4 Compare August 8, 2025 21:46

jackzhxng force-pushed the jz/bump-transformers branch from b0027f5 to eccf6f0 Compare August 10, 2025 20:56

jackzhxng added 2 commits August 10, 2025 14:47

Add helpful logs

9876c7e

Re-enable smollm3 tests for linux

19f4d21

jackzhxng force-pushed the jz/bump-transformers branch from eccf6f0 to 87fe6e3 Compare August 10, 2025 21:56

Experiment reverting transformers bump

99805f8

jackzhxng force-pushed the jz/bump-transformers branch from 87fe6e3 to 99805f8 Compare August 10, 2025 22:00

jackzhxng added 2 commits August 13, 2025 13:56

Revert "Experiment reverting transformers bump"

108ed17

This reverts commit 99805f8.

Formatting and remove logs

59778eb

jackzhxng force-pushed the jz/bump-transformers branch from fc7b69e to 59778eb Compare August 14, 2025 09:04

kimishpatel reviewed Aug 14, 2025

View reviewed changes

jackzhxng added 10 commits August 14, 2025 11:25

Bump ET release from 0.6 -> 0.7

ff8a2a1

Bisect down to ET 20250701

a3009ca

Experiment reverting transformers bump

ae488b1

Clean

b7a2fa1

Bisect down to ET 20250628

1e0a671

Bisect down to ET 20250626

896f0da

Revert "Bisect down to ET 20250626"

abd641b

This reverts commit 896f0da.

Revert "Bisect down to ET 20250628"

7f7f9c2

This reverts commit 1e0a671.

Revert "Experiment reverting transformers bump"

5f8a56f

This reverts commit ae488b1.

Revert "Bisect down to ET 20250701"

92bc2ba

This reverts commit a3009ca.

jackzhxng mentioned this pull request Aug 15, 2025

Mac runners segmentation fault on pin updates #122

Open

jackzhxng added 3 commits August 15, 2025 09:22

Skip mac tests

4abb2ec

Remove unnecessary ET 0.6 guards

ad9b639

Ruff format

b252038

jackzhxng force-pushed the jz/bump-transformers branch from 330ca8d to b252038 Compare August 15, 2025 16:22

jackzhxng added 2 commits August 15, 2025 11:19

Remove all transformers < 4.54 guards

e135310

Merge branch 'main' into jz/bump-transformers

671bc06

kimishpatel reviewed Aug 15, 2025

View reviewed changes

Format

70338e9

jackzhxng merged commit aae1dc7 into huggingface:main Aug 18, 2025
67 of 79 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump transformers and torch #117

Bump transformers and torch #117

Uh oh!

jackzhxng commented Aug 1, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 1, 2025

Uh oh!

kimishpatel Aug 6, 2025

Uh oh!

jackzhxng Aug 7, 2025

Uh oh!

kimishpatel Aug 14, 2025

Uh oh!

jackzhxng Aug 14, 2025 •

edited

Loading

Uh oh!

kimishpatel Aug 15, 2025

Uh oh!

jackzhxng Aug 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Bump transformers and torch #117

Bump transformers and torch #117

Uh oh!

Conversation

jackzhxng commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Pin bumps

Code changes

Infra changes

Known failures

Uh oh!

HuggingFaceDocBuilderDev commented Aug 1, 2025

Uh oh!

kimishpatel Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

kimishpatel Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kimishpatel Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jackzhxng commented Aug 1, 2025 •

edited

Loading

jackzhxng Aug 14, 2025 •

edited

Loading

jackzhxng Aug 15, 2025 •

edited

Loading