[mllama] fix loading and inference #38223

zucchini-nlp · 2025-05-20T09:56:07Z

What does this PR do?

Fixes #38220, it's a shame we couldn't see it earlier in CI. Probably because mllama isn't available in EU 🥲

We should not repeat keys before calling attn, otherwise it is repeated twice. And remove base_model_prefix so the model can load old state dicts by manual remapping

ArthurZucker

LGTM indeed eager attn forward does this again

ArthurZucker · 2025-05-20T10:00:37Z

Let's make sure the cache object has a correct shape to support GQA

HuggingFaceDocBuilderDev · 2025-05-20T10:09:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

fix loading

fix loading

cf3ef4e

zucchini-nlp requested a review from ArthurZucker May 20, 2025 09:56

ArthurZucker approved these changes May 20, 2025

View reviewed changes

zucchini-nlp merged commit 2edb0e4 into huggingface:main May 20, 2025
14 checks passed

faaany pushed a commit to faaany/transformers that referenced this pull request May 21, 2025

[mllama] fix loading and inference (huggingface#38223)

ec91c94

fix loading

xvyv99 pushed a commit to xvyv99/transformers that referenced this pull request May 21, 2025

[mllama] fix loading and inference (huggingface#38223)

7137c9c

fix loading

redmoe-moutain pushed a commit to redmoe-moutain/transformers that referenced this pull request Jun 10, 2025

[mllama] fix loading and inference (huggingface#38223)

70216c3

fix loading

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mllama] fix loading and inference #38223

[mllama] fix loading and inference #38223

Uh oh!

zucchini-nlp commented May 20, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker commented May 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!

[mllama] fix loading and inference #38223

[mllama] fix loading and inference #38223

Uh oh!

Conversation

zucchini-nlp commented May 20, 2025

What does this PR do?

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented May 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!