[Bugfix] Fix default weight loading for scalars #7534

mgoin · 2024-08-14T22:57:24Z

It is pretty easy to run into issues when loading scalar weights from checkpoints into parameters, since they can have no shape in some cases, resulting in errors like

  File "/home/mgoin/code/vllm/vllm/model_executor/model_loader/weight_utils.py", line 525, in default_weight_loader
    assert param.size() == loaded_weight.size(), (
AssertionError: Attempted to load weight (torch.Size([1])) into parameter (torch.Size([]))

I think it makes sense to detect and allow "broadcasting" in this special case for ease of use.

github-actions · 2024-08-14T22:57:38Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

comaniac

LGTM. I've been faced to this several times...

Signed-off-by: Alvant <alvasian@yandex.ru>

Fix default weight loading for scalars

c247983

mgoin requested review from comaniac and youkaichao August 14, 2024 22:57

mgoin added 2 commits August 14, 2024 22:58

Update comment

43cb06b

Format

9182f3d

comaniac approved these changes Aug 14, 2024

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 15, 2024

Merge branch 'main' into fix-scalar-loading

16c1b63

mgoin mentioned this pull request Aug 15, 2024

Release v0.5.5 #7481

Closed

simon-mo merged commit 21313e0 into main Aug 15, 2024
49 of 51 checks passed

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

463363f

zifeitong pushed a commit to zifeitong/vllm that referenced this pull request Aug 20, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

6ce6dbf

fialhocoelho pushed a commit to opendatahub-io/vllm that referenced this pull request Aug 22, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

e495e5b

omrishiv pushed a commit to omrishiv/vllm that referenced this pull request Aug 26, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

e233439

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

08117a6

Signed-off-by: Alvant <alvasian@yandex.ru>

simon-mo deleted the fix-scalar-loading branch October 28, 2024 16:50

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Bugfix] Fix default weight loading for scalars (vllm-project#7534)

4f4a217

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix default weight loading for scalars #7534

[Bugfix] Fix default weight loading for scalars #7534

mgoin commented Aug 14, 2024

github-actions bot commented Aug 14, 2024

comaniac left a comment

[Bugfix] Fix default weight loading for scalars #7534

[Bugfix] Fix default weight loading for scalars #7534

Conversation

mgoin commented Aug 14, 2024

github-actions bot commented Aug 14, 2024

comaniac left a comment

Choose a reason for hiding this comment