Model load fix for qwen2 0.5b and 1.5b #2303

matvey-kolbasov-hs · 2024-07-25T08:14:13Z

Fix for Qwen2 model loading

This PR fixes an issue with loading for models Qwen2 0.5B and Qwen2 1.5B. These models use tied embeddings so they require corresponding aliases for layer names.

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

danieldk

Thanks for reporting this and providing a PR!

Since the model configuration states whether the embeddings are tied or not, I think it's nicer to rely on the configuration rather than aliases.

See #2313.

danieldk · 2024-07-26T12:58:16Z

Fixed by #2313. Thanks again for pointing out this issue and providing a fix!

matvey-kolbasov-hs added 4 commits July 24, 2024 11:59

tied embeddings for qwe2

f73f57c

logging

eabcb29

fix aliases

fbb683f

Merge branch 'huggingface:main' into model_load_fix_for_qwen2_1.5B

e582eed

omri-sap approved these changes Jul 25, 2024

View reviewed changes

danieldk self-assigned this Jul 25, 2024

danieldk reviewed Jul 26, 2024

View reviewed changes

danieldk closed this Jul 26, 2024

danieldk removed their assignment Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model load fix for qwen2 0.5b and 1.5b #2303

Model load fix for qwen2 0.5b and 1.5b #2303

matvey-kolbasov-hs commented Jul 25, 2024

danieldk left a comment •

edited

Loading

danieldk commented Jul 26, 2024

Model load fix for qwen2 0.5b and 1.5b #2303

Model load fix for qwen2 0.5b and 1.5b #2303

Conversation

matvey-kolbasov-hs commented Jul 25, 2024

Fix for Qwen2 model loading

danieldk left a comment • edited Loading

Choose a reason for hiding this comment

danieldk commented Jul 26, 2024

danieldk left a comment •

edited

Loading