-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix qwen-14b model #1173
fix qwen-14b model #1173
Conversation
28ac647
to
11fcd9d
Compare
I've tested using this code and qwen-14b still can not load |
What error did you encounter? |
Still encountered this error: RuntimeError: shape '[3, 32, 128]' is invalid for input of size 15360 |
I tested it again and it works. Please make sure the changes to your local vllm code have taken effect. |
Thanks! This works for me, turns out I need to restart the cluster instead of detach-and-reattach. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Tested with Qwen/Qwen-14B-Chat
and it works well. Thanks for your contribution!
Support both qwen-7b and qwen-14b.