-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add qwen2 #2495
Add qwen2 #2495
Conversation
Thank you for the contribution! Do you know where can I get the weights for |
Yes, it is not released yet. Would you mind joining our HF org to read our new models, which are temporarily private? https://huggingface.co/Qwen BTW you can contact me through email (junyang.ljy@alibaba-inc.com) or I can join your slack channel for further discussion (my slack email: justinlin930319@gmail.com). |
Thanks. Emailed. |
Where is the difference between qwen2 and Llama2 architecture? It looks like the same. If it's right, does it better to extend LlamaForCasualLM like |
The code for Qwen2 is adaptive to previous Qwen as well as the next generation Qwen2. In comparison with Llama or Mistral, we have qkv bias, and the mixture of sliding window attention and full attention, which is controlled by the argument |
Thanks for access. I'm testing it with
Can you add a Qwen2 config similar to the |
Did you git clone the latest transformers and install it by |
I saw the Huggingface PR. This is pretty tricky because we don't know when will Huggingface release their new version that includes Qwen2. Can you still include it vLLM so we can release it without people waiting for Huggingface nightly/latest release? |
Hi, HF transformers just released their new version with Qwen2 included. You can give a try and see if it works for you. |
verified. pushing a commit for transformers model pinning and will merge. |
Recently, I have pushed the new codes of Qwen2 to Hugging Face Transformers, and thus I also would love to contribute the new model to vLLM as well.
In this PR, I have provided the implementation of Qwen2 model and add some notes on it.