-
Notifications
You must be signed in to change notification settings - Fork 623
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT: Support QwQ-32B #3005
FEAT: Support QwQ-32B #3005
Conversation
About MLX version, you could refer to codes below to make it more compact. inference/xinference/model/llm/llm_family.json Lines 1446 to 1456 in aeb1ccd
|
Modified. |
All tests failed, seems some places in your jsons have wrong syntax. |
uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different |
OK, could you resolve it? |
Solved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
修改容器之后 commit 成新镜像了吗?不然不是自动还原了? |
docker exec进入容器,修改代码后,docker restart容器,这样不行吗? |
Only ['qwen1.5-chat', 'qwen1.5-moe-chat', 'qwen2-instruct', 'qwen2-moe-instruct', 'qwen2.5-instruct', 'qwen2.5-coder-instruct', 'glm4-chat', 'glm4-chat-1m', 'llama-3.1-instruct', 'deepseek-r1-distill-qwen', 'deepseek-r1-distill-llama'] support tool calls。 |
No description provided.