Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT: Support QwQ-32B #3005

Merged
merged 14 commits into from
Mar 6, 2025
Merged

FEAT: Support QwQ-32B #3005

merged 14 commits into from
Mar 6, 2025

Conversation

cyhasuka
Copy link
Contributor

@cyhasuka cyhasuka commented Mar 6, 2025

No description provided.

@XprobeBot XprobeBot added this to the v1.x milestone Mar 6, 2025
@cyhasuka cyhasuka changed the title FEAT: support QwQ-32B FEAT: Support QwQ-32B Mar 6, 2025
@ruifenggong
Copy link

@cyhasuka
Copy link
Contributor Author

cyhasuka commented Mar 6, 2025

建议加上 https://www.modelscope.cn/models/Qwen/QwQ-32B-GGUF

Added.

@qinxuye
Copy link
Contributor

qinxuye commented Mar 6, 2025

About MLX version, you could refer to codes below to make it more compact.

{
"model_format": "mlx",
"model_size_in_billions": 70,
"quantizations": [
"3bit",
"4bit",
"6bit",
"8bit",
"fp16"
],
"model_id": "mlx-community/Llama-3.3-70B-Instruct-{quantization}"

@cyhasuka
Copy link
Contributor Author

cyhasuka commented Mar 6, 2025

About MLX version, you could refer to codes below to make it more compact.

{
"model_format": "mlx",
"model_size_in_billions": 70,
"quantizations": [
"3bit",
"4bit",
"6bit",
"8bit",
"fp16"
],
"model_id": "mlx-community/Llama-3.3-70B-Instruct-{quantization}"

Modified.

@qinxuye
Copy link
Contributor

qinxuye commented Mar 6, 2025

All tests failed, seems some places in your jsons have wrong syntax.

@qinxuye
Copy link
Contributor

qinxuye commented Mar 6, 2025

image

Don't know what has happened.

@cyhasuka
Copy link
Contributor Author

cyhasuka commented Mar 6, 2025

uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different

@qinxuye
Copy link
Contributor

qinxuye commented Mar 6, 2025

uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different

OK, could you resolve it?

@cyhasuka
Copy link
Contributor Author

cyhasuka commented Mar 6, 2025

OK, could you resolve it?

Solved.

Copy link
Contributor

@qinxuye qinxuye left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@qinxuye qinxuye merged commit 1598c23 into xorbitsai:main Mar 6, 2025
11 of 13 checks passed
@ascacl
Copy link

ascacl commented Mar 6, 2025

你好,我容器化部署的,我进入容器按照这个代码修改了代码,然后重启了容器,为什么界面上选不到呢?
image

@oubeichen
Copy link

修改容器之后 commit 成新镜像了吗?不然不是自动还原了?

@ascacl
Copy link

ascacl commented Mar 7, 2025

修改容器之后 commit 成新镜像了吗?不然不是自动还原了?

docker exec进入容器,修改代码后,docker restart容器,这样不行吗?

@helojo
Copy link

helojo commented Mar 10, 2025

Only ['qwen1.5-chat', 'qwen1.5-moe-chat', 'qwen2-instruct', 'qwen2-moe-instruct', 'qwen2.5-instruct', 'qwen2.5-coder-instruct', 'glm4-chat', 'glm4-chat-1m', 'llama-3.1-instruct', 'deepseek-r1-distill-qwen', 'deepseek-r1-distill-llama'] support tool calls。
could QwQ 32B enable function calls?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants