FEAT: Support QwQ-32B #3005

cyhasuka · 2025-03-06T01:12:50Z

No description provided.

ruifenggong · 2025-03-06T03:31:28Z

建议加上 https://www.modelscope.cn/models/Qwen/QwQ-32B-GGUF

cyhasuka · 2025-03-06T03:39:46Z

建议加上 https://www.modelscope.cn/models/Qwen/QwQ-32B-GGUF

Added.

qinxuye · 2025-03-06T03:43:41Z

About MLX version, you could refer to codes below to make it more compact.

inference/xinference/model/llm/llm_family.json

Lines 1446 to 1456 in aeb1ccd

    
           { 
        
             "model_format": "mlx", 
        
             "model_size_in_billions": 70, 
        
             "quantizations": [ 
        
               "3bit", 
        
               "4bit", 
        
               "6bit", 
        
               "8bit", 
        
               "fp16" 
        
             ], 
        
             "model_id": "mlx-community/Llama-3.3-70B-Instruct-{quantization}"

cyhasuka · 2025-03-06T03:50:05Z

About MLX version, you could refer to codes below to make it more compact.

inference/xinference/model/llm/llm_family.json

Lines 1446 to 1456 in aeb1ccd

{

"model_format": "mlx",

"model_size_in_billions": 70,

"quantizations": [

"3bit",

"4bit",

"6bit",

"8bit",

"fp16"

],

"model_id": "mlx-community/Llama-3.3-70B-Instruct-{quantization}"

Modified.

qinxuye · 2025-03-06T04:15:42Z

All tests failed, seems some places in your jsons have wrong syntax.

xinference/model/llm/llm_family.json

qinxuye · 2025-03-06T07:15:33Z

Don't know what has happened.

cyhasuka · 2025-03-06T07:18:07Z

uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different

qinxuye · 2025-03-06T07:25:14Z

uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different

OK, could you resolve it?

cyhasuka · 2025-03-06T07:35:40Z

OK, could you resolve it?

Solved.

qinxuye

LGTM

ascacl · 2025-03-06T14:10:27Z

你好，我容器化部署的，我进入容器按照这个代码修改了代码，然后重启了容器，为什么界面上选不到呢？

oubeichen · 2025-03-07T02:43:16Z

修改容器之后 commit 成新镜像了吗？不然不是自动还原了？

ascacl · 2025-03-07T04:15:29Z

修改容器之后 commit 成新镜像了吗？不然不是自动还原了？

docker exec进入容器，修改代码后，docker restart容器，这样不行吗？

helojo · 2025-03-10T10:18:28Z

Only ['qwen1.5-chat', 'qwen1.5-moe-chat', 'qwen2-instruct', 'qwen2-moe-instruct', 'qwen2.5-instruct', 'qwen2.5-coder-instruct', 'glm4-chat', 'glm4-chat-1m', 'llama-3.1-instruct', 'deepseek-r1-distill-qwen', 'deepseek-r1-distill-llama'] support tool calls。
could QwQ 32B enable function calls？

cyhasuka added 5 commits March 6, 2025 09:02

Update installation.rst

0215569

Update llm_family.json

ec92226

Update llm_family_modelscope.json

006c989

Update core.py

a4460ea

Update core.py

4da2c6f

XprobeBot added the feature label Mar 6, 2025

XprobeBot added this to the v1.x milestone Mar 6, 2025

cyhasuka added 2 commits March 6, 2025 09:19

Update llm_family.json

22894b0

Update llm_family_modelscope.json

4eb97d5

cyhasuka changed the title ~~FEAT: support QwQ-32B~~ FEAT: Support QwQ-32B Mar 6, 2025

Update llm_family.json

5716555

Update llm_family.json

5d05b88

Update llm_family_modelscope.json

9dc3446

qinxuye reviewed Mar 6, 2025

View reviewed changes

xinference/model/llm/llm_family.json Outdated Show resolved Hide resolved

xinference/model/llm/llm_family.json Outdated Show resolved Hide resolved

cyhasuka added 2 commits March 6, 2025 14:38

Update llm_family.json

af5487e

Update llm_family_modelscope.json

31859a5

cyhasuka added 2 commits March 6, 2025 15:31

Update llm_family.json

2eb7b2a

Update llm_family.json

241861d

qinxuye approved these changes Mar 6, 2025

View reviewed changes

qinxuye merged commit 1598c23 into xorbitsai:main Mar 6, 2025
11 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Support QwQ-32B #3005

FEAT: Support QwQ-32B #3005

cyhasuka commented Mar 6, 2025

ruifenggong commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye left a comment

ascacl commented Mar 6, 2025 •

edited

Loading

oubeichen commented Mar 7, 2025

ascacl commented Mar 7, 2025

helojo commented Mar 10, 2025

FEAT: Support QwQ-32B #3005

FEAT: Support QwQ-32B #3005

Conversation

cyhasuka commented Mar 6, 2025

ruifenggong commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye commented Mar 6, 2025

cyhasuka commented Mar 6, 2025

qinxuye left a comment

Choose a reason for hiding this comment

ascacl commented Mar 6, 2025 • edited Loading

oubeichen commented Mar 7, 2025

ascacl commented Mar 7, 2025

helojo commented Mar 10, 2025

ascacl commented Mar 6, 2025 •

edited

Loading