Support Pangu Pro MoE model #1204

Angazenn · 2025-06-13T08:45:56Z

What this PR does / why we need it?

Support Pangu Pro MoE model (https://arxiv.org/abs/2505.21411)

Does this PR introduce any user-facing change?

Yes, new model supported

How was this patch tested?

Test locally

shen-shanshan · 2025-06-13T09:47:18Z

@Angazenn Thanks for your contribution, please also update the model support doc: https://github.com/vllm-project/vllm-ascend/blob/main/docs/source/user_guide/supported_models.md.

Signed-off-by: angazenn <zengyanjia@huawei.com>

shen-shanshan · 2025-06-20T10:55:43Z

@Angazenn Please modify the model name:

PanGuMoEModel -> PanguProMoEModel
PanGuMoEForCausalLM -> PanguProMoEForCausalLM

The name of opensource config is different.

vllm_ascend/models/pangu_moe.py

vllm_ascend/models/__init__.py

vllm_ascend/models/pangu_moe.py

Signed-off-by: angazenn <zengyanjia@huawei.com>

Yikun · 2025-06-20T15:58:29Z

VLLM_USE_V1=1 vllm serve /root/.cache/pangu-pro-moe-model \
    --tensor-parallel-size 4 \
    --swap-space 16 \
    --disable-log-stats  --disable-log-requests  \
    --trust-remote-code  --enforce-eager

curl http://localhost:8000/v1/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "/root/.cache/pangu-pro-moe-model",
        "prompt": "The future of AI is",
        "max_tokens": 128,
        "temperature": 0
    }'

I do an E2E test in my local env, it works as expected.

cc @ganyi1996ppo @wangxiyuan

Yikun changed the title ~~[draft]support pangu~~ [draft]support new moe model Jun 13, 2025

shen-shanshan self-assigned this Jun 13, 2025

Angazenn force-pushed the pangu branch from d417596 to a1a6b13 Compare June 13, 2025 11:44

github-actions bot added the module:ops label Jun 13, 2025

support pangu moe

fef1253

Signed-off-by: angazenn <zengyanjia@huawei.com>

Angazenn force-pushed the pangu branch from a1a6b13 to fef1253 Compare June 16, 2025 02:59

angazenn added 5 commits June 19, 2025 17:25

Merge remote-tracking branch 'upstream/main' into pangu

882d732

Merge remote-tracking branch 'upstream/main' into pangu

9f39188

fix tp accuracy error

b0faff9

Signed-off-by: angazenn <zengyanjia@huawei.com>

fix lint

6fbf4f5

Signed-off-by: angazenn <zengyanjia@huawei.com>

Merge remote-tracking branch 'upstream/main' into pangu

e048623

Angazenn force-pushed the pangu branch from c944a81 to e048623 Compare June 20, 2025 08:57

fix lint

8f9917d

Signed-off-by: angazenn <zengyanjia@huawei.com>

Angazenn force-pushed the pangu branch 2 times, most recently from d28cd5f to 2ad8d0b Compare June 20, 2025 10:09

remove unnecessary config

68fffc2

Signed-off-by: angazenn <zengyanjia@huawei.com>

Angazenn force-pushed the pangu branch from 2ad8d0b to 68fffc2 Compare June 20, 2025 10:55

Yikun reviewed Jun 20, 2025

View reviewed changes

Angazenn force-pushed the pangu branch 2 times, most recently from ac299f2 to 99df622 Compare June 20, 2025 11:37

rename

881c820

Signed-off-by: angazenn <zengyanjia@huawei.com>

Angazenn force-pushed the pangu branch from 99df622 to 881c820 Compare June 20, 2025 11:47

Yikun mentioned this pull request Jun 20, 2025

[release] 0.9.1rc1 release checklist #1315

Closed

29 tasks

Angazenn mentioned this pull request Jun 20, 2025

supoort 310p moe #1327

Closed

Yikun changed the title ~~[draft]support new moe model~~ Support Pangu Pro MoE model Jun 20, 2025

Yikun approved these changes Jun 20, 2025

View reviewed changes

Yikun merged commit 2f1266d into vllm-project:main Jun 20, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Pangu Pro MoE model #1204

Support Pangu Pro MoE model #1204

Angazenn commented Jun 13, 2025 •

edited by Yikun

Loading

Uh oh!

shen-shanshan commented Jun 13, 2025 •

edited

Loading

Uh oh!

shen-shanshan commented Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun commented Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

Support Pangu Pro MoE model #1204

Support Pangu Pro MoE model #1204

Conversation

Angazenn commented Jun 13, 2025 • edited by Yikun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

shen-shanshan commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shen-shanshan commented Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun commented Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

Angazenn commented Jun 13, 2025 •

edited by Yikun

Loading

shen-shanshan commented Jun 13, 2025 •

edited

Loading