Skip to content

Conversation

@MengqingCao
Copy link
Collaborator

@MengqingCao MengqingCao commented Oct 22, 2025

What this PR does / why we need it?

This is the step 1 of refactoring code to adapt with vllm main, and this pr aligned with vllm-project/vllm@17c540a

  1. refactor deepseek to the latest code arch as of vllm-project/vllm@17c540a

  2. bunches of fixes due to vllm changes

Does this PR introduce any user-facing change?

How was this patch tested?

CI passed with existing test.

@github-actions
Copy link

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

@MengqingCao MengqingCao changed the title [Refactor] Refactor code to adapt with vllm main [1/N][Refactor] Refactor code to adapt with vllm main Oct 22, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors the code to adapt with the vllm main branch. The changes include removing an unused pre-commit hook, adding version-dependent logic for scheduler initialization and compilation level, and updating attention mechanisms for compatibility with different vllm versions. The review focuses on identifying potential issues related to version compatibility and code maintainability, specifically targeting high and critical severity issues.

@MengqingCao MengqingCao added ready read for review ready-for-test start test by label for PR labels Oct 22, 2025
@github-actions
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@github-actions
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

1 similar comment
@github-actions
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

def version_check():
"""check if torch_npu version >= dev20250919"""
import re
import re # noqa
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

make sense in performance

@weijinqian0
Copy link
Collaborator

add fill_(0) in attenion for vllm-project/vllm#26680

@github-actions
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Copy link
Collaborator

@whx-sjtu whx-sjtu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very hard work!

Copy link
Collaborator

@whx-sjtu whx-sjtu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

MengqingCao and others added 2 commits October 24, 2025 02:59
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: Icey <1790571317@qq.com>
Co-authored-by: Icey <1790571317@qq.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
  * fix torchair deepseekv2 modeling when q_lora_rank is None
  * add fill_(0) for attn output in dummy run
  * add comments for NPUWorker.device

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: MengqingCao <cmq0113@163.com>
@MengqingCao
Copy link
Collaborator Author

This pr or the updated vllm code maybe introduce some synchornize operations somewhere, which breaks aclgraph in mtp scenario. But I think this pr is more important, thus recommand to merge this firstly after ci passed. And I will fix the above issue later

@wangxiyuan wangxiyuan merged commit cea0755 into vllm-project:main Oct 24, 2025
28 checks passed
wangxiyuan pushed a commit that referenced this pull request Oct 25, 2025
…ion tests (#3729)

### What this PR does / why we need it?

Enable the unit tests that #3612 skipped.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

Unit tests.

- vLLM main:
vllm-project/vllm@17c540a

Signed-off-by: gcanlin <canlinguosdu@gmail.com>
wangxiyuan pushed a commit that referenced this pull request Oct 30, 2025
### What this PR does / why we need it?
[UT] fix ut test for test_utils that
#3612 skipped.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
vLLM version: v0.11.0rc3
vLLM main:
vllm-project/vllm@17c540a

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants