[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 #28101

Isotr0py · 2025-11-05T05:46:13Z

Purpose

Fix [Feature]: Enable DeepSeek-OCR pluginization #28096
Since the difference between ds-moe and ds-v2 is only the MHA/MLA, we can consolidate them.

Test Plan

python examples/offline_inference/vision_language.py -m deepseek_ocr

python examples/offline_inference/basic/generate.py --model deepseek-ai/DeepSeek-V2-Lite-Chat

Test Result

Have confirmed CPU/XPU can run DeepSeek-COR now, and Deepseek-V2 isn't affected.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>

…deepseek-ocr-cpu Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify · 2025-11-05T15:23:19Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @Isotr0py.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

chatgpt-codex-connector

💡 Codex Review

vllm/vllm/model_executor/models/deepseek_v2.py

Lines 1277 to 1283 in 1ca29da

    
           class DeepseekV2ForCausalLM(nn.Module, SupportsPP, MixtureOfExperts, SupportsLoRA): 
        
               packed_modules_mapping = { 
        
                   "gate_up_proj": ["gate_proj", "up_proj"], 
        
               } 
        
               def __init__(self, *, vllm_config: VllmConfig, prefix: str = ""): 
        
                   super().__init__()

Reintroduce qkv packed mapping for Deepseek LoRA modules

The old DeepseekForCausalLM exposed a packed_modules_mapping entry for "qkv_proj": ["q_proj", "k_proj", "v_proj"] so LoRA and quantization code could translate user‑facing module names to the packed QKVParallelLinear weights. After consolidating the Deepseek implementation into deepseek_v2.py, the new DeepseekForCausalLM simply inherits DeepseekV2ForCausalLM, whose mapping only includes gate_up_proj (and optionally fused_qkv_a_proj). As a result, LoRA loaders now have no mapping for q_proj, k_proj, or v_proj when targeting Deepseek MoE models (see vllm/lora/models.py for how these mappings are consumed), so adapters that previously worked will fail to register or load. Consider restoring the qkv_proj entry when self.use_mha is true so LoRA and packed quantization continue to work for the original Deepseek models.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

jikunshang

LGTM. thanks for refactoring!

mergify · 2025-11-07T02:22:30Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @Isotr0py.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify · 2025-11-08T02:55:30Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @Isotr0py.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

gcanlin · 2025-11-10T03:29:55Z

LGTM. Thanks for the good work! I have validated Deepseek-OCR on Ascend NPU (910B) using the latest vllm-ascend main branch plus this PR, and the results are successful.

…m-project#28101) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

…pSeek-v2 (vllm-project#28101) (vllm-project#14) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

sixgod-666 · 2025-11-14T09:06:20Z

LGTM. Thanks for the good work! I have validated Deepseek-OCR on Ascend NPU (910B) using the latest vllm-ascend main branch plus this PR, and the results are successful.

How can I obtain a mirror environment that can successfully run DeepSeek-OCR? I tried the vllm-ascend-main image and found that its vllm version is v0.11.0.

…m-project#28101) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

jikunshang and others added 5 commits November 3, 2025 01:22

align with deepseekV2MoE

2b38423

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>

Merge remote-tracking branch 'jikunshang/kunshang/deepseek_ocr' into …

da7ad80

…deepseek-ocr-cpu Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

consolidate deepseekv2 mha attention

d232db1

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

fix

ac93e7b

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

minor

180f245

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify bot added the deepseek Related to DeepSeek models label Nov 5, 2025

gcanlin mentioned this pull request Nov 5, 2025

[DO NOT MERGE][Model][Patch] Support for Deepseek-OCR vllm-project/vllm-ascend#3874

Draft

Isotr0py added 3 commits November 5, 2025 15:44

update deepseek-moe registry

4d5de8c

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

map deepseek-moe to deepseek-v2

0bb6d5d

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

code format

ee0624e

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify bot added the new-model Requests to new models label Nov 5, 2025

Isotr0py added 3 commits November 5, 2025 16:53

remove deepseek.py

ebcbd4b

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Merge branch 'main' into deepseek-ocr-cpu

1dbba64

conditional stacked_params_mapping

1ca29da

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Isotr0py marked this pull request as ready for review November 5, 2025 15:22

Isotr0py requested review from DarkLight1337 and ywang96 as code owners November 5, 2025 15:22

mergify bot added the needs-rebase label Nov 5, 2025

chatgpt-codex-connector bot reviewed Nov 5, 2025

View reviewed changes

Isotr0py added 2 commits November 5, 2025 23:30

Merge remote-tracking branch 'upstream/main' into deepseek-ocr-cpu

266e772

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

codex

04a91dc

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify bot removed the needs-rebase label Nov 5, 2025

jikunshang approved these changes Nov 7, 2025

View reviewed changes

mergify bot added the needs-rebase label Nov 7, 2025

Merge remote-tracking branch 'upstream/main' into deepseek-ocr-cpu

f8d4b3e

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Isotr0py enabled auto-merge (squash) November 7, 2025 07:09

mergify bot removed the needs-rebase label Nov 7, 2025

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 7, 2025

fix

51cfe4c

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify bot added the needs-rebase label Nov 8, 2025

Merge remote-tracking branch 'upstream/main' into deepseek-ocr-cpu

e339fe3

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

mergify bot removed the needs-rebase label Nov 8, 2025

Isotr0py merged commit 934a9c3 into vllm-project:main Nov 8, 2025
55 checks passed

Isotr0py deleted the deepseek-ocr-cpu branch November 8, 2025 05:53

ywang96 added this to the v0.11.1 milestone Nov 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 #28101

[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 #28101

Uh oh!

Isotr0py commented Nov 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 5, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

jikunshang left a comment

Uh oh!

mergify bot commented Nov 7, 2025

Uh oh!

mergify bot commented Nov 8, 2025

Uh oh!

Uh oh!

gcanlin commented Nov 10, 2025

Uh oh!

sixgod-666 commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	class DeepseekV2ForCausalLM(nn.Module, SupportsPP, MixtureOfExperts, SupportsLoRA):
	packed_modules_mapping = {
	"gate_up_proj": ["gate_proj", "up_proj"],
	}

	def __init__(self, *, vllm_config: VllmConfig, prefix: str = ""):
	super().__init__()

Uh oh!

[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 #28101

[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 #28101

Uh oh!

Conversation

Isotr0py commented Nov 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Nov 5, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

jikunshang left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Nov 7, 2025

Uh oh!

mergify bot commented Nov 8, 2025

Uh oh!

Uh oh!

gcanlin commented Nov 10, 2025

Uh oh!

sixgod-666 commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Isotr0py commented Nov 5, 2025 •

edited by github-actions bot

Loading