Add support for Qwen3.5 MoE by michaelroyzen · Pull Request #1109 · linkedin/Liger-Kernel

michaelroyzen · 2026-02-26T18:54:43Z

Add Qwen3.5 MoE support to Liger Kernel

Summary

Adds Liger Kernel optimizations for the Qwen3.5 MoE model family (qwen3_5_moe / qwen3_5_moe_text), targeting Transformers v5+
Qwen3.5 MoE combines Qwen3 Next's hybrid GDN/attention architecture with Sparse MoE (shared + routed experts), so the implementation mirrors Qwen3 Next's Liger integration: Gemma-style RMSNorm (LigerRMSNormForQwen3Next), fused SwiGLU experts (LigerExperts), and fused linear cross-entropy loss

Changes

New file:

src/liger_kernel/transformers/model/qwen3_5_moe.py — lce_forward for Qwen3_5MoeForCausalLM, based on the Qwen3 Next version with the load_balancing_loss_func import updated to point to Qwen3.5 MoE's local definition

Modified files:

src/liger_kernel/transformers/monkey_patch.py — apply_liger_kernel_to_qwen3_5_moe function (RMSNorm, SwiGLU experts, fused LCE; RoPE disabled) with instance patching for norm layers, shared expert, and routed experts; registered as qwen3_5_moe and qwen3_5_moe_text in MODEL_TYPE_TO_APPLY_LIGER_FN
src/liger_kernel/transformers/__init__.py — Export apply_liger_kernel_to_qwen3_5_moe in TYPE_CHECKING, __getattr__, and __all__
test/utils.py — revert_liger_kernel_to_qwen3_5_moe for test cleanup
test/convergence/fp32/test_mini_models.py — Availability check, imports, and MiniModelConfig entry for mini_qwen3_5_moe
test/transformers/test_monkey_patch.py — is_qwen3_5_moe_available helper and test_apply_liger_kernel_to_instance_for_qwen3_5_moe verifying all patches are applied correctly

Test plan

test_apply_liger_kernel_to_instance_for_qwen3_5_moe passes (monkey patch instance patching)
mini_qwen3_5_moe convergence test passes (fp32 mini model)
Existing Qwen3 Next and Qwen3 MoE tests still pass (no regressions)

michaelroyzen · 2026-02-26T18:57:42Z

@shimizust @Tcc0403

…d of Llama-style

michaelroyzen · 2026-02-26T19:48:42Z

Convergence test passes

Tcc0403

@Mecoli1219 can you take a look?

src/liger_kernel/transformers/model/qwen3_5_moe.py

src/liger_kernel/transformers/monkey_patch.py

michaelroyzen · 2026-02-27T18:42:19Z

Confirming Qwen3-Next still passes

… bf16 test matching Qwen3 MoE tolerances

michaelroyzen · 2026-02-27T18:45:54Z

Are we ready to merge @Tcc0403 @Mecoli1219?

Add support for Qwen3.5 MoE

fe34dfd

michaelroyzen mentioned this pull request Feb 26, 2026

Add Qwen3.5 MoE #1110

Open

Michael Royzen and others added 3 commits February 26, 2026 14:10

Both Qwen3.5 MoE and Qwen3-Next should use Gemma-style RMSNorm instea…

14a7250

…d of Llama-style

Convergence test fixes

b31a2f7

Fix test imports

43496ac

Tcc0403 reviewed Feb 27, 2026

View reviewed changes

src/liger_kernel/transformers/model/qwen3_5_moe.py Outdated Show resolved Hide resolved

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

Add shift_labels to loss_function calls

06a510c

Match fp32 skip behavior for Qwen3.5 MoE (as with Qwen3-Next) and add…

390cd7d

… bf16 test matching Qwen3 MoE tolerances

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Qwen3.5 MoE#1109

Add support for Qwen3.5 MoE#1109
michaelroyzen wants to merge 6 commits intolinkedin:mainfrom
michaelroyzen:add-qwen3_5_moe

michaelroyzen commented Feb 26, 2026

Uh oh!

michaelroyzen commented Feb 26, 2026

Uh oh!

michaelroyzen commented Feb 26, 2026 •

edited

Loading

Uh oh!

Tcc0403 left a comment

Uh oh!

Uh oh!

Uh oh!

michaelroyzen commented Feb 27, 2026 •

edited

Loading

Uh oh!

michaelroyzen commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

michaelroyzen commented Feb 26, 2026

Add Qwen3.5 MoE support to Liger Kernel

Summary

Changes

Test plan

Uh oh!

michaelroyzen commented Feb 26, 2026

Uh oh!

michaelroyzen commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tcc0403 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

michaelroyzen commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michaelroyzen commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michaelroyzen commented Feb 26, 2026 •

edited

Loading

michaelroyzen commented Feb 27, 2026 •

edited

Loading