Moe bf16 ep #4144

grimoire · 2025-11-21T12:05:02Z

backends/moe.py and nn/moe.py has been refactored.
Reuse token dispatcher in DLBlas

lvhan028 · 2025-11-24T10:37:33Z

lmdeploy/pytorch/envs.py

    # we don't need to read this, it would be passed to ray workers
    # If Ray is launched from outside, it may fail to access the environment variables.
    os.getenv('DEEPEP_MAX_BATCH_SIZE', None)
+    os.getenv('DEEPEP_MAX_TOKENS_PER_RANK', None)


Do we need to set those envs manually?

DLBlas read these vars to build buffer.
https://github.com/DeepLink-org/DLBlas/blob/1710a860f654ddf50907251ec51670910368ee45/dlblas/layers/moe/token_dispatcher.py#L43

lvhan028 · 2025-11-24T10:47:21Z

lmdeploy/pytorch/backends/cuda/moe/default.py

+    hidden_dim: int,
+    top_k: int,
+    layer_idx: int = 0,
+    chunk_size: Optional[int] = 32 * 1024,


chunk_size is not used by FusedMoENormal

grimoire added 3 commits November 20, 2025 20:56

refactor pytorch.nn.moe

d767bca

add ep support

d0865d4

fix tp

f663b34

lvhan028 requested a review from CUHKSZzxy November 24, 2025 03:49

grimoire added 3 commits November 24, 2025 14:53

support blocked fp8 moe with split_size<world_size

30fbac1

unit test allow both fa3 and fa

2e64bb5

add singleton

1107fe2

lvhan028 reviewed Nov 24, 2025

View reviewed changes

grimoire added 2 commits November 24, 2025 19:07

singleton and ctxmgrbase

89e7050

comment

bd53fb5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Moe bf16 ep #4144

Moe bf16 ep #4144

Uh oh!

grimoire commented Nov 21, 2025

Uh oh!

lvhan028 Nov 24, 2025

Uh oh!

grimoire Nov 24, 2025

Uh oh!

lvhan028 Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Moe bf16 ep #4144

Are you sure you want to change the base?

Moe bf16 ep #4144

Uh oh!

Conversation

grimoire commented Nov 21, 2025

Uh oh!

lvhan028 Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

grimoire Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

lvhan028 Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants