[core][distributed] add ep group and all2all interface #18077

youkaichao · 2025-05-13T13:01:38Z

the all2all interface should be refactored with a follow-up from #15956 .

this PR mainly adds ep (expert parallel) group, and all2all base class, so that backend-dependent code like pplx's pplx_init will not invade into common code path (vllm/distributed/parallel_state.py) .

Signed-off-by: youkaichao <youkaichao@gmail.com>

youkaichao · 2025-05-13T13:15:19Z

vllm/distributed/device_communicators/all2all.py

+
+class All2AllBase:
+
+    def __init__(self, cpu_group, model):


for how to use cpu_group to initialize pplx, see ppl-ai/pplx-kernels#18 as an example.

here we have access to the model instance, so we can also get moe configs here, and we can assume all moe layers have the same config, without _all_to_all_cache .

from @nandor ,

uid = nvshmem_get_unique_id() if rank == 0 else nvshmem_alloc_empty_unique_id() torch.distributed.broadcast(uid, src=0) nvshmem_init(uid, rank, world_size)

is not necessary for intranode code.

we can tell if we are in single node with

vllm/vllm/distributed/parallel_state.py

Line 1127 in 79a1d25

def in_the_same_node_as(pg: Union[ProcessGroup, StatelessProcessGroup],

Signed-off-by: youkaichao <youkaichao@gmail.com>

varun-sundar-rabindranath · 2025-05-13T13:35:38Z

Thanks @youkaichao - The abstractions generally look good . But the dispatch-combine calls for pplx input a very different set of information than the naive all2all implementation. I think we can cross that bridge when we add the pplx All2All implementation to the abstraction - But just adding it as a note.

youkaichao · 2025-05-13T13:40:19Z

But the dispatch-combine calls for pplx input a very different set of information than the naive all2all implementation.

agree, right now the interface is just porting code from the naive implementation. we should definitely change it when we add the pplx All2All implementation to the abstraction.

tlrmchlsmth · 2025-05-13T16:49:52Z

vllm/distributed/device_communicators/all2all.py

+    def dispatch(self, hidden_states: torch.Tensor,
+                 router_logits: torch.Tensor):
+        raise NotImplementedError
+
+    def combine(self, hidden_states: torch.Tensor) -> torch.Tensor:
+        raise NotImplementedError


Is there a function signature for dispatch and combine that will work across all kernels?

The naive multicast implementation

pplx-kernels

DeepEP

I think it would to look like:

def dispatch(self, hidden_states: torch.Tensor, hs_scales: torch.Tensor, # Quantized scales for hidden_states router_logits: torch.Tensor, topk_weights: torch.Tensor, topk_ids: torch.Tensor) -> Tuple[torch.Tensor, torch.Tensor] # out_hidden_states and out_scales

(There are other parameters to the pplx-kernels that are needed but I think they can be encapsulated in a PPLXAll2All wrapper class)

@varun-sundar-rabindranath @bnellnm WDYT?

Another consideration here is that the output format of the different A2A implementations will be different.

For naive All2All, the output shape would be [total_tokens_across_dp, hidden_size]

For pplx All2All, it will be [num_experts_per_rank, max_tokens_per_expert, hidden_size], with padding along axis 1

For DeepEP I am not sure what the output format will look like

for a generic dispatch/combine interface for different backends (pplx, naive, deepep), I think we can have:

def dispatch(self, tensors: List[torch.Tensor]) -> List[torch.Tensor]: def combine(self, tensors: List[torch.Tensor]) -> List[torch.Tensor]:

the backend can decide what does each tensor mean, and the moe layer can use assertion to make sure the list contains data it needs.

What's the reason for having the base class in this case? It seems that the caller will have to know exactly the implementation it's using

It's mainly to manage the lifecycle of the communication library, and separate the implementations.

it is true that the dispatch / combine apis are quite different, but what matters in this pr is to unify the construction, backend selection, and deconstruction. as a byproduct, we need to have a generic dispatch / combine function.

I agree with @tlrmchlsmth that it would be good if these interfaces matched the pplx AllToAll interfaces.

feel free to change the interface in the moe modularization pr!

bnellnm · 2025-05-13T19:29:38Z

vllm/distributed/utils.py

+    try:
+        # pytorch <= 2.6
+        from torch.distributed.distributed_c10d import _shutdown_backend
+        _shutdown_backend(pg)
+    except ImportError:
+        # pytorch >= 2.7
+        pg.shutdown()


I used this in the PPLX PR. I think this is a bit nicer than try/catch.

if is_torch_equal_or_newer("2.7"): pg.shutdown() else: # Lazy import for non-CUDA backends. from torch.distributed.distributed_c10d import _shutdown_backend _shutdown_backend(pg)

feel free to change it in the moe modularization pr!

…18077) Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

…18077) Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: minpeter <kali2005611@gmail.com>

youkaichao added 5 commits May 13, 2025 19:31

add ep group

db45e8c

Signed-off-by: youkaichao <youkaichao@gmail.com>

fix shutdown error

d47c461

Signed-off-by: youkaichao <youkaichao@gmail.com>

add prepare_communication_buffer_for_model

a09bdd6

Signed-off-by: youkaichao <youkaichao@gmail.com>

call prepare_communication_buffer_for_model

31e7ad4

Signed-off-by: youkaichao <youkaichao@gmail.com>

add all2all

204144c

Signed-off-by: youkaichao <youkaichao@gmail.com>

mergify bot added the v1 label May 13, 2025

youkaichao added 2 commits May 13, 2025 21:05

fix

4e7c376

Signed-off-by: youkaichao <youkaichao@gmail.com>

fix

4eaadc0

Signed-off-by: youkaichao <youkaichao@gmail.com>

youkaichao commented May 13, 2025

View reviewed changes

youkaichao added 3 commits May 13, 2025 21:17

fix

7f27950

Signed-off-by: youkaichao <youkaichao@gmail.com>

add intranode and internode

a663683

Signed-off-by: youkaichao <youkaichao@gmail.com>

fix

a8f63ee

Signed-off-by: youkaichao <youkaichao@gmail.com>

youkaichao marked this pull request as ready for review May 13, 2025 13:29

youkaichao requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners May 13, 2025 13:29

add comment

76031b8

Signed-off-by: youkaichao <youkaichao@gmail.com>

tlrmchlsmth reviewed May 13, 2025

View reviewed changes

bnellnm reviewed May 13, 2025

View reviewed changes

tlrmchlsmth approved these changes May 14, 2025

View reviewed changes

youkaichao merged commit 6266c57 into vllm-project:main May 14, 2025
29 of 32 checks passed

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[core][distributed] add ep group and all2all interface (vllm-project#…

3fc7f8d

…18077) Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

minpeter pushed a commit to minpeter/vllm that referenced this pull request Jun 24, 2025

[core][distributed] add ep group and all2all interface (vllm-project#…

7a61498

…18077) Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: minpeter <kali2005611@gmail.com>

tanujtiwari1998 mentioned this pull request Jul 8, 2025

cached tokens completions character-tech/vllm#22

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[core][distributed] add ep group and all2all interface #18077

[core][distributed] add ep group and all2all interface #18077

Uh oh!

youkaichao commented May 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

youkaichao May 13, 2025

Uh oh!

youkaichao May 13, 2025

Uh oh!

varun-sundar-rabindranath commented May 13, 2025

Uh oh!

youkaichao commented May 13, 2025

Uh oh!

tlrmchlsmth May 13, 2025

Uh oh!

tlrmchlsmth May 13, 2025

Uh oh!

youkaichao May 13, 2025

Uh oh!

tlrmchlsmth May 13, 2025

Uh oh!

youkaichao May 13, 2025

Uh oh!

bnellnm May 13, 2025

Uh oh!

youkaichao May 14, 2025

Uh oh!

bnellnm May 13, 2025

Uh oh!

youkaichao May 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[core][distributed] add ep group and all2all interface #18077

[core][distributed] add ep group and all2all interface #18077

Uh oh!

Conversation

youkaichao commented May 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

varun-sundar-rabindranath commented May 13, 2025

Uh oh!

youkaichao commented May 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

youkaichao commented May 13, 2025 •

edited by github-actions bot

Loading