Add CUDA IPC transport implementation #120

wukaixingxp · 2026-02-06T23:56:27Z

Implements a new transport layer using CUDA IPC (Inter-Process Communication) for direct GPU-to-GPU memory sharing within a single node.

This enables zero-copy transfers between processes by sharing GPU memory handles instead of copying data through CPU memory.

Key features:

Direct GPU memory access using CUDA IPC handles
Eliminates GPU->CPU->GPU copies for intra-node transfers
Leverages NVLink/PCIe P2P when available
Automatic fallback for non-CUDA tensors

Implements a new transport layer using CUDA IPC (Inter-Process Communication) for direct GPU-to-GPU memory sharing within a single node. This enables zero-copy transfers between processes by sharing GPU memory handles instead of copying data through CPU memory. Key features: - Direct GPU memory access using CUDA IPC handles - Eliminates GPU->CPU->GPU copies for intra-node transfers - Leverages NVLink/PCIe P2P when available - Automatic fallback for non-CUDA tensors Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>

amirafzali · 2026-02-10T21:29:55Z

a general question is when do we find cuda IPC useful? my understanding is it would only be single host synchronous weight sync cases. do you see this as a common usage case?

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 6, 2026

This was referenced Feb 7, 2026

Feature: Make batching automatic in put_state_dict operations #122

Open

GPU-Direct Weight Sync APIs and CUDA IPC Transport #115

Closed

Developer and others added 3 commits February 7, 2026 00:56

lint and add more tests

d9d5650

fix tests

2ad1caf

Merge branch 'main' into feature/cuda-ipc-transport

0d7adad

wukaixingxp marked this pull request as ready for review February 10, 2026 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDA IPC transport implementation #120

Add CUDA IPC transport implementation #120

Uh oh!

wukaixingxp commented Feb 6, 2026

Uh oh!

amirafzali commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add CUDA IPC transport implementation #120

Are you sure you want to change the base?

Add CUDA IPC transport implementation #120

Uh oh!

Conversation

wukaixingxp commented Feb 6, 2026

Uh oh!

amirafzali commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants