-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Insights: deepspeedai/DeepSpeed
Overview
-
- 7 Merged pull requests
- 1 Open pull request
- 0 Closed issues
- 6 New issues
Could not load contribution data
Please try again later
7 Pull requests merged by 6 people
-
Add index to HPU devices
#7497 merged
Aug 19, 2025 -
Reduce performance impact of compiler.enable decorator
#7498 merged
Aug 18, 2025 -
Fix DeepCompile for PyTorch v2.8
#7496 merged
Aug 18, 2025 -
Fix invalid f-strings
#7457 merged
Aug 16, 2025 -
Add Zenflow code for Stage 1 & 2
#7391 merged
Aug 15, 2025 -
fix xpu device_id AttributeError issue
#7488 merged
Aug 15, 2025 -
Enable forked PRs
#7486 merged
Aug 14, 2025
1 Pull request opened by 1 person
-
DeepCompile ZeRO-3: robust allgather for uneven shards; fix profiling…
#7489 opened
Aug 15, 2025
6 Issues opened by 6 people
-
[BUG]Deepspeed (v0.15.4 ~v0.16.9) Zero3 training performance is slow,compare than v0.13.1
#7499 opened
Aug 19, 2025 -
[REQUEST] Add automatic logging of parallelism and ZeRO config to WandbMonitor
#7494 opened
Aug 16, 2025 -
[BUG] No backpropagation after micro-batch-id ≥ 3 with MPI backend on Jetson Orin AGX
#7492 opened
Aug 15, 2025 -
Model saved from deepspeed and accelerate cannot be loaded or incomeplete
#7490 opened
Aug 15, 2025 -
[BUG] Cuda failure 700 when use deepcompile with zero stage 3
#7487 opened
Aug 14, 2025
7 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[AMD][ROCm] Improve support of AMD
#7448 commented on
Aug 19, 2025 • 1 new comment -
Does the open-source code for FastPersist include the last two optimizations mentioned in the paper, "parallelizing checkpoint writes over DP ranks and pipelining checkpoint writes"?
#7475 commented on
Aug 13, 2025 • 0 new comments -
nv-torch-nightly-v100 CI test failure
#7467 commented on
Aug 20, 2025 • 0 new comments -
nv-nightly CI test failure
#7140 commented on
Aug 20, 2025 • 0 new comments -
Support DeepSpeed offload and reload states with ZeRO1 and ZeRO2
#7421 commented on
Aug 19, 2025 • 0 new comments -
Support Muon Optimizer
#7454 commented on
Aug 18, 2025 • 0 new comments -
Add world-size getter in Engine
#7479 commented on
Aug 16, 2025 • 0 new comments