Skip to content

Latest commit

 

History

History
12 lines (10 loc) · 458 Bytes

2020-03-17-reduce-scatter.md

File metadata and controls

12 lines (10 loc) · 458 Bytes
title sneak_preview tags excerpt
ZeRO stage 1 with reduced communication
true
training ZeRO English
Partition-aware ZeRO with up to 2x reduction in communication time!
  • Partition-aware approach instead of initial implementation that used a global collective (all-reduce)
  • Total communication volume reduction 1.5x -> 1x of data parallelism
  • Up to 2x reduction in communication time compared to all-reduce

Further updates coming soon!