Skip to content

reduce all-to-all communication volume when both expert and non-expert are tensor-parallel#5626

Merged
loadams merged 9 commits intomicrosoft:masterfrom taozhiwei:myfeatureJul 23, 2024

Commits

Commits on Jun 25, 2024

Commits on Jun 27, 2024

Commits on Jun 30, 2024

Commits on Jul 2, 2024

Commits on Jul 9, 2024

Commits on Jul 12, 2024

Commits on Jul 16, 2024

Commits on Jul 20, 2024

Commits on Jul 22, 2024