Add support for loss parallel #1546

fmassa · 2025-08-08T09:44:22Z

IMO we should just add the loss in the model and let autoparallel parallelize it for us. But for now, let's follow how the other models are implemented

IMO we should just add the loss in the model and let autoparallel parallelize it for us

…ssa/enable_loss_parallel

wconstab · 2025-08-08T18:42:26Z

torchtitan/experiments/auto_parallel/parallelize_llama.py

+        # in our case, but we can work around it by adding
+        # casting the output to a DTensor on a 1d device mesh.
+        # We should just use AutoParallel to do this for us, but
+        # it would require putting the loss inside the model as well


I agree that overall we should just put the loss in the model, but I like the approach here for now because it's useful to be as structurally similar to torchtitan as possible for drop-in purposes

torchtitan/experiments/auto_parallel/parallelize_llama.py

Add support for loss parallel

027be20

IMO we should just add the loss in the model and let autoparallel parallelize it for us

fmassa requested a review from wconstab August 8, 2025 09:44

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 8, 2025

Merge branch 'autoparallel' of github.com:pytorch/torchtitan into fma…

97d30a1

…ssa/enable_loss_parallel

wconstab approved these changes Aug 9, 2025

View reviewed changes

Address review feedback

09aeafa

fmassa merged commit 3f04d22 into autoparallel Aug 10, 2025
2 checks passed

fmassa deleted the fmassa/enable_loss_parallel branch August 10, 2025 17:12

IvanKobzarev mentioned this pull request Oct 8, 2025

Add support for loss parallel (#1546) #1828

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for loss parallel #1546

Add support for loss parallel #1546

Uh oh!

fmassa commented Aug 8, 2025 •

edited

Loading

Uh oh!

wconstab Aug 8, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for loss parallel #1546

Add support for loss parallel #1546

Uh oh!

Conversation

fmassa commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wconstab Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fmassa commented Aug 8, 2025 •

edited

Loading