Fix SDPA decomp problem #4851

mcremon-meta · 2024-08-22T21:35:49Z

Summary:
As titled. The new _safe_softmax function is meant to avoid NaN issues mostly in training. For inference, we shouldn't need it so we swap with the regular softmax, which will prevent the decomposition that introduces the unsupported ops (eq, logical_not and any). See https://www.internalfb.com/code/fbsource/fbcode/caffe2/torch/_decomp/decompositions.py?lines=425.

Note that it needed some changes to run_and_verify since we now need some aten IR changes. I will fix it in another diff, where run_and_verify will use a nop quantizer instead. This way the code path will be the same for fp32 and quantized. But let's make CI green first!

We will also need to formalize better how to apply passes on the initial graph module (aten IR passes as opposed to edge IR passes). Seems like lifted constants and other things like that can create issues, but unless we see errors, let's wait until the IR changes from PT/ET are in first.

Reviewed By: hsharma35

Differential Revision: D61639074

pytorch-bot · 2024-08-22T21:35:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4851

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 66c39b4 with merge base 87b38cf ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-08-22T21:35:58Z

This pull request was exported from Phabricator. Differential Revision: D61639074

Summary: As titled. The new `_safe_softmax` function is meant to avoid NaN issues mostly in training. For inference, we shouldn't need it so we swap with the regular softmax, which will prevent the decomposition that introduces the unsupported ops (`eq`, `logical_not` and `any`). See https://www.internalfb.com/code/fbsource/fbcode/caffe2/torch/_decomp/decompositions.py?lines=425. Reviewed By: hsharma35 Differential Revision: D61639074

facebook-github-bot · 2024-08-22T22:24:26Z

This pull request was exported from Phabricator. Differential Revision: D61639074

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 22, 2024

facebook-github-bot added the fb-exported label Aug 22, 2024

hsharma35 approved these changes Aug 22, 2024

View reviewed changes

facebook-github-bot force-pushed the export-D61639074 branch from 244ebc0 to 66c39b4 Compare August 22, 2024 22:24

facebook-github-bot merged commit d7c069f into main Aug 22, 2024
35 of 37 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix SDPA decomp problem #4851

Fix SDPA decomp problem #4851

Uh oh!

mcremon-meta commented Aug 22, 2024

Uh oh!

pytorch-bot bot commented Aug 22, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

Uh oh!

Uh oh!

Fix SDPA decomp problem #4851

Fix SDPA decomp problem #4851

Uh oh!

Conversation

mcremon-meta commented Aug 22, 2024

Uh oh!

pytorch-bot bot commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4851

✅ No Failures

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 22, 2024 •

edited

Loading