Skip to content

Added benchmarking for new torchao low precision attention api#3865

Open
howardzhang-cv wants to merge 6 commits intogh/howardzhang-cv/17/basefrom
gh/howardzhang-cv/17/head
Open

Added benchmarking for new torchao low precision attention api#3865
howardzhang-cv wants to merge 6 commits intogh/howardzhang-cv/17/basefrom
gh/howardzhang-cv/17/head

Conversation

@howardzhang-cv
Copy link
Contributor

@howardzhang-cv howardzhang-cv commented Feb 12, 2026

Stack from ghstack (oldest at bottom):

Summary

  • Added new benchmark for new low precision attention API (FP8 FA3 backend)
  • Compares LPIPS and runtime against high-precision model
  • uses flux.1-schnell model, 4 inference steps, DrawBench prompts
  • has options to control number of prompts, torch.compile usage, warmup_iters, using debug prompts, number of inference steps
  • Following the guidelines of add performance and accuracy eval of flux-1.schnell #3502
  • Can configure the recipe of the run through changing config in top of file

Example Run

python benchmarks/attention/eval_flux_model.py

Results

Config LPIPS Runtime Speedup
w/out compile 0.224 1.04s 1.06x
w/ compile 0.226 0.77s 1.12x
  • Tested w/ and w/out torch.compile, using FP8 FA3 backend (no fused rope, no hadamard)
  • LPIPS and Speedup are compared to high-precision model that is either w/ or w/out torch.compile (to keep it fair)

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Feb 12, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3865

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit fe36423 with merge base d4c1ba3 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 12, 2026
howardzhang-cv added a commit that referenced this pull request Feb 12, 2026
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 66939a9
Pull-Request: #3865
@howardzhang-cv howardzhang-cv added benchmark module: not user facing Use this tag if you don't want this PR to show up in release notes labels Feb 12, 2026
[ghstack-poisoned]
howardzhang-cv added a commit that referenced this pull request Feb 12, 2026
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: dd15b60
Pull-Request: #3865
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
howardzhang-cv added a commit that referenced this pull request Feb 13, 2026
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 4f64a25
Pull-Request: #3865
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

benchmark CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments