Added benchmarking for new torchao low precision attention api by howardzhang-cv · Pull Request #3865 · pytorch/ao

howardzhang-cv · 2026-02-12T02:39:29Z

Stack from ghstack (oldest at bottom):

Summary

Added new benchmark for new low precision attention API (FP8 FA3 backend)
Compares LPIPS and runtime against high-precision model
uses flux.1-schnell model, 4 inference steps, DrawBench prompts
has options to control number of prompts, torch.compile usage, warmup_iters, using debug prompts, number of inference steps
Following the guidelines of add performance and accuracy eval of flux-1.schnell #3502
Can configure the recipe of the run through changing config in top of file

Example Run

python benchmarks/attention/eval_flux_model.py

Results

Config	LPIPS	Runtime	Speedup
w/out compile	0.224	1.04s	1.06x
w/ compile	0.226	0.77s	1.12x

Tested w/ and w/out torch.compile, using FP8 FA3 backend (no fused rope, no hadamard)
LPIPS and Speedup are compared to high-precision model that is either w/ or w/out torch.compile (to keep it fair)

[ghstack-poisoned]

pytorch-bot · 2026-02-12T02:39:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3865

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit fe36423 with merge base d4c1ba3 ():

NEW FAILURE - The following job has failed:

PR Label Check / Check PR Labels (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 66939a9 Pull-Request: #3865

[ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: dd15b60 Pull-Request: #3865

[ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 4f64a25 Pull-Request: #3865

Update

eb083c0

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 12, 2026

howardzhang-cv added a commit that referenced this pull request Feb 12, 2026

Added benchmarking for new torchao low precision attention api

3664ef6

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 66939a9 Pull-Request: #3865

howardzhang-cv mentioned this pull request Feb 12, 2026

Added new API for low precision fp8 attention using FA3 #3857

Open

howardzhang-cv added benchmark module: not user facing Use this tag if you don't want this PR to show up in release notes labels Feb 12, 2026

Update

2380610

[ghstack-poisoned]

howardzhang-cv added a commit that referenced this pull request Feb 12, 2026

Added benchmarking for new torchao low precision attention api

29f2406

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: dd15b60 Pull-Request: #3865

Update

994a587

[ghstack-poisoned]

howardzhang-cv mentioned this pull request Feb 13, 2026

use helion instead of triton for low precision attention quantization kernels #3880

Closed

howardzhang-cv added 3 commits February 12, 2026 17:06

Update

b17b5ec

[ghstack-poisoned]

Update

9589d68

[ghstack-poisoned]

Update

fe36423

[ghstack-poisoned]

howardzhang-cv added a commit that referenced this pull request Feb 13, 2026

Added benchmarking for new torchao low precision attention api

66a2ddd

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 4f64a25 Pull-Request: #3865

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added benchmarking for new torchao low precision attention api#3865

Added benchmarking for new torchao low precision attention api#3865
howardzhang-cv wants to merge 6 commits intogh/howardzhang-cv/17/basefrom
gh/howardzhang-cv/17/head

howardzhang-cv commented Feb 12, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

howardzhang-cv commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Example Run

Results

Uh oh!

pytorch-bot bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3865

❌ 1 New Failure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

howardzhang-cv commented Feb 12, 2026 •

edited

Loading

pytorch-bot bot commented Feb 12, 2026 •

edited

Loading