Support QAT int4 v1 path for BC #2888

andrewor14 · 2025-08-27T14:20:49Z

Summary: Int4WeightOnlyConfig supports version 1 (targeting tinygemm) and version 2 (targeting fbgemm). However, the latter requires a new dependency (fbgemm_gpu_genai >= 1.2.0), which is problematic for torchao integrations with other frameworks. For now, we should continue to support the v1 path for BC.

Test Plan:

python test/quantization/test_qat.py -k test_infer_int4_weight_only_config

pytorch-bot · 2025-08-27T14:20:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2888

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Multiple CI trunk failures after landing https://github.com/pytorch/pytorch/pull/161002

✅ No Failures

As of commit 80ccdbc with merge base 6f035e8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vkuzo · 2025-08-27T15:12:07Z

I think this PR should also test the int4 workflow with version 1, can we add that?

**Summary:** `Int4WeightOnlyConfig` supports version 1 (targeting tinygemm) and version 2 (targeting fbgemm). However, the latter requires a new dependency (fbgemm_gpu_genai >= 1.2.0), which is problematic for torchao integrations with other frameworks. For now, we should continue to support the v1 path for BC. **Test Plan:** ``` python test/quantization/test_qat.py -k test_infer_int4_weight_only_config ```

andrewor14 · 2025-08-27T19:33:43Z

I think this PR should also test the int4 workflow with version 1, can we add that?

added a test

jerryzh168 · 2025-08-27T20:06:46Z

test/quantization/test_qat.py

+        """
+        self._test_quantize_api_against_ptq(
+            Int4WeightOnlyConfig(version=version),
+            target_prepare_sqnr=12,


does this mean the prepare numerics does not match convert numerics very well? will this be an issue

**Summary:** `Int4WeightOnlyConfig` supports version 1 (targeting tinygemm) and version 2 (targeting fbgemm). However, the latter requires a new dependency (fbgemm_gpu_genai >= 1.2.0), which is problematic for torchao integrations with other frameworks. For now, we should continue to support the v1 path for BC. **Test Plan:** ``` python test/quantization/test_qat.py -k test_infer_int4_weight_only_config ```

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 27, 2025

andrewor14 requested review from jerryzh168 and vkuzo August 27, 2025 14:20

andrewor14 force-pushed the qat-int4-v1 branch from e17a0a7 to 084eaff Compare August 27, 2025 14:38

andrewor14 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Aug 27, 2025

andrewor14 force-pushed the qat-int4-v1 branch from 084eaff to 8bb6aac Compare August 27, 2025 17:51

andrewor14 force-pushed the qat-int4-v1 branch from 8bb6aac to 80ccdbc Compare August 27, 2025 18:40

jerryzh168 reviewed Aug 27, 2025

View reviewed changes

jerryzh168 approved these changes Aug 27, 2025

View reviewed changes

andrewor14 merged commit 6e9bf26 into main Aug 28, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support QAT int4 v1 path for BC #2888

Support QAT int4 v1 path for BC #2888

Uh oh!

andrewor14 commented Aug 27, 2025

Uh oh!

pytorch-bot bot commented Aug 27, 2025 •

edited

Loading

Uh oh!

vkuzo commented Aug 27, 2025

Uh oh!

andrewor14 commented Aug 27, 2025

Uh oh!

jerryzh168 Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support QAT int4 v1 path for BC #2888

Support QAT int4 v1 path for BC #2888

Uh oh!

Conversation

andrewor14 commented Aug 27, 2025

Uh oh!

pytorch-bot bot commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2888

❗ 1 Active SEVs

✅ No Failures

Uh oh!

vkuzo commented Aug 27, 2025

Uh oh!

andrewor14 commented Aug 27, 2025

Uh oh!

jerryzh168 Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Aug 27, 2025 •

edited

Loading