Support PLAIN_INT32 for AWQ on Intel GPU #3019

xiaowangintel · 2025-09-17T09:25:19Z

Summary:
Support PLAIN_INT32 for AWQ on Intel GPU

Test:

# task is gsm8k
python example.py --repo "microsoft/Phi-4-mini-instruct" --quant awq-int4wo-128 --calibration_limit 5 --max_seq_length 4096 --device xpu

#task is mmlu
python example.py --repo "Qwen/Qwen3-8B" --quant awq-int4wo-128 --calibration_limit 1 --max_seq_length 4096 --device xpu

Result:

Task	Model	calibration_limit	awq
gsm8k	Phi-4-mini-instruct	5	0.75815
mmlu	Qwen3-8B	1	0.7595

pytorch-bot · 2025-09-17T09:25:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3019

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3a325f3 with merge base 067b273 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-09-18T03:37:37Z

torchao/prototype/awq/example.py

add a test like

ao/test/quantization/quantize_/workflows/int4/test_int4_tensor.py

Line 223 in 18dbe87

def test_activation_prescaling(self):

@xiaowangintel let us add the UT as int4_tensor.

liangan1

LGTM, how about the CUDA accuracy on these two models?

xiaowangintel · 2025-09-19T01:42:13Z

LGTM, how about the CUDA accuracy on these two models?

Please visit #2400

Support PLAIN_INT32 for AWQ on Intel GPU

99f6b3f

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2025

xiaowangintel changed the title ~~Support PLAIN_INT32 for AWQ on Intel GPU~~ [WIP]Support PLAIN_INT32 for AWQ on Intel GPU Sep 17, 2025

xiaowangintel requested review from Xia-Weiwen, jerryzh168 and liangan1 September 18, 2025 03:30

xiaowangintel changed the title ~~[WIP]Support PLAIN_INT32 for AWQ on Intel GPU~~ Support PLAIN_INT32 for AWQ on Intel GPU Sep 18, 2025

Support PLAIN_INT32 for AWQ on Intel GPU

fabebb2

jerryzh168 reviewed Sep 18, 2025

View reviewed changes

liangan1 reviewed Sep 18, 2025

View reviewed changes

liangan1 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Sep 18, 2025

Support PLAIN_INT32 for AWQ on Intel GPU

3a325f3

liangan1 requested review from jerryzh168 and liangan1 September 19, 2025 01:50

liangan1 approved these changes Sep 19, 2025

View reviewed changes

liangan1 mentioned this pull request Sep 19, 2025

Filling some Int4 tensor feature gaps after tensor subclass migration #3013

Open

6 tasks

jerryzh168 approved these changes Sep 19, 2025

View reviewed changes

jerryzh168 merged commit cfa39c8 into pytorch:main Sep 19, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support PLAIN_INT32 for AWQ on Intel GPU #3019

Support PLAIN_INT32 for AWQ on Intel GPU #3019

Uh oh!

xiaowangintel commented Sep 17, 2025

Uh oh!

pytorch-bot bot commented Sep 17, 2025 •

edited

Loading

Uh oh!

jerryzh168 Sep 18, 2025

Uh oh!

liangan1 Sep 18, 2025

Uh oh!

xiaowangintel Sep 18, 2025

Uh oh!

liangan1 left a comment •

edited

Loading

Uh oh!

xiaowangintel commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support PLAIN_INT32 for AWQ on Intel GPU #3019

Support PLAIN_INT32 for AWQ on Intel GPU #3019

Uh oh!

Conversation

xiaowangintel commented Sep 17, 2025

Uh oh!

pytorch-bot bot commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3019

✅ No Failures

Uh oh!

jerryzh168 Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

liangan1 Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

xiaowangintel Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

liangan1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaowangintel commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Sep 17, 2025 •

edited

Loading

liangan1 left a comment •

edited

Loading