-
Notifications
You must be signed in to change notification settings - Fork 349
Support PLAIN_INT32 for AWQ on Intel GPU #3019
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3019
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 3a325f3 with merge base 067b273 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add a test like
def test_activation_prescaling(self): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@xiaowangintel let us add the UT as int4_tensor.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, how about the CUDA accuracy on these two models?
Please visit #2400 |
Summary:
Support PLAIN_INT32 for AWQ on Intel GPU
Test:
Result: