-
Notifications
You must be signed in to change notification settings - Fork 11.6k
[SYCL]fix ggml_sycl_mul_mat_id() to match the change of api #7436
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor things which I think should be changed.
I have also tested this branch on an Arc A770
and A100
GPU and can confirm the MUL_MAT_ID implementation is working
@arthw could you rebase this PR? I believe it will resolve the failing CI checks. |
yes, rebase it. |
In It was added in #6505 as a workaround to avoid crashing SYCL with MoE models, but if mul_mat_id was fixed here, it should be removed. |
Fix the ggml_sycl_mul_mat_id() which is impacted by the api parameters changed by #6505.
Now, only 1 mul_mat_id UT case is fault: type_a=iq4_nl. It's known issue.