-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLBlast support for Mixtral #4451
Comments
i manged to launch it alright but could not add any input |
Yeah, the Blas prompt processing seems to not work like it should, hopefuly we'll get a fix for that pretty soon. |
I discovered that all models are broken with CLBlast and git bisected the bad commit, see #4453 Fix: |
Is this still a problem after 55e87c3? |
Somehow it works, but performance is not that great. It makes inference just a bit faster: CPU
OpenCL
(these numbers are for offloading only 14 layers) |
Mixtral support with OpenCL requires some extra work |
It would be nice to have full to-do list specifically for this issue. I see OpenCL-related remarks in ggml.c in |
input still does not work . the input bar is blinking and stops blinking when i input something but no text does actually show up pressing enter also does not do anything thanks to all the people working with this never ending project feels like there are several models released each day that break things or need support weird was about to shutdown my pc and i realized i didn't shutdown the program and it actually worked but was extremely context focused until it started to ramble about pubs with itself guess it did get thirsty it seams to completely fail to read and understand the bob transcript (which worked for mistral 7b) . |
Yeah for me the CuBlas or any other Blas basically does the same evaluation speed as the cpu (maybe 1 tk/s faster) making mixtral completely unusable |
shouldn't this be marked has a bug not "enhancement" |
sadly no news anywhere regarding mixtral and clblast and it isn't mentioned on the roadmap either |
This issue is stale because it has been open for 30 days with no activity. |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
The Mixtral PR has been merged: #4428
Testing latest commit 948ff13 with CLBlast and Mixtral I see some assertion errors:
I assume that the CLBlast implementation has not yet been updated to support Mixtral, so filing this issue to request this improvement. Thanks!
The text was updated successfully, but these errors were encountered: