-
Notifications
You must be signed in to change notification settings - Fork 2
Add Adreno GPU support #17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
3e74c27
to
e9ae5a1
Compare
Add new OpenCL backend to support Adreno GPUs --------- Co-authored-by: Skyler Szot <quic_sszot@quicinc.com> Co-authored-by: Shangqing Gu <quic_shawngu@quicinc.com> Co-authored-by: Alexander Angus <quic_aangus@quicinc.com> Co-authored-by: Hongqiang Wang <quic_wangh@quicinc.com> Co-authored-by: Max Krasnyansky <quic_maxk@quicinc.com>
* `ggml-base`, `..` and `.` are added by `ggml_add_backend_library`
Still getting the warning about clCreateCmdQueue being obsolete. Will fix that separately.
Also for unsupported GPUs.
c8f46be
to
9697d07
Compare
@lhez It looks a nice feature for qualcomm SoCs.
So, tested with the following change.
2nd (runtime error)
|
I have rebased the this branch onto the current llama.cpp master. Everthing works for on the T14s Gen6 Qualcom under linux. If anytests are required please ping me. I hope this gets merged soon |
@contra-bit Thank you for testing it out. This has already been merged upstream. |
@sparkleholic Sorry for missing this. I didn't seem to receive notification about this. Did you use a commercial device or a development board (or a testing device)? |
Merged upstream. |
@lhez <
Hence I've check the OpenCL info on QCS8550.
I suspect the following hard-coded size might be the cause.
Because of the following resource limitations.
I've just verified it's working well by reducing the allocation size not exceeding 256MB. |
issue: CodeLinaro#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro/llama.cpp#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro/llama.cpp#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro/llama.cpp#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro/llama.cpp#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
issue: CodeLinaro/llama.cpp#17 (comment) This patch fixes the memory allocation size not exceeding the maximum size of the OpenCL device.
Add new OpenCL backend to support Adreno GPUs