You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The WIP implementation in that PR might be a bit outdated by now, so one can either attempt to update it or implement it from scratch on top of the current code base.
The text was updated successfully, but these errors were encountered:
@goerch I just synced the unit tests from llama.cpp as you proposed in #317
Will close the issue as completed now.
Maybe in the future we can more similar simplifications for other ops that have quantization branches (e.g. ggml_cpy())
This task is described well in ggerganov/llama.cpp#1237
The WIP implementation in that PR might be a bit outdated by now, so one can either attempt to update it or implement it from scratch on top of the current code base.
The text was updated successfully, but these errors were encountered: