-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ggml : deprecate GGML_TASK_FINALIZE
#284
Labels
Comments
ggerganov
added
enhancement
New feature or request
good first issue
Good for newcomers
refactoring
Refactoring
performance
Speed related topics
and removed
enhancement
New feature or request
labels
Jun 25, 2023
ggerganov
changed the title
ggml : deprecate GGML_TASK_FINALIZE
ggml : deprecate Jun 25, 2023
GGML_TASK_FINALIZE
goerch
added a commit
to goerch/ggml
that referenced
this issue
Jun 25, 2023
goerch
added a commit
to goerch/ggml
that referenced
this issue
Jun 26, 2023
Hi, I took a look at this and found that GGML_TASK_FINALIZE is used by https://github.com/ggerganov/llama.cpp/blob/178b1850ebd21b349cebbee887950e435c5aa2d3/ggml.c#L15453 Please advise. |
INIT and FINALIZE are now optional and disabled for the majority of ops: Lines 4080 to 4116 in 79ea7a4
This is good enough for now |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
The initial design of the compute tasks in
ggml
was each one to have 3 separate stages:GGML_TASK_INIT
GGML_TASK_COMPUTE
GGML_TASK_FINALIZE
So far, the
GGML_TASK_FINALIZE
step has been left completely unused and it seems it won't find any applications in the future. Therefore, it is best to remove it all together. This will simplify the code a little bit and also bring performance improvements as there will be fewer thread synchronization points during the graph computationRelated:
The text was updated successfully, but these errors were encountered: