Question about the design philosophy: Manual Graph Construction vs. DL Compilers #17674
Unanswered
kimminsu38oo
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I’m relatively new to this project, so please forgive me if I’m asking something obvious.
From what I understand,
llama.cppworks by manually constructing the computation graph (e.g., viabuild_graphinggml) and invoking operators one by one. This seems different from the approach used by Deep Learning Compilers like TVM or XLA, which automate graph optimization and operator fusion.If my understanding is correct, I have a few questions regarding the limitations of this "manual" approach:
I’m curious why llama.cpp chose this manual implementation strategy over a compiler-based approach. Are there any plans to introduce compiler-like optimizations in the future?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions