metal : fix fusion across different encoders #14849

ggerganov · 2025-07-24T06:31:07Z

Since in Metal we encode the graph in parallel using multiple MTLComputeCommandEncoders, we have to prevent from fusing ops that are split in different encoders.

ggml-ci

* metal : fix fusion across different encoders ggml-ci * cont : add assertion ggml-ci

* origin/master: docs : update HOWTO‑add‑model.md for ModelBase and new model classes (ggml-org#14874) ggml : remove invalid portPos specifiers from dot files (ggml-org#14838) context : restore preemptive sched reset when LLAMA_SET_ROWS=0 (ggml-org#14870) mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (ggml-org#14503) rpc : check for null buffers in get/set/copy tensor endpoints (ggml-org#14868) sched : fix multiple evaluations of the same graph with pipeline parallelism (ggml-org#14855) musa: upgrade musa sdk to rc4.2.0 (ggml-org#14498) sync : ggml cmake : fix usage issues (ggml/1257) ggml-cpu : remove stdlib include from repack.cpp (ggml/1276) context : perform output reorder lazily upon access after sync (ggml-org#14853) chat : fix kimi-k2 chat template (ggml-org#14852) sycl: fixed semantics of block offset calculation (ggml-org#14814) llama : fix MiniCPM inference after Granite Four changes (ggml-org#14850) docs: add libcurl-dev install hint for Linux distros (ggml-org#14801) metal : fix fusion across different encoders (ggml-org#14849) sycl: fix undefined variable in work group size check (ggml-org#14843) convert : text-only support for GLM-4.1V-9B-Thinking (ggml-org#14823) CUDA: fix overflow in FA, tune performance (ggml-org#14840) CUDA: fix compilation with GGML_CUDA_F16 (ggml-org#14837)

metal : fix fusion across different encoders

d28be8b

ggml-ci

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Jul 24, 2025

ggerganov mentioned this pull request Jul 24, 2025

Eval bug: gemma3 generates infinite "and" output after commit bf9087f #14835

Closed

cont : add assertion

42826eb

ggml-ci

ggerganov merged commit 065908c into master Jul 24, 2025
55 checks passed

ggerganov deleted the gg/metal-fix-fusion branch July 24, 2025 07:24

ggerganov mentioned this pull request Jul 25, 2025

server : fix vision test regex #14871

Closed

taronaeo pushed a commit to taronaeo/llama.cpp-s390x that referenced this pull request Jul 25, 2025

metal : fix fusion across different encoders (ggml-org#14849)

e84b911

* metal : fix fusion across different encoders ggml-ci * cont : add assertion ggml-ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

metal : fix fusion across different encoders #14849

metal : fix fusion across different encoders #14849

Uh oh!

ggerganov commented Jul 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

metal : fix fusion across different encoders #14849

metal : fix fusion across different encoders #14849

Uh oh!

Conversation

ggerganov commented Jul 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants