mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

ngxson · 2025-05-11T22:33:06Z

In this PR:

Remove libllava - it contains too many redundant and unsafe code - the libmtmd already covers all use cases with a better API
Remove clip-quantize-cli because it's already broken a long time ago - it will be replaced soon ; In the meantime, if you need to quantize vision models, use convert_hf_to_gguf.py --outtype, minimum type supported is q8_0
Move all conversion scripts to mtmd/legacy-models ; new models can be converted using convert_hf_to_gguf.py --mmproj

NOTE: in the next PR, many APIs will be removed from clip.h, as we will convert clip.cpp to be used internally by libmtmd

The kv cache hierarchy was squashed so that now all of the llama-kv-cache-* implementations inherit directly from llama_memory_i and there is no intermediary llama_kv_cache base class. ggml-org/llama.cpp#14006 The llava.* tool files were migrated to mtmd.* files ggml-org/llama.cpp#13460 Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

The llava.* tool files were migrated to mtmd.* files ggml-org/llama.cpp#13460 Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

ngxson added 2 commits May 12, 2025 00:26

mtmd : remove libllava, remove clip-quantize-cli

ab209aa

rm clip_model_quantize

3795423

ngxson added the breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. label May 11, 2025

github-actions bot added examples python python script changes labels May 11, 2025

ngxson marked this pull request as ready for review May 12, 2025 15:04

ngxson requested a review from ggerganov May 12, 2025 15:04

ggerganov approved these changes May 12, 2025

View reviewed changes

ngxson merged commit b472634 into ggml-org:master May 13, 2025
46 checks passed

ngxson mentioned this pull request May 13, 2025

clip : clip.h become private API (⚠️ breaking change) #13510

Merged

gabe-l-hart added a commit to gabe-l-hart/ollama that referenced this pull request Jun 25, 2025

fix: Remove llava.*

be6cbbb

The llava.* tool files were migrated to mtmd.* files ggml-org/llama.cpp#13460 Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

Uh oh!

ngxson commented May 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

Uh oh!

Conversation

ngxson commented May 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson commented May 11, 2025 •

edited

Loading