Skip to content

mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) #13460

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 13, 2025

Conversation

ngxson
Copy link
Collaborator

@ngxson ngxson commented May 11, 2025

In this PR:

  • Remove libllava - it contains too many redundant and unsafe code - the libmtmd already covers all use cases with a better API
  • Remove clip-quantize-cli because it's already broken a long time ago - it will be replaced soon ; In the meantime, if you need to quantize vision models, use convert_hf_to_gguf.py --outtype, minimum type supported is q8_0
  • Move all conversion scripts to mtmd/legacy-models ; new models can be converted using convert_hf_to_gguf.py --mmproj

NOTE: in the next PR, many APIs will be removed from clip.h, as we will convert clip.cpp to be used internally by libmtmd

@ngxson ngxson added the breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. label May 11, 2025
@github-actions github-actions bot added examples python python script changes labels May 11, 2025
@ngxson ngxson marked this pull request as ready for review May 12, 2025 15:04
@ngxson ngxson requested a review from ggerganov May 12, 2025 15:04
@ngxson ngxson merged commit b472634 into ggml-org:master May 13, 2025
46 checks passed
gabe-l-hart added a commit to gabe-l-hart/ollama that referenced this pull request Jun 24, 2025
The kv cache hierarchy was squashed so that now all of the llama-kv-cache-*
implementations inherit directly from llama_memory_i and there is no
intermediary llama_kv_cache base class.

ggml-org/llama.cpp#14006

The llava.* tool files were migrated to mtmd.* files

ggml-org/llama.cpp#13460

Branch: GraniteFour

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
gabe-l-hart added a commit to gabe-l-hart/ollama that referenced this pull request Jun 25, 2025
The llava.* tool files were migrated to mtmd.* files

ggml-org/llama.cpp#13460

Branch: GraniteFour

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples python python script changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants