Diffusion-model (LLaDA) support for Llama.cpp? #12208

kannony · 2025-03-05T20:43:16Z

kannony
Mar 5, 2025

Mercury Coder is a diffusion-model based LLM which its creators claim offers huge efficiency improvements over standard LLMs. Is there any possibility that llama.cpp will add support for it?

https://huggingface.co/spaces/multimodalart/LLaDA
https://ml-gsai.github.io/LLaDA-demo/
https://github.com/ML-GSAI/LLaDA

jhowilbur · 2025-03-06T22:01:58Z

jhowilbur
Mar 6, 2025

Hey community
A good starting point could be a mix with this StableDiffusionGGML

Here we have a good explanation to help you run the python version locally
https://www.youtube.com/watch?v=TIyUD0uJcsM

0 replies

andrewkchan · 2025-05-22T05:22:53Z

andrewkchan
May 22, 2025

This is newly relevant with the release of Gemini Diffusion yesterday: https://simonwillison.net/2025/May/21/gemini-diffusion/

I might be interested in exploring this. Wrapping up a quantization PR for Iwan's fork (ikawrakow/ik_llama.cpp#441) and am interested in what it would take to add support for a whole new architecture to llama.cpp (although I can't guarantee immediate bandwidth).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Diffusion-model (LLaDA) support for Llama.cpp? #12208

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Diffusion-model (LLaDA) support for Llama.cpp? #12208

Uh oh!

kannony Mar 5, 2025

Replies: 2 comments

Uh oh!

Uh oh!

jhowilbur Mar 6, 2025

Uh oh!

andrewkchan May 22, 2025

kannony
Mar 5, 2025

jhowilbur
Mar 6, 2025

andrewkchan
May 22, 2025