Open
Description
Hi,
Ternary quantization has become popular and has demonstrated computational speedups and power reductions, as demonstrated in works like llama.cpp and bitnet.cpp. We trained the first ternary DiT network, DiT is a popular structure nowadays for text to image generation. We would like to know if we can be assisted in realizing the deployment of it on stable-diffusion.cpp.
We asked llama.cpp for help and they advised me to come here for guidance link.
Metadata
Metadata
Assignees
Labels
No labels