How to convert SDXL model to int8

I want to use illustrious model on gfx1650. gfx1650 have bugs in fp16 and can only use fp32 which is too slow, maybe int8 can run faster.
I convert and test model on my 780M(gfx1103).
I have tried `convert_to_quant --int8 --block_size 128 --comfy_quant --simple --nerf_large -i oneObsession_v19.safetensors` and use `Load Checkpoint (Quantized)`, error reports `Weight scale shape mismatch: scale.shape=torch.Size([]), expected (10, 2)`. If I use ComfyUI-FeatherOps and use build-in `Load Checkpoint`, it generates black image. If I only convert diffusion model and use build-in `Load Diffusion Model`, it generates error image like this.

![Image](https://github.com/user-attachments/assets/2ab539ab-ec0f-4525-bd5e-9a7d63e98b26)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to convert SDXL model to int8 #21

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to convert SDXL model to int8 #21

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions