Clarification about `DiT.safetensors` and `text_encoder` checkpoints in inference scripts

Hi, thanks for releasing the inference code for SmartPhotoCrafter.

I am trying to reproduce the demo inference following the README. The example command uses:

```bash
--model_path "ckpt/Qwen-Image-Edit-2509" \
--dit_path "ckpt/DiT.safetensors" \
--vlm_path "ckpt/text_encoder"
```
I understand that Qwen-Image-Edit-2509 is the base model. However, I am not sure what the expected contents of DiT.safetensors and text_encoder are.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification about `DiT.safetensors` and `text_encoder` checkpoints in inference scripts #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Clarification about DiT.safetensors and text_encoder checkpoints in inference scripts #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Clarification about `DiT.safetensors` and `text_encoder` checkpoints in inference scripts #2