lumina reversed timesteps seems not handled in some funcs

During my training, I accidently found if I use the default `shift` timestep sampling method, even after thousands of steps, the model still learned nothing, the sample images are identical. Which is very strange. Because when using `nextdit_shift` it only needs ~200 steps to show the noticeable changes in sample images

Lumina uses t=0 as the noise and t=1 as the image. Which is reversed, unlike other models  (t=1 as the noise and t=0 as image). 

I found this causes some funcs not working as expected, since they expects the usual timesteps.

In

https://github.com/kohya-ss/sd-scripts/blob/f5d44fd4874ef19dfbd764158ad29187184c6fd0/library/lumina_train_util.py#L806

`nextdit_shift` reversed the timesteps in `time_shift()`. Other timestep sampling methods don't work properly. They still shift toward t=1.

`--min_snr_gamma` is also affected, maybe more.

I also found that diffusers still uses the usual timesteps in its api, and only reverse it before passing to the network.

https://github.com/huggingface/diffusers/blob/0f252be0ed42006c125ef4429156cb13ae6c1d60/src/diffusers/pipelines/lumina2/pipeline_lumina2.py#L698

Maybe we should do this as well?

https://github.com/kohya-ss/sd-scripts/blob/f5d44fd4874ef19dfbd764158ad29187184c6fd0/lumina_train_network.py#L271


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

lumina reversed timesteps seems not handled in some funcs #2201

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

lumina reversed timesteps seems not handled in some funcs #2201

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions