Add stochastic sampling to FlowMatchEulerDiscreteScheduler #11369

apolinario · 2025-04-19T17:49:14Z

What does this PR do?

This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on Lightricks/LTX-Video@b1aeddd ltx_video/schedulers/rf.py, which was added with th release of 0.9.6-distilled. I decoupled the next and current sigma to try to get closer to the rf.py implementation of the stochastic sampling, but a second pair of eyes on this would be great.

To try it:

import torch
from diffusers import LTXVideoTransformer3DModel, FlowMatchEulerDiscreteScheduler, LTXPipeline
from diffusers.utils import export_to_video

transformer = LTXVideoTransformer3DModel.from_pretrained(
    "multimodalart/ltxv-2b-0.9.6-distilled",
    subfolder="transformer",
    torch_dtype=torch.bfloat16,
    variant="bf16"
)

scheduler = FlowMatchEulerDiscreteScheduler.from_pretrained(
    "multimodalart/ltxv-2b-0.9.6-distilled",
    subfolder="scheduler"
)

pipe = LTXPipeline.from_pretrained(
    "Lightricks/LTX-Video-0.9.5",
    transformer=transformer,
    scheduler=scheduler, #add or remove the scheduler to see the difference
    torch_dtype=torch.bfloat16,
)
pipe.to("cuda")

prompt = "A woman eating a burger"
negative_prompt = "worst quality, inconsistent motion, blurry, jittery, distorted"
generator = torch.Generator(device="cuda").manual_seed(42)
video = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    width=1216,
    height=704,
    num_frames=121,
    num_inference_steps=8,
    guidance_scale=1,
    generator=generator
).frames[0]

export_to_video(video, "distilled_scheduler.mp4", fps=24)

Who can review?

@yiyixuxu

This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on Lightricks/LTX-Video@b1aeddd ltx_video/schedulers/rf.py

HuggingFaceDocBuilderDev · 2025-04-19T17:55:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

apolinario · 2025-04-19T19:32:34Z

@bot /style

github-actions · 2025-04-19T19:33:28Z

Style fixes have been applied. View the workflow run here.

yiyixuxu

thanks @apolinario

yiyixuxu · 2025-04-20T08:46:56Z

src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py

            dt = sigma_next - sigma

-        prev_sample = sample + dt * model_output
+        # Determine whether to use stochastic sampling for this step
+        use_stochastic = stochastic_sampling if stochastic_sampling is not None else self.config.stochastic_sampling


I think just have this in config is enough no?

apolinario · 2025-04-21T15:41:40Z

@bot /style

github-actions · 2025-04-21T15:42:32Z

Style fixes have been applied. View the workflow run here.

src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py

yiyixuxu · 2025-04-21T17:46:02Z

src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py

+
+            current_sigma = per_token_sigmas[..., None]
+            next_sigma = lower_sigmas[..., None]
+            dt = next_sigma - current_sigma  # Equivalent to sigma_next - sigma


@apolinario
here it seems to reversed, no?
before:
dt = (per_token_sigmas - lower_sigmas)[..., None]

now:
dt = ower_sigmas - per_token_sigmas

good catch!

nitinmukesh · 2025-04-21T19:17:49Z

Quick question, Should it be LTXPipeline or LTXConditionPipeline?
0.9.5 support was added in LTXConditionPipeline.

Co-authored-by: YiYi Xu <yixu310@gmail.com>

yiyixuxu · 2025-04-22T03:18:39Z

thanks @apolinario !

yiyixuxu · 2025-04-22T03:53:33Z

@nitinmukesh I think it's probably better in LTXConditionPipeline, can you try it out?

nitinmukesh · 2025-04-22T11:15:09Z

@apolinario

Thank you for adding the sampling.

Please could you share few sample outputs which you created. I am not getting good results so want to compare if something wrong in code.
#11359

Ednaordinary · 2025-05-08T18:52:37Z

Quick question, Should it be LTXPipeline or LTXConditionPipeline? 0.9.5 support was added in LTXConditionPipeline.

LTXPipeline. The model here is 0.9.6-distilled (the only one that uses the stochastic sampling as of now). "0.9.5" is included because the transformer and scheduler from 0.9.6 are inserted, which is fine because nothing else in the pipeline is different from 0.9.5 and there's currently nothing the lightricks 0.9.6 repos. 0.9.6-distilled is guidance distilled so it does not work in the condition pipeline, while 0.9.6 does

Ednaordinary · 2025-05-08T19:27:21Z

Also, I find increasing the schedulers shift while using distilled helps to boost coherence. This is inline with FastVideo (PCM distillation) which says to set the shift to 17. The 1.0 in https://huggingface.co/multimodalart/ltxv-2b-0.9.6-distilled/blob/main/scheduler/scheduler_config.json doesn't seem like a good default. I'm unsure what it is in the original LTX repo

Some different shifts

0.25

0.25.mp4

0.5

0.5.mp4

1.0

1.mp4

2.0

2.mp4

4.0

4.mp4

8.0

8.mp4

16.0

16.mp4

32

32.mp4

64

64.mp4

16 seems like a good default

(tested by adding pipe.scheduler._shift = 16.0 somewhere between pipe init and pipe call)

nitinmukesh · 2025-05-09T10:46:47Z

Thank you @Ednaordinary

The information you provided is very helpful. Getting better results than before.

distilled_scheduler1.mp4

nitinmukesh · 2025-05-09T15:56:09Z

249 frames

distilled_scheduler2.mp4

Ednaordinary · 2025-05-09T16:42:13Z

Looks great! What I've noticed so far is that the background is often very repetitive in a weird way like shown in your 249 frame example. Sometimes this can be solved by increasing the shift to an insanely large amount (think in the 200s) but that also incurs everything else that comes from running shift that high (eventually, everything just turns into blobs)

nitinmukesh · 2025-05-09T16:57:57Z

Sure will try that, thank you. Next gonna try if distilled support image to video (LTXImageToVideoPipeline).
The speed of the models is OoTW, all these generated on 8 GB VRAM + 16 GB RAM and only 3 minutes for 249 frames.

nitinmukesh · 2025-05-10T09:05:33Z

I2V is working good.

distilled_scheduler6.mp4

Result from 0.9.1 using same image
newgenai79/sd-diffuser-webui#11

nitinmukesh · 2025-05-10T09:32:52Z

@Ednaordinary

What do you suggest for 0.9.6 dev, LTXPipeline or Conditioning pipeline, in case you tried.

Ednaordinary · 2025-05-10T09:48:42Z

0.9.6 should be the condition pipeline I'm pretty sure

Add stochastic sampling to FlowMatchEulerDiscreteScheduler

690adb5

This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on Lightricks/LTX-Video@b1aeddd ltx_video/schedulers/rf.py

apolinario requested a review from yiyixuxu April 19, 2025 17:54

Apply style fixes

f87956e

yiyixuxu approved these changes Apr 20, 2025

View reviewed changes

Use config value directly

9edc5be

Apply style fixes

32d9aef

yiyixuxu reviewed Apr 21, 2025

View reviewed changes

apolinario and others added 3 commits April 21, 2025 23:31

Swap order

9c35a89

Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py

25bc77d

Co-authored-by: YiYi Xu <yixu310@gmail.com>

Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py

ff1012f

Co-authored-by: YiYi Xu <yixu310@gmail.com>

yiyixuxu merged commit 6ab62c7 into main Apr 22, 2025
15 checks passed

nitinmukesh mentioned this pull request May 9, 2025

[Feature request] LTX-Video v0.9.6 15x faster inference than non-distilled model. #11359

Closed

Add stochastic sampling to FlowMatchEulerDiscreteScheduler #11369

Add stochastic sampling to FlowMatchEulerDiscreteScheduler #11369

Uh oh!

Conversation

apolinario commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 19, 2025

Uh oh!

apolinario commented Apr 19, 2025

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Apr 20, 2025

Choose a reason for hiding this comment

Uh oh!

apolinario commented Apr 21, 2025

Uh oh!

github-actions bot commented Apr 21, 2025

Uh oh!

Uh oh!

Uh oh!

yiyixuxu Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

apolinario Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

nitinmukesh commented Apr 21, 2025

Uh oh!

Uh oh!

yiyixuxu commented Apr 22, 2025

Uh oh!

yiyixuxu commented Apr 22, 2025

Uh oh!

nitinmukesh commented Apr 22, 2025

Uh oh!

Ednaordinary commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ednaordinary commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nitinmukesh commented May 9, 2025

Uh oh!

nitinmukesh commented May 9, 2025

Uh oh!

Ednaordinary commented May 9, 2025

Uh oh!

nitinmukesh commented May 9, 2025

Uh oh!

nitinmukesh commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nitinmukesh commented May 10, 2025

Uh oh!

Ednaordinary commented May 10, 2025

Uh oh!

Uh oh!

apolinario commented Apr 19, 2025 •

edited

Loading

Ednaordinary commented May 8, 2025 •

edited

Loading

Ednaordinary commented May 8, 2025 •

edited

Loading

nitinmukesh commented May 10, 2025 •

edited

Loading