-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
Hi TSDSR team,
I noticed that the SD3-based pipeline feeds the flow-matching predictor v directly into the score-based loss. But in my understanding, v isn’t usually interchangeable with the standard diffusion noise prediction. Could you briefly explain the rationale behind this choice and whether any extra scaling or conversion is applied? I might be missing something obvious, so any pointers to the relevant lines in the code or paper would be greatly appreciated. Thanks a lot for your great work and for clarifying!
Metadata
Metadata
Assignees
Labels
No labels