for more stable generator. and for speed infer, can reference (lora, lcm) + [trt](https://github.com/NVIDIA/TensorRT/blob/release/10.5/demo/Diffusion/flux_pipeline.py)