feat: support cache for LongCat-Image by e1ijah1 · Pull Request #602 · vipshop/cache-dit

e1ijah1 · 2025-12-22T14:47:29Z

#587

Signed-off-by: elijah <f1renze.142857@gmail.com>

README.md

DefTruth · 2025-12-22T14:54:13Z

@e1ijah1 Hi~ Can you share the image results w/ or w/o cache?

DefTruth · 2025-12-22T14:54:54Z

Thanks for your contribution!

Signed-off-by: elijah <f1renze.142857@gmail.com>

e1ijah1 · 2025-12-22T15:03:40Z

@e1ijah1 Hi~ Can you share the image results w/ or w/o cache?

Below is the longcat_image_edit.1024x1024.C0_Q0_NONE.png generated by python3 generate.py generate longcat_image_edit

Summary:

INFO 12-22 06:37:00 [base.py:557] ----------------------------------------------------------------------------------------------------
INFO 12-22 06:37:00 [base.py:342] 🤖 Example Init Config Summary:
INFO 12-22 06:37:00 [base.py:360] - Model: /data/LongCat-Image-Edit/ + meituan-longcat/LongCat-Image-Edit
INFO 12-22 06:37:00 [base.py:360] - Task Type: IE2I - Image Editing to Image
INFO 12-22 06:37:00 [base.py:360] - Torch Dtype: torch.bfloat16
INFO 12-22 06:37:00 [base.py:360] - LoRA Weights: None
INFO 12-22 06:37:00 [base.py:196] 🤖 Example Input Summary:
INFO 12-22 06:37:00 [base.py:196] - prompt: Turn the cat into a dog
INFO 12-22 06:37:00 [base.py:196] - negative_prompt:
INFO 12-22 06:37:00 [base.py:196] - guidance_scale: 4.5
INFO 12-22 06:37:00 [base.py:196] - num_inference_steps: 50
INFO 12-22 06:37:00 [base.py:196] - image: Single Image (1024x1024)
INFO 12-22 06:37:00 [base.py:196] - generator: device cpu, seed 0
INFO 12-22 06:37:00 [base.py:259] 🤖 Example Output Summary:
INFO 12-22 06:37:00 [base.py:270] - Model: longcat_image_edit
INFO 12-22 06:37:00 [base.py:270] - Optimization: C0_Q0_NONE
INFO 12-22 06:37:00 [base.py:270] - Load Time: 0.99s
INFO 12-22 06:37:00 [base.py:270] - Warmup Time: 64.54s
INFO 12-22 06:37:00 [base.py:270] - Inference Time: 54.15s
INFO 12-22 06:37:01 [base.py:227] Image saved to longcat_image_edit.1024x1024.C0_Q0_NONE.png
INFO 12-22 06:37:01 [base.py:568] ----------------------------------------------------------------------------------------------------

Below is the longcat_image_edit.1024x1024.C0_Q0_DBCache_F1B0_W8I1M0MC3_R0.24_CFG1_T0O0_S32.png generated by python3 generate.py generate longcat_image_edit --cache

Summary:

INFO 12-22 06:43:32 [base.py:557] ----------------------------------------------------------------------------------------------------
INFO 12-22 06:43:32 [base.py:342] 🤖 Example Init Config Summary:
INFO 12-22 06:43:32 [base.py:360] - Model: /data/LongCat-Image-Edit/ + meituan-longcat/LongCat-Image-Edit
INFO 12-22 06:43:32 [base.py:360] - Task Type: IE2I - Image Editing to Image
INFO 12-22 06:43:32 [base.py:360] - Torch Dtype: torch.bfloat16
INFO 12-22 06:43:32 [base.py:360] - LoRA Weights: None
INFO 12-22 06:43:32 [base.py:196] 🤖 Example Input Summary:
INFO 12-22 06:43:32 [base.py:196] - prompt: Turn the cat into a dog
INFO 12-22 06:43:32 [base.py:196] - negative_prompt:
INFO 12-22 06:43:32 [base.py:196] - guidance_scale: 4.5
INFO 12-22 06:43:32 [base.py:196] - num_inference_steps: 50
INFO 12-22 06:43:32 [base.py:196] - image: Single Image (1024x1024)
INFO 12-22 06:43:32 [base.py:196] - generator: device cpu, seed 0
INFO 12-22 06:43:32 [base.py:259] 🤖 Example Output Summary:
INFO 12-22 06:43:32 [base.py:270] - Model: longcat_image_edit
INFO 12-22 06:43:32 [base.py:270] - Optimization: C0_Q0_DBCache_F1B0_W8I1M0MC3_R0.24_CFG1_T0O0_S32
INFO 12-22 06:43:32 [base.py:270] - Load Time: 0.99s
INFO 12-22 06:43:32 [base.py:270] - Warmup Time: 35.16s
INFO 12-22 06:43:32 [base.py:270] - Inference Time: 22.47s
INFO 12-22 06:43:32 [base.py:227] Image saved to longcat_image_edit.1024x1024.C0_Q0_DBCache_F1B0_W8I1M0MC3_R0.24_CFG1_T0O0_S32.png
INFO 12-22 06:43:32 [base.py:568] ----------------------------------------------------------------------------------------------------

BTW, I'm still downloading the LongCat-Image weights. I'll post the generated images here later.

e1ijah1 · 2025-12-23T02:54:45Z

@e1ijah1 Hi~ Can you share the image results w/ or w/o cache?

The image generated with out cache:

Summary:

INFO 12-22 18:40:26 [base.py:557] ----------------------------------------------------------------------------------------------------
INFO 12-22 18:40:26 [base.py:342] 🤖 Example Init Config Summary:
INFO 12-22 18:40:26 [base.py:360] - Model: /data/LongCat-Image + meituan-longcat/LongCat-Image
INFO 12-22 18:40:26 [base.py:360] - Task Type: T2I - Text to Image
INFO 12-22 18:40:26 [base.py:360] - Torch Dtype: torch.bfloat16
INFO 12-22 18:40:26 [base.py:360] - LoRA Weights: None
INFO 12-22 18:40:26 [base.py:196] 🤖 Example Input Summary:
INFO 12-22 18:40:26 [base.py:196] - prompt: A young Asian woman wearing a yellow knit sweater paired with a white necklace. Her hands rest on her knees, with a serene expression. The background features a rough brick wall, with warm afternoon sunlight casting upon her, creating a tranquil and cozy atmosphere. The shot uses a medium-distance perspective, highlighting her demeanor and the details of her attire. Soft lighting illuminates her face, emphasizing her facial features and the texture of her accessories, adding depth and warmth to the image. The overall composition is simple and elegant, with the brick wall's texture complementing the interplay of sunlight and shadows, showcasing the character's grace and composure.
INFO 12-22 18:40:26 [base.py:196] - height: 1024
INFO 12-22 18:40:26 [base.py:196] - width: 1024
INFO 12-22 18:40:26 [base.py:196] - guidance_scale: 4.5
INFO 12-22 18:40:26 [base.py:196] - num_inference_steps: 50
INFO 12-22 18:40:26 [base.py:196] - generator: device cpu, seed 0
INFO 12-22 18:40:26 [base.py:259] 🤖 Example Output Summary:
INFO 12-22 18:40:26 [base.py:270] - Model: longcat_image
INFO 12-22 18:40:26 [base.py:270] - Optimization: C0_Q0_NONE
INFO 12-22 18:40:26 [base.py:270] - Load Time: 0.98s
INFO 12-22 18:40:26 [base.py:270] - Warmup Time: 37.35s
INFO 12-22 18:40:26 [base.py:270] - Inference Time: 26.83s
INFO 12-22 18:40:26 [base.py:227] Image saved to longcat_image.1024x1024.C0_Q0_NONE.png
INFO 12-22 18:40:26 [base.py:568] ----------------------------------------------------------------------------------------------------

The image generated with cache:

longcat_image 1024x1024 C0_Q0_DBCache_F1B0_W8I1M0MC3_R0 24_CFG1_T0O0_S31

Summary:

INFO 12-22 18:45:37 [base.py:557] ----------------------------------------------------------------------------------------------------
INFO 12-22 18:45:37 [base.py:342] 🤖 Example Init Config Summary:
INFO 12-22 18:45:37 [base.py:360] - Model: /data/LongCat-Image + meituan-longcat/LongCat-Image
INFO 12-22 18:45:37 [base.py:360] - Task Type: T2I - Text to Image
INFO 12-22 18:45:37 [base.py:360] - Torch Dtype: torch.bfloat16
INFO 12-22 18:45:37 [base.py:360] - LoRA Weights: None
INFO 12-22 18:45:37 [base.py:196] 🤖 Example Input Summary:
INFO 12-22 18:45:37 [base.py:196] - prompt: A young Asian woman wearing a yellow knit sweater paired with a white necklace. Her hands rest on her knees, with a serene expression. The background features a rough brick wall, with warm afternoon sunlight casting upon her, creating a tranquil and cozy atmosphere. The shot uses a medium-distance perspective, highlighting her demeanor and the details of her attire. Soft lighting illuminates her face, emphasizing her facial features and the texture of her accessories, adding depth and warmth to the image. The overall composition is simple and elegant, with the brick wall's texture complementing the interplay of sunlight and shadows, showcasing the character's grace and composure.
INFO 12-22 18:45:37 [base.py:196] - height: 1024
INFO 12-22 18:45:37 [base.py:196] - width: 1024
INFO 12-22 18:45:37 [base.py:196] - guidance_scale: 4.5
INFO 12-22 18:45:37 [base.py:196] - num_inference_steps: 50
INFO 12-22 18:45:37 [base.py:196] - generator: device cpu, seed 0
INFO 12-22 18:45:37 [base.py:259] 🤖 Example Output Summary:
INFO 12-22 18:45:37 [base.py:270] - Model: longcat_image
INFO 12-22 18:45:37 [base.py:270] - Optimization: C0_Q0_DBCache_F1B0_W8I1M0MC3_R0.24_CFG1_T0O0_S31
INFO 12-22 18:45:37 [base.py:270] - Load Time: 0.98s
INFO 12-22 18:45:37 [base.py:270] - Warmup Time: 24.92s
INFO 12-22 18:45:37 [base.py:270] - Inference Time: 13.51s
INFO 12-22 18:45:37 [base.py:227] Image saved to longcat_image.1024x1024.C0_Q0_DBCache_F1B0_W8I1M0MC3_R0.24_CFG1_T0O0_S31.png
INFO 12-22 18:45:37 [base.py:568] ----------------------------------------------------------------------------------------------------

DefTruth

LGTM~ Thanks for your contribution!

e1ijah1 added 8 commits December 22, 2025 21:46

feat: support cache for LongCat-Image & LongCat-Image-Edit

67e47ca

Signed-off-by: elijah <f1renze.142857@gmail.com>

docs: add LongCat-Image example

9202d5f

Signed-off-by: elijah <f1renze.142857@gmail.com>

docs: add LongCat-Image example

a2c0f9f

Signed-off-by: elijah <f1renze.142857@gmail.com>

fix typo

40e397f

Signed-off-by: elijah <f1renze.142857@gmail.com>

add example for LongCat-Image-Edit

10a4e1b

Signed-off-by: elijah <f1renze.142857@gmail.com>

add example for LongCat-Image-Edit

af457b3

Signed-off-by: elijah <f1renze.142857@gmail.com>

fix bugs

db41f0e

Signed-off-by: elijah <f1renze.142857@gmail.com>

merge main

3fd7ef4

Signed-off-by: elijah <f1renze.142857@gmail.com>

DefTruth requested changes Dec 22, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

fix doc

9272d07

Signed-off-by: elijah <f1renze.142857@gmail.com>

update env_path_mapping

ff204d1

DefTruth approved these changes Dec 23, 2025

View reviewed changes

Fix comment formatting in registers.py

26ff89f

DefTruth changed the title ~~feat: support cache for LongCat-Image & LongCat-Image-Edit~~ feat: support cache for LongCat-Image Dec 23, 2025

DefTruth merged commit ec65b19 into vipshop:main Dec 23, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support cache for LongCat-Image#602

feat: support cache for LongCat-Image#602
DefTruth merged 11 commits intovipshop:mainfrom
e1ijah1:feat/longcat-image

e1ijah1 commented Dec 22, 2025

Uh oh!

Uh oh!

DefTruth commented Dec 22, 2025

Uh oh!

DefTruth commented Dec 22, 2025

Uh oh!

e1ijah1 commented Dec 22, 2025 •

edited

Loading

Uh oh!

e1ijah1 commented Dec 23, 2025

Uh oh!

DefTruth left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

e1ijah1 commented Dec 22, 2025

Uh oh!

Uh oh!

DefTruth commented Dec 22, 2025

Uh oh!

DefTruth commented Dec 22, 2025

Uh oh!

e1ijah1 commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

e1ijah1 commented Dec 23, 2025

Uh oh!

DefTruth left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

e1ijah1 commented Dec 22, 2025 •

edited

Loading