Proposal T2V: Save videos as videos instead of numpy arrays #2459

Akshat-Tripathi · 2026-01-21T11:20:47Z

Currently the reference implementation saves the model output directly. This is a 81x720x1280x3 sized fp32 numpy array, which means each video takes up 895795200 bytes or 896MB.

With 247 videos this would require 221GB to store the result of an accuracy run.

However, if we first encode the video to mp4 and save the mp4 bytes, the required storage will drastically shrink to ~900MB.

To ensure fairness I propose that all submitters use the same implementation to save their video, namely diffusers.utils.export_to_video

github-actions · 2026-01-21T11:20:56Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Akshat-Tripathi · 2026-01-21T17:45:12Z

recheck

Akshat-Tripathi · 2026-01-22T12:21:37Z

recheck

…than numpy arrays, drastically reducing storage requirement

Akshat-Tripathi requested a review from a team as a code owner January 21, 2026 11:20

pgmpablo157321 force-pushed the t2v_storage_optimisation branch from de74cef to 8351d5e Compare January 26, 2026 16:44

pgmpablo157321 and others added 4 commits January 26, 2026 12:52

Updated t2v reference implementation to save videos as videos rather …

3a2f371

…than numpy arrays, drastically reducing storage requirement

Corrected fps

bd2250b

Minor bugfix

b25766a

Add configuration + enforce count

f2d04b6

pgmpablo157321 force-pushed the t2v_storage_optimisation branch from 18fdc70 to f2d04b6 Compare January 26, 2026 17:56

pgmpablo157321 approved these changes Jan 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal T2V: Save videos as videos instead of numpy arrays #2459

Proposal T2V: Save videos as videos instead of numpy arrays #2459

Uh oh!

Akshat-Tripathi commented Jan 21, 2026

Uh oh!

github-actions bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

Akshat-Tripathi commented Jan 21, 2026

Uh oh!

Akshat-Tripathi commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Proposal T2V: Save videos as videos instead of numpy arrays #2459

Are you sure you want to change the base?

Proposal T2V: Save videos as videos instead of numpy arrays #2459

Uh oh!

Conversation

Akshat-Tripathi commented Jan 21, 2026

Uh oh!

github-actions bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Akshat-Tripathi commented Jan 21, 2026

Uh oh!

Akshat-Tripathi commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Jan 21, 2026 •

edited

Loading