Distributed inference example for llava_next #3179

VladOS95-cyber · 2024-10-20T16:30:17Z

Add distributed inference example for LLaVA-NeXT-Video-7B-hf

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Link: [Community Contributions] examples on distributed inference using 🤗 Accelerate #3078
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul @a-r-r-o-w @muellerzr

VladOS95-cyber · 2024-10-20T18:02:51Z

Hi @sayakpaul @a-r-r-o-w! This PR is ready for review, please, take a look.

sayakpaul

Thanks for working on this so quickly. Left some comments.

sayakpaul · 2024-10-21T04:53:49Z

examples/inference/distributed/llava_next_video.py

+    indices = np.arange(0, total_frames, total_frames / 8).astype(int)
+    video = read_video_pyav(container, indices)
+
+    conversations = [


Maybe we could repurpose this to just yield captions?

Hey @sayakpaul! What do you mean by repurpose here? The idea of this example was to provide user prompts (Questions) to certain videos and process it. If we want to get just captions, what exactly are we going to do with it?

Train text-to-video models, for one

@sayakpaul, please, take a look on recent changes, what do you think?

Hey @sayakpaul! Do you have any comments on recent changes? I added video dataset loading and splitting it into batches, and then, on every video batch, we distribute prepared prompts to generate answers.

examples/inference/distributed/llava_next_video.py

HuggingFaceDocBuilderDev · 2024-10-21T04:56:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul reviewed Oct 21, 2024

View reviewed changes

VladOS95-cyber added 4 commits October 24, 2024 16:16

add distributed inference example for llava_next

61ec52d

some fixes and refactoring

1c23c91

add videos dataset for example, refactor logic

8cd9376

small fix for batch size

0a58faa

VladOS95-cyber force-pushed the add-video-capture-example-on-distributed-inference branch from 91ee412 to 0a58faa Compare October 24, 2024 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed inference example for llava_next #3179

Distributed inference example for llava_next #3179

VladOS95-cyber commented Oct 20, 2024 •

edited

Loading

VladOS95-cyber commented Oct 20, 2024

sayakpaul left a comment

sayakpaul Oct 21, 2024

VladOS95-cyber Oct 22, 2024

sayakpaul Oct 22, 2024

VladOS95-cyber Oct 22, 2024 •

edited

Loading

VladOS95-cyber Oct 27, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 21, 2024

Distributed inference example for llava_next #3179

Are you sure you want to change the base?

Distributed inference example for llava_next #3179

Conversation

VladOS95-cyber commented Oct 20, 2024 • edited Loading

Before submitting

Who can review?

VladOS95-cyber commented Oct 20, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul Oct 21, 2024

Choose a reason for hiding this comment

VladOS95-cyber Oct 22, 2024

Choose a reason for hiding this comment

sayakpaul Oct 22, 2024

Choose a reason for hiding this comment

VladOS95-cyber Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

VladOS95-cyber Oct 27, 2024 • edited Loading

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 21, 2024

VladOS95-cyber commented Oct 20, 2024 •

edited

Loading

VladOS95-cyber Oct 22, 2024 •

edited

Loading

VladOS95-cyber Oct 27, 2024 •

edited

Loading