Skip to content

A new video-text dataset which may help #362

Open
@mutonix

Description

Vript is a fine-grained video-text dataset with 12K annotated high-resolution videos (~400k clips), where each clip has a detailed caption of ~145 words.

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions