[WIP] UCF101 prototype with utilities for video loading #4838

bjuncek · 2021-11-02T18:31:33Z

A simple pyav based set of utilities with a POC implementation for UCF101 dataset

cc @pmeier @bjuncek

facebook-github-bot · 2021-11-02T18:31:40Z

💊 CI failures summary and remediations

As of commit f1a69e0 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

pmeier

Thanks a lot @bjuncek. I have some comments inline about the general infrastructure. I can't really comment on the validity of the video utility datapipes that you added, because I have to little experience with videos. I'll leave that up to other reviewers.

main.py

torchvision/prototype/datasets/_builtin/ucf101.py

torchvision/prototype/datasets/decoder.py

torchvision/prototype/datasets/video_utils.py

Co-authored-by: Philip Meier <github.pmeier@posteo.de>

…k/vision into bkorbar/prototypes/ucf101

bjuncek · 2021-12-01T22:40:16Z

Ok, so I've tried doing a pass on this, trying to fix the decoder inconsistency we've been talking about offline.

I don't understand datapipes well enough to understand why pop from a dict would fail or why I'd need to annotate variables in a datapipe. Everything since 8f57ee6 has nothing to do with the functionality of the UCF101 dataset and can be reverted if you want to do a clean pass and then merge into this.

…k/vision into bkorbar/prototypes/ucf101

fmassa · 2021-12-03T09:38:47Z

torchvision/prototype/datasets/video_utils.py

+    def __iter__(self) -> Iterator[Dict[str, Any]]:
+        for video_d in self.datapipe:
+            buffer = video_d["file"]
+            with av.open(buffer, metadata_errors="ignore") as container:
+                stream = container.streams.video[0]
+                time_base = stream.time_base
+
+                # duration is given in time_base units as int
+                duration = stream.duration
+
+                # get video_stream timestramps
+                # with a tolerance for pyav imprecission
+                _ptss = torch.arange(duration - 7)
+                _ptss = self._unfold(_ptss)
+                # shuffle the clips
+                perm = torch.randperm(_ptss.size(0))
+                idx = perm[: self.num_clips_per_video]
+                samples = _ptss[idx]
+
+                for clip_pts in samples:
+                    start_pts = clip_pts[0].item()
+                    end_pts = clip_pts[-1].item()
+                    # video_timebase is the default time_base
+                    pts_unit = "pts"
+                    start_pts, end_pts, pts_unit = _video_opt._convert_to_sec(start_pts, end_pts, "pts", time_base)
+                    video_frames = video._read_from_stream(
+                        container,
+                        float(start_pts),
+                        float(end_pts),
+                        pts_unit,
+                        stream,
+                        {"video": 0},
+                    )
+
+                    vframes_list = [frame.to_ndarray(format="rgb24") for frame in video_frames]
+
+                    if vframes_list:
+                        vframes = torch.as_tensor(np.stack(vframes_list))
+                        # account for rounding errors in conversion
+                        # FIXME: fix this in the code
+                        vframes = vframes[: self.num_frames_per_clip, ...]
+
+                    else:
+                        vframes = torch.empty((0, 1, 1, 3), dtype=torch.uint8)
+                        print("FAIL")
+
+                    # [N,H,W,C] to [N,C,H,W]
+                    vframes = vframes.permute(0, 3, 1, 2)
+                    assert vframes.size(0) == self.num_frames_per_clip
+
+                    # TODO: support sampling rates (FPS change)
+                    # TODO: optimization (read all and select)
+
+                    yield {
+                        "clip": vframes,
+                        "pts": clip_pts,
+                        "range": (start_pts, end_pts),
+                        "video_meta": {
+                            "time_base": float(stream.time_base),
+                            "guessed_fps": float(stream.guessed_rate),
+                        },
+                        "path": video_d["path"],
+                        "target": video_d["target"],
+                    }


Why not just do the following:

sample m start positions

for every start position, read k frames

yield the k frames at once, m times

Unless I'm missing something, this is exactly what I do:

sample starting positions (line 132)

for every start position (line 134) read k frames (line 140)

yield the frames as a sample (line 168)

Are you suggesting to take the yield outside of the loop? If so, is there any benefit to this?

fmassa · 2021-12-03T09:40:01Z

torchvision/prototype/datasets/video_utils.py

+import numpy as np
+import torch
+from torchdata.datapipes.iter import IterDataPipe
+from torchvision.io import video, _video_opt


I'm not sure if I would use _video_opt in here.

Sure.
Any particular reason why not?

pmeier · 2021-12-16T15:02:55Z

@bjuncek I've cleaned up the PR with recent changes and also removed all the decoder changes. We will handle this separate from this PR following #5075.

bjuncek and others added 6 commits October 21, 2021 06:31

stash

9c9b27e

Merge branch 'pytorch:main' into bkorbar/prototypes/ucf101

e00c095

base implementation

914380f

Format and add documentation to the video utilities

9dd6786

simple driver for Philip to play with

7ad8357

format ucf101 and lint stuff

dc205e9

pytorch-probot bot added the ciflow/default label Nov 2, 2021

facebook-github-bot added the cla signed label Nov 2, 2021

bjuncek added module: video prototype module: datasets labels Nov 2, 2021

bjuncek requested a review from pmeier November 2, 2021 18:32

pmeier reviewed Nov 3, 2021

View reviewed changes

pmeier requested a review from fmassa November 3, 2021 07:31

bjuncek and others added 5 commits November 3, 2021 15:52

Update torchvision/prototype/datasets/_builtin/ucf101.py

711adf3

Co-authored-by: Philip Meier <github.pmeier@posteo.de>

Update torchvision/prototype/datasets/_builtin/ucf101.py

56c1779

Co-authored-by: Philip Meier <github.pmeier@posteo.de>

Update torchvision/prototype/datasets/video_utils.py

65f3c64

Co-authored-by: Philip Meier <github.pmeier@posteo.de>

Merge branch 'main' into bkorbar/prototypes/ucf101

017e9b9

Merge branch 'main' into bkorbar/prototypes/ucf101

666ca6e

pmeier mentioned this pull request Nov 4, 2021

add tests for prototype builtin datasets #4682

Merged

Bruno Korbar and others added 9 commits November 10, 2021 22:05

address pytorch#4838 (review)

acc0e54

Update torchvision/prototype/datasets/_builtin/ucf101.py

c209153

Co-authored-by: Philip Meier <github.pmeier@posteo.de>

Merge branch 'bkorbar/prototypes/ucf101' of https://github.com/bjunce…

31c0eb7

…k/vision into bkorbar/prototypes/ucf101

use internal utils

f5eb8fd

remove transform antipattern

0a66ff0

change return/pop stuff

d29d22b

remove unnecessary and uncalled methods

cf4f354

make changes to catch up with the master

52b2b67

minor flake

5e2f15d

Bruno Korbar and others added 13 commits November 28, 2021 14:05

addressing comments 1

a3737ab

remove shuffler comment

8fce5ff

remove main.py

a10a3a0

clange and flake being mad at me

697fdfd

Merge branch 'main' into bkorbar/prototypes/ucf101

ebef4f2

Merge branch 'main' into bkorbar/prototypes/ucf101

62078b6

addig type annotations

a574089

pass flake8

d809cb9

Decoder typing change

8f57ee6

remove unused parameters

8f21f0e

fixing _api with decoder changes

8dbda84

build errors

788d82a

remove unused

31a8929

pmeier and others added 5 commits December 2, 2021 08:16

Merge branch 'main' into bkorbar/prototypes/ucf101

84cdecb

fix python lint

4386c48

cleanup decoder

97bd457

mypy fix

4609783

Merge branch 'bkorbar/prototypes/ucf101' of https://github.com/bjunce…

1c77e6f

…k/vision into bkorbar/prototypes/ucf101

fmassa reviewed Dec 3, 2021

View reviewed changes

pmeier mentioned this pull request Dec 9, 2021

[RFC] How should datasets handle decoding of files? #5075

Open

pmeier added 3 commits December 16, 2021 15:08

[DIRTY] Merge branch 'main' into bkorbar/prototypes/ucf101

6019ce7

revert decoder changes

25c3668

add categories and fix data loading

08a616c

pmeier added 3 commits December 16, 2021 16:57

cleanup

0675649

Merge branch 'main' into bkorbar/prototypes/ucf101

381f70e

use shuffling hint

f1a69e0

pmeier mentioned this pull request Feb 2, 2022

add HMDB51 and UCF101 datasets as well as prototype for new style video decoding #5335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] UCF101 prototype with utilities for video loading #4838

[WIP] UCF101 prototype with utilities for video loading #4838

Uh oh!

bjuncek commented Nov 2, 2021 •

edited by pytorch-probot bot

Loading

Uh oh!

facebook-github-bot commented Nov 2, 2021 •

edited

Loading

Uh oh!

pmeier left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bjuncek commented Dec 1, 2021

Uh oh!

fmassa Dec 3, 2021

Uh oh!

bjuncek Jan 4, 2022

Uh oh!

fmassa Dec 3, 2021

Uh oh!

bjuncek Jan 4, 2022

Uh oh!

pmeier commented Dec 16, 2021

Uh oh!

Uh oh!

[WIP] UCF101 prototype with utilities for video loading #4838

Are you sure you want to change the base?

[WIP] UCF101 prototype with utilities for video loading #4838

Uh oh!

Conversation

bjuncek commented Nov 2, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Nov 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

pmeier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bjuncek commented Dec 1, 2021

Uh oh!

fmassa Dec 3, 2021

Choose a reason for hiding this comment

Uh oh!

bjuncek Jan 4, 2022

Choose a reason for hiding this comment

Uh oh!

fmassa Dec 3, 2021

Choose a reason for hiding this comment

Uh oh!

bjuncek Jan 4, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier commented Dec 16, 2021

Uh oh!

Uh oh!

bjuncek commented Nov 2, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Nov 2, 2021 •

edited

Loading