Extend `RandomShortestSize` to support Video specific flavour of the augmentation #6770

datumbox · 2022-10-14T10:49:27Z

Our RandomShortestSize implementation on the references is developed with Object Detection in mind. Videos require a slight variation (see reference implementation for detection vs video).

This PR extends the transform in a BC-way so that it can support both. In particular, we make the max_size optional and this allows us to reparameterize the transform for videos as such:

from torchvision.prototype.transforms import RandomShortestSize
import math
import torch

x = torch.randn(7, 11, 3, 450, 800)
t = RandomShortestSize(list(range(256, 320+1)))
z = t(x)
print(z.shape)

size = min(z.shape[-2:])

_, t, c, h, w = x.shape
if w < h:
    new_h = int(math.floor((float(h) / w) * size))
    new_w = size
else:
    new_h = size
    new_w = int(math.floor((float(w) / h) * size))

print(new_h, new_w)

assert (new_h, new_w) == tuple(z.shape[-2:])

Though the names of min_size and max_size are confusing (better names would have been shortside_min_size_range and longside_max_size), their semantics align with the arguments that F.resize() has for size and and max_size. On the latter the default value of max_size (which again applies to the longest edge) is None, so this transform uses the same semantics and default values as in other places of the API.

…gmentation

vfdev-5

OK to me, thanks @datumbox !

test/test_prototype_transforms.py

…r of the augmentation (#6770) Summary: * Extend RandomShortestSize to support Video specific flavour of the augmentation * Adding a test. * Apply changes from code review Reviewed By: NicolasHug Differential Revision: D40427454 fbshipit-source-id: ecd2ec17b047449c043b4c2f45b762c722cc5e04

Extend RandomShortestSize to support Video specific flavour of the au…

8304f2d

…gmentation

facebook-github-bot added the cla signed label Oct 14, 2022

datumbox added enhancement module: transforms prototype and removed cla signed labels Oct 14, 2022

Merge branch 'main' into prototype/extend_random_shortest

a66089a

facebook-github-bot added the cla signed label Oct 14, 2022

datumbox requested review from pmeier and vfdev-5 October 14, 2022 10:50

vfdev-5 approved these changes Oct 14, 2022

View reviewed changes

Adding a test.

8ca29dd

vfdev-5 reviewed Oct 14, 2022

View reviewed changes

test/test_prototype_transforms.py Outdated Show resolved Hide resolved

Apply changes from code review

431dd76

datumbox merged commit 88b6b93 into pytorch:main Oct 14, 2022

datumbox deleted the prototype/extend_random_shortest branch October 14, 2022 11:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend `RandomShortestSize` to support Video specific flavour of the augmentation #6770

Extend `RandomShortestSize` to support Video specific flavour of the augmentation #6770

Uh oh!

datumbox commented Oct 14, 2022 •

edited

Loading

Uh oh!

vfdev-5 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Extend RandomShortestSize to support Video specific flavour of the augmentation #6770

Extend RandomShortestSize to support Video specific flavour of the augmentation #6770

Uh oh!

Conversation

datumbox commented Oct 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vfdev-5 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Extend `RandomShortestSize` to support Video specific flavour of the augmentation #6770

Extend `RandomShortestSize` to support Video specific flavour of the augmentation #6770

datumbox commented Oct 14, 2022 •

edited

Loading