Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistent input video tensor shapes across different Transforms #235

Open
MichaelRipa opened this issue Feb 24, 2023 · 0 comments
Open

Comments

@MichaelRipa
Copy link

Hi,

I was looking through the Transforms API docs and noticed an inconsistency with video input shapes across different transforms. Here are a few examples to illustrate my confusion:

  • AugMix takes an input video tensor of shape (T, C, H, W)
  • CutMix takes in a batch of videos of shape (B, C, T, H, W)
  • Div255 takes in an input video tensor of shape (C, T, H, W)

Is there any reason as to why the 'channels' and 'temporal' dimensions are sometimes transposed? I tried looking for an answer to this question but couldn't find anything, so I hope this is the right place.

Thanks! 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant