S3D feature request

### 🚀 The feature

S3D model for video classification ([paper](https://arxiv.org/abs/1712.04851), [paperswithcode](https://paperswithcode.com/paper/rethinking-spatiotemporal-feature-learning), [implementation](https://github.com/facebookresearch/multimodal/blob/main/examples/mugen/retrieval/s3d.py) in torchmultimodal).

### Motivation, pitch

I've onboarded a [model](https://github.com/facebookresearch/multimodal/blob/main/examples/mugen/retrieval/video_clip.py) to TorchMultimodal that depends on S3D. Right now the S3D implementation is in torchmultimodal, but as a purely vision model (and a fairly popular model) it would fit nicely in torchvision. There is some discussion on [this PR in torchmultimodal](https://github.com/facebookresearch/multimodal/pull/135) about the changes needed to onboard S3D to torchvision. 

### Alternatives

Right now, S3D is implemented in torchmultimodal.

### Additional context

I started working on the [suggested refactoring](https://github.com/facebookresearch/multimodal/pull/135) to move S3D from multimodal to vision. However, the process of onboarding myself to the torchvision library and implementing the shared base with the inception model will likely take more time than I have, and my torchvision POC is on PTO. Are there any steps I can take within the next week (my internship is ending), or can the model be onboarded by the torchvision team?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

S3D feature request #6402

🚀 The feature

Motivation, pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

S3D feature request #6402

Description

🚀 The feature

Motivation, pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions