Description
The new dataset API is now stable enough to start porting more datasets from the old API. For the 0.13.0
release planned for 2022H2 we want to achieve at least feature parity for the new API. If you want to help out, please comment on the respective issue so we can assign it to you.
The process of adding a dataset to the new API is described here. In addition, we already ported some datasets that you could use as reference. In any case, if you are blocked by something feel free to send a partial PR and ping me there so I can help.
The following datasets need to be ported:
Image classification
Image classification datasets are good starting point if you are not familiar with the dataset or the new API since they these datsets tend to be the easiest.
- Country211 #5337
- EuroSAT #5338
- Flickr8k #5339 1
- Flickr30k #5340 1
- Flowers102 #5341
- Food101 #5342
- INaturalist #5343
- LFWPeople #5344 2
- LSUN #5345
- Omniglot #5346
- Places365 #5347
- RenderedSST2 #5348
- SBU #5349 1
- STL10 #5350
- SUN397 #5351
- USPS #5352
Image detection or segmentation
Image detection or segmentation datasets tend to be a little harder since one needs to merge more infomation into one sample compared to classification. My suggestion is to only pick one of these if you are either familiar with the dataset or the new API so you don't have two manage two things at once.
- Cityscapes #5353 (@vfdev-5 )
- FGVCAircraft #5354 3
- Kitti #5355 3 2
- StanfordCars #5356 3
- WIDERFace #5367
Image pairs
We are still designing how exactly image pair datasets should be implemented. I list them here for completeness, but I suggest not picking up any of them until the design is finished.
Video classification
We are still designing how exactly video datasets should be implemented. I list them here for completeness, but I suggest not picking up any of them until the design is finished.
Optical flow
We are still designing how exactly optical flow datasets should be implemented. I list them here for completeness, but I suggest not picking up any of them until the design is finished.
Footnotes
-
These datasets do not provide public download links for the data so they might be harder to work on. ↩ ↩2 ↩3
-
Maybe we should have
lfw/people
,kitti/object
, andkitti/flow
datasets to cleanly separate the different variants. This also applies tococo
as discussed in https://github.com/pytorch/vision/pull/5326#discussion_r796813705 ↩ ↩2 ↩3 -
These datasets are implemented as classification datasets in the old API, but provide extra annotations for detection or segmentation. ↩ ↩2 ↩3