-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RFC] New datasets to torchvision #3562
Comments
Hi, This is exactly our current idea, thanks for bringing it up. I agree with all the aforementioned proposals. One thing to mention as well is that there is an ongoing effort to provide new dataset abstractions in PyTorch via DataPipes pytorch/pytorch#49440. While this doesn't block us providing new datasets, it is good to keep in mind that we might in the future revisit the way we implement datasets. |
related to this issue, it can also be useful if pytorch can store this datasets on their storage and provide link to download them. |
Yes, it is and thus
will never happen. Also see this section in our README
|
as per observation from "torchvision/datasets/" below datasets need to be added,please update the pitch LFW Labeled Faces in Wild |
Hey @harishsdev, not sure what you mean. From the original pitch only KITTI was added, which is correctly marked. In your list you left out CUB-200-2011, which is not supported yet. We do feature the |
Hi @harishsdev, I have created a pr for LFW Dataset, can you guide me about any further changes. |
The link provided for VGGFace2 is not correct; That link points to the first VGGFace dataset (which is available from this page). |
Actually the tar.gz is down for many months. Don't know what happened to VGG Face https://www.robots.ox.ac.uk/~vgg/data/vgg_face2 Probably this is the link https://www.robots.ox.ac.uk/~vgg/data/vgg_face/vgg_face_dataset.tar.gz |
Respectfully, that is the wrong url. The link you've provided is for the first version of VGGFace. The original pitch asked for VGGFace2, which will not be possible to provide at this time. |
@oke-aditya can we add the |
@oke-aditya Should we add the FGVC-Aircraft dataset (as implemented in this PR)? |
@yassineAlouini We already have FGVC-Aircraft in the current API
as well as #5354 to track progress for porting it to the prototype one. |
🚀 Feature
This is a proposal to add more highly cited datasets. Thanks to papers with code datasets which made this search easy.
Motivation
These datasets are used quite frequently and would provide benefits to both researchers as well as people who work in computer vision. I'm not sure of the citation metric, but we can verify the count of papers once.
Pitch
The following datasets can be considered. Papers are reported as per the last 5 years count on papers with code. They can be inaccurate, feel free to edit. I'm also adding previously approved or proposed ones
See #5108
Probably, we should think and add these, one by one. Also support downloading, not just loading of the dataset.
Additional context
Please feel free to discuss about datasets before opening PRs!
cc @pmeier
The text was updated successfully, but these errors were encountered: