Skip to content

[Datasets] Missing/Incorrect client data mapping in OpenImage Detection dataset #238

Open
@justinpan0

Description

What happened + What you expected to happen

The official data downloaded from The PASCAL Visual Object Classes Challenge 2007 has the raw data format like this:

$ ls VOCdevkit/VOC2007/JPEGImages/ -U | head -n 10
002588.jpg
001540.jpg
005556.jpg
008047.jpg
000188.jpg
002007.jpg
003111.jpg
000421.jpg
005766.jpg
005575.jpg

This looks different from the ones in client_data_mapping downloaded from umich server https://fedscale.eecs.umich.edu/dataset/openimage_detection.tar.gz:

$ cat detection/client_data_mapping/trainval.csv | head -n 10
client_id,data_path,label_name,label_id
0,4fa8054781a4c382.jpg,,-1
1,b37f763ae67d0888.jpg,,-1
2,7e8584b0f487cb9e.jpg,,-1
3,249086e72671397d.jpg,,-1
4,3c2fad96113ca97e.jpg,,-1
5,c9e409f562248252.jpg,,-1
6,4ce684871c147d4c.jpg,,-1
7,8d0cbf68276262bb.jpg,,-1
8,51c5d8d5d9cd87ca.jpg,,-1

Versions / Dependencies

N/A

Reproduction script

See above.

Issue Severity

Low: It annoys or frustrates me.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions