Skip to content

Trainingdata-datamarket/Selfie-and-ID-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

Selfie and ID Dataset

  1. About the dataset
  2. Distributions
  3. Content
  4. Get the Dataset
  5. Links

About the dataset

We introduce a large image dataset called Selfie and ID Dataset for training a neural network to solve reidentification tasks. The dataset consists of selfie photos and personal document photos of people. The list of documents, from which photos are taken: passports, foreign passports, driving licenses, medical certificates, membership/bank/transport cards, certificates, diplomas, etc.

"Selfie and ID Dataset" can be utilized in several applications, including facial recognition algorithms, identity verification systems, forensic analysis, and deep learning research. By providing both personal and official photographs, this dataset offers a comprehensive resource for evaluating and improving the performance of algorithms in various real-world scenarios.

The dataset consists of 5,591 sets of images (83,865 images in total) from 4,200+ unique people from 55 countries. The data for the dataset is still gathering, so the number of photos is getting bigger!

The dataset includes 2 different types of images:

  • Selfies - 13 selfies of a person from a mobile phone, the person is depicted alone on it, the face is clearly visible.
  • ID Photos - ID photos of the person from 2 different documents.

Frame 15

Data in the dataset

  • People from 18 to 76 years old are presented in the dataset.
  • For each person in the dataset age, country and gender is presented.
  • The data was mostly collected indoor, however there are also selfies made outdoors.
  • The lighting is artificial, natural daily lightning, evening outdoor lighting and dark indoor lighting.
  • People provided selfies where the head takes up at least 1/2 of the frame.
  • Distance from the camera is approximately 20-30 centimeters.

People in the dataset

Frame 14

Distributions

Gender of people in the dataset

image

Ethnicity of people in the dataset

image

Age of people in the dataset

image

Content

The folder "Selfies ID Images dataset" includes 29 folders (18 sets from caucasians and 11 from hispanics):

  • corresponding to each person in the sample
  • containing of 13 selfies and 2 photos from the documents of the individual

File with the extension .csv

includes the following information for each media file:

  • SetId: the identifier of the set,
  • UserRace: race of the person,
  • Age: the age of the person,
  • Name: name of the person,
  • FName: name of the file,
  • URL: the link to access the media file

Get the Dataset

This is just an example of the data. If you need access to the entire dataset, contact us via sales@trainingdata.pro or leave a request on trainingdata.pro/data-market

ezgif com-gif-maker

Links

Resource Link
TrainingData.pro trainingdata.pro/data-market
Kaggle https://www.kaggle.com/trainingdatapro
HuggingFace https://huggingface.co/TrainingDataPro

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published