Commercial

Selfie with ID Dataset

Selfie with ID Dataset contains high-quality selfie images and ID documents for robust facial recognition and identity verification tasks in KYC applications.

Request a demo
  • Photos
    65,000+
  • People
    5,000+
  • Countries
    40+
  • Re-identification
  • Facial Recognition
  • Computer Vision
  • Security
  • iBeta
  • Photos
    65,000+
  • People
    5,000+
  • Countries
    40+

Dataset Info

Characteristic Data
Description Photos of individuals and their identification documents for facial recognition tasks.
Data types Image
Tasks Face recognition, Computer Vision, Biometric Verification
Total number of images 65,000
Total number of people 5,000
Number of files in a set 15 (13 selfies and 2 photos of document)
Labeling Only technical characteristics and metadata (age, gender, ethnicity)
Gender Male, Female
Ethnicity Caucasian (90%), African (10%)
Number of country 40
Type of document Passports, international passports, driver licenses, student cards, health certificates, membership/bank/transport cards, certificates, etc
Download sample

Statistics

Distribution by gender
Distribution by country
Distribution by age
Distribution by country

Technical
Characteristics

Characteristic Data
Image Extensions Jpg, jpeg, heic
Devices Xiaomi redmi note 10s, Infinix smart 8, Samsung, iPhone 11, Xiaomi Redmi 14C, iPhone X, Redmi and etc.
Source and collection methodology: Data was collected via crowdsourcing platforms.

What is included in Selfie with ID Dataset?
The dataset consists of selfie images paired with ID documents, covering multiple facial features, ID photos, and personal information variations. It includes diverse demographics, enabling accurate document verification, face recognition, and the development of recognition models.
What are the sources of data for Unidata datasets?
Selfie and ID Dataset is collected from legally permissible sources, including crowdsourcing platforms and in-house data capture teams. All selfies and documents dataset entries are validated to ensure high-quality facial images and ID documents for reliable recognition technology testing.
What types of annotations are provided?
The dataset includes annotations for facial landmarks, ID document metadata, and identity verification attributes. These annotations support facial recognition, verification systems, and biometric data analysis in benchmark datasets and recognition technology applications.
Can I request a sample of the dataset before purchasing?
Yes. Unidata provides a sample of the selfie with ID dataset so you can evaluate the dataset quality, selfie photos, and ID documents coverage. This allows you to test its applicability for document verification and facial recognition before making a purchase.
Still have questions about using Unidata datasets? Read our user-guides

Dataset Use Cases

  • Financial Services

    KYC and Remote Onboarding

    Selfie with ID Dataset supports banks and fintech platforms in improving identity verification during digital onboarding. The dataset contains paired selfie photos and ID documents, allowing recognition systems to confirm customer identities. This reduces fraud, strengthens verification systems, and ensures compliance with KYC requirements for secure financial transactions.



  • Telecommunications

    Subscriber Verification and SIM Registration

    Telecom companies use the Selfies & ID Images Dataset to validate ID cards during new SIM activation and subscription management. By matching facial images from selfie photos with official ID photos, providers enhance document verification, prevent fraudulent accounts, and protect personal information while meeting regulatory compliance standards.



  • E-Government Services

    Digital Identity Authentication

    The Selfies and Documents Dataset enables public agencies to strengthen identity verification in e-government platforms. Since the dataset consists of aligned facial images and ID documents, it helps train systems that authenticate citizens securely. This safeguards access to public services, reduces impersonation risks, and ensures safe handling of sensitive personal information.



  • Technology and Biometrics Industry

    Training and Benchmarking Models

    For technology companies and research groups, the Selfie and ID Dataset provides high-quality biometric data for recognition technology development. It serves as training data and testing data for recognition models that compare facial features in selfie images with ID photos. This supports learning models, improves selfie apps, and advances global verification systems.



Similar Datasets

What our clients are saying

UniData

4 3 Reviews

PA

Paul 2025-02-21

Very Positive Experience!

The team was very responsive when requesting a specific dataset, and was able to work with us on what data we specifically needed and custom pricing for our use case. Overall a great experience, and would recommend them to others!

TH

Thorsten 2025-01-09

Very good experience

We got in touch with UniData to buy several datasets from them. Communication was very cooperative, quick, and friendly. We were able to find contract conditions that suited both parties well. I also appreciate the team's dedication to understand and address the needs of the customer. And the datasets we bought from UniData matched with our expectations.

Max Crous 2024-10-08

Data purchase

Our team got in touch with UniData for purchasing video data. The team at UniData was transparent, timely, and pleasant to communicate and negotiate with. Their samples and descriptions aligned well with the data we received. We will certainly reach out to UniData again if we're in search of 3rd party video data.

Abhijeet Zilpelwar 2025-02-26

Data is well organized and easy to…

Data is well organized and easy to consume. We could download and use it for training within few hours of receiving the data links.

Why Choose Us

Unidata offers unparalleled expertise in AI data solutions, delivering superior data quality and optimized workflows

Expertise

Our team consists of industry-leading experts in AI data solutions

Quality

We ensure superior data quality to maximize your AI project's potential

Efficiency

Our optimized workflows accelerate your model training processes

Proven Results

Our track record of case studies demonstrates our ability to deliver outstanding outcomes

Customization

Our track record of case studies demonstrates our ability to deliver outstanding outcomes

Support

We provide ongoing support and consultation to ensure continuous success
background
team
1000 +
full-time assessors

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.