Commercial

Face Re-identification Image Dataset

It is a large-scale face identification dataset containing over 670,000 annotated images of 23,000+ individuals captured in varied angles, lighting, and environments, designed for training face recognition, face detection, and person re-identification systems in surveillance applications, with rich metadata and diverse facial expressions, poses, and backgrounds.

Get in touch Download sample
  • people
    23,110
  • images
    670,000+
  • Facial Recognition
  • Computer Vision
  • Machine learning
  • Security

It is a large-scale face identification dataset containing over 670,000 annotated images of 23,000+ individuals captured in varied angles, lighting, and environments, designed for training face recognition, face detection, and person re-identification systems in surveillance applications, with rich metadata and diverse facial expressions, poses, and backgrounds.

Get in touch Download sample
  • Facial Recognition
  • Computer Vision
  • Machine learning
  • Security
  • people
    23,110
  • images
    670,000+

Dataset Info

Characteristic Data
Description Images of people featuring various angles, backgrounds, and attributes for facial expression recognition
Data types Image
Tasks Face Re-identification, Computer Vision
Total number of image 670 190
Number of people 23 110
Number of files in a set 28 images and 1 ID photo for each person
Labeling Metadata (ID, nationality, gender, age, emotion, collecting scene)
Gender Male, Female
Ethnicity Asian, Mexican, Caucasian, African, Indian
Collecting scene Indoor and outdoor scenes
Age Teenagers, young adults, middle-aged, elderly
Download sample

Statistics

Distribution by gender
Distribution by ethnicity

Technical
Characteristics

Characteristic Data
Image extension Jpg, png
Accuracy of labels of face pose is more than 97%
Device Phone
Source and collection methodology. Data was collected by a partner of Unidata.

Dataset Use Cases

  • Security & Surveillance

    Enhancing Person Re-Identification in Video Surveillance Systems

    Ths dataset supports surveillance applications by providing high-quality face images and video sequences captured under various conditions. It enables recognition systems to accurately match unique identities across different cameras, improving facial recognition accuracy in public safety and security monitoring solutions.

  • AI Research & Model Benchmarking

    Evaluating Deep Learning Models for Face Recognition

    Researchers use this face re-identification dataset to benchmark recognition technology and test deep neural networks in facial recognition tasks. The dataset consists of thousands of annotated face photographs from different people, providing training data ideal for improving re-identification systems and studying face detection performance.

  • Anti-Spoofing and Fraud Prevention

    Detecting Fake or Manipulated Faces in Digital Systems

    This dataset also supports the development of face anti-spoofing models, crucial for detecting fake faces or identity fraud attempts. By including real and synthetic variations of facial images, it enables recognition algorithms to distinguish between authentic users and spoofed identities, strengthening security systems and biometric verification tools in financial and governmental sectors.

  • Identity Verification & Access Control

    Training Models for Face Detection and Verification

    This face identification dataset helps develop face verification systems used in authentication platforms, smart access control, and digital ID validation. Containing human faces with varied poses, expressions, and lighting conditions, it enhances recognition algorithms.

FAQs

What is included in this dataset?
The dataset includes 670,190 high-resolution facial images representing 23,110 individuals. Each subject has 28 facial photos and one ID-style image, labeled with metadata such as gender, age, nationality, emotion, and capture scene.
Is this a real-world dataset or synthetic data?
This is a real-world dataset, consisting of authentic images of individuals captured in everyday environments. The dataset provides natural variations in facial expressions, head poses, and lighting.
Can I request a sample of the dataset before purchasing or downloading it?
Yes. Unidata offers free dataset samples so you can assess the image quality, annotation accuracy, and format before purchase. These samples help you determine whether the dataset meets the needs of your AI research, computer vision, or facial recognition projects.
What are the sources of data for Unidata datasets?
Face Re-identification Image Dataset was collected by a partner of Unidata using smartphone cameras in diverse indoor and outdoor environments. All data sources are legally verified and adhere to ethical and privacy standards to ensure compliance with data protection laws.
How are Unidata datasets licensed?
Unidata datasets follow a dual-licensing model: free samples are provided for evaluation and testing, while full datasets are available exclusively through purchase. This ensures users can validate data quality before acquiring the full dataset for research or commercial use.
How are Unidata datasets stored?
All datasets are stored on AWS cloud infrastructure, offering secure, scalable, and high-availability data access. Unidata’s storage practices comply with ISO 27001 and ISO 27701 standards, ensuring the confidentiality and integrity of sensitive data such as facial images.
How long does it take to receive the dataset?
After you submit a dataset request, Unidata will contact you to review your requirements and finalize documentation. Upon contract signing and payment, delivery is completed within 3–10 business days through a secure download link.
Still have questions about using Unidata datasets? Read our user-guides

Similar Datasets

Why Companies Trust Unidata’s Services for ML/AI

Share your project requirements, we handle the rest. Every service is tailored, executed, and compliance-ready, so you can focus on strategy and growth, not operations.

Share your project requirements, we handle the rest. Every service is tailored, executed, and compliance-ready, so you can focus on strategy and growth, not operations.

1,100+ Labelers & AI Experts

  • 920K files labeled daily across text, audio, video, and image data
  • 20+ expert-level data labeling tools for quality & speed
01

19+ Industries & Diverse Data Types

  • Finance, IT, Retail, Healthcare & more
  • Standard and specialized formats (DICOM, LiDAR)
02

Turnkey ML/AI Services

  • From data collection to validation
  • Multi-type annotation tailored to you
03

100% Legal & Secure

  • Legally sourced & stored data
  • AWS ISO 27001/27701
04

Smooth Collaboration

  • Dedicated PM & SLA guarantee
  • Europe-timezone communication
05

Need Proof?

See the results we've delivered for leading tech companies and startups

Explore our case

What our clients are saying

UniData

4 3 Reviews

PA

Paul 2025-02-21

Very Positive Experience!

The team was very responsive when requesting a specific dataset, and was able to work with us on what data we specifically needed and custom pricing for our use case. Overall a great experience, and would recommend them to others!

TH

Thorsten 2025-01-09

Very good experience

We got in touch with UniData to buy several datasets from them. Communication was very cooperative, quick, and friendly. We were able to find contract conditions that suited both parties well. I also appreciate the team's dedication to understand and address the needs of the customer. And the datasets we bought from UniData matched with our expectations.

Max Crous 2024-10-08

Data purchase

Our team got in touch with UniData for purchasing video data. The team at UniData was transparent, timely, and pleasant to communicate and negotiate with. Their samples and descriptions aligned well with the data we received. We will certainly reach out to UniData again if we're in search of 3rd party video data.

Abhijeet Zilpelwar 2025-02-26

Data is well organized and easy to…

Data is well organized and easy to consume. We could download and use it for training within few hours of receiving the data links.

Trusted by the world's biggest brands

Our Clients Love Us

Enterprise Document Automation

Document AI Lead

The dataset gave us strong value for both pilot and early-stage testing. We plan to broaden coverage as deployment scales.

Identity Verification Lab

Deputy Director

The data was good. We passed PAD level 1 from iBeta.

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.