Commercial

Synthetic Printed UK Passports Dataset

Synthetic Printed UK Passports Dataset offers 5,000 realistic, AI-generated images of British passports, reflecting authentic United Kingdom passport layouts, textures, and photo styles. Designed for machine learning, OCR, and biometric identification, this dataset enhances models used in document verification, immigration systems, and digital identity research.

Get in touch Download sample
  • Images
    5 000
  • PII
  • Data generation
  • Security
  • Anti-spoofing
  • Computer Vision

Synthetic Printed UK Passports Dataset offers 5,000 realistic, AI-generated images of British passports, reflecting authentic United Kingdom passport layouts, textures, and photo styles. Designed for machine learning, OCR, and biometric identification, this dataset enhances models used in document verification, immigration systems, and digital identity research.

Get in touch Download sample
  • PII
  • Data generation
  • Security
  • Anti-spoofing
  • Computer Vision
  • Images
    5 000

Dataset Info

Characteristic Data
Description Printed synthetic passport images for training ML models in PII extraction
Data types Image
Tasks OCR, Computer Vision
Total number of files 5 000
Number of files in a set 96 (Angles - 3, Lighting - 4, Backgrounds - 4, Distances - 2)
Angles 0°, 25°, 45°
Lighting Natural-daylight, Office-LED, Warm-indoor, Dim-light
Backgrounds Neutral wall, Textured desk, Outdoor pavement, Docs-on-docs
Distance Close (80-90 % frame), Medium (50-60 %)
Labeling Metadata (Passport ID, Sample ID, Class, Country, Gender, Age Group, Angle, Distance, Category, Resolution, Camera, Light Condition, Background, Timestamp)
Gender Male, Female
Download sample

Statistics

Distribution by gender

Technical
Characteristics

Characteristic Data
Image Extensions HEIC, JPG
Data Type generated
Source and collection methodology: Data was AI-generated.

Dataset Use Cases

  • Identity Verification & Security Systems

    Developing AI Models for Passport Recognition

    Synthetic Printed UK Passports Dataset supports research and development in identity verification and document recognition systems. Containing diverse passport images of British passports and United Kingdom passports, it enables training of facial recognition and document authentication algorithms. This data helps enhance recognition technology used by government services and immigration databases.

  • Artificial Intelligence & Machine Learning

    Training Deep Learning Models for Document Classification

    This UK passport dataset serves as high-quality training data for building machine learning and deep learning models capable of identifying travel documents and identity cards. The dataset consists of synthetic passport photos and structured personal details, allowing developers to test recognition algorithms without using sensitive personal data.

  • Border Control & Immigration Systems

    Testing Recognition Accuracy in Biometric Verification Tools

    The dataset provides a controlled environment for testing biometric data and document verification methods used in immigration systems. Simulating real UK passports, it supports government databases and border control authorities in improving recognition accuracy, passport validation, and foreign nationals’ identity checks.

  • Research & Educational Applications

    Developing and Evaluating Synthetic Identity Datasets

    It is ideal for academic research, focusing on national identity, recognition systems, and biometric passports. The datasets provided contain printed passport samples that replicate authentic document layouts while ensuring data privacy. Researchers can explore recognition searches, technology workflows, and identity verification models using ethically sourced, non-sensitive information.

FAQs

What is included in this dataset?
The dataset contains 5,000 synthetic passport images, each representing a unique printed document. Every sample is provided in HEIC or JPG format, featuring varied angles (0°, 25°, 45°), lighting conditions, and backgrounds, allowing robust model training and validation.
What types of annotations are provided?
Each synthetic passport image includes detailed metadata annotations. These labels describe passport ID, sample ID, gender, country, age group, lighting type, background, angle, resolution, and timestamp, providing structured reference data for training and benchmarking OCR or ID recognition systems.
Can I request a sample of the dataset before purchasing or downloading it?
Yes. Unidata provides free sample files that allow you to test and evaluate the dataset before purchase.
Is it possible to request a custom dataset?
Yes. You can request a custom synthetic passport dataset tailored to your project needs, such as different lighting, document angles, or background conditions. Unidata’s team can generate AI-based passport images for specific countries or formats to support advanced machine learning and document recognition research.
How was the data collected?
Synthetic Printed UK Passports Dataset was generated using AI-based synthetic data generation techniques. The images replicate real-world passport printing and scanning scenarios under multiple conditions - including diverse lighting, textures, and camera setups - to create realistic and balanced training data.
How are Unidata datasets licensed?
Unidata datasets follow a dual-licensing model: free samples are provided for evaluation and testing, while full datasets are available exclusively through purchase. This ensures transparent access for both academic research and commercial development.
Do Unidata datasets follow GDPR or other data privacy regulations?
Yes. All Unidata datasets comply with GDPR and relevant data privacy regulations. Since the UK passport dataset is entirely synthetic, it does not include real personal or biometric information, ensuring lawful and ethical use for AI research and product development.
Is this a real-world dataset or synthetic data?
This is a synthetic dataset, created using AI-based data generation technology. The dataset contains artificially produced UK passport images that mimic real-world documents while excluding any real or identifiable personal data, making it safe for open research and commercial AI training.
Still have questions about using Unidata datasets? Read our user-guides

Similar Datasets

What our clients are saying

UniData

4 3 Reviews

PA

Paul 2025-02-21

Very Positive Experience!

The team was very responsive when requesting a specific dataset, and was able to work with us on what data we specifically needed and custom pricing for our use case. Overall a great experience, and would recommend them to others!

TH

Thorsten 2025-01-09

Very good experience

We got in touch with UniData to buy several datasets from them. Communication was very cooperative, quick, and friendly. We were able to find contract conditions that suited both parties well. I also appreciate the team's dedication to understand and address the needs of the customer. And the datasets we bought from UniData matched with our expectations.

Max Crous 2024-10-08

Data purchase

Our team got in touch with UniData for purchasing video data. The team at UniData was transparent, timely, and pleasant to communicate and negotiate with. Their samples and descriptions aligned well with the data we received. We will certainly reach out to UniData again if we're in search of 3rd party video data.

Abhijeet Zilpelwar 2025-02-26

Data is well organized and easy to…

Data is well organized and easy to consume. We could download and use it for training within few hours of receiving the data links.

Why Choose Us

Unidata offers unparalleled expertise in AI data solutions, delivering superior data quality and optimized workflows

Expertise

Our team consists of industry-leading experts in AI data solutions

Quality

We ensure superior data quality to maximize your AI project's potential

Efficiency

Our optimized workflows accelerate your model training processes

Proven Results

Our track record of case studies demonstrates our ability to deliver outstanding outcomes

Customization

Our track record of case studies demonstrates our ability to deliver outstanding outcomes

Support

We provide ongoing support and consultation to ensure continuous success
background
team
1000 +
full-time assessors

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.