License Plate Annotation for Vehicle Recognition System

Image

What happens when a computer vision pipeline depends on complex license plate data — and the data doesn’t exist yet? Our team had 14 days to annotate 100,000 images across regions, languages, and formats, down to every digit and symbol. Here's how we built a high-accuracy dataset under pressure — and why synthetic data couldn't compete.

Timeline 2 weeks
Data 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)
Image
Timeline 2 weeks
Data 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)

Task:

A long-term client approached us with a task critical for their computer vision pipeline: to accurately annotate vehicle license plates across various regions for use in traffic camera systems. Unlike previous projects, this time the complexity was significantly higher. They needed not only bounding boxes for entire license plates, but also precise annotations for each individual digit, the region code, and specific regional symbols — including those in Arabic script.

Speed and accuracy were paramount. The client planned to train a neural network on this data to automate plate recognition in real-world conditions, and the annotated dataset had to reflect real diversity across regions, styles, and lighting conditions.

Solution:

  • 01

    Task Design & Guidelines Development:

    • Domain Research and Client Interviews:
      We began by exploring the structure of license plates and regional variations across the target geography. This involved consultations with the client to clarify all edge cases and potential ambiguities.
    • Detailed Annotation Guidelines:
      We developed a comprehensive annotation manual covering every object class: full license plate, individual digits, region codes, and national/regional symbols.
      The guide included visual examples, rules for difficult cases (e.g., partially occluded plates), and standardized labeling logic.
    • Edge Case Library & Reference Materials:
      A shared library of rare or borderline cases was created to ensure consistency when annotators encountered unexpected formats or damaged plates.
    • Pilot Run & Calibration:
      A small batch of images was annotated and reviewed collaboratively with the client to lock in quality standards before full-scale production.
  • 02

    Scalable Team Setup & Training:

    • Specialist Team Assembly:
      We onboarded a dedicated team of over 30 trained annotators with prior experience in computer vision tasks, ensuring minimal ramp-up time.
    • Rapid Onboarding Process:
      Annotators went through structured training sessions, completed a qualification test, and participated in guided walkthroughs of the annotation platform.
    • Live Helpdesk Support:
      A real-time support system was launched to provide answers to annotators’ questions on edge cases and tool usage, ensuring uninterrupted productivity.
    • Monitoring Infrastructure:
      A custom dashboard tracked real-time annotation rates, validator load, and individual accuracy metrics to maintain speed without sacrificing quality.
    • Balanced Workflows:
      The project was carefully scheduled to avoid annotator fatigue and ensure consistent throughput, with task assignments adjusted dynamically based on performance.
  • 03

    Iterative Delivery & Quality Assurance Loop:

    • Client Feedback Integration:
      Early deliveries were used to fine-tune both annotation rules and internal QA processes. Feedback from the client was immediately incorporated into updated guidelines.
    • Layered QA Process:
      A two-stage validation approach was used: manual reviews of random samples and logic-based consistency checks across labels (e.g., size coherence between digits and the plate bounding box).
    • Versioning & Rule Updates:
      As new plate formats and exceptions were discovered, the guidelines were updated, and annotators were briefed accordingly. Change logs were maintained and version-controlled.
    • Reporting & Transparency:
      Weekly reports summarized quality metrics, identified bottlenecks, and included recommendations for further refinements or automation opportunities.

Result

  • We delivered 100,000 images annotated with multilayered structure: bounding boxes for full plates, digits, region codes.

  • Delivered on time thanks to early team onboarding, pilot-phase efficiency testing, and a fallback plan involving ML-assisted pre-labeling.

  • >97.5% accuracy verified through manual QA and automated checks

  • No revisions required — all batches approved on first delivery

  • Improved model performance for the client’s plate recognition pipeline, outperforming datasets built with synthetic data

Similar Cases

  • Image
    Data Collection

    Audio Data Collection for Emotion-Sensitive Voice Systems

    Unidata collected 750+ unique audio samples of children’s emotional expressions — enabling emotion recognition in family-focused apps.

    Lean more
  • Image

    Aerial Image Annotation for Urban Planning

    We annotated 132,000+ objects in 11,000 aerial images—streamlining urban planning data with scalable workflows and tailored class logic.

    Lean more
  • Image
    Image Annotation

    Image Annotation for Construction and Heavy Machinery

    We successfully completed a project annotating construction equipment, labeling approximately 5,000 images using object detection methods. Our approach ensured high accuracy and fast turnaround, fully meeting the client’s requirements.

    Lean more
  • Image
    Image Annotation

    Image Annotation for Ore Detection

    We helped a mining company quickly train a model to detect ore granularity and oversized fragments directly on the conveyor belt—cutting processing delays and freeing up internal resources.

    Lean more
  • Image
    Data Collection

    Image Data Collection for Hair Loss Classification Task

    With clear guidelines and a sharp execution strategy, we delivered a high-quality dataset tailored for hair loss classification tasks.

    Lean more

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.