License Plate Annotation for Vehicle Recognition System

Image

What happens when a computer vision pipeline depends on complex license plate data — and the data doesn’t exist yet? Our team had 14 days to annotate 100,000 images across regions, languages, and formats, down to every digit and symbol. Here's how we built a high-accuracy dataset under pressure — and why synthetic data couldn't compete.

Timeline 2 weeks
Data 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)
Image
Timeline 2 weeks
Data 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)

Task:

A long-term client approached us with a task critical for their computer vision pipeline: to accurately annotate vehicle license plates across various regions for use in traffic camera systems. Unlike previous projects, this time the complexity was significantly higher. They needed not only bounding boxes for entire license plates, but also precise annotations for each individual digit, the region code, and specific regional symbols — including those in Arabic script.

Speed and accuracy were paramount. The client planned to train a neural network on this data to automate plate recognition in real-world conditions, and the annotated dataset had to reflect real diversity across regions, styles, and lighting conditions.

Solution:

  • 01

    Task Design & Guidelines Development:

    • Domain Research and Client Interviews:
      We began by exploring the structure of license plates and regional variations across the target geography. This involved consultations with the client to clarify all edge cases and potential ambiguities.
    • Detailed Annotation Guidelines:
      We developed a comprehensive annotation manual covering every object class: full license plate, individual digits, region codes, and national/regional symbols.
      The guide included visual examples, rules for difficult cases (e.g., partially occluded plates), and standardized labeling logic.
    • Edge Case Library & Reference Materials:
      A shared library of rare or borderline cases was created to ensure consistency when annotators encountered unexpected formats or damaged plates.
    • Pilot Run & Calibration:
      A small batch of images was annotated and reviewed collaboratively with the client to lock in quality standards before full-scale production.
  • 02

    Scalable Team Setup & Training:

    • Specialist Team Assembly:
      We onboarded a dedicated team of over 30 trained annotators with prior experience in computer vision tasks, ensuring minimal ramp-up time.
    • Rapid Onboarding Process:
      Annotators went through structured training sessions, completed a qualification test, and participated in guided walkthroughs of the annotation platform.
    • Live Helpdesk Support:
      A real-time support system was launched to provide answers to annotators’ questions on edge cases and tool usage, ensuring uninterrupted productivity.
    • Monitoring Infrastructure:
      A custom dashboard tracked real-time annotation rates, validator load, and individual accuracy metrics to maintain speed without sacrificing quality.
    • Balanced Workflows:
      The project was carefully scheduled to avoid annotator fatigue and ensure consistent throughput, with task assignments adjusted dynamically based on performance.
  • 03

    Iterative Delivery & Quality Assurance Loop:

    • Client Feedback Integration:
      Early deliveries were used to fine-tune both annotation rules and internal QA processes. Feedback from the client was immediately incorporated into updated guidelines.
    • Layered QA Process:
      A two-stage validation approach was used: manual reviews of random samples and logic-based consistency checks across labels (e.g., size coherence between digits and the plate bounding box).
    • Versioning & Rule Updates:
      As new plate formats and exceptions were discovered, the guidelines were updated, and annotators were briefed accordingly. Change logs were maintained and version-controlled.
    • Reporting & Transparency:
      Weekly reports summarized quality metrics, identified bottlenecks, and included recommendations for further refinements or automation opportunities.

Result

  • We delivered 100,000 images annotated with multilayered structure: bounding boxes for full plates, digits, region codes.

  • Delivered on time thanks to early team onboarding, pilot-phase efficiency testing, and a fallback plan involving ML-assisted pre-labeling.

  • >97.5% accuracy verified through manual QA and automated checks

  • No revisions required — all batches approved on first delivery

  • Improved model performance for the client’s plate recognition pipeline, outperforming datasets built with synthetic data

Similar Cases

  • Image
    Image Annotation

    Semantic Segmentation for Interior Design: A Complex Multiclass Annotation Project

    How do you segment every single object in a cluttered interior photo — 30+ classes per image? We designed a multi-step annotation pipeline to handle complexity without losing precision.

    Lean more
  • Image
    Data Collection

    Fight Detection for a Video Analytics System

    From scenario planning to annotation, we supported a full-cycle dataset build for a CV model trained to detect physical aggression in public spaces.

    Lean more
  • Image
    Audio Labeling services for ml Audio Transcription

    Speaker-Segmented Audio Annotation

    We handled complex, real-world audio by combining automation with expert oversight — capturing every voice, pause, and interruption.

    Lean more
  • Image
    Image Annotation

    Pose Estimation for Proctoring

    How do you teach AI to recognize when a student is cheating during an exam? By accurately annotating 6000 images of real exam scenarios — and that’s exactly what we did.

    Lean more
  • Image
    Image Annotation

    Ore Annotation for a Mining Company

    We helped a mining company quickly train a model to detect ore granularity and oversized fragments directly on the conveyor belt—cutting processing delays and freeing up internal resources.

    Lean more

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.