Home Case Studies Product Grouping for E-commerce

Product Grouping for E-commerce

Thousands of listings. Different sellers. Endless naming variations.

Helping buyers navigate this chaos was the challenge facing one of the top classifieds platforms. To group similar offers under clean, easy-to-browse product cards, they needed a model trained on real, structured data. That’s where Unidata came in — providing expert annotation that cut through the clutter and made model identification not only possible, but scalable.

: Industry Major online classifieds platform

: Timeline 2 months

: Data 20,000 listings with triple annotation overlap

: Industry Major online classifieds platform

: Timeline 2 months

: Data 20,000 listings with triple annotation overlap

Table of Content

Client Request
Our Approach
Results

Client Request

The client aimed to enhance user experience by implementing a system that could automatically and accurately identify the product model mentioned in listing titles and descriptions. The ultimate goal was to help buyers quickly compare relevant offers by grouping similar listings under unified product cards.

To achieve this, the platform engaged Unidata for high-quality data annotation — and we got to work.

Our Approach

01

Technical Scope and Pilot Phase
The client provided detailed guidelines for identifying product models from listing text.
Our team reviewed the instructions and proposed refinements, including:
- How to handle product attributes (e.g., color or storage capacity) when they appeared in titles but weren’t part of the actual model
- How to treat variations in naming conventions across different product categories
During the pilot phase, key challenges included:
- Multilingual listings
- Numerous abbreviations and non-standard formatting
- Ambiguities requiring client clarification
02

Annotation and Review Process
Over the course of two months, our team annotated 20,000 listings, focusing on precise model identification. Key challenges we addressed included:
- Identifying relevant model keywords in long and often cluttered titles
- Extracting model names from product descriptions, especially in categories like fashion, where listings often contained attributes (e.g., sleeve length, material, color) irrelevant to the model itself
- Standardizing model names across similar listings
To ensure consistency across annotators, we:
- Developed a set of internal rules and examples
- Conducted training sessions to reduce subjective variation
- Implemented continuous review and feedback throughout the annotation phase
03

Validation Workflow
All annotations underwent a thorough validation process to ensure accuracy.

Because model identification involved subjective judgment, we took the following steps:
- Held regular sync meetings to align interpretations
- Updated annotation guidelines based on team feedback and corner cases
- Provided ongoing training and clarification sessions for the team
Validator performance was monitored using analytics to:
- Identify outliers or inconsistencies
- Optimize review efficiency
- Improve overall data quality

Results

The model trained on the annotated dataset was successfully deployed into the production system. Listings are now automatically grouped into product cards based on the identified model
The grouping logic correctly handles edge cases and non-standard listings
Real-user testing conducted by the client confirmed the effectiveness of the model even on complex or ambiguous examples

Similar Cases

Image Annotation

Pose Estimation for Proctoring

How do you teach AI to recognize when a student is cheating during an exam? By accurately annotating 6000 images of real exam scenarios — and that’s exactly what we did.
Lean more
Image Annotation

Image Annotation for Retail Product Classification

How do you annotate shelves packed with thousands of ever-changing products? We built a high-speed pipeline to handle real-time updates and ensure merchandising insights stay current.
Lean more
Image Annotation

Image Annotation for Strawberry Ripeness Detection

Our custom dataset powered the transition from manual picking to AI-assisted harvesting — optimizing yield through data-driven ripeness detection.
Lean more
Image Annotation

Image Annotation for Construction and Heavy Machinery

We successfully completed a project annotating construction equipment, labeling approximately 5,000 images using object detection methods. Our approach ensured high accuracy and fast turnaround, fully meeting the client’s requirements.
Lean more
Audio Labeling services for ml Audio Transcription

Multi-Speaker Audio Annotation for Banking

We handled complex, real-world audio by combining automation with expert oversight — capturing every voice, pause, and interruption.
Lean more

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

What service are you looking for? *

What service are you looking for?

Data Labeling

Data Collection

Ready-made Datasets

Human Moderation

Medicine

Other (please describe below)

What's your budget range? *

What's your budget range?

< $1,000

$1,000 – $5,000

$5,000 – $10,000

$10,000 – $50,000

$50,000+

Not sure yet

Оставьте это поле пустым.

Where did you hear about Unidata? *

Where did you hear about Unidata?

Google LinkedIn Kaggle / Hugging Face / Github Referral (colleague, partner, client) G2 ChatGPT / AI assistant Other

I agree to the Terms of Service and Privacy Policy. By submitting my contact information, I consent to receive emails, messages, and calls from Unidata and its affiliates.

Andrew: Head of Client Success

— I'll guide you through every step, from your first
message to full project delivery

Thank you for your
message

It has been successfully sent!

We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.

Product Grouping for E-commerce

Client Request

Our Approach

Technical Scope and Pilot Phase

Annotation and Review Process

Validation Workflow

Results

Similar Cases

Pose Estimation for Proctoring

Image Annotation for Retail Product Classification

Image Annotation for Strawberry Ripeness Detection

Image Annotation for Construction and Heavy Machinery

Multi-Speaker Audio Annotation for Banking

Ready to get started?

Thank you for your message

Ready to get started?

Thank you for your
message