The Task
A design-tech company approached us with an ambitious project. Their model needed pixel-perfect semantic segmentation of interior photos—for every object in view.
We had previously collaborated on a simpler task involving just floor and wall segmentation. This new project, however, scaled in complexity. Now, each photo had to be fully segmented by class, covering over 30 categories—furniture, fixtures, decor, appliances, and more.
The accuracy bar was set high. Precise labeling and consistent separation of foreground/background layers were critical. There was no room for ambiguity or missed elements.
The Solution
-
- 01
-
Preparation and Annotator Training
- Custom Entry Test: We developed a screening test to ensure annotators could differentiate between visually similar classes like armchairs vs. sofas, or shelves vs. cabinets.
- Structured Training Program: Annotators went through video-based training with examples of correct and incorrect segmentation.
- Error Feedback Loop: We set up a fast feedback system with video reviews of common errors, helping annotators improve rapidly.
-
- 02
-
Annotation Workflow
- Per-Object Segmentation: Every object on each photo was segmented individually and labeled according to the class hierarchy.
- Foreground–Background Separation: We created specific guidelines for annotating layered objects (e.g., a vase on a table in front of a wall).
- Consistency Protocols: The team followed a shared labeling structure to maintain consistency across the dataset.
-
- 03
-
Quality Assurance and Validation
- Layered Review: Each annotated image underwent a two-step validation—initial QA check, followed by expert review.
- Systemic Feedback: Repeated errors were flagged and addressed through daily syncs between annotators and validators.
- Hierarchy Tweaks: During the mid-project review, the client requested minor adjustments to layer priorities (foreground/background). We implemented changes without delaying delivery.
The Result
100% of data delivered on time with all requested objects correctly segmented and classified.
Consistent precision across the dataset—especially in challenging areas like overlapping objects and shadow boundaries.
Positive client feedback highlighted the clarity of object boundaries and the low correction rate.