Advanced Message Filtering for Platform Safety

Image

When user trust is at stake, platforms can't afford to let harmful messages slip through.

A major classifieds company needed a reliable way to protect conversations on their platform — without slowing them down. To make that happen, Unidata provided high-precision annotation and validation for a smart filtering and classification system that now helps keep millions of daily interactions safe and respectful.

Industry E-commerce and Retail
Timeline 3 months
Data Dialogues exchanged between users
Image
Industry E-commerce and Retail
Timeline 3 months
Data Dialogues exchanged between users

Client Request

The client—a major player in the classifieds space—sought to develop a message filtering system that could:

  • Prevent the spread of inappropriate or restricted content
  • Improve overall conversation quality on the platform
  • Protect users from violations such as:
    • Offensive or abusive language
    • Personal data disclosure
    • Negative or harmful speech

To achieve this, Unidata was brought in to annotate and validate the dataset, providing the foundation for a model that could reliably detect and categorize sensitive content.

Our Approach

  • 01

    Technical Requirements and Pilot Phase

    The client provided a detailed technical brief outlining classification requirements. Our team proposed additional refinements to ensure a more precise and layered annotation process.

    During the pilot phase, we collaborated closely with the client to:

    • Clarify classification rules for key categories, including:
      • Insults and abusive language
      • Mentions of personal information
      • References to meeting arrangements
      • Negative sentiment directed at the platform
    • Address complex edge cases, such as:
      • Implicit mentions of meeting locations (e.g., vague geographic references without full addresses)
  • 02

    Annotation and Quality Control Process

    Our annotation team at Unidata handled classification by carefully considering:

    • Platform-specific communication patterns
    • Informal language use typical in peer-to-peer messaging
    • The context of each message, not just isolated phrases

    Messages were annotated across several primary categories:

    • Use of profanities or slurs
    • Disclosure of personal or sensitive information
    • Various forms of direct and indirect insults
    • Mentions of meeting points or negotiation outside the platform
  • 03

    Data Validation

    To ensure the highest level of annotation accuracy, we implemented a robust validation workflow:

    • Involved experienced validators to review annotated samples
    • Introduced an interactive error analysis process, which included:
      • Team discussions of edge cases
      • Targeted surveys to refine judgment on difficult categories

    We also conducted training and testing sessions with annotators focused on:

    • Eliminating errors in high-complexity cases
    • Aligning the team on annotation logic and edge-case handling
    • Ensuring consistent interpretation of classification criteria

Results

  • The model trained on our annotated data was successfully tested and deployed on the client’s platform. Internal testing involved evaluating model performance against randomly selected user messages

  • The initial testing phase showed promising results:

    • The model accurately blocked inappropriate or restricted content
    • Responses remained contextually appropriate across various scenarios

Similar Cases

  • Image
    Data Collection

    Fight Detection for a Video Analytics System

    From scenario planning to annotation, we supported a full-cycle dataset build for a CV model trained to detect physical aggression in public spaces.

    Lean more
  • Image
    Image Annotation

    Optimizing Harvest Efficiency

    Our custom dataset powered the transition from manual picking to AI-assisted harvesting — optimizing yield through data-driven ripeness detection.

    Lean more
  • Image

    Aerial Image Annotation for Urban Planning Tools

    We annotated 132,000+ objects in 11,000 aerial images—streamlining urban planning data with scalable workflows and tailored class logic.

    Lean more
  • Image
    NLP Annotation services

    Intent Annotation for a Classified Platform

    In marketplaces, speed and clarity drive conversions — and buyers expect instant answers.
    To meet this demand, one of the top classified platforms set out to build an AI assistant capable of handling frequent questions with precision. Unidata provided the annotated intent data that became the foundation for smart, context-aware responses — helping users get what they need, faster.

    Lean more
  • Image
    Image Annotation

    Product Classification and Shelf Image Annotation for a Retail Client

    How do you annotate shelves packed with thousands of ever-changing products? We built a high-speed pipeline to handle real-time updates and ensure merchandising insights stay current.

    Lean more

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other (please describe below)
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.