Commercial

Robotic Household Activities Dataset

This droid dataset comprises 1,000 hours of multimodal recordings of cleaning, laundry folding, and dishwashing activities, combining head-mounted video with detailed robotic data streams from seven 9-axis IMU units positioned on the chest, upper arms, forearms, and wrists. The robot dataset captures complete robotic data through 3-axis accelerometer (linear acceleration), 3-axis gyroscope (angular velocity), and 3-axis magnetometer (magnetic field orientation) sensors, using onboard sensor-fusion to generate orientation quaternions rather than raw motion signals, enabling precise motion tracking and real-world robotic learning.

Get in touch Download sample
  • Hours
    1,000
robotic dataset
  • Computer Vision
  • Machine Learning
  • Human Activity Recognition
  • Robot Learning
  • Motion Analysis

This droid dataset comprises 1,000 hours of multimodal recordings of cleaning, laundry folding, and dishwashing activities, combining head-mounted video with detailed robotic data streams from seven 9-axis IMU units positioned on the chest, upper arms, forearms, and wrists. The robot dataset captures complete robotic data through 3-axis accelerometer (linear acceleration), 3-axis gyroscope (angular velocity), and 3-axis magnetometer (magnetic field orientation) sensors, using onboard sensor-fusion to generate orientation quaternions rather than raw motion signals, enabling precise motion tracking and real-world robotic learning.

Get in touch Download sample
  • Computer Vision
  • Machine Learning
  • Human Activity Recognition
  • Robot Learning
  • Motion Analysis
  • Hours
    1,000

Dataset Info

Characteristic Data
Description 1,000 hours of multimodal recordings of cleaning, laundry folding, and dishwashing activities
Data types Video (head-mounted), 9-axis IMU streams
Tasks Human Activity Recognition, Motion Analysis, Multimodal Learning, Action Segmentation
Hours of recordings 1,000
IMU placements Chest, Upper arms, Forearms, Wrists
Sensor outputs 3-axis accelerometer, 3-axis gyroscope, 3-axis magnetometer, Orientation quaternions
Labeling Metadata (counter, duration, task, date, video size, sensor size, recording version)
Download sample

Technical
Characteristics

Characteristic Data
Video extension MP4
Video source Head-mounted camera
Extension of labeling file JSON
IMU specification 7 units, 9-axis each
IMU data types Linear acceleration, Angular velocity, Magnetic field orientation
Orientation format Quaternions (from onboard sensor-fusion)
Source and collection methodology. Data was collected by a partner of Unidata.

Dataset Use Cases

  • Robotics & Automation

    Learning Household Manipulation Skills

    This dataset supports robotic learning for everyday manipulation tasks such as cleaning, laundry folding, and dishwashing. With 1,000 hours of synchronized head-mounted video and 9-axis IMU sensor data, it enables model training on real-world motions, trajectories, and object interactions, helping robots learn complex sequences in realistic home environments.

  • Artificial Intelligence Research

    Training Foundation Models with Multimodal Data

    The robot activities dataset is well suited for training foundation models that combine vision, sensor data, and human demonstrations. By capturing detailed sensor data from real robots and humans, researchers can improve model performance in action segmentation, motion understanding, and multimodal learning methods across large-scale robot datasets.

  • Humanoid Robotics

    Human Motion Imitation and Skill Transfer

    For humanoid robotics, this dataset provides valuable training data to study how human movements translate to robot arms and full-body systems. The inclusion of precise orientation quaternions and IMU streams helps models learn smooth motions, grasp strategies, and coordinated actions needed for human-like task execution.

  • Robotic Systems Engineering

    Benchmarking Real-World Robotic Performance

    Engineers can use this robot dataset to benchmark robotic systems under realistic scenarios involving varied objects and tasks. The structured metadata, high-quality video, and sensor recordings allow consistent evaluation of robotic manipulation, making it useful for comparing learning methods and validating performance in real-world applications.

FAQs

What is included in this robotic dataset?
The dataset includes 1,000 hours of multimodal recordings featuring cleaning, laundry folding, and dishwashing tasks. It consists of head-mounted video and synchronized sensor data streams collected from real humans performing household activities.
What types of annotations are provided in the dataset?
Annotations include structured metadata such as task type, recording duration, timestamps, recording version, and sensor file sizes. These labels enable precise alignment between video, sensor data, and manipulation tasks for model training.
Can I request a sample of the dataset before purchasing or downloading it?
Yes. Unidata provides free samples so you can evaluate the video quality, sensor data structure, and labeling format. This allows researchers to assess whether the robotic data supports their robotic systems and learning methods before full access.
What are the sources of data for Unidata robotic datasets?
This robot activities dataset was collected by a verified partner of Unidata. Data collection followed controlled protocols to ensure consistent, high-quality robot and human demonstration recordings.
What sensor data does the dataset contain?
The dataset contains data from seven 9-axis IMU units placed on the chest, upper arms, forearms, and wrists. Each IMU provides linear acceleration, angular velocity, magnetic field orientation, and orientation quaternions generated through onboard sensor fusion.
How are Unidata datasets licensed?
Unidata follows a dual-licensing model. Free samples are available for testing and evaluation, while full robotic datasets are provided exclusively through purchase.
Do Unidata datasets comply with GDPR and data privacy regulations?
Yes. All datasets are curated in compliance with GDPR and applicable data protection laws. Data is collected from legally permissible sources to ensure ethical use in robotics and machine learning research.
How are Unidata datasets stored and managed?
Unidata stores all datasets securely on AWS cloud infrastructure with compliance to ISO 27001 and ISO 27701 standards. This ensures high availability, scalability, and secure handling of robotic data and sensor streams.
How long does it take to receive the dataset after purchase?
Once you submit a request, Unidata will contact you to confirm requirements and complete documentation. After signing and payment, the dataset is delivered within 3–10 days.
Still have questions about using Unidata datasets? Read our user-guides

Unidata Cases

Digital Tree Passport Annotation for Forest Mapping

  • Forestry Monitoring & GIS
  • 2 months
  • 200,000 trees, 10 species classes
Learn more

License Plate Annotation for Vehicle Recognition System

  • 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)
  • 2 weeks
Learn more

Sentiment Annotation for Brand Monitoring

  • Marketing & Consumer Insights
  • 12,000 text samples, 3 sentiment classes (positive, negative, neutral)
  • 3 weeks
Learn more

Surveillance Video Annotation for Entrance Monitoring

  • Surveillance & Security
  • 90 minutes of video from three cameras, approximately 50-60 thousand frames
  • 2 week
Learn more

Similar Datasets

  • Image Image Commercial
    • Materials Science
    • Chemistry
    • Physics
    • Data Science
    • Machine Learning

    Materials Platform for Data Science (MPDS) Dataset

    Materials Platform for Data Science Dataset (Project Pauling Files) is a highly curated inorganic materials dataset with a 30-year track record, based on about half a million peer-reviewed scientific publications and powering several successful commercial products from the world's leading publishers. It integrates materials data from 405,100 publications, linking 139,005 phase diagrams, 409,771 crystalline nanostructures, and 1,075,676 physical property sets into 189,682 materials phases within a large-scale materials graph of ~5 million nodes and ~150 million edges, making it a comprehensive materials platform for data science and a cornerstone among modern material science datasets and materials project database resources.

    405,100 Publications
    139,005 Phase diagrams
    409,771 Crystalline nanostructures
    1,075,676 Property sets
    189,682 Material phases

  • Fermatix SWE-Bench dataset fermatix-swe-bench-dataset Commercial
    • Programming languages
    • Machine Learning
    • Automated Code Review
    • Bug Fixing

    SWE-Bench Coding Tasks Dataset

    SWE-Bench Coding Tasks Dataset  is an extended programming languages dataset that builds on the original SWE-Bench benchmark with broader language coverage, golden/test patches, and real-world coding tasks like bug fixing, code completion, and automated code review. It supports coding agents, language models, and developer tools with verified benchmark scores and multi-language test sets.

    8,712 files
    6 programming languages

  • Image Image Commercial
    • Machine Learning
    • Audio Processing
    • ASR
    • Voice Recognition

    Sound Effects Dataset

    Sound Effects Dataset contains 500,000 professionally recorded and curated sound effect audio files, designed for a wide range of creative, research, and machine learning applications. This high-quality audio dataset includes real-world and synthetic audio events, rich metadata, and stereo recordings, enabling audio classification, speech recognition, and scene analysis across diverse acoustic environments.

    500,000 Audio

  • Image Image Commercial
    • Machine Learning
    • Audio Processing
    • ASR
    • Voice Recognition

    Audio Dataset: Various Music Genres

    This music genres dataset contains 500,000 studio-grade music tracks in lossless FLAC format, designed for music genre classification and detection tasks. It provides rich music metadata, including detailed genre labels, instruments, and artist information, making it ideal training data for machine learning and deep learning models in audio analysis.

    500,000 Audio

Why Companies Trust Unidata's Datasets

Share your project requirements, we handle the rest. Every service is tailored, executed, and compliance-ready, so you can focus on strategy and growth, not operations.

70+ Datasets

  • Finance, IT, E-commerce, Retail, Healthcare and 14+ Industries
  • Multiple supported formats
01

Unique & Diverse Data

  • Diversity in ethnicity, age, country, gender, and more
  • Exclusively collected data, not available from open sources
02

Custom Dataset Solutions

  • No manual collection needed from your side; we handle everything
  • Up to 70% cheaper than in-house
03

100% Legal, Secure & Compliant

  • Curated and legally sourced
  • AWS ISO 27001/27701
04

Smooth Collaboration & Fast Delivery

  • 87% of datasets delivered in 3–10 days
  • Dedicated PM, Europe-timezone communication
05

Need Proof?

See the results we've delivered for leading tech companies and startups.

Explore datasets

What our clients are saying

UniData

4 3 Reviews

PA

Paul 2025-02-21

Very Positive Experience!

The team was very responsive when requesting a specific dataset, and was able to work with us on what data we specifically needed and custom pricing for our use case. Overall a great experience, and would recommend them to others!

TH

Thorsten 2025-01-09

Very good experience

We got in touch with UniData to buy several datasets from them. Communication was very cooperative, quick, and friendly. We were able to find contract conditions that suited both parties well. I also appreciate the team's dedication to understand and address the needs of the customer. And the datasets we bought from UniData matched with our expectations.

Max Crous 2024-10-08

Data purchase

Our team got in touch with UniData for purchasing video data. The team at UniData was transparent, timely, and pleasant to communicate and negotiate with. Their samples and descriptions aligned well with the data we received. We will certainly reach out to UniData again if we're in search of 3rd party video data.

Abhijeet Zilpelwar 2025-02-26

Data is well organized and easy to…

Data is well organized and easy to consume. We could download and use it for training within few hours of receiving the data links.

Trusted by the world's biggest brands

Our Clients Love Us

Enterprise Document Automation

Document AI Lead

The dataset gave us strong value for both pilot and early-stage testing. We plan to broaden coverage as deployment scales.

Identity Verification Lab

Deputy Director

The data was good. We passed PAD level 1 from iBeta.

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.