Commercial

Egocentric Video Dataset

This egocentric dataset contains 239.3 hours of first-person videos capturing daily activities in home environments, designed for Physical AI, robotic systems, manipulation tasks, and egocentric vision research. The videos were captured via Pico VR headset and 4 Zed cameras across six subsets: 33 hours at natural speed, 84 hours slow-motion with both hands always in frame, 20 hours of real-speed hand-object interactions, and 102 hours of scripted object transfer tasks using a dual Zed + Pico + tracker setup. Quaternion-based orientation from onboard sensor fusion supports 3D pose estimations and egocentric tracking across first-person perspectives.

Get in touch Download sample
  • Hours
    239.3
egocentric dataset
  • Computer Vision
  • Human Activity Recognition
  • Robot Learning
  • Motion Analysis
  • Egocentric Vision

This egocentric dataset contains 239.3 hours of first-person videos capturing daily activities in home environments, designed for Physical AI, robotic systems, manipulation tasks, and egocentric vision research. The videos were captured via Pico VR headset and 4 Zed cameras across six subsets: 33 hours at natural speed, 84 hours slow-motion with both hands always in frame, 20 hours of real-speed hand-object interactions, and 102 hours of scripted object transfer tasks using a dual Zed + Pico + tracker setup. Quaternion-based orientation from onboard sensor fusion supports 3D pose estimations and egocentric tracking across first-person perspectives.

Get in touch Download sample
  • Computer Vision
  • Human Activity Recognition
  • Robot Learning
  • Motion Analysis
  • Egocentric Vision
  • Hours
    239.3

Dataset Info

Characteristic Data
Description Egocentric video recordings of daily activities in home environments
Data types Video
Tasks Hand Activity Recognition, Egocentric Action Recognition, Hand-Object Interaction
Hours of recordings 239
Number of segments 6
Subset 1 33 hours - Natural speed, hands appear only when necessary (Pico + Trackers)
Subset 2 84 hours - Slow motion, both hands always in frame, detailed kinematics (Pico + Trackers)
Subset 3 20 hours - Real speed, hands as needed, object transferring between surfaces (Pico + Trackers)
Subset 4-6 102.3 hours - Object transferring tasks across 3 scripted scenarios (Zed + Pico + Trackers)
Environments Kitchen, bathroom, living room, etc.
Activities Daily household actions, object transferring
Download sample

Technical
Characteristics

Characteristic Data
Video source Head-mounted camera (Pico VR headset) + 4 Zed cameras
Recording speed Real-time (subsets 1 & 3) / Slow-motion (subset 2)
Hand visibility As needed (subsets 1 & 3) / Both hands always visible (subset 2)
Extension of labeling file txt
Orientation format Quaternions (from onboard sensor-fusion)
Source and collection methodology: Data was captured using two different hardware configurations (Pico + motion trackers and Zed + Pico + motion trackers) while participants performed daily household activities in home environments.

Statistics

Hardware configuration distribution

Dataset Use Cases

  • Robotics & Physical AI

    Teaching Robotic Systems Real-World Manipulation

    First-person video paired with quaternion orientation data gives robotic systems the spatial context they need for manipulation tasks. The slow-motion subset captures detailed hand kinematics, making it practical training data for robotic arms learning object transfers. Six subsets across varied conditions cover the range of daily household actions that real-world robotics actually encounters.

  • Computer Vision & Egocentric Vision Research

    Benchmarking First-Person Action Recognition

    Researchers building egocentric action recognition models get six subsets recorded across varied speeds and hand visibility conditions — something most existing datasets don’t offer. Multimodal data covering hand-object interactions and 3D pose estimations support more accurate activity recognition. The large-scale egocentric footage spans dynamic scenes across kitchens, bathrooms, and living rooms.

  • Healthcare & Rehabilitation

    Analyzing Human Motion for Clinical Applications

    Detailed hand motion data across scripted and naturalistic scenarios supports performance analysis in occupational therapy and motor rehabilitation. Pose estimations and egocentric tracking of hand-object interactions provide measurable visual data for assessing patient progress. The slow-motion subset makes fine-grained human motion visible in ways standard recording speeds miss.

  • Augmented & Virtual Reality

    Improving Hand Tracking in Immersive Environments

    VR and AR developers can use this egocentric data to train hand tracking algorithms that hold up in real home environments. IMU signals and motion capture data from Pico VR headsets reflect how hands actually move during daily activity, covering the first-person perspectives immersive virtual applications need to feel responsive and accurate.

FAQ

What is included in Egocentric Video Dataset?
The dataset contains 239 hours of first-person videos across six subsets, recorded in a home environment, including kitchen, bathroom, and living room. Each recording includes egocentric video from Pico VR headsets and Zed cameras, motion tracker data, and quaternion-based orientation from onboard sensor fusion.
How was the data collected?
Data was captured using two hardware configurations: Pico VR headset with motion trackers, and Zed + Pico + motion trackers, while participants performed daily household activities. Recording conditions vary across subsets, covering real-time speed, slow-motion kinematics, and scripted object transfer scenarios.
Is this dataset unique?
Is this dataset unique? Yes. The combination of slow-motion kinematic recording, dual-camera setup (Zed + Pico), and scripted object transfer scenarios across six structured subsets is not replicated in existing datasets. The multimodal data, such as egocentric video, IMU signals, and quaternion orientation, make it distinct for robotic and Physical AI research.
Is this real-world or synthetic data?
This is entirely real-world data. Participants performed genuine daily household actions, including object transfers across surfaces, giving the dataset the natural human motion variability that synthetic datasets typically lack.
How are Unidata datasets licensed?
Unidata datasets follow a dual-licensing model: free samples are provided for trial and testing, while full datasets are available exclusively through purchase. Licensing terms are confirmed during the request and documentation process.
Do Unidata datasets comply with GDPR?
Yes. All datasets are curated in compliance with GDPR and applicable data protection laws. Data is collected from legally permissible sources to ensure ethical and lawful usage across research and commercial applications.
How are Unidata datasets stored?
All datasets are stored securely on AWS cloud infrastructure, aligned with ISO 27001 and ISO 27701 standards for information security and privacy management. This ensures high availability, scalability, and a privacy-focused environment for handling sensitive visual data.
How long does delivery take?
Once you submit a request, Unidata will contact you to review the details and complete the necessary documents. After signing and payment, the dataset is delivered within 3–10 days.
Still have questions about using Unidata datasets? Read our user-guides

Unidata Cases

Digital Tree Passport Annotation for Forest Mapping

  • Forestry Monitoring & GIS
  • 200,000 trees, 10 species classes
  • 2 months
Learn more

License Plate Annotation for Vehicle Recognition System

  • 100,000 images with detailed license plate markup (bounding boxes, digits, regional symbols)
  • 2 weeks
Learn more

Sentiment Annotation for Brand Monitoring

  • Marketing & Consumer Insights
  • 12,000 text samples, 3 sentiment classes (positive, negative, neutral)
  • 3 weeks
Learn more

Surveillance Video Annotation for Entrance Monitoring

  • Surveillance & Security
  • 90 minutes of video from three cameras, approximately 50-60 thousand frames
  • 2 week
Learn more

Similar Datasets

  • Car Accident Video Dataset Image Commercial
    • Auto
    • Object Detection
    • Machine Learning
    • Computer Vision

    Car Accident Video Dataset

    Car Accident Video Dataset contains 5,000 annotated video clips capturing 3 seconds before, during, and after traffic accidents, providing structured data for car accident detection and prediction tasks. This video dataset supports training models for traffic detection, road accident analysis, and crash recognition, offering consistent MP4 footage for developing reliable accident detection systems.

    5 000 Videos

  • scene scanning dataset scene scanning dataset Commercial
    • Machine learning
    • Human Behavior
    • Pose Estimation
    • Action Recognition
    • Computer Vision

    Scene Scanning Video Dataset

    Scene Scanning Video Dataset includes 10,000 high-resolution (1920×1080, 30 FPS) MP4 videos capturing people looking around and scanning their environment in controlled settings. This video dataset contains annotated metadata labels—such as environment type, camera motion, participant count, gender, duration, and resolution—providing structured training data for computer vision, face detection, face recognition, and object detection.

    10,000 Videos

  • fall detection dataset fall detection dataset Commercial
    • Computer Vision
    • Machine Learning
    • Public Safety
    • Object Detection
    • Action Recognition

    Fall Detection Dataset

    Fall Detection Dataset contains 10,000 high-resolution videos of staged human fall events recorded across indoor and outdoor settings using static and moving cameras. This camera dataset supports fall detection computer vision research by providing well-structured metadata, realistic fall scenarios, and consistent 1080p video quality for training and evaluating detection systems.

    10,000 Videos

  • Image Image Commercial
    • Computer Vision
    • Machine learning
    • Crime
    • Public Safety
    • Object Detection

    Weapon Detection Dataset

    Weapon Detection Dataset contains 10,000 annotated images of staged individuals with visible weapons, collected from public CCTV footage and internet sources for training weapon detection models. This surveillance dataset includes bounding box annotations and rich metadata, supporting accurate weapons detection, small object detection, and real-time security systems across CCTV and camera-based monitoring scenarios.

    10,000 Images

  • Image Image Commercial
    • Computer Vision
    • Machine Learning
    • Crime
    • Public Safety
    • Action Recognition

    Fight Detection Video Dataset

    This fight detection dataset contains 1,000 high-quality videos of simulated physical altercations recorded in controlled environments, captured from static and moving surveillance camera views at up to 1920×1080 resolution and 30 FPS. Designed for violence detection, action recognition, and public safety systems, this surveillance dataset includes rich metadata annotations enabling accurate camera fight analysis and training violence detection models.

    1,000 Videos

  • robot dataset manipulations dataset Commercial

    Lerobot SO-101 Manipulations Dataset

    Lerobot SO-101 Manipulations Dataset is a compact, high-quality set of 20 teleoperated robotic arm recordings (~30 FPS) captured on the calibrated robot studio SO-101 arm using multi-view cameras (front, wrist, top) in the Lerobot framework. Designed for imitation learning and model training on the Lerobot platform, it demonstrates pick-and-place and basic manipulation tasks with structured LeRobot v2.1 annotations. 

    This dataset is intended as a limited reference set and contains 20 episodes; however, full custom data collection in Lerobot format can be provided on request.

    20 Episodes
    ~30 FPS

  • Image Image Commercial
    • Materials Science
    • Chemistry
    • Physics
    • Data Science
    • Machine Learning

    Materials Platform for Data Science (MPDS) Dataset

    Materials Platform for Data Science Dataset (Project Pauling Files) is a highly curated inorganic materials dataset with a 30-year track record, based on about half a million peer-reviewed scientific publications and powering several successful commercial products from the world's leading publishers. It integrates materials data from 405,100 publications, linking 139,005 phase diagrams, 409,771 crystalline nanostructures, and 1,075,676 physical property sets into 189,682 materials phases within a large-scale materials graph of ~5 million nodes and ~150 million edges, making it a comprehensive materials platform for data science and a cornerstone among modern material science datasets and materials project database resources.

    405,100 Publications
    139,005 Phase diagrams
    409,771 Crystalline nanostructures
    1,075,676 Property sets
    189,682 Material phases

  • robotic dataset robotic dataset Commercial
    • Computer Vision
    • Machine Learning
    • Human Activity Recognition
    • Robot Learning
    • Motion Analysis

    Robotic Household Activities Dataset

    This droid dataset comprises 1,000 hours of multimodal recordings of cleaning, laundry folding, and dishwashing activities, combining head-mounted video with detailed robotic data streams from seven 9-axis IMU units positioned on the chest, upper arms, forearms, and wrists. The robot dataset captures complete robotic data through 3-axis accelerometer (linear acceleration), 3-axis gyroscope (angular velocity), and 3-axis magnetometer (magnetic field orientation) sensors, using onboard sensor-fusion to generate orientation quaternions rather than raw motion signals, enabling precise motion tracking and real-world robotic learning.

    1,000 Hours

  • Serbia road traffic example 1 Serbia road traffic images Commercial
    • Auto
    • Detection
    • Data annotation
    • Machine Learning
    • Computer Vision

    Road Traffic in Serbia, Videos and Images Dataset

    This Serbian road traffic dataset contains 5,000 high-resolution (≥1080p) videos and annotated frame images with JSON bounding box annotations for vehicle detection and classification tasks. Collected from bridge vantage points, it provides detailed data on traffic flows, vehicle types (cars and minivans), and traffic density, supporting AI development for traffic prediction, road safety analysis, and computer vision model training.

    5,000 Video

  • Fermatix SWE-Bench dataset fermatix-swe-bench-dataset Commercial
    • Programming languages
    • Machine Learning
    • Automated Code Review
    • Bug Fixing

    SWE-Bench Coding Tasks Dataset

    SWE-Bench Coding Tasks Dataset  is an extended programming languages dataset that builds on the original SWE-Bench benchmark with broader language coverage, golden/test patches, and real-world coding tasks like bug fixing, code completion, and automated code review. It supports coding agents, language models, and developer tools with verified benchmark scores and multi-language test sets.

    8,712 files
    6 programming languages

Why Companies Trust Unidata's Datasets

Share your project requirements, we handle the rest. Every service is tailored, executed, and compliance-ready, so you can focus on strategy and growth, not operations.

70+ Datasets

  • Finance, IT, E-commerce, Retail, Healthcare and 14+ Industries
  • Multiple supported formats
01

Unique & Diverse Data

  • Diversity in ethnicity, age, country, gender, and more
  • Exclusively collected data, not available from open sources
02

Custom Dataset Solutions

  • No manual collection needed from your side; we handle everything
  • Up to 70% cheaper than in-house
03

100% Legal, Secure & Compliant

  • Curated and legally sourced
  • AWS ISO 27001/27701
04

Smooth Collaboration & Fast Delivery

  • 87% of datasets delivered in 3–10 days
  • Dedicated PM, Europe-timezone communication
05

Need Proof?

See the results we've delivered for leading tech companies and startups.

Explore datasets

What our clients are saying

UniData

4 3 Reviews

PA

Paul 2025-02-21

Very Positive Experience!

The team was very responsive when requesting a specific dataset, and was able to work with us on what data we specifically needed and custom pricing for our use case. Overall a great experience, and would recommend them to others!

TH

Thorsten 2025-01-09

Very good experience

We got in touch with UniData to buy several datasets from them. Communication was very cooperative, quick, and friendly. We were able to find contract conditions that suited both parties well. I also appreciate the team's dedication to understand and address the needs of the customer. And the datasets we bought from UniData matched with our expectations.

Max Crous 2024-10-08

Data purchase

Our team got in touch with UniData for purchasing video data. The team at UniData was transparent, timely, and pleasant to communicate and negotiate with. Their samples and descriptions aligned well with the data we received. We will certainly reach out to UniData again if we're in search of 3rd party video data.

Abhijeet Zilpelwar 2025-02-26

Data is well organized and easy to…

Data is well organized and easy to consume. We could download and use it for training within few hours of receiving the data links.

Trusted by the world's biggest brands

Our Clients Love Us

Enterprise Document Automation

Document AI Lead

The dataset gave us strong value for both pilot and early-stage testing. We plan to broaden coverage as deployment scales.

Identity Verification Lab

Deputy Director

The data was good. We passed PAD level 1 from iBeta.

Ready to get started?

Tell us what you need — we’ll reply within 24h with a free estimate

    What service are you looking for? *
    What service are you looking for?
    Data Labeling
    Data Collection
    Ready-made Datasets
    Human Moderation
    Medicine
    Other
    What's your budget range? *
    What's your budget range?
    < $1,000
    $1,000 – $5,000
    $5,000 – $10,000
    $10,000 – $50,000
    $50,000+
    Not sure yet
    Where did you hear about Unidata? *
    Where did you hear about Unidata?
    Head of Client Success
    Andrew
    Head of Client Success

    — I'll guide you through every step, from your first
    message to full project delivery

    Thank you for your
    message

    It has been successfully sent!

    We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.