Commercial
American Speech Recognition Dataset
The dataset includes 1,136 hours of annotated telephone dialogues from 1,416 native speakers across the United States, providing high-quality audio recordings, transcriptions, and speaker metadata to support speech recognition systems, NLP tasks, and machine learning models requiring diverse American speech datasets
Request a demo
-
- Hours
- 1,100+
-
- Speakers
- 1,400+
-
- Sentence Accuracy Rate
- 95%
- NLP
- LLM
- Machine Learning
- Audio Processing
- ASR
- Voice Recognition
-
- Hours
- 1,100+
-
- Speakers
- 1,400+
-
- Sentence Accuracy Rate
- 95%
Dataset Info
Characteristic | Data |
Description | Audio of telephone dialogues in American for training NLP models in real-world conversational scenarios. |
Data types | Audio |
Tasks | Speech recognition, NLP |
Country | the United States(USA) |
Hours of telephone dialogue | 1,136 |
Number of speakers | 1,416 |
Labeling | Annotation (text content, speaker's ID, gender, age and other attributes) |
Gender | Male (45%), Female (55%) |
Recording device | Android smartphone, iPhone |
Statistics
-
- Distribution by gender
Technical
Characteristics
Characteristic | Data |
Audio Format | Wav |
Sampling Rate | 16kHz |
Number of Channels | Mono |
Bit Depth | 16 bit |
Recording condition | Low background noise (indoor) |
Dataset Use Cases
What is included in the American Speech Recognition Dataset?
This dataset consists of 1,136 hours of telephone dialogues recorded by 1,416 speakers across the United States. The audio files are provided in WAV format with annotations including transcriptions, speaker ID, gender, and age.
What is this dataset used for?
The dataset is designed for training automatic speech recognition systems, NLP models, and voice assistants. It can also be applied in customer service automation, emotion recognition, and voice command technologies.
How was the data collected?
The data was created through structured data collection using mobile devices under indoor conditions with low background noise. This ensures high-quality audio recordings suitable for speech recognition technology and voice analysis.
Is it possible to request a custom dataset?
Yes, Unidata supports requests for custom datasets. You can specify requirements such as speaker demographics, recording conditions, or annotation formats, making it easier to train more precise recognition systems and voice technology models.
Still have questions about using Unidata datasets?
Read our user-guides
Similar Datasets
What our clients are saying

UniData
Why Choose Us
Unidata offers unparalleled expertise in AI data solutions, delivering superior data quality and optimized workflowsExpertise
Our team consists of industry-leading experts in AI data solutionsQuality
We ensure superior data quality to maximize your AI project's potentialEfficiency
Our optimized workflows accelerate your model training processesProven Results
Our track record of case studies demonstrates our ability to deliver outstanding outcomesCustomization
Our track record of case studies demonstrates our ability to deliver outstanding outcomesSupport
We provide ongoing support and consultation to ensure continuous success
- 1000 +
- full-time assessors
Ready to get started?
Tell us what you need — we’ll reply within 24h with a free estimate

- Andrew
- Head of Client Success
— I'll guide you through every step, from your first
message to full project delivery
Thank you for your
message
We use cookies to enhance your experience, personalize content, ads, and analyze traffic. By clicking 'Accept All', you agree to our Cookie Policy.