Best Audio Datasets & Databases

Easily explore, compare & preview top Audio Datasets via Datarade.
47 Results
4.8(1)

TagX Data collection for AI/ ML training | LLM data | Data collection for AI development & model finetuning | Text, image, audio, and document data

by TagX
Whether you require image, audio, or text data, we have the expertise and resources to collect and deliver ... We provide In-field data collection for speech, image, text, and survey data.
Available for 249 countries
10K images/document
99% %
Starts at
$1,000 / month

Nexdata | Multilingual Speech Synthesis Data | 400 Hours | TTS Data | Audio Data | AI Training Data

by Nexdata
, 800TB of image/video data, about 2 billion pieces of NLP data. ... The AI Training Data is recorded by native speaker, with authentic accent and sweet sound.
Available for 42 countries
400 hours
5 years of historical data
95% sentence accuracy
Starts at
$5,000 / purchase
Free sample preview
5.0(1)

WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing

We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... Key Features of Our Audio Data Datasets: Vast Collection: Our repository consists of over 600 hours
Available for 64 countries
600 Hours of Recording
Pricing available upon request
5.0(1)

Shaip - Multilingual Conversational AI Training Data (Text & Audio)

by ShAIp
We offered audio data collection and transcription services based on their requirements while fully customizing ... We offered audio data collection and transcription services based on their requirements while fully customizing
Available for 215 countries
20K Hours of Audio
95% Match Rate
Available Pricing:
One-off purchase

Multi-lingual audio recognition service dataset

We can ingest various types of audio content (including speech, video) and generate text output (STT) ... With our system, we can assess audio, video, speech, dialogue and more and output STT text with annotations
Available for 157 countries
Pricing available upon request

Bulgarian audio dataset for speech recognition 10 hours (4/4)

the data. ... High-quality transcriptions come with the data in JSON format.
Available for 1 countries
10 hours
Starts at
€1,250 / purchase

AI-Machine Learning Sound / Audio / Snippet Recordings Database

Other audio-based use cases ... Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars,
Available for 249 countries
2 years of historical data
Pricing available upon request

Deeply Korean Read Speech Corpus - Audio AI & ML Training Data

by Deeply
The Read Speech dataset consists of 289.9 hours of audio clips of reading the scripts with 3 text sentiments
Available for 1 countries
190K records
99% Validity
Pricing available upon request

US Public Companies Earning Calls Audio and Video Database - FactSquared Transcribe

FactSquared Transcribe provides automated, full-text, searchable, indexed feeds of audio and video content ... FactSquared Transcribe provides automated, full-text, searchable, indexed feeds of audio and video content
Available for 1 countries
Pricing available upon request

Data Collection by EPIC Translations: Copywriting, Text & Audio Data Data for AI & ML Training

Text Data Collection Audio Data Collection Chatbot Training Data Copywriting Crowdsourcing ... oil pressure rates based on various factors for use in oil production purposes. d) The collection of audio
Available for 215 countries
50K sentences
12 weeks of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share

Monetize data on Datarade Marketplace

List your data on our global B2B marketplace to reach 100k monthly buyers

More Audio Data Products

Discover related audio data products.

35K Hours
98% sentence/word
60 countries covered
Nexdata has off-the-shelf 35,000 hours Machine Learning (ML) Data of 16kHz conversational speech, covering 100+ countries including English, German, French, ...
65K Hours
98% sentence/word
94 countries covered
Off-the-shelf read speech data cover 100+ languages. All the Machine Learning (ML) Data are collected from native speakers, with signed authorization agreeme...
50K Hours
98% sentence/word
21 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
1K hour per month
99.5% word accuracy
136 countries covered
Nexdata provides multi-language, multi-timbre, multi-domain and multi-style speech synthesis data collection servicesfor Deep Learning Data.
15K Hours
98% sentence/word
61 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
100K hours per month
99.5% word accuracy
136 countries covered
Nexdata provides high-quality Speech Data services for speech cleaning, speech transcription, phoneme annotation etc, with word accuracy of 99.5% and phoneme...
95% match rate
213 countries covered
10 years of historical data
Custom Data Collection Services by ShAIp - Any subject. Any scenario be it Text, Audio, Image or Video.
20K Hours of Audio
95% Match Rate
215 countries covered
We help the client source, curate, & transcribe the right set of data required to train AI/ML model, with utmost precision. We offered audio data collection ...
249 countries covered
15 years of historical data
Primary-sourced, publicly available corporate event data covering 9,000 equities worldwide including: Earnings calendar, DateBreaks (earnings date revisions)...
50K Hours
98% sentence/word
21 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
249 countries covered
Kieli is a professional data analytics company that provides data labelling and data annotation for hundreds of use cases, including NLP of Arabic texts.
10 hours
Bulgaria covered
Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quali...
20 hours
Bulgaria covered
The third dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
20 hours
Bulgaria covered
The second dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-q...
20 hours
Bulgaria covered
The first dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
10 hours
Lithuania covered
The fifth dataset consisting 10 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise an...
20 hours
Lithuania covered
Fourth dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qual...

Users also searched for