Best Audio Datasets & Databases
Easily explore, compare & preview top Audio Datasets via Datarade.
49 Results
All Podcast Audio - Metadata for 3.5m podcasts & 176m episodes worldwide
by
Listen Notes
Access the most up-to-date, comprehensive podcast audio database, with 35+ rich data attributes for podcasts ... direct playable audio urls)
* Features 35+ data fields , such as basic metadata, global rank, RSS feed
Available for 250 countries
177M episodes
20 years of historical data
Available Pricing:
One-off purchase
Yearly License
Free sample preview
Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets
by
Nexdata
Audio Data and 800TB of Annotated Imagery Data. ... The audio data is rich in content and accurate in transcription.
1.
Available for 27 countries
50K Hours
5 years of historical data
98% sentence/word
Starts at
$20,000 / purchase
Free sample preview
TagX Data collection for AI/ ML training | LLM data | Data collection for AI development & model finetuning | Text, image, audio, and document data
by
TagX
Whether you need raw data or a processed dataset, we can deliver the data in your preferred format, including ... We provide In-field data collection for speech, image, text, and survey data.
Available for 249 countries
10K images/document
99% %
Starts at
$1,000 / month
Shaip - Multilingual Conversational AI Training Data (Text & Audio)
by
ShAIp
We offered audio data collection and transcription services based on their requirements while fully customizing ... We offered audio data collection and transcription services based on their requirements while fully customizing
Available for 215 countries
20K Hours of Audio
95% Match Rate
Available Pricing:
One-off purchase
Norwegian audio dataset for speech recognition 20 hours (1/5)
by
StageZero
resell the data. ... - Maximum four hours of speech per person in the dataset.
Available for 1 countries
20 hours
Starts at
€2,500 / purchase
Multi-lingual audio recognition service dataset
by
Overtone
Overtone's APIs allow customers to use our state of the art machine learning and large language models ... We can ingest various types of audio content (including speech, video) and generate text output (STT)
Available for 157 countries
Pricing available upon request
AI-Machine Learning Sound / Audio / Snippet Recordings Database
by
SoundPrint
Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars, ... Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars,
Available for 249 countries
2 years of historical data
Pricing available upon request
Deeply Korean Read Speech Corpus - Audio AI & ML Training Data
by
Deeply
The Read Speech dataset consists of 289.9 hours of audio clips of reading the scripts with 3 text sentiments ... The dataset also includes metadata such as a script(speech-to-text aligned), speaker, age, sex, noise
Available for 1 countries
190K records
99% Validity
Pricing available upon request
Data Collection by EPIC Translations: Copywriting, Text & Audio Data Data for AI & ML Training
Text Data Collection
6. Audio Data Collection
7. Chatbot Training Data
8. Copywriting
9. ... Data Entry
11. Data Mining
12.
Available for 215 countries
50K sentences
12 weeks of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share
Podcast Database - Complete Podcast Metadata, All Countries & Languages
by
Listen Notes
Access the most up-to-date, comprehensive podcast database, complete with 35+ rich data attributes. ... Attributes ==
See the full list of data attributes on this page: https://www.listennotes.com/podcast-datasets
Available for 250 countries
3.5M podcasts
20 years of historical data
Available Pricing:
One-off purchase
Yearly License
Free sample preview
Can't find the data you're looking for?
Let data providers come to you by posting your request
Post your request
More Audio Data Products
Discover related audio data products.
15K Hours
98% sentence/word
84 countries covered
Nexdata has off-the-shelf 15,000 hours Machine Learning (ML) Data of 8kHz conversational speech, covering 100+ countries including English, German, French, S...
50K Hours
98% sentence/word
27 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
160 Records
236 countries covered
1 years of historical data
The world’s largest noise complaint dataset with over 160K reports including labeled noise sources. Ideal for AI training in acoustic event detection and urb...
177M episodes
250 countries covered
20 years of historical data
Access the most up-to-date, comprehensive podcast audio database, with 35+ rich data attributes for podcasts and all audio urls with over 24,000 years! Our m...
15K Hours
98% sentence/word
82 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
40K Hours
98% sentence/word
58 countries covered
The speech data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is design...
10M Measurements
95% Precision
237 countries covered
Street noise-level data from any city. Analyze noise exposure across 200+ countries for risk modeling, real estate, AI-training and health studies. Real meas...
20 hours
Norway covered
The fourth part of 20 hours of Norwegian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qual...
20K Hours of Audio
95% Match Rate
215 countries covered
We help the client source, curate, & transcribe the right set of data required to train AI/ML model, with utmost precision. We offered audio data collection ...
95% match rate
213 countries covered
10 years of historical data
Custom Data Collection Services by ShAIp - Any subject. Any scenario be it Text, Audio, Image or Video.
10K images/document
99% %
249 countries covered
TagX specializes in data collection for Artificial intelligence, data analytics, and other software solutions. We provide In-field data collection for speech...
249 countries covered
2 years of historical data
Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars, arenas, churches, movie theaters, retail and many more)...
177M episodes
250 countries covered
20 years of historical data
Access the most up-to-date, comprehensive podcast audio database, with 35+ rich data attributes for podcasts and all audio urls with over 24,000 years! Our m...
177M episodes
100% uptime
250 countries covered
Developer-friendly and Enterprise-grade Podcast API. Structured, relevant, real-time.
Search the meta data of 3,500,000+ podcasts and 177,000,000+ episode...
3.5M podcasts
250 countries covered
20 years of historical data
Access the most up-to-date, comprehensive podcast database, complete with 35+ rich data attributes. Our meticulously curated dataset guarantees top-tier qual...
10M Measurements
95% Precision
237 countries covered
Street noise-level data from any city. Analyze noise exposure across 200+ countries for risk modeling, real estate, AI-training and health studies. Real meas...
10M Hours
95% Precision
236 countries covered
Starter dataset for AI teams with sampled noise (from 10M+ hours of measurements), mobility, and POI data. Ideal for rapid prototyping and AI research. CSV o...
35B Data Points
95% Precision
236 countries covered
Combines 10M+ hours of noise data with mobility and POI visitation data. Ideal for AI models combining environmental, mobility, and behavioral signals. CSV o...