Best Speech Datasets & Databases
Easily explore, compare & preview top Speech Datasets via Datarade.
Refine your data search
Refine your data search
Recommended Speech Data Products
34 Results
Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets
by
Nexdata
Audio Data and 800TB of Annotated Imagery Data. ... The audio data is rich in content and accurate in transcription.
1.
Available for 28 countries
50K Hours
5 years of historical data
98% sentence/word
Starts at
$20,000 / purchase
Free sample preview
Way With Words' isiZulu Speech Collection Dataset
by
WayWithWords
Thank you for your interest in Way With Words' off-the-shelf Speech Collection Dataset in South African ... This dataset is equally split across four domains: Insurance, Retail, Debt Collection, and Travel.
Available for 1 countries
50 Hours
99% Accurate
Available Pricing:
One-off purchase
Usage-based
Free sample preview
WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... High-Quality Recordings: We prioritize the quality of our audio data, ensuring clear and professional
Available for 64 countries
600 Hours of Recording
Pricing available upon request
FactSquared Stock Sentiment Speech Analytics Data USA
by
FactSquared
FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t -- ... FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t --
Available for 1 countries
Pricing available upon request
Bulgarian audio dataset for speech recognition 20 hours (1/4)
by
StageZero
resell the data. ... - Maximum four hours of speech per person in the dataset.
Available for 1 countries
20 hours
Starts at
$2,500 / purchase
Deeply Korean Read Speech Corpus - Audio AI & ML Training Data
by
Deeply
The dataset also includes metadata such as a script(speech-to-text aligned), speaker, age, sex, noise ... The Read Speech dataset consists of 289.9 hours of audio clips of reading the scripts with 3 text sentiments
Available for 1 countries
190K records
99% Validity
Pricing available upon request
8kHz Conversational Speech Data | 15,000 Hours | Audio Data | Speech Recognition Data| Machine Learning (ML) Data
by
Nexdata
Audio Data and 800TB of Annotated Imagery Data. ... Nexdata has off-the-shelf 15,000 hours Machine Learning (ML) Data of 8kHz conversational speech, covering
Available for 86 countries
15K Hours
5 years of historical data
98% sentence/word
Starts at
$20,000 / purchase
Free sample preview
Way With Words' seSotho Speech Collection Dataset
by
WayWithWords
Thank you for your interest in Way With Words' off-the-shelf Speech Collection Dataset in South African ... This dataset is equally split across four domains: Insurance, Retail, Debt Collection, and Travel.
Available for 1 countries
50 Hours
99% Accurate
Available Pricing:
One-off purchase
Usage-based
Free sample preview
Lithuanian audio dataset for speech recognition 10 hours (5/5)
by
StageZero
resell the data. ... - Maximum four hours of speech per person in the dataset.
Available for 1 countries
10 hours
Starts at
€1,500 / purchase
Speech Recognition Data Collection Services | 100+ Languages Resources |Audio Data | Speech Recognition Data | Machine Learning (ML) Data
by
Nexdata
(ML) Data.
1. ... regions, and provide various types of speech recognition data collection services for Machine Learning
Available for 117 countries
100K hours per month
5 years of historical data
99.5% word accuracy
Starts at
$20,000 / purchase
Free sample preview
Can't find the data you're looking for?
Let data providers come to you by posting your request
Post your request
More Speech Data Products
Discover related speech data products.
400 hours
95% sentence accuracy
60 countries covered
Speech Synthesis speech data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician...
35K Hours
98% sentence/word
80 countries covered
Nexdata has off-the-shelf 35,000 hours Machine Learning (ML) Data of 16kHz conversational speech, covering 100+ countries including English, German, French, ...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
63 participants from all South Africa...
100K hours per month
99.5% word accuracy
117 countries covered
Nexdata is equipped with professional recording equipment and has resources pool of 70+ countries and regions, and provide various types of speech recognitio...
50K Hours
98% sentence/word
28 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
1K hour per month
99.5% word accuracy
114 countries covered
Nexdata provides multi-language, multi-timbre, multi-domain and multi-style speech synthesis data collection servicesfor Deep Learning Data.
1K hour per month
99.5% word accuracy
114 countries covered
Nexdata provides multi-language, multi-timbre, multi-domain and multi-style speech synthesis data collection servicesfor Deep Learning Data.
20K Hours
98% accuracy
71 countries covered
Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portugue...
190K records
99% Validity
South Korea covered
Pairs of Korean speakers reading a script with 3 distinct text sentiments, with 3 distinct voice sentiments, are recorded. The recordings took place in 3 dif...
USA covered
FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t -- saying in their public comments on market-moving topics.
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
50+ participants from KwaZulu-Natal, ...
20K Hours
98% accuracy
71 countries covered
Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portugue...
1M hours
95% Accuracy
47 countries covered
Off-the-shelf 1 million hours of Unsupervised speech dataset, covering 10+ languages(English, French, German, Japanese, Arabic, Mandarin and etc. , 100,000 h...
20K hours
98% Word Accuracy Rate
41 countries covered
Off-the-shelf 20,000 hours of Real-world Casual Conversation Speech data, covering 30+ languages. Covering diverse domains like self-media, conversations, li...
10 hours
Bulgaria covered
Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quali...
20 hours
Bulgaria covered
The third dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
20 hours
Bulgaria covered
The second dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-q...