Refine your data search
33 Results

Nexdata | Multilingual Code-switching Speech Data | 5,000 Hours |Audio Data| Speech Recognition Data|AI Training Data

by Nexdata
, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. ... mono channel Recording environment : quiet indoor environment, without echo Recording content (read speech
Available for 28 countries
50K Hours
5 years of historical data
98% sentence/word
Starts at
$5,000 / purchase
Free sample preview
5.0(1)

WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing

recordings that capture the intricacies of human speech. ... We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings
Available for 64 countries
600 Hours of Recording
Pricing available upon request
4.4(2)

Way With Words' Afrikaans Speech Collection Dataset

Thank you for your interest in Way With Words’ off-the-shelf Speech Collection Dataset in South African
Available for 1 countries
50 Hours
99% Accurate
Available Pricing:
One-off purchase
Usage-based
Free sample preview

FactSquared Stock Sentiment Speech Analytics Data USA

and more than 250 other factors, indexed to speaker’s historical speech patterns. ... FactSquared Analyze offers unique data-driven insights into what public figures are – and aren’t – saying
Available for 1 countries
Pricing available upon request

Deeply Korean Read Speech Corpus - Audio AI & ML Training Data

by Deeply
The dataset also includes metadata such as a script(speech-to-text aligned), speaker, age, sex, noise ... The Read Speech dataset consists of 289.9 hours of audio clips of reading the scripts with 3 text sentiments
Available for 1 countries
190K records
99% Validity
Pricing available upon request

Bulgarian audio dataset for speech recognition 10 hours (4/4)

the data. ... Speech is recorded and transcribed on separate tracks.
Available for 1 countries
10 hours
Starts at
€1,250 / purchase

Nexdata | Speech Recognition Data Collection Services | 100+ Languages Resources |Audio Data | Speech Recognition Data | Machine Learning (ML) Data

by Nexdata
recognition data collection services for Machine Learning (ML) Data. ... Please visit us at https://www.nexdata.ai/service/speech-recognition?source=Datarade
Available for 117 countries
100K hours per month
5 years of historical data
99.5% word accuracy
Starts at
$5,000 / purchase
Free sample preview
4.4(2)

Way With Words' seSotho Speech Collection Dataset

Thank you for your interest in Way With Words’ off-the-shelf Speech Collection Dataset in South African
Available for 1 countries
50 Hours
99% Accurate
Available Pricing:
One-off purchase
Usage-based
Free sample preview

Bulgarian audio dataset for speech recognition 20 hours (3/4)

the data. ... Speech is recorded and transcribed on separate tracks.
Available for 1 countries
20 hours
Starts at
€2,500 / purchase

Nexdata | Multilingual Read Speech Data | 65,000 Hours | Generative AI Audio Data| Speech Recognition Data | Machine Learning (ML) Data

by Nexdata
Off-the-shelf read speech data cover 100+ languages. ... of Audio Data and 800TB of Annotated Imagery Data.
Available for 102 countries
65K Hours
5 years of historical data
98% sentence/word
Starts at
$5,000 / purchase
Free sample preview

Can't find the data you're looking for?

Let data providers come to you by posting your request

Post your request

More Speech Data Products

Discover related speech data products.
40K Hours
98% sentence/word
54 countries covered
The speech data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is design...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 49 participants from Limpopo, North-W...
1K hour per month
99.5% word accuracy
114 countries covered
Nexdata provides multi-language, multi-timbre, multi-domain and multi-style speech synthesis data collection servicesfor Deep Learning Data.
65K Hours
98% sentence/word
102 countries covered
Off-the-shelf read speech data cover 100+ languages. All the Machine Learning (ML) Data are collected from native speakers, with signed authorization agreeme...
35K Hours
98% sentence/word
81 countries covered
Nexdata has off-the-shelf 35,000 hours Machine Learning (ML) Data of 16kHz conversational speech, covering 100+ countries including English, German, French, ...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 50+ participants from KwaZulu-Natal, ...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
400 hours
95% sentence accuracy
60 countries covered
The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician partici...
1M hours
95% Accuracy
74 countries covered
Off-the-shelf 1 million hours of Unsupervised speech dataset, covering 10+ languages(English, French, German, Japanese, Arabic, Mandarin and etc. , 100,000 h...
USA covered
FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t -- saying in their public comments on market-moving topics.
190K records
99% Validity
South Korea covered
Pairs of Korean speakers reading a script with 3 distinct text sentiments, with 3 distinct voice sentiments, are recorded. The recordings took place in 3 dif...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 50+ participants from KwaZulu-Natal, ...
1M hours
95% Accuracy
74 countries covered
Off-the-shelf 1 million hours of Unsupervised speech dataset, covering 10+ languages(English, French, German, Japanese, Arabic, Mandarin and etc. , 100,000 h...
20K hours
98% Word Accuracy Rate
41 countries covered
Off-the-shelf 20,000 hours of Casual Conversation Speech data, covering 30+ languages. Covering diverse domains like self-media, conversations, live streams,...
10 hours
Bulgaria covered
Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quali...
20 hours
Bulgaria covered
The third dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
20 hours
Bulgaria covered
The second dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-q...
20 hours
Bulgaria covered
The first dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...