Let data providers come to you!

Post your request to reach 1240+ data providers and find the best match for your data needs

How it works

Tell us what you need
2-3 mins
Receive proposals
within 24 hours
Connect with providers
Post request now
Post your data request
Unscripted Call Center Telephony Speech Data | 20,000 Hours |Speech Recognition Data| Speech AI Datasets product image in hero

Unscripted Call Center Telephony Speech Data | 20,000 Hours |Speech Recognition Data| Speech AI Datasets

Nexdata
No reviews yetBadge iconVerified Data Provider
#
xxxxxxxxxx
Xxxxxxxxx
xxxxxx
xxxxxxxxxx
Xxxxx
Xxxxxx
Xxxxxxxxxx
Xxxxxx
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
2 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
3 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx Xxxxx Xxxxxx xxxxx xxxxxxxx
4 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
5 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx xxxxxx xxxxxxx xxxxxxx Xxxxx
6 xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx Xxxxx Xxxxxxx xxxxxx Xxxxxxxx
7 Xxxxxxx Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx xxxxxxxxx Xxxxxxx
8 xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx Xxxxxx
9 Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx Xxxxxxxxx xxxxxxxxx
10 xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx Xxxxxxx xxxxx Xxxxxxx
... xxxxxxx Xxxxx xxxxxxxxxx Xxxxxxx Xxxxx xxxxxxxxxx Xxxxxx xxxxxx
Request Data Sample
Volume
20K
Hours
Data Quality
98%
accuracy
Avail. Formats
.bin, .json, and .xml
File
Coverage
71
Countries
History
5
years

Description

Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Arabic and etc. It covers multiple domains like finance, real-estate, sale, health, insurance, and telecom.
1. Overview Format: 8kHz 16bit, wav, mono channel Recording condition: Phone recording system, with low background noise (call center scenario) Recording content: Spontaneous inbound and outbound callings in typical domain, such as finance, real-estate, sale, health, insurance, telecom Language: English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Arabic, Dutch, Swedish, Norwegian and etc. Features of annotation: Transcription text, timestamp, speaker ID, gender, noise, PII redacted Accuracy: Word Accuracy Rate (WAR) 98% 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

Country Coverage

Africa (6)
Algeria
Egypt
Libya
Morocco
South Africa
Tunisia
Asia (25)
Afghanistan
Bahrain
Hong Kong
India
Indonesia
Iran (Islamic Republic of)
Iraq
Japan
Jordan
Korea (Republic of)
Kuwait
Lebanon
Macao
Malaysia
Myanmar
Pakistan
Philippines
Qatar
Saudi Arabia
Singapore
Taiwan
Thailand
Turkey
United Arab Emirates
Vietnam
Europe (21)
Austria
Belgium
Czech Republic
Denmark
Finland
France
Germany
Greece
Hungary
Italy
Luxembourg
Netherlands
Norway
Poland
Portugal
Romania
Russian Federation
Spain
Sweden
Switzerland
United Kingdom
North America (6)
Canada
Costa Rica
El Salvador
Mexico
Panama
United States of America
Oceania (2)
Australia
New Zealand
South America (11)
Argentina
Bolivia (Plurinational State of)
Brazil
Chile
Colombia
Cuba
Dominican Republic
Ecuador
Peru
Uruguay
Venezuela (Bolivarian Republic of)

History

5 years of historical data

Volume

20,000 Hours

Pricing

Free sample available
License Starts at
One-off purchase
$20,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
98%
accuracy

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Data Cleansing
Data Labeling

Categories

Related Products

400 hours
95% sentence accuracy
60 countries covered
Speech Synthesis speech data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician...
730M Individual Profiles
100% Open Web Data
250 countries covered
Xverum’s Machine Learning (ML) data will help you to train LLMs and generative AI with 800M B2B profiles. 100+ attributes, global coverage, and GDPR-complian...
35 million records
248 countries covered
Clean data is an excellent data solution for companies with limited data engineering capabilities and those who want to reduce time to value. Dataset consist...
420M MAU
95% Match rate
248 countries covered
We provide POI Data, which can be used to train AI & ML Models on14M physical locations globally, and unlock wide range of use cases, from marketing to publi...

Frequently asked questions

What is Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets?

Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Arabic and etc. It covers multiple domains like finance, real-estate, sale, health, insurance, and telecom.

What is Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets used for?

This product has 4 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Data Cleansing, and Data Labeling. Global businesses and organizations buy Natural Language Processing (NLP) Data from Nexdata to fuel their analytics and enrichment.

Who can use Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets?

This product is best suited if you’re a Medium-sized Business or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets go?

This product has 5 years of historical coverage. It can be delivered on a secondly, minutely, hourly, daily, weekly, monthly, quarterly, yearly, real-time, and on-demand basis.

Which countries does Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets cover?

This product includes data covering 71 countries like USA, Japan, Germany, India, and UK. Nexdata is headquartered in United States of America.

How much does Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets cost?

Pricing for Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets starts at USD20,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets?

Businesses can buy Natural Language Processing (NLP) Data from Nexdata and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, and Feed API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets?

Nexdata has reported that this product has the following quality and accuracy assurances: 98% accuracy. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Unscripted Call Center Telephony Speech Data 20,000 Hours Speech Recognition Data Speech AI Datasets?

This product has 3 related products. These alternatives include Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data AI Datasets, Machine Learning (ML) Data 800M+ B2B Profiles AI-Ready for Deep Learning (DL), NLP & LLM Training, and Coresignal Clean Data Company Data AI-Enriched Datasets Global / 35M+ Records / Updated Weekly. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$20,000 / purchase
License Starts at
One-off purchase
$20,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
6h Avg. response time
100% Response rate