Nexdata |Multilingual Conversational Speech Data | 8kHz Telephone| 15,000 Hours | Audio Data | Speech Recognition Data| Machine Learning (ML) Data product image in hero

Nexdata |Multilingual Conversational Speech Data | 8kHz Telephone| 15,000 Hours | Audio Data | Speech Recognition Data| Machine Learning (ML) Data

Nexdata
No reviews yetBadge iconVerified Data Provider
#
Dataset Name
Format
Link
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx
2 Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
3 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
4 xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
5 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx
6 Xxxxx Xxxxxx xxxxx xxxxxxxx
7 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
8 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
9 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx
10 xxxxxx xxxxxxx xxxxxxx Xxxxx
... xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx
Sign In To Preview Data
#
Dataset Name
Format
Link
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx
2 Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
3 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
4 xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
5 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx
6 Xxxxx Xxxxxx xxxxx xxxxxxxx
7 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
8 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
9 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx
10 xxxxxx xxxxxxx xxxxxxx Xxxxx
... xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx
Sign In To Preview Data
Volume
15K
Hours
Data Quality
98%
sentence/word
Avail. Formats
.bin, .json, and .xml
File
Coverage
77
Countries
History
5
years

Data Dictionary

[Sample] Nexdata-8k Multilingual Conversational Speech Data.csv
Attribute Type Example Mapping
Dataset Name
String 500 Hours – German Conversational Speech Data by Telephone
String German Language Name
Format
String 8kHz
Link
[Sample] Nexdata-8k Multilingual Conversational Speech Data.csv
Attribute Type Example Mapping
Dataset Name
String 500 Hours Spanish Conversational Speech Data by Telephone
String Spanish Language Name
Format
String 8kHz
Link
String https://www.nexdata.ai/dataset/1234?source=Datarade
Product Attributes
Attribute Type Example Mapping
Product Name
String Volume
8k Multilingual Conversational Speech Data
String 15,000 hours

Description

Nexdata has off-the-shelf 15,000 hours Machine Learning (ML) Data of 8kHz conversational speech, covering 100+ countries including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Russia and etc.
1. Specifications Format : 8kHz, 8bit, u-law/a-law pcm, mono channel; Environment : quiet indoor environment, without echo; Recording content : No preset linguistic data,dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed; Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc. Annotation : annotating for the transcription text, speaker identification, gender and noise symbols; Device : Telephony recording system; Language : 100+ Languages; Application scenarios : speech recognition; voiceprint recognition; Accuracy rate : the word accuracy rate is not less than 98% 2. About Nexdata Nexdata owns off-the-shelf 1,000,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

Country Coverage

Africa (7)
Algeria
Egypt
Kenya
Libya
Morocco
Tanzania, United Republic of
Tunisia
Asia (29)
Afghanistan
Bangladesh
China
Georgia
Hong Kong
India
Indonesia
Iran (Islamic Republic of)
Iraq
Israel
Japan
Jordan
Korea (Republic of)
Kuwait
Macao
Malaysia
Myanmar
Oman
Pakistan
Philippines
Qatar
Saudi Arabia
Singapore
Sri Lanka
Taiwan
Thailand
Turkey
United Arab Emirates
Vietnam
Europe (27)
Austria
Belgium
Bulgaria
Czech Republic
Denmark
Finland
France
Germany
Greece
Hungary
Ireland
Italy
Luxembourg
Netherlands
Norway
Poland
Portugal
Romania
Russian Federation
Serbia
Slovakia
Slovenia
Spain
Sweden
Switzerland
Ukraine
United Kingdom
North America (5)
Canada
Costa Rica
El Salvador
Mexico
United States of America
Oceania (2)
Australia
New Zealand
South America (7)
Argentina
Brazil
Chile
Colombia
Dominican Republic
Ecuador
Puerto Rico

History

5 years of historical data

Volume

15,000 Hours

Pricing

Free sample available
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
98%
sentence/word

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Speech Recognition
ASR
Call Center

Categories

Related Searches

Related Products

15K Hours
98% sentence/word
83 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in. NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...

Frequently asked questions

What is Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data?

Nexdata has off-the-shelf 15,000 hours Machine Learning (ML) Data of 8kHz conversational speech, covering 100+ countries including English, German, French, Spanish, Italian, Portuguese, Korean, Japanese, Hindi, Russia and etc.

What is Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data used for?

This product has 5 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Speech Recognition, ASR, and Call Center. Global businesses and organizations buy Natural Language Processing (NLP) Data from Nexdata to fuel their analytics and enrichment.

Who can use Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data?

This product is best suited if you’re a Medium-sized Business or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data go?

This product has 5 years of historical coverage. It can be delivered on a secondly, minutely, hourly, daily, weekly, monthly, quarterly, yearly, real-time, and on-demand basis.

Which countries does Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data cover?

This product includes data covering 77 countries like USA, China, Japan, Germany, and India. Nexdata is headquartered in United States of America.

How much does Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data cost?

Pricing for Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data starts at USD5,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data?

Businesses can buy Natural Language Processing (NLP) Data from Nexdata and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, and Feed API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data?

Nexdata has reported that this product has the following quality and accuracy assurances: 98% sentence/word. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Nexdata Multilingual Conversational Speech Data 8kHz Telephone 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data?

This product has 3 related products. These alternatives include Nexdata In-Cabin Speech Data 15,000 Hours AI Training Data Speech Recognition Data Audio Data Natural Language Processing (NLP) Data, FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data, and Textual Data NLP-enriched Data Transcription Data Entity Extraction & Disambiguation Ready-to-use. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$5,000 / purchase
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
4h Avg. response time
100% Response rate