Multilingual Full Duplex Conversational Speech Data | 2 Million Hours | Audio AI & ML Training Data product image in hero

Multilingual Full Duplex Conversational Speech Data | 2 Million Hours | Audio AI & ML Training Data

Nexdata
No reviews yetBadge iconVerified Data Provider
Product Name
8k Multilingual Conversational Speech Data
xxxxxxxxxx Xxxxxxxxx
xxxxxx xxxxxxxxxx
Xxxxx Xxxxxx
Xxxxxxxxxx Xxxxxx
Xxxxxxxxx Xxxxxxxxxx
xxxxxxxxx Xxxxxxxxx
xxxxxxxxx Xxxxxxx
xxxxxx Xxxxx
xxxxxxxxxx xxxxxx
Xxxxxxxxxx xxxxxx
Volume
2M
Hours
Data Quality
98%
word accuracy rate
Avail. Formats
.bin, .json, and .xml
File
Coverage
73
Countries
History
5
years

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Product Name
String Volume
8k Multilingual Conversational Speech Data
String 15,000 hours

Description

Nexdata owns 2 million hours of unlabeled full duplex speech data and 10,000 hours of human-labeled full duplex speech data.These full duplex data could enhance the performance of ASR&TTS in full duplex scenarios such as large model interaction, call center and etc.
Format: 8kHz/16kHz/24kHz/48kHz,speaker channel separation Content category: Recorders in free conversation without a set topic; Recording condition: Low background noise (indoor); Recording device: mobile phone, telephone, microphone Language: English, Japanese, Korean, Tagalog, Urdu, Swedish, Thai, Arabic, Mandarin Features of annotation:Transcription text, timestamp, speaker ID, gender. Accuracy Rate: Word Accuracy Rate (WAR) 98% 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 3 million hours of Audio Data and 800TB of computer vision data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

Country Coverage

Africa (5)
Algeria
Egypt
Morocco
South Africa
Tunisia
Asia (24)
Afghanistan
Bahrain
Bangladesh
Georgia
Hong Kong
India
Indonesia
Iran (Islamic Republic of)
Iraq
Israel
Japan
Jordan
Korea (Republic of)
Kuwait
Malaysia
Myanmar
Oman
Philippines
Qatar
Saudi Arabia
Singapore
Thailand
United Arab Emirates
Vietnam
Europe (27)
Austria
Belgium
Bulgaria
Czech Republic
Denmark
Finland
France
Germany
Greece
Hungary
Ireland
Italy
Luxembourg
Netherlands
Norway
Poland
Portugal
Romania
Russian Federation
Serbia
Slovakia
Slovenia
Spain
Sweden
Switzerland
Ukraine
United Kingdom
North America (8)
Canada
Costa Rica
El Salvador
Guatemala
Mexico
Nicaragua
Panama
United States of America
Oceania (2)
Australia
New Zealand
South America (7)
Argentina
Brazil
Chile
Colombia
Dominican Republic
Ecuador
Puerto Rico

History

5 years of historical data

Volume

2 million Hours

Pricing

Free sample available
License Starts at
One-off purchase
$20,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
98%
word accuracy rate

Delivery

Methods
SOAP API
Streaming API
Email
S3 Bucket
SFTP
UI Export
Feed API
REST API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Categories

Related Searches

Related Products

Frequently asked questions

What is Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data?

Nexdata owns 2 million hours of unlabeled full duplex speech data and 10,000 hours of human-labeled full duplex speech data.These full duplex data could enhance the performance of ASR&TTS in full duplex scenarios such as large model interaction, call center and etc.

What is Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data used for?

This product has 4 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Speech Recognition, and LLM Training. Global businesses and organizations buy Natural Language Processing (NLP) Data from Nexdata to fuel their analytics and enrichment.

Who can use Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data?

This product is best suited if you’re a Medium-sized Business or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data go?

This product has 5 years of historical coverage. It can be delivered on a secondly, minutely, hourly, daily, weekly, monthly, quarterly, yearly, real-time, and on-demand basis.

Which countries does Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data cover?

This product includes data covering 73 countries like USA, Japan, Germany, India, and UK. Nexdata is headquartered in United States of America.

How much does Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data cost?

Pricing for Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data starts at USD20,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data?

Businesses can buy Natural Language Processing (NLP) Data from Nexdata and get the data via SOAP API, Streaming API, Email, S3 Bucket, SFTP, UI Export, Feed API, and REST API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data?

Nexdata has reported that this product has the following quality and accuracy assurances: 98% word accuracy rate. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Multilingual Full Duplex Conversational Speech Data 2 Million Hours Audio AI & ML Training Data?

This product has 3 related products. These alternatives include 8kHz Conversational Speech Data 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data, Machine Learning (ML) Data 800M+ B2B Profiles AI-Ready for Deep Learning (DL), NLP & LLM Training, and Audio ML/ DL - Noise Level Data 180+ Countries Coverage CCPA, GDPR Compliant 35 B + Data Points 100% Traceable Consent. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$20,000 / purchase
License Starts at
One-off purchase
$20,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
5h Avg. response time
100% Response rate
Promoted

Sync this data product to your data warehouse - no code

Monda makes it easy to access data products from any source and sync them to your preferred data warehouse.