Nexdata | Multilingual Speech Synthesis Data | 400 Hours | TTS Data | Audio Data | AI Training Data product image in hero

Nexdata | Multilingual Speech Synthesis Data | 400 Hours | TTS Data | Audio Data | AI Training Data

Nexdata
No reviews yetBadge iconVerified Data Provider
#
Dataset Name
Format
Samples
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx
2 Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
3 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
4 xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
5 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx
6 Xxxxx Xxxxxx xxxxx xxxxxxxx
7 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
8 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
9 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx
10 xxxxxx xxxxxxx xxxxxxx Xxxxx
... xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx
Sign In To Preview Data
Volume
400
hours
Data Quality
95%
sentence accuracy
Avail. Formats
.bin, .json, and .xml
File
Coverage
61
Countries
History
5
years

Data Dictionary

[Sample] Nexdata-Multilingual Speech Synthesis Data.csv
Attribute Type Example Mapping
Dataset Name
String 20 Hours - American English Speech Synthesis Corpus-Male
String American English Language Name
Format
String 44,100Hz, 16bit
Samples
String https://www.nexdata.ai/dataset/1159?source=Datarade
Product Attributes
Attribute Type Example Mapping
Product Name
String Volume
Multilingual Speech Synthesis Data
String 400 hours

Description

The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
1. Specifications Format : 44.1 kHz/48 kHz, 16bit/24bit, uncompressed wav, mono channel. Recording environment : professional recording studio. Recording content : general narrative sentences, interrogative sentences, etc. Speaker : native speaker Annotation Feature : word transcription, part-of-speech, phoneme boundary, four-level accents, four-level prosodic boundary. Device : Microphone Language : American English, British English, Japanese, French, Dutch,Mandarin Chinese, Catonese, Canadian French,Australian English, Italian, New Zealand English, Spanish, Mexican Spanish Application scenarios : speech synthesis Accuracy rate: Word transcription: the sentences accuracy rate is not less than 99%. Part-of-speech annotation: the sentences accuracy rate is not less than 98%. Phoneme annotation: the sentences accuracy rate is not less than 98% (the error rate of voiced and swallowed phonemes is not included, because the labelling is more subjective). Accent annotation: the word accuracy rate is not less than 95%. Prosodic boundary annotation: the sentences accuracy rate is not less than 97% Phoneme boundary annotation: the phoneme accuracy rate is not less than 95% (the error range of boundary is within 5%) 2. About Nexdata Nexdata owns off-the-shelf 1,000,000 hours of speech recognition data, 800TB of image/video data, about 2 billion pieces of NLP data. These ready-to-go AI & ML Training Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/tts?source=Datarade

Country Coverage

Africa (6)
Algeria
Egypt
Libya
Morocco
Tanzania, United Republic of
Tunisia
Asia (24)
Bangladesh
China
Hong Kong
India
Indonesia
Iraq
Japan
Jordan
Korea (Republic of)
Kuwait
Macao
Malaysia
Oman
Pakistan
Philippines
Qatar
Saudi Arabia
Singapore
Syrian Arab Republic
Taiwan
Thailand
Turkey
United Arab Emirates
Vietnam
Europe (21)
Austria
Belgium
Bulgaria
Denmark
Finland
France
Germany
Greece
Hungary
Ireland
Italy
Netherlands
Norway
Poland
Portugal
Romania
Russian Federation
Spain
Sweden
Switzerland
United Kingdom
North America (3)
Canada
Mexico
United States of America
Oceania (2)
Australia
New Zealand
South America (5)
Argentina
Brazil
Colombia
Dominican Republic
Venezuela (Bolivarian Republic of)

History

5 years of historical data

Volume

400 hours

Pricing

Free sample available
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
95%
sentence accuracy

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Categories

Related Searches

Related Products

15K Hours
98% sentence/word
83 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
350K calls per month
63 countries covered
1 years of historical data
Access a vast collection of transcribed customer call records tailored to your needs. Ideal for in-depth analysis of customer interactions and behavior trend...
420M MAU
95% Match rate
248 countries covered
We provide POI Data, which can be used to train AI & ML Models on14M physical locations globally, and unlock wide range of use cases, from marketing to publi...

Frequently asked questions

What is Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data?

The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

What is Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data used for?

This product has 3 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning. Global businesses and organizations buy Natural Language Processing (NLP) Data from Nexdata to fuel their analytics and enrichment.

Who can use Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data?

This product is best suited if you’re a Medium-sized Business or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data go?

This product has 5 years of historical coverage. It can be delivered on a real-time and on-demand basis.

Which countries does Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data cover?

This product includes data covering 61 countries like USA, China, Japan, Germany, and India. Nexdata is headquartered in United States of America.

How much does Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data cost?

Pricing for Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data starts at USD5,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data?

Businesses can buy Natural Language Processing (NLP) Data from Nexdata and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, and Feed API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data?

Nexdata has reported that this product has the following quality and accuracy assurances: 95% sentence accuracy. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Nexdata Multilingual Speech Synthesis Data 400 Hours TTS Data Audio Data AI Training Data?

This product has 3 related products. These alternatives include Nexdata In-Cabin Speech Data 15,000 Hours AI Training Data Speech Recognition Data Audio Data Natural Language Processing (NLP) Data, FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data, and AI Training Data US Transcription Data Unique Consumer Sentiment Data: Transcription of the calls to the companies. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$5,000 / purchase
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
100% Response rate