Deeply Korean Read Speech Corpus - Audio AI & ML Training Data

xxxxxxxxxx	Xxxxxxxxx	xxxxxx	xxxxxxxxxx	Xxxxx	Xxxxxx	Xxxxxxxxxx	Xxxxxx	Xxxxxxxxx
xxxxxxxxxx	Xxxxxxxxx	xxxxxx	xxxxxxxxxx	Xxxxx	Xxxxxx	Xxxxxxxxxx	Xxxxxx	Xxxxxxxxx
Xxxxxxxxxx	xxxxxxxxx	Xxxxxxxxx	xxxxxxxxx	Xxxxxxx	xxxxxx	Xxxxx	xxxxxxxxxx	xxxxxx
Xxxxxxxxxx	xxxxxx	Xxxxx	Xxxxxx	xxxxx	xxxxxxxx	xxxxxxx	Xxxxx	Xxxxxxxx
xxxxxxxxxx	xxxxxx	Xxxxxxxxx	xxxxxx	Xxxxxxxxx	Xxxxxxxxx	xxxxxxxxxx	Xxxxxx	Xxxxx
xxxxxx	xxxxxxx	xxxxxxx	Xxxxx	xxxxxx	Xxxxxxxxxx	xxxxxxxx	xxxxxx	Xxxxx
Xxxxxxx	xxxxxx	Xxxxxxxx	Xxxxxxx	Xxxxx	xxxxxx	xxxxxxxxxx	Xxxxx	xxxxxxxxxx
xxxxxxxxx	Xxxxxxx	xxxxxxxx	xxxxxxxx	Xxxxxxxxxx	Xxxxxxxx	Xxxxxxxx	xxxxxxxxx	Xxxxxxxxxx
Xxxxxx	Xxxxxxxxx	xxxxx	xxxxxxx	xxxxxxxxx	Xxxxxx	Xxxxxxx	Xxxxxxxxx	xxxxxxxxx
xxxxxxxxx	Xxxxx	xxxxxxxx	Xxxxxxx	xxxxxxxxx	Xxxxxxx	xxxxx	Xxxxxxx	xxxxxxx
Xxxxx	xxxxxxxxxx	Xxxxxxx	Xxxxx	xxxxxxxxxx	Xxxxxx	xxxxxx	Xxxxxxxxx	xxxxx

Request Data Sample

Volume

190K

records

Data Quality

99%

Validity

Coverage

Country

Pairs of Korean speakers reading a script with 3 distinct text sentiments, with 3 distinct voice sentiments, are recorded. The recordings took place in 3 different places, of which the level of reverberation differs. Every experiment is recorded at 3 distinct distances with 2 types of smartphones.

□ Recording contents A pair of adults reading scripts containing 3 distinct text sentiments(negative, neutral, positive) with 3 distinct voice sentiments(negative, neutral, positive). (Script: movie reviews(positive, negative), everyday conversation(neutral) □ Recording environments Anechoic Chamber (no reverb), Studio apartment (moderate reverb), Dance studio (high reverb) □ Device iPhone X (iOS), Samsung Galaxy S7 (Android) □ Distance from the source 0.4m, 2.0m, 4.0m □ Volume ~ 290 hours, ~ 190,000 utterances, ~ 107 GB □ Format wav(44100Hz, 16-bit, mono), or h5(16000Hz, 16-bit, mono) □ Language Korean □ Demographics 34 Korean adults, with 26% males and 74% females, and 47% are in 20s, 20.5% in 30s, 17.5% in 40s, 6% in 50s, 9% in 60s. The Read Speech dataset consists of 289.9 hours of audio clips of reading the scripts with 3 text sentiments with 3 voice sentiments recorded at 3 distinct places using 2 different smartphones running under different operating systems. The participants are encouraged to record repetitively in all 3 types of place (anechoic chamber, studio apartment, dance studio), and every recording is conducted systematically at 3 ordinal distances(0.4m, 2.0m, 4.0m) with 2 types of device(iPhone X and Galaxy S7). The type of text sentiments and voice sentiments is categorized as follows:  ‘Negative text sentiment’, ‘neutral text sentiment’, ‘positive text sentiment’ indicates that the contents being vocalized are negative, neutral, and positive respectively. Specifically, for the negative and positive text sentiments, negative/positive movie reviews, containing degradations, criticisms or compliments, were used. And, for the neutral text sentiment, everyday conversations without typical emotions were used. ‘Negative voice sentiment’ indicates that the speaker vocalized the script with a negative tone of voice, for the sake of consistency, we instructed the speakers to vocalize as if they were angry. ‘Neutral voice sentiment’ indicates that the speaker vocalized the script with a neutral tone of voice, with any emotions involved. Finally, ‘positive voice sentiment’ indicates that the speakers vocalized the script with a positive tone of voice, especially as if they were happy. Each type of voice sentiment was vocalized regardless of the content of the script (text sentiment), for example, the speakers were also asked to vocalize the script positively even though the content was negative. The dataset also includes metadata such as a script(speech-to-text aligned), speaker, age, sex, noise, type of place, distance, and device. The impulse responses of each type of place are available upon request.

Asia (1)

Korea (Republic of)

190,000	records
290	Hours of audio
107	GB

Free sample available

Deeply has not published pricing information for this product yet. You can request detailed pricing information below.

Request detailed pricing

Self-reported by the provider

99%

Validity

Artificial Intelligence (AI)

Machine Learning (ML)

Sentiment Analysis

Automatic Speech Recognition

Room Acoustics

AI Training Data

Pricing available upon request

Pricing available upon request

South Africa

Free sample preview

Pricing available upon request

What is Deeply Korean Read Speech Corpus - Audio AI & ML Training Data?

What is Deeply Korean Read Speech Corpus - Audio AI & ML Training Data used for?

This product has 5 key use cases. Deeply recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Sentiment Analysis, Automatic Speech Recognition, and Room Acoustics. Global businesses and organizations buy AI Training Data from Deeply to fuel their analytics and enrichment.

Who can use Deeply Korean Read Speech Corpus - Audio AI & ML Training Data?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for AI Training Data. Get in touch with Deeply to see what their data can do for your business and find out which integrations they provide.

Which countries does Deeply Korean Read Speech Corpus - Audio AI & ML Training Data cover?

This product includes data covering 1 country like South Korea. Deeply is headquartered in Korea (Republic of).

How much does Deeply Korean Read Speech Corpus - Audio AI & ML Training Data cost?

Pricing information for Deeply Korean Read Speech Corpus - Audio AI & ML Training Data is available by getting in contact with Deeply. Connect with Deeply to get a quote and arrange custom pricing models based on your data requirements.

What is the data quality of Deeply Korean Read Speech Corpus - Audio AI & ML Training Data?

Deeply has reported that this product has the following quality and accuracy assurances: 99% Validity. You can compare and assess the data quality of Deeply using Datarade’s data marketplace.

What are similar products to Deeply Korean Read Speech Corpus - Audio AI & ML Training Data?

This product has 3 related products. These alternatives include Pixta AI Imagery Data Global 10,000 Stock Images Annotation and Labelling Services Provided Traffic scenes from low view for AI & ML, Acoustic Guitar Dataset for AI-Generated Music (Machine Learning (ML) Data), and UniCourt Legal Analytics API - USA Legal Data (AI Normalized). You can compare the best AI Training Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Pricing available upon request

Verified Provider

Report this product

Deeply Korean Read Speech Corpus - Audio AI & ML Training Data

Description

Country Coverage

Volume

Pricing

Suitable Company Sizes

Quality

Use Cases

Categories

Related Products

Pixta AI | Imagery Data | Global | 10,000 Stock Images | Annotation and Labelling Services Provided | Traffic scenes from low view for AI & ML

Acoustic Guitar Dataset for AI-Generated Music (Machine Learning (ML) Data)

UniCourt Legal Analytics API - USA Legal Data (AI Normalized)

Way With Words' Afrikaans Speech Collection Dataset

Frequently asked questions

Deeply
We give meaning to sound

Sync this data product to your data warehouse - no code

Deeply Korean Read Speech Corpus - Audio AI & ML Training Data

Description

Country Coverage

Volume

Pricing

Suitable Company Sizes

Quality

Use Cases

Categories

Related Products

Pixta AI | Imagery Data | Global | 10,000 Stock Images | Annotation and Labelling Services Provided | Traffic scenes from low view for AI & ML

Acoustic Guitar Dataset for AI-Generated Music (Machine Learning (ML) Data)

UniCourt Legal Analytics API - USA Legal Data (AI Normalized)

Way With Words' Afrikaans Speech Collection Dataset

Frequently asked questions

Deeply We give meaning to sound

Sync this data product to your data warehouse - no code

Deeply
We give meaning to sound