Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea product image in hero

Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea

Deeply
No reviews yetBadge iconVerified Data Provider
#
xxxxxxxxxx
Xxxxxxxxx
xxxxxx
xxxxxxxxxx
Xxxxx
Xxxxxx
Xxxxxxxxxx
Xxxxxx
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
2 Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
3 xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx Xxxxx Xxxxxx xxxxx xxxxxxxx
4 xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
5 Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx xxxxxx xxxxxxx xxxxxxx Xxxxx
6 xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx Xxxxx Xxxxxxx xxxxxx Xxxxxxxx
7 Xxxxxxx Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx xxxxxxxxx Xxxxxxx
8 xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx Xxxxxx
9 Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx Xxxxxxxxx xxxxxxxxx
10 xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx Xxxxxxx xxxxx Xxxxxxx
... xxxxxxx Xxxxx xxxxxxxxxx Xxxxxxx Xxxxx xxxxxxxxxx Xxxxxx xxxxxx
Request Data Sample
Volume
70K
Records
Data Quality
100%
Validity
Coverage
1
Country

Description

The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers. 16 different types of nonverbal human sound and the metadata such as age, sex of the speaker, level of authenticity, and noise are human-labeled to each utterance.
The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea and validated by the AI data platform. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat-clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’. The dataset is the first dataset to the world due to its large volume, various types of nonverbal vocal cues, and various participants. We expect that the utilization of this dataset would bring precise detection of the nonverbal vocal cues, and a better understanding of the human conversation. We're ready to deliver further information, statistics, or samples upon request. Don't hesitate to reach out! *The dataset can be delivered as either original wav files(44,100Hz, 16-bit PCM, 1-channel) or a single compressed h5 file(resampled to 16,000Hz).

Country Coverage

Asia (1)
Korea (Republic of)

Volume

57 Hours of Audio
70,000 Records

Pricing

Free sample available
Deeply has not published pricing information for this product yet. You can request detailed pricing information below.

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
100%
Validity

Delivery

Methods
Email

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Sentiment Analysis
Automatic Speech Recognition

Categories

Related Products

10K images
9 countries covered
10 years of historical data
Collection of 10,000+ images of traffic scene from low view that are ready to use for optimizing the accuracy of computer vision models.
1B Monthly records
USA covered
Website visit data with URLs, categories, timestamps, and anonymized unique identifiers.
5B records
98% accuracy
USA covered
CrawlBee ML datasets are specially curated and cleansed to provide the highest quality training data for those looking to provide real-world answers to big p...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...

Frequently asked questions

What is Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea?

The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers. 16 different types of nonverbal human sound and the metadata such as age, sex of the speaker, level of authenticity, and noise are human-labeled to each utterance.

What is Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea used for?

This product has 4 key use cases. Deeply recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Sentiment Analysis, and Automatic Speech Recognition. Global businesses and organizations buy AI Training Data from Deeply to fuel their analytics and enrichment.

Who can use Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for AI Training Data. Get in touch with Deeply to see what their data can do for your business and find out which integrations they provide.

Which countries does Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea cover?

This product includes data covering 1 country like South Korea. Deeply is headquartered in Korea (Republic of).

How much does Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea cost?

Pricing information for Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea is available by getting in contact with Deeply. Connect with Deeply to get a quote and arrange custom pricing models based on your data requirements.

How can I get Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea?

Businesses can buy AI Training Data from Deeply and get the data via Email.

What is the data quality of Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea?

Deeply has reported that this product has the following quality and accuracy assurances: 100% Validity. You can compare and assess the data quality of Deeply using Datarade’s data marketplace.

What are similar products to Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea?

This product has 3 related products. These alternatives include Pixta AI Imagery Data Global 10,000 Stock Images Annotation and Labelling Services Provided Traffic scenes from low view for AI & ML, BIGDBM Website Visits Data With Industry/Context Categorization - Training Set for ML and AI, and CrawlBee ML Training Data LLM Data Generative AI Data Code Base Training Data Healthcare Training Data. You can compare the best AI Training Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Pricing available upon request