
Deeply Vocal Characterizer Dataset
A dataset by Deeply
Pricing available upon request
Get a Quote
Column | Sample | Another one | Attribute | |
---|---|---|---|---|
1 | Fashion | Sports, Health | 25-49 | Germany |
2 | Just Another | Sample | Another | Row |
Volume
57 | Hours of Audio |
70,000 | Records |
Use Cases
Geography
Asia
(1)
Korea (Republic of)
Categories
Product Description
The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea and validated by the AI data platform. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat-clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’.
The dataset is the first dataset to the world due to its large volume, various types of nonverbal vocal cues, and various participants.
We expect that the utilization of this dataset would bring precise detection of the nonverbal vocal cues, and a better understanding of the human conversation.
We're ready to deliver further information, statistics, or samples upon request. Don't hesitate to reach out!
*The dataset can be delivered as either original wav files(44,100Hz, 16-bit PCM, 1-channel) or a single compressed h5 file(resampled to 16,000Hz).
Suitable Company Sizes
Small Business
Medium-sized Business
Enterprise
Pricing
Free sample available
Deeply has not published pricing information for this product yet.
You can request detailed pricing information below.
Quality
Self-reported by the provider
Delivery
Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Related Products
Deeply Parent-Child Vocal Interaction Dataset
by Deeply
The parent-child vocal interaction, such as reading fairy tales, singing children’s songs, conversing, is recorded. Recorded at 3 types of places, of which the level of reverberation differs. Also...
Volume | 360K records, 232 Hours of Audio |
---|---|
Quality | 95% Accuracy |
Country | South Korea |
Use Case | Room Acoustics, Dereverberation + 3 more |
Deeply Korean Read Speech Corpus
by Deeply
Pairs of Korean speakers reading a script with 3 distinct text sentiments, with 3 distinct voice sentiments, are recorded. The recordings took place in 3 different places, of which the level of rev...
Volume | 190K records, 290 Hours of audio |
---|---|
Quality | 99% Validity |
Country | South Korea |
Use Case | Room Acoustics, Sentiment Analysis + 3 more |
Dental CBCT dataset
by Automaton AI
Dataset of the CBCT dental images.
Volume | 50 Patient Data |
---|---|
Country | India |
Use Case | Panoramic Radiography and CBCT, Diagnosis of Kerato Cysts + 2 more |
Exercise / Functional Training / Outdoor Exercise Dataset
by Automaton AI
Exercise / Functional Training / Outdoor Exercise Dataset
Volume | 62.8K Images |
---|---|
Country | India |
Use Case | Smart Mirror, Outdoor Workout + 3 more |
Technicals Score
by Danel Capital
A predictive analytics equity rating score based on Machine Learning with possible values from 1 to 10 (from worst to best), which only takes into consideration Technical data points (over 700). Th...