Bulgarian audio dataset for speech recognition 20 hours (2/4) product image in hero

Bulgarian audio dataset for speech recognition 20 hours (2/4)

StageZero
No reviews yetBadge iconVerified Data Provider
#
Speaker 1
Speaker 2
Transcribed text
1 xxxxxxxxxx Xxxxxxxxx xxxxxx
2 xxxxxxxxxx Xxxxx Xxxxxx
3 Xxxxxxxxxx Xxxxxx Xxxxxxxxx
4 Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
5 xxxxxxxxx Xxxxxxx xxxxxx
6 Xxxxx xxxxxxxxxx xxxxxx
7 Xxxxxxxxxx xxxxxx Xxxxx
8 Xxxxxx xxxxx xxxxxxxx
9 xxxxxxx Xxxxx Xxxxxxxx
10 xxxxxxxxxx xxxxxx Xxxxxxxxx
... xxxxxx Xxxxxxxxx Xxxxxxxxx
Request Data Sample
Volume
20
hours
Avail. Format
.json
File
Coverage
1
Country

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Speaker 1
Speaker 2
Transcribed text

Description

The second dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.

Country Coverage

Europe (1)
Bulgaria

Volume

20 hours

Pricing

License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
Email
Format
.json

Use Cases

Machine Learning (ML)
Deep Learning Speech Recognition

Categories

Related Products

15K Hours
98% sentence/word
83 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
20 hours
Bulgaria covered
The first dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...
10K recordings
95% accuracy
64 countries covered
Authentic and spoofed faces recorded with different mobile phone cameras, showcasing both men and women, with and without glasses, under indoor and outdoor l...

Frequently asked questions

What is Bulgarian audio dataset for speech recognition 20 hours (2/4)?

The second dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.

What is Bulgarian audio dataset for speech recognition 20 hours (2/4) used for?

This product has 3 key use cases. StageZero recommends using the data for Machine Learning (ML), Deep Learning, and Speech Recognition. Global businesses and organizations buy Natural Language Processing (NLP) Data from StageZero to fuel their analytics and enrichment.

Who can use Bulgarian audio dataset for speech recognition 20 hours (2/4)?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with StageZero to see what their data can do for your business and find out which integrations they provide.

Which countries does Bulgarian audio dataset for speech recognition 20 hours (2/4) cover?

This product includes data covering 1 country like Bulgaria. StageZero is headquartered in Finland.

How much does Bulgarian audio dataset for speech recognition 20 hours (2/4) cost?

Pricing for Bulgarian audio dataset for speech recognition 20 hours (2/4) starts at EUR2,500 per purchase. Connect with StageZero to get a quote and arrange custom pricing models based on your data requirements.

How can I get Bulgarian audio dataset for speech recognition 20 hours (2/4)?

Businesses can buy Natural Language Processing (NLP) Data from StageZero and get the data via Email. Depending on your data requirements and subscription budget, StageZero can deliver this product in .json format.

What is the data quality of Bulgarian audio dataset for speech recognition 20 hours (2/4)?

You can compare and assess the data quality of StageZero using Datarade’s data marketplace. StageZero appears on selected Datarade top lists ranking the best data providers, including Who’s New on Datarade? May Edition.

What are similar products to Bulgarian audio dataset for speech recognition 20 hours (2/4)?

This product has 3 related products. These alternatives include Nexdata In-Cabin Speech Data 15,000 Hours AI Training Data Speech Recognition Data Audio Data Natural Language Processing (NLP) Data, Bulgarian audio dataset for speech recognition 20 hours (1/4), and TagX - 5000+ Face Anti Spoofing Data Anti Spoofing Detection Face Recognition Fraud Detection KYC authentication Global coverage. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
€2,500 / purchase
License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available