Bulgarian audio dataset for speech recognition 10 hours (4/4) product image in hero

Bulgarian audio dataset for speech recognition 10 hours (4/4)

StageZero
No reviews yetBadge iconVerified Data Provider
#
Speaker 1
Speaker 2
Transcription text
1 xxxxxxxxxx Xxxxxxxxx xxxxxx
2 xxxxxxxxxx Xxxxx Xxxxxx
3 Xxxxxxxxxx Xxxxxx Xxxxxxxxx
4 Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
5 xxxxxxxxx Xxxxxxx xxxxxx
6 Xxxxx xxxxxxxxxx xxxxxx
7 Xxxxxxxxxx xxxxxx Xxxxx
8 Xxxxxx xxxxx xxxxxxxx
9 xxxxxxx Xxxxx Xxxxxxxx
10 xxxxxxxxxx xxxxxx Xxxxxxxxx
... xxxxxx Xxxxxxxxx Xxxxxxxxx
Request Data Sample
Volume
10
hours
Avail. Format
.json
File
Coverage
1
Country

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Speaker 1
Speaker 2
Transcription text

Description

Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.

Country Coverage

Europe (1)
Bulgaria

Volume

10 hours

Pricing

License Starts at
One-off purchase
€1,250 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
Email
Format
.json

Use Cases

Machine Learning (ML)
Deep Learning Speech Recognition

Categories

Related Products

40K Hours
98% sentence/word
52 countries covered
The speech data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is design...
20 hours
Lithuania covered
Fourth dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qual...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...

Frequently asked questions

What is Bulgarian audio dataset for speech recognition 10 hours (4/4)?

Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.

What is Bulgarian audio dataset for speech recognition 10 hours (4/4) used for?

This product has 3 key use cases. StageZero recommends using the data for Machine Learning (ML), Deep Learning, and Speech Recognition. Global businesses and organizations buy Natural Language Processing (NLP) Data from StageZero to fuel their analytics and enrichment.

Who can use Bulgarian audio dataset for speech recognition 10 hours (4/4)?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with StageZero to see what their data can do for your business and find out which integrations they provide.

Which countries does Bulgarian audio dataset for speech recognition 10 hours (4/4) cover?

This product includes data covering 1 country like Bulgaria. StageZero is headquartered in Finland.

How much does Bulgarian audio dataset for speech recognition 10 hours (4/4) cost?

Pricing for Bulgarian audio dataset for speech recognition 10 hours (4/4) starts at EUR1,250 per purchase. Connect with StageZero to get a quote and arrange custom pricing models based on your data requirements.

How can I get Bulgarian audio dataset for speech recognition 10 hours (4/4)?

Businesses can buy Natural Language Processing (NLP) Data from StageZero and get the data via Email. Depending on your data requirements and subscription budget, StageZero can deliver this product in .json format.

What is the data quality of Bulgarian audio dataset for speech recognition 10 hours (4/4)?

You can compare and assess the data quality of StageZero using Datarade’s data marketplace. StageZero appears on selected Datarade top lists ranking the best data providers, including Who’s New on Datarade? May Edition.

What are similar products to Bulgarian audio dataset for speech recognition 10 hours (4/4)?

This product has 3 related products. These alternatives include Nexdata Multilingual Native & Accented English Speech Data 40,000 Hours Audio Data Speech Recognition Data Natural Language Processing (NLP) Data, Lithuanian audio dataset for speech recognition 20 hours (4/5), and FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
€1,250 / purchase
License Starts at
One-off purchase
€1,250 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available