Lithuanian audio dataset for speech recognition 20 hours (2/5) product image in hero

Lithuanian audio dataset for speech recognition 20 hours (2/5)

StageZero
No reviews yetBadge iconVerified Data Provider
#
Speaker 1
Speaker 2
Transcribed text
1 xxxxxxxxxx Xxxxxxxxx xxxxxx
2 xxxxxxxxxx Xxxxx Xxxxxx
3 Xxxxxxxxxx Xxxxxx Xxxxxxxxx
4 Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
5 xxxxxxxxx Xxxxxxx xxxxxx
6 Xxxxx xxxxxxxxxx xxxxxx
7 Xxxxxxxxxx xxxxxx Xxxxx
8 Xxxxxx xxxxx xxxxxxxx
9 xxxxxxx Xxxxx Xxxxxxxx
10 xxxxxxxxxx xxxxxx Xxxxxxxxx
... xxxxxx Xxxxxxxxx Xxxxxxxxx
Request Data Sample
Volume
20
hours
Avail. Format
.json
File
Coverage
1
Country

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Speaker 1
Speaker 2
Transcribed text

Description

Second dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.

Country Coverage

Europe (1)
Lithuania

Volume

20 hours

Pricing

License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
Email
Format
.json

Use Cases

Machine Learning (ML)
Deep Learning Speech Recognition

Categories

Related Products

50K Hours
98% sentence/word
29 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
20 hours
Bulgaria covered
The third dataset of 20 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qu...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...

Frequently asked questions

What is Lithuanian audio dataset for speech recognition 20 hours (2/5)?

Second dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.

What is Lithuanian audio dataset for speech recognition 20 hours (2/5) used for?

This product has 3 key use cases. StageZero recommends using the data for Machine Learning (ML), Deep Learning, and Speech Recognition. Global businesses and organizations buy Machine Learning (ML) Data from StageZero to fuel their analytics and enrichment.

Who can use Lithuanian audio dataset for speech recognition 20 hours (2/5)?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Machine Learning (ML) Data. Get in touch with StageZero to see what their data can do for your business and find out which integrations they provide.

Which countries does Lithuanian audio dataset for speech recognition 20 hours (2/5) cover?

This product includes data covering 1 country like Lithuania. StageZero is headquartered in Finland.

How much does Lithuanian audio dataset for speech recognition 20 hours (2/5) cost?

Pricing for Lithuanian audio dataset for speech recognition 20 hours (2/5) starts at EUR2,500 per purchase. Connect with StageZero to get a quote and arrange custom pricing models based on your data requirements.

How can I get Lithuanian audio dataset for speech recognition 20 hours (2/5)?

Businesses can buy Machine Learning (ML) Data from StageZero and get the data via Email. Depending on your data requirements and subscription budget, StageZero can deliver this product in .json format.

What is the data quality of Lithuanian audio dataset for speech recognition 20 hours (2/5)?

You can compare and assess the data quality of StageZero using Datarade’s data marketplace. StageZero appears on selected Datarade top lists ranking the best data providers, including Who’s New on Datarade? May Edition.

What are similar products to Lithuanian audio dataset for speech recognition 20 hours (2/5)?

This product has 3 related products. These alternatives include Nexdata Multilingual Code-switching Speech Data 5,000 Hours Audio Data Speech Recognition Data AI Training Data, FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data, and Bulgarian audio dataset for speech recognition 20 hours (3/4). You can compare the best Machine Learning (ML) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
€2,500 / purchase
License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available