Lithuanian audio dataset for speech recognition 20 hours (1/5) product image in hero

Lithuanian audio dataset for speech recognition 20 hours (1/5)

StageZero
No reviews yetBadge iconVerified Data Provider
#
Speaker 1
Speaker 2
Transcribed text
1 xxxxxxxxxx Xxxxxxxxx xxxxxx
2 xxxxxxxxxx Xxxxx Xxxxxx
3 Xxxxxxxxxx Xxxxxx Xxxxxxxxx
4 Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
5 xxxxxxxxx Xxxxxxx xxxxxx
6 Xxxxx xxxxxxxxxx xxxxxx
7 Xxxxxxxxxx xxxxxx Xxxxx
8 Xxxxxx xxxxx xxxxxxxx
9 xxxxxxx Xxxxx Xxxxxxxx
10 xxxxxxxxxx xxxxxx Xxxxxxxxx
... xxxxxx Xxxxxxxxx Xxxxxxxxx
Request Data Sample
Volume
20
hours
Avail. Format
.json
File
Coverage
1
Country

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Speaker 1
Speaker 2
Transcribed text

Description

First dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.

Country Coverage

Europe (1)
Lithuania

Volume

20 hours

Pricing

License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
Email
Format
.json

Use Cases

Categories

Related Products

15K Hours
98% sentence/word
61 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
20 hours
Lithuania covered
Third dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quali...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
598M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...

Frequently asked questions

What is Lithuanian audio dataset for speech recognition 20 hours (1/5)?

First dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.

What is Lithuanian audio dataset for speech recognition 20 hours (1/5) used for?

This product has 2 key use cases. StageZero recommends using the data for Deep Learning and Speech Recognition. Global businesses and organizations buy Natural Language Processing (NLP) Data from StageZero to fuel their analytics and enrichment.

Who can use Lithuanian audio dataset for speech recognition 20 hours (1/5)?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with StageZero to see what their data can do for your business and find out which integrations they provide.

Which countries does Lithuanian audio dataset for speech recognition 20 hours (1/5) cover?

This product includes data covering 1 country like Lithuania. StageZero is headquartered in Finland.

How much does Lithuanian audio dataset for speech recognition 20 hours (1/5) cost?

Pricing for Lithuanian audio dataset for speech recognition 20 hours (1/5) starts at EUR2,500 per purchase. Connect with StageZero to get a quote and arrange custom pricing models based on your data requirements.

How can I get Lithuanian audio dataset for speech recognition 20 hours (1/5)?

Businesses can buy Natural Language Processing (NLP) Data from StageZero and get the data via Email. Depending on your data requirements and subscription budget, StageZero can deliver this product in .json format.

What is the data quality of Lithuanian audio dataset for speech recognition 20 hours (1/5)?

You can compare and assess the data quality of StageZero using Datarade’s data marketplace. StageZero appears on selected Datarade top lists ranking the best data providers, including Who’s New on Datarade? May Edition.

What are similar products to Lithuanian audio dataset for speech recognition 20 hours (1/5)?

This Tabular Data has 3 related products. These alternatives include Nexdata In-Car Speech Data 15,000 Hours AI Training Data Speech Recognition Data Audio Data Natural Language Processing (NLP) Data, Lithuanian audio dataset for speech recognition 20 hours (3/5), and FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
€2,500 / purchase
License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available