Lithuanian audio dataset for speech recognition 20 hours (4/5) product image in hero

Lithuanian audio dataset for speech recognition 20 hours (4/5)

StageZero
No reviews yetBadge iconVerified Data Provider
#
Speaker 1
Speaker 2
Transcription text
1 xxxxxxxxxx Xxxxxxxxx xxxxxx
2 xxxxxxxxxx Xxxxx Xxxxxx
3 Xxxxxxxxxx Xxxxxx Xxxxxxxxx
4 Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx
5 xxxxxxxxx Xxxxxxx xxxxxx
6 Xxxxx xxxxxxxxxx xxxxxx
7 Xxxxxxxxxx xxxxxx Xxxxx
8 Xxxxxx xxxxx xxxxxxxx
9 xxxxxxx Xxxxx Xxxxxxxx
10 xxxxxxxxxx xxxxxx Xxxxxxxxx
... xxxxxx Xxxxxxxxx Xxxxxxxxx
Request Data Sample
Volume
20
hours
Avail. Format
.json
File
Coverage
1
Country

Data Dictionary

Product Attributes
Attribute Type Example Mapping
Speaker 1
Speaker 2
Transcription text

Description

Fourth dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.

Country Coverage

Europe (1)
Lithuania

Volume

20 hours

Pricing

License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
Email
Format
.json

Use Cases

Machine Learning (ML)
Deep Learning Speech Recognition

Categories

Related Products

40K Hours
98% sentence/word
55 countries covered
The speech data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is design...
10 hours
Bulgaria covered
Fourth dataset of 10 hours of Bulgarian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quali...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
350K calls per month
63 countries covered
1 years of historical data
Access a vast collection of transcribed customer call records tailored to your needs. Ideal for in-depth analysis of customer interactions and behavior trend...

Frequently asked questions

What is Lithuanian audio dataset for speech recognition 20 hours (4/5)?

Fourth dataset of 20 hours of Lithuanian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-quality transcriptions.

What is Lithuanian audio dataset for speech recognition 20 hours (4/5) used for?

This product has 3 key use cases. StageZero recommends using the data for Machine Learning (ML), Deep Learning, and Speech Recognition. Global businesses and organizations buy Natural Language Processing (NLP) Data from StageZero to fuel their analytics and enrichment.

Who can use Lithuanian audio dataset for speech recognition 20 hours (4/5)?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with StageZero to see what their data can do for your business and find out which integrations they provide.

Which countries does Lithuanian audio dataset for speech recognition 20 hours (4/5) cover?

This product includes data covering 1 country like Lithuania. StageZero is headquartered in Finland.

How much does Lithuanian audio dataset for speech recognition 20 hours (4/5) cost?

Pricing for Lithuanian audio dataset for speech recognition 20 hours (4/5) starts at EUR2,500 per purchase. Connect with StageZero to get a quote and arrange custom pricing models based on your data requirements.

How can I get Lithuanian audio dataset for speech recognition 20 hours (4/5)?

Businesses can buy Natural Language Processing (NLP) Data from StageZero and get the data via Email. Depending on your data requirements and subscription budget, StageZero can deliver this product in .json format.

What is the data quality of Lithuanian audio dataset for speech recognition 20 hours (4/5)?

You can compare and assess the data quality of StageZero using Datarade’s data marketplace. StageZero appears on selected Datarade top lists ranking the best data providers, including Who’s New on Datarade? May Edition.

What are similar products to Lithuanian audio dataset for speech recognition 20 hours (4/5)?

This product has 3 related products. These alternatives include Nexdata Multilingual Native & Accented English Speech Data 40,000 Hours Audio Data Speech Recognition Data Natural Language Processing (NLP) Data, Bulgarian audio dataset for speech recognition 10 hours (4/4), and FileMarket 20,000 Voice Memos Multilingual Training Data for Conversational AI Machine Learning (ML) Data. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
€2,500 / purchase
License Starts at
One-off purchase
€2,500 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available