Let data providers come to you!

Post your request to reach 1240+ data providers and find the best match for your data needs

How it works

Tell us what you need
2-3 mins
Receive proposals
within 24 hours
Connect with providers
Post request now
Post your data request

Best Translation Datasets & Databases

Easily explore, compare & preview top Translation Datasets via Datarade.
10 Results

Parallel Corpus Data | 200 Million Pairs | Machine Translation Data | Natural Language Processing Data | Translation Data

by Nexdata
Audio Data and 800TB of Annotated Imagery Data. ... Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling
Available for 109 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$10,000 / purchase
Free sample preview
5.0(1)

TAUS Language Translation Data | Parallel translation for Colloquial English into various languages for Machine Learning

by TAUS
Need more data? ... A carefully selected part of the colloquial corpus has been translated and reviewed by native speakers
Available for 15 countries
1M words per language pair
7 months of historical data
100% words
Starts at
€100,000 / purchase

Data Validation by EPIC Translations: AI & ML Translation Quality Data Evaluation

Geo-Local Data Evaluation . ... that can manipulate and translate data to a given output.
Available for 249 countries
100K sentences
12 months of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share
5.0(1)

TAUS Language Translation Data | Parallel translation for E- Commerce, various language pairs

by TAUS
Based on that, we've applied TAUS proprietary Matching Data technology to extract the data from the TAUS ... Data Cloud, a large industry-shared repository of parallel corpora.
Available for 11 countries
1M words per language pair
1 years of historical data
100% words
Starts at
€5,000 / purchase
5.0(1)

WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing

We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... High-Quality Recordings: We prioritize the quality of our audio data, ensuring clear and professional
Available for 64 countries
600 Hours of Recording
Pricing available upon request

Data Annotation by EPIC Translations: Image Annotation Data for AI & ML

that can manipulate and translate data to a given output. ... Machine Learning Pipeline – That is, from Data Collection, Data Preprocessing, selection of algorithm
Available for 249 countries
50K images
10 months of historical data
Pricing available upon request
10% Datarade discount
10% revenue share
5.0(1)

TAUS Language Translation Data | Parallel translation for Covid-19, Medical and Healthcare, various languages for Machine Learning

by TAUS
TAUS also generated corpora by applying Matching Data selection to DataCloud and ParaCrawl data. ... The selected data is related to virology, epidemic, medicine, and healthcare.
Available for 19 countries
123M Target words in total
1 years of historical data
100% words
Starts at
€5,000 / purchase

Data Collection by EPIC Translations: Copywriting, Text & Audio Data Data for AI & ML Training

Data Entry 11. Data Mining 12. ... Our Data Collection services: 1. AI Training Data 2. Crowdsourcing 3. Data Processing 4.
Available for 215 countries
50K sentences
12 weeks of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share
5.0(1)

TAUS Language Translation Data | Parallel translation for Legal contracts and obligations, various language pairs

by TAUS
Other than some other Matching Data corpora that focus on business and legal communications, this corpus
Available for 7 countries
5M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase
5.0(1)

TAUS Language Translation Data | Parallel translation for Medical / Pharmaceutical, various language pairs for Machine Learning

by TAUS
This is a must-have corpus for anyone seeking for pharma-related data. ... High fidelity MT training data is always important, even more so when it comes to medical subjects.
Available for 6 countries
3M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase

Can't find the data you're looking for?

Let data providers come to you by posting your request

Post your request

More Translation Data Products

Discover related translation data products.
50K sentences
100% match rate
215 countries covered
Our Data Collection services: 1. AI Training Data 2. Crowdsourcing 3. Data Processing 4. Copywriting 5. Text Data Collection 6. Audio Data Collection...
50K images
249 countries covered
10 months of historical data
. Audio Classification . Acoustic Data Classification . Environmental Sound Classification . Natural Language . Smart Labeling . Entity Annotation . E...
100K sentences
100% match rate
249 countries covered
. Content Moderation . Geo-Local Data Evaluation . Machine Translation Quality Evaluation
1M words per language pair
100% words
11 countries covered
Reliable product descriptions and information are a crucial asset in any e-commerce environment. In these corpora you'll find carefully filtered and cleaned ...
1M words per language pair
100% words
15 countries covered
A carefully selected part of the colloquial corpus has been translated and reviewed by native speakers in many long-tail languages, to get the highest-qualit...
3M Million words per language
100% words
6 countries covered
High fidelity MT training data is always important, even more so when it comes to medical subjects. This is a must-have corpus for anyone seeking for pharma-...
200 million pairs
90% Accuracy
109 countries covered
Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling, medical treatment,news, and finance. Data clea...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
100K sentences
100% match rate
249 countries covered
. Content Moderation . Geo-Local Data Evaluation . Machine Translation Quality Evaluation
50K images
249 countries covered
10 months of historical data
. Audio Classification . Acoustic Data Classification . Environmental Sound Classification . Natural Language . Smart Labeling . Entity Annotation . E...
50K sentences
100% match rate
215 countries covered
Our Data Collection services: 1. AI Training Data 2. Crowdsourcing 3. Data Processing 4. Copywriting 5. Text Data Collection 6. Audio Data Collection...
5M Million words per language
100% words
7 countries covered
When settling an agreement, there should be no doubt about the conditions and mutual obligations. Contracts and agreements are subject to close scrutiny, so ...