Best Translation Datasets, Databases & APIs

Find the right Translation Data: Search, preview & buy data securely via Datarade.

Recommended Translation Data Products

8 Results
Start icon5.0(1)

TAUS Language Translation Data | Parallel translation for E- Commerce, various language pairs

by TAUS
Based on that, we’ve applied TAUS proprietary Matching Data technology to extract the data from the TAUS ... Data Cloud, a large industry-shared repository of parallel corpora.
Available for 11 countries
1M words per language pair
1 years of historical data
100% words
Starts at
€5,000 / purchase
Free sample available

Data Validation by EPIC Translations: AI & ML Translation Quality Data Evaluation

Machine Translation Quality Evaluation WHAT DOES EPIC TRANSLATIONS BRING TO THE TABLE? ... Geo-Local Data Evaluation .
Available for 249 countries
100K sentences
12 months of historical data
100% match rate
Pricing available upon request
10% Datarade discount
Free sample available
10% revenue share
Start icon5.0(1)

TAUS Language Translation Data | Parallel translation for Legal contracts and obligations, various language pairs

by TAUS
Other than some other Matching Data corpora that focus on business and legal communications, this corpus
Available for 7 countries
5M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase
Free sample available

Data Annotation by EPIC Translations: Image Annotation Data for AI & ML

Machine Learning Pipeline – That is, from Data Collection, Data Preprocessing, selection of algorithm ... The collection of data, labelling of data, development of machine learning algorithm that was used as
Available for 249 countries
50K images
10 months of historical data
Pricing available upon request
10% Datarade discount
Free sample available
10% revenue share
Start icon5.0(1)

TAUS Language Translation Data | Parallel translation for Colloquial English into various languages for Machine Learning

by TAUS
Need more data?
Available for 15 countries
1M words per language pair
7 months of historical data
100% words
Starts at
€100,000 / purchase
Free sample available

Data Collection by EPIC Translations: Copywriting, Text & Audio Data Data for AI & ML Training

Our Data Collection services: AI Training Data Crowdsourcing Data Processing Copywriting ... Text Data Collection Audio Data Collection Chatbot Training Data Copywriting Crowdsourcing
Available for 215 countries
50K sentences
12 weeks of historical data
100% match rate
Pricing available upon request
10% Datarade discount
Free sample available
10% revenue share
Start icon5.0(1)

TAUS Language Translation Data | Parallel translation for Covid-19, Medical and Healthcare, various languages for Machine Learning

by TAUS
corpora are the result of a collective industry charity effort where participants contributed their own translation ... TAUS also generated corpora by applying Matching Data selection to DataCloud and ParaCrawl data.
Available for 19 countries
123M Target words in total
1 years of historical data
100% words
Starts at
€5,000 / purchase
Free sample available
Start icon5.0(1)

TAUS Language Translation Data | Parallel translation for Medical / Pharmaceutical, various language pairs for Machine Learning

by TAUS
This is a must-have corpus for anyone seeking for pharma-related data. ... High fidelity MT training data is always important, even more so when it comes to medical subjects.
Available for 6 countries
3M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase
Free sample available

More Translation Data Products

Discover related translation data products.
50K images
249 countries covered
10 months of historical data
. Audio Classification . Acoustic Data Classification . Environmental Sound Classification . Natural Language . Smart Labeling . Entity Annotation . E...
50K sentences
100% match rate
215 countries covered
Our Data Collection services: 1. AI Training Data 2. Crowdsourcing 3. Data Processing 4. Copywriting 5. Text Data Collection 6. Audio Data Collection...
100K sentences
100% match rate
249 countries covered
. Content Moderation . Geo-Local Data Evaluation . Machine Translation Quality Evaluation
1M words per language pair
100% words
11 countries covered
Reliable product descriptions and information are a crucial asset in any e-commerce environment. In these corpora you'll find carefully filtered and cleaned ...
1M words per language pair
100% words
15 countries covered
A carefully selected part of the colloquial corpus has been translated and reviewed by native speakers in many long-tail languages, to get the highest-qualit...
3M Million words per language
100% words
6 countries covered
High fidelity MT training data is always important, even more so when it comes to medical subjects. This is a must-have corpus for anyone seeking for pharma-...
100K sentences
100% match rate
249 countries covered
. Content Moderation . Geo-Local Data Evaluation . Machine Translation Quality Evaluation
50K images
249 countries covered
10 months of historical data
. Audio Classification . Acoustic Data Classification . Environmental Sound Classification . Natural Language . Smart Labeling . Entity Annotation . E...
50K sentences
100% match rate
215 countries covered
Our Data Collection services: 1. AI Training Data 2. Crowdsourcing 3. Data Processing 4. Copywriting 5. Text Data Collection 6. Audio Data Collection...
5M Million words per language
100% words
7 countries covered
When settling an agreement, there should be no doubt about the conditions and mutual obligations. Contracts and agreements are subject to close scrutiny, so ...
123M Target words in total
100% words
19 countries covered
These corpora are the result of a collective industry charity effort where participants contributed their own translation memories covering this domain so th...
3M Million words per language
100% words
6 countries covered
High fidelity MT training data is always important, even more so when it comes to medical subjects. This is a must-have corpus for anyone seeking for pharma-...

The Ultimate Guide to Translation Data 2023

Learn about translation data analytics, sources, and collection.

Where can I buy Translation Data?

Data providers and vendors listed on Datarade sell Translation Data products and samples. Popular Translation Data products and datasets available on our platform are TAUS Language Translation Data | Parallel translation for E- Commerce, various language pairs by TAUS, Data Validation by EPIC Translations: AI & ML Translation Quality Data Evaluation by EPIC Translations, and TAUS Language Translation Data | Parallel translation for Legal contracts and obligations, various language pairs by TAUS.

How can I get Translation Data?

You can get Translation Data via a range of delivery methods - the right one for you depends on your use case. For example, historical Translation Data is usually available to download in bulk and delivered using an S3 bucket. On the other hand, if your use case is time-critical, you can buy real-time Translation Data APIs, feeds and streams to download the most up-to-date intelligence.

What are similar data types to Translation Data?

Translation Data is similar to Natural Language Processing (NLP) Data, Annotated Imagery Data, Machine Learning (ML) Data, Deep Learning (DL) Data, and Synthetic Data. These data categories are commonly used for Natural Language Processing (NLP).

What are the most common use cases for Translation Data?

The top use cases for Translation Data are Natural Language Processing (NLP).