Best Translation Datasets & Databases
Easily explore, compare & preview top Translation Datasets via Datarade.
Refine your data search
Refine your data search
Recommended Translation Data Products
10 Results
TAUS Language Translation Data | Parallel translation for E- Commerce, various language pairs
by
TAUS
Based on that, we’ve applied TAUS proprietary Matching Data technology to extract the data from the TAUS ... Data Cloud, a large industry-shared repository of parallel corpora.
Available for 11 countries
1M words per language pair
1 years of historical data
100% words
Starts at
€5,000 / purchase
Nexdata | Multilingual Parallel Corpus Data | 200 Million Pairs | Text AI Training Data | Natural Language Processing Data | Translation Data
by
Nexdata
Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling ... Overview
Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language
Available for 109 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$5,000 / purchase
Free sample preview
Data Validation by EPIC Translations: AI & ML Translation Quality Data Evaluation
Machine Translation Quality Evaluation
WHAT DOES EPIC TRANSLATIONS BRING TO THE TABLE? ... Geo-Local Data Evaluation
.
Available for 249 countries
100K sentences
12 months of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share
TAUS Language Translation Data | Parallel translation for Legal contracts and obligations, various language pairs
by
TAUS
Other than some other Matching Data corpora that focus on business and legal communications, this corpus
Available for 7 countries
5M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase
WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... Key Features of Our Audio Data Datasets:
Vast Collection: Our repository consists of over 600 hours
Available for 64 countries
600 Hours of Recording
Pricing available upon request
Data Annotation by EPIC Translations: Image Annotation Data for AI & ML
Machine Learning Pipeline – That is, from Data Collection, Data Preprocessing, selection of algorithm ... The collection of data, labelling of data, development of machine learning algorithm that was used as
Available for 249 countries
50K images
10 months of historical data
Pricing available upon request
10% Datarade discount
10% revenue share
TAUS Language Translation Data | Parallel translation for Medical / Pharmaceutical, various language pairs for Machine Learning
by
TAUS
This is a must-have corpus for anyone seeking for pharma-related data. ... High fidelity MT training data is always important, even more so when it comes to medical subjects.
Available for 6 countries
3M Million words per language
1 years of historical data
100% words
Starts at
€5,000 / purchase
Data Collection by EPIC Translations: Copywriting, Text & Audio Data Data for AI & ML Training
Our Data Collection services:
AI Training Data
Crowdsourcing
Data Processing
Copywriting ... Text Data Collection
Audio Data Collection
Chatbot Training Data
Copywriting
Crowdsourcing
Available for 215 countries
50K sentences
12 weeks of historical data
100% match rate
Pricing available upon request
10% Datarade discount
10% revenue share
TAUS Language Translation Data | Parallel translation for Colloquial English into various languages for Machine Learning
by
TAUS
Need more data?
Available for 15 countries
1M words per language pair
7 months of historical data
100% words
Starts at
€100,000 / purchase
TAUS Language Translation Data | Parallel translation for Covid-19, Medical and Healthcare, various languages for Machine Learning
by
TAUS
corpora are the result of a collective industry charity effort where participants contributed their own translation ... TAUS also generated corpora by applying Matching Data selection to DataCloud and ParaCrawl data.
Available for 19 countries
123M Target words in total
1 years of historical data
100% words
Starts at
€5,000 / purchase
Monetize data on Datarade Marketplace
List your data on our global B2B marketplace to reach 100k monthly buyers
More Translation Data Products
Discover related translation data products.
200 million pairs
90% Accuracy
109 countries covered
Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling, medical treatment,news, and finance. Data clea...
50K sentences
100% match rate
215 countries covered
Our Data Collection services:
1. AI Training Data
2. Crowdsourcing
3. Data Processing
4. Copywriting
5. Text Data Collection
6. Audio Data Collection...
50K images
249 countries covered
10 months of historical data
. Audio Classification
. Acoustic Data Classification
. Environmental Sound Classification
. Natural Language
. Smart Labeling
. Entity Annotation
. E...
100K sentences
100% match rate
249 countries covered
. Content Moderation
. Geo-Local Data Evaluation
. Machine Translation Quality Evaluation
1M words per language pair
100% words
11 countries covered
Reliable product descriptions and information are a crucial asset in any e-commerce environment. In these corpora you'll find carefully filtered and cleaned ...
1M words per language pair
100% words
15 countries covered
A carefully selected part of the colloquial corpus has been translated and reviewed by native speakers in many long-tail languages, to get the highest-qualit...
200 million pairs
90% Accuracy
109 countries covered
Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling, medical treatment,news, and finance. Data clea...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
100K sentences
100% match rate
249 countries covered
. Content Moderation
. Geo-Local Data Evaluation
. Machine Translation Quality Evaluation
50K images
249 countries covered
10 months of historical data
. Audio Classification
. Acoustic Data Classification
. Environmental Sound Classification
. Natural Language
. Smart Labeling
. Entity Annotation
. E...
50K sentences
100% match rate
215 countries covered
Our Data Collection services:
1. AI Training Data
2. Crowdsourcing
3. Data Processing
4. Copywriting
5. Text Data Collection
6. Audio Data Collection...
5M Million words per language
100% words
7 countries covered
When settling an agreement, there should be no doubt about the conditions and mutual obligations. Contracts and agreements are subject to close scrutiny, so ...