Best Natural Language Processing (NLP) Datasets & Databases
Easily explore, compare & preview top Natural Language Processing (NLP) Datasets via Datarade.
Refine your data search
Refine your data search
Recommended Natural Language Processing (NLP) Data Products
50+ Results
Nexdata | Audio Annotation Services | AI-assisted Labeling |Speech Data | AI Training Data | Natural Language Processing (NLP) Data
by
Nexdata
Language Processing (NLP) Data, etc. ... Overview
We provide various types of Natural Language Processing (NLP) Data services, including:
Available for 118 countries
100K hours per month
5 years of historical data
99.5% word accuracy
Available Pricing:
One-off purchase
Free sample preview
WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing
language processing, voice assistants, and more. ... language processing, voice assistants, and more.
Available for 64 countries
600 Hours of Recording
Pricing available upon request
Nexdata | Native & Accented English Speech Data |40,000 Hours | Audio Data|Speech Recognition Data| Natural Language Processing (NLP) Data
by
Nexdata
About Nexdata
Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of ... Audio Data and 800TB of Annotated Imagery Data.
Available for 54 countries
40K Hours
10 years of historical data
98% sentence/word
Starts at
$20,000 / purchase
Free sample preview
AI & ML Training Data | 800M Profiles for LLMs, Generative AI, NLP & Predictive Models
by
Xverum
From natural language processing (NLP) to predictive analytics, our data empowers a wide range of industries ... Primary Use Cases and Verticals
Natural Language Processing (NLP):
Train models for named entity recognition
Available for 250 countries
730M Individual Profiles
3 years of historical data
99% Complete and Fully Updated Data
Starts at
$1,000$900 / month
Free sample preview
10% Datarade discount
Kieli NLP Data - Fully-labelled dataset of Arabic language for Machine Learning & AI platforms
by
Kieli
language processing techniques. ... Kieli is a professional data analytic company dedicated to solving human language challenges using natural
Available for 242 countries
Pricing available upon request
Textual Data | NLP-enriched Data | Transcription Data | Entity Extraction & Disambiguation | Ready-to-use
・Natural Disasters: Earthquakes, floods, hurricanes, and other weather-related incidents.・
・Legal and ... We also offer bespoke integrations, leveraging your data to enhance the accuracy of event detection and
Available for 250 countries
55 languages
5 years of historical data
99.95% SLA
Pricing available upon request
Free sample preview
TAUS Language Translation Data | Parallel translation for E- Commerce, various language pairs
by
TAUS
Data is available in parallel format and new language pairs can be created quickly:
French - Dutch ... Based on that, we’ve applied TAUS proprietary Matching Data technology to extract the data from the TAUS
Available for 11 countries
1M words per language pair
1 years of historical data
100% words
Starts at
€5,000 / purchase
Bitext NLP Labeling for Gen AI Data Annotation and Labeling (DAL) projects
by
bitext
English, Spanish, French, German, Italian, Portuguese, Arabic, Chinese, Japanese, Korean…)
Multiple NLP ... Bitext, we offer advanced linguistic tools designed for automated pre-labeling of datasets to help scale Data
Available for 240 countries
Pricing available upon request
Free sample preview
Nexdata |Text Annotation Services | AI-assisted Labeling |Text Labeling for AI & ML | Text Data |Natural Language Processing (NLP) Data
by
Nexdata
Language Processing (NLP) Data, etc. ... Nexdata provides high-quality Natural Language Processing (NLP) Data annotation for text cleaning, entity
Available for 115 countries
50 TB per month
5 years of historical data
98% accuracy
Starts at
$20,000 / purchase
Free sample preview
Knuckle Head Data Annotation and Labelling Services (NLP Data for English, French, Spanish, Italian, Portuguese, Japanese, Indian)
by
Knuckle Head
We have been working on several projects for Data Annotation, Data-Collection and data labeling services ... Image Annotation and Labeling
Face Recognition and Emotions
Audio / Video Annotation
Medical Annotation
Data
Available for 191 countries
Pricing available upon request
Can't find the data you're looking for?
Let data providers come to you by posting your request
Post your request
More Natural Language Processing (NLP) Data Products
Discover related natural language processing (nlp) data products.
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
46 participants from Western Cape, No...
50 TB per month
98% accuracy
115 countries covered
Nexdata provides high-quality Natural Language Processing (NLP) Data annotation for text cleaning, entity tagging, named entity tagging, text classification ...
100K hours per month
99.5% word accuracy
118 countries covered
Nexdata provides high-quality Speech Data services for speech cleaning, speech transcription, phoneme annotation etc, with word accuracy of 99.5% and phoneme...
35 million records
248 countries covered
Clean data is an excellent data solution for companies with limited data engineering capabilities and those who want to reduce time to value. Dataset consist...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
49 participants from Limpopo, North-W...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
50+ participants from KwaZulu-Natal, ...
50M Records
100% Data Coverage
61 countries covered
APISCRAPY's AI & ML training data is meticulously curated and labelled to ensure the best quality. Our training data comes from a variety of areas, including...
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
598M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...
400 hours
95% sentence accuracy
60 countries covered
The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician partici...
35 million records
248 countries covered
Clean data is an excellent data solution for companies with limited data engineering capabilities and those who want to reduce time to value. Dataset consist...
20K Hours of Audio
95% Match Rate
215 countries covered
We help the client source, curate, & transcribe the right set of data required to train AI/ML model, with utmost precision. We offered audio data collection ...
2M pairs
95% Accuracy
50 countries covered
Off-the-shelf 2 millions pairs SFT text data. Contains 12 types of SFT QA, and the accuracy is not less than 95%. All prompts are manually written to meet di...
1 PB
90% Accuracy
8 countries covered
Off-the-shelf 50 Million Test Questions Text Parsing And Processing Data. Each question contains title, answer, parse, subject, grade, question type; The edu...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in.
NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
730M Individual Profiles
99% Complete and Fully Updated Data
250 countries covered
Xverum’s Machine Learning (ML) data will help you to train LLMs and generative AI with 800M B2B profiles. 100+ attributes, global coverage, and GDPR-complian...
26M records
249 countries covered
45 months of historical data
Easily find and get job postings from any industry and location. Job postings API allows you to use a wide selection of filters to discover job listings you'...
240 countries covered
At Bitext, we offer advanced linguistic tools designed for automated pre-labeling of datasets to help scale Data Annotation and Labeling (DAL) projects.