Best Textual datasets & Databases
Easily explore, compare & preview top Textual datasets via Datarade.
Refine your data search
Refine your data search
Recommended Textual data Products
50+ Results
Textual Data | NLP-enriched Data | Transcription Data | Entity Extraction & Disambiguation | Ready-to-use
We also offer bespoke integrations, leveraging your data to enhance the accuracy of event detection and
Available for 250 countries
55 languages
5 years of historical data
99.95% SLA
Pricing available upon request
Free sample preview
PDF Scraping Textual Data | Transcription Data |Â E-Receipt Data, PDF Text Extraction
by
Reomnify
Use Reomnify’s expert data science team and proprietary algorithms to build customers to build bespoke ... We can provide you with structured data via PDF Scraping from company financial statements and reports
Available for 249 countries
1 hourly update (at least)
Pricing available upon request
FileMarket | Text Recognition Data | 50,000 Images | Computer Vision Data | AI Model Training Data | Textual data | Annotated Imagery Data
by
FileMarket
This dataset is part of our extensive offerings, which also include Textual Data, Object Detection Data ... , Large Language Model (LLM) Data, and Deep Learning (DL) Data.
Available for 160 countries
50K images
97% accuracy
Pricing available upon request
Free sample preview
Bitext | AI Training Data | Textual Data | 9 Languages for Synthetic Text Data | 100% Utterances Semantically Equivalent | 20 Verticals Covered
by
bitext
trusted source for top-tier Textual Data. ... Enhance your AI models with Bitext’s comprehensive Textual Data and access high-quality data with 100%
Available for 249 countries
9 Languages
100% Utterances Semantically Equivalent
Available Pricing:
One-off purchase
Monthly License
Yearly License
Free sample preview
Factori AI & ML Training Data | Consumer Data | USA | Machine Learning Data
by
Factori
data is gathered and aggregated via surveys, digital services, and public data sources. ... Our comprehensive data enrichment solution includes a variety of data sets that can help you address
Available for 1 countries
300 + Million Profiles
1 years of historical data
97% fill rate
Starts at
$360,000 / year
Free sample preview
Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning (DL), NLP & LLM Training
by
Xverum
What Makes Our Data Unique? ... How Is the Data Sourced?
Available for 250 countries
730M Individual Profiles
3 years of historical data
100% Open Web Data
Starts at
$1,000$900 / month
Free sample preview
10% Datarade discount
Parallel Corpus Data | 200 Million Pairs | Machine Translation Data | Natural Language Processing Data | Translation Data
by
Nexdata
Specifications
Storage format : TXT
Data content : Parallel Corpus Data
Data size : 200 million pairs ... , 1 million hours of Audio Data and 800TB of Annotated Imagery Data.
Available for 108 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$10,000 / purchase
Free sample preview
Coresignal | Clean Data | Company Data | AI-Enriched Datasets | Global / 35M+ Records / Updated Weekly
by
Coresignal
Clean data is an excellent data solution for companies with limited data engineering capabilities and ... It’s an excellent data solution for companies with limited data engineering capabilities and those who
Available for 248 countries
35 million records
Available Pricing:
One-off purchase
Monthly License
Yearly License
Usage-based
Free sample preview
WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... Key Features of Our Audio Data Datasets:
Vast Collection: Our repository consists of over 600 hours
Available for 64 countries
600 Hours of Recording
Pricing available upon request
AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies
, Consumer Behavior Data, Consumer Sentiment Data, Consumer Review Data, AI Training Data, Textual Data ... , Consumer Sentiment Data, Consumer Review Data, AI Training Data and Transcription Data applications
Available for 63 countries
350K calls per month
1 years of historical data
Starts at
$5,000$4,500 / purchase
Free sample preview
10% Datarade discount
Can't find the data you're looking for?
Let data providers come to you by posting your request
Post your request
More Textual data Products
Discover related textual data products.
438M records
249 countries covered
45 months of historical data
Coresignal Job Postings Data is your guide to the job market. With our job posting dataset or Jobs API, you can access millions of new and historical job pos...
9 Languages
100% Utterances Semantically Equivalent
249 countries covered
Enhance your AI models with Bitext's comprehensive Textual Data and access high-quality data with 100% semantically equivalent utterances across 20 verticals.
15K Hours
98% sentence/word
82 countries covered
The Natural Language Processing (NLP) Data of in-car speech covers 20+ languages, including read, wake-up word, commend word, code-swithing, multimodal and n...
400 hours
95% sentence accuracy
60 countries covered
Speech Synthesis speech data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician...
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue.
Domains include: Insurance, Retail, Debt Collection, Travel.
63 participants from all South Africa...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in.
NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
USA covered
FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t -- saying in their public comments on market-moving topics.
240 countries covered
Combined location data, map data, and strategic intelligence to provide clients with the best possible picture of real-world human activity.
600 Hours of Recording
64 countries covered
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings. Our audio datasets are meticulously curated and de...
598M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...
35M records
249 countries covered
Comprehensive data on companies worldwide. Discover and analyze businesses from any industry with all the necessary information at hand. This multi-source co...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
20K Hours
98% accuracy
71 countries covered
Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portugue...
2M pairs
95% Accuracy
50 countries covered
Off-the-shelf 2 millions pairs SFT text data. Contains 12 types of SFT QA, and the accuracy is not less than 95%. All prompts are manually written to meet di...
1B Records
250 countries covered
1 years of historical data
Comprehensive training data on 1M+ stores across the US & Canada. Includes detailed menus, inventory, pricing, and availability. Ideal for AI/ML models, powe...
1 PB
90% Accuracy
8 countries covered
Off-the-shelf 50 Million Test Questions Text Parsing And Processing Data. Each question contains title, answer, parse, subject, grade, question type; The edu...
20K hours
98% Word Accuracy Rate
41 countries covered
Off-the-shelf 20,000 hours of Real-world Casual Conversation Speech data, covering 30+ languages. Covering diverse domains like self-media, conversations, li...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in.
NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...