Best Textual datasets & Databases
Easily explore, compare & preview top Textual datasets via Datarade.
Refine your data search
Refine your data search
Recommended Textual data Products
50+ Results
Textual Data | NLP-enriched Data | Transcription Data | Entity Extraction & Disambiguation | Ready-to-use
We also offer bespoke integrations, leveraging your data to enhance the accuracy of event detection and
Available for 250 countries
55 languages
5 years of historical data
99.95% SLA
Pricing available upon request
Free sample preview
FileMarket | Text Recognition Data | 50,000 Images | Computer Vision Data | AI Model Training Data | Textual data | Annotated Imagery Data
by
FileMarket
This dataset is part of our extensive offerings, which also include Textual Data, Object Detection Data ... , Large Language Model (LLM) Data, and Deep Learning (DL) Data.
Available for 160 countries
50K images
97% accuracy
Pricing available upon request
Free sample preview
Textual Data API | Deep Learning Data | Full Text | Firehose | 3.5M+ daily news articles | Noise-free
by
Webz.io
Matthieu Vaxelaire
CEO
Mention
What makes our Textual Data unique? ... How is our News Data sourced?
Available for 250 countries
200 Countries
16 years of historical data
Pricing available upon request
PDF Scraping Textual Data | Transcription Data | E-Receipt Data, PDF Text Extraction
by
Reomnify
Use Reomnify’s expert data science team and proprietary algorithms to build customers to build bespoke ... We can provide you with structured data via PDF Scraping from company financial statements and reports
Available for 249 countries
1 hourly update (at least)
Pricing available upon request
Bitext | AI Training Data | Textual Data | 9 Languages for Synthetic Text Data | 100% Utterances Semantically Equivalent | 20 Verticals Covered
by
bitext
trusted source for top-tier Textual Data. ... Enhance your AI models with Bitext’s comprehensive Textual Data and access high-quality data with 100%
Available for 249 countries
9 Languages
100% Utterances Semantically Equivalent
Available Pricing:
One-off purchase
Monthly License
Yearly License
Free sample preview
Factori AI & ML Training Data | Consumer Data | USA | Machine Learning Data
by
Factori
data is gathered and aggregated via surveys, digital services, and public data sources. ... Our comprehensive data enrichment solution includes a variety of data sets that can help you address
Available for 1 countries
300 + Million Profiles
1 years of historical data
97% fill rate
Starts at
$360,000 / year
Free sample preview
Coresignal | Clean Data | Company Data | AI-Enriched Datasets | Global / 35M+ Records / Updated Weekly
by
Coresignal
Clean data is an excellent data solution for companies with limited data engineering capabilities and ... It’s an excellent data solution for companies with limited data engineering capabilities and those who
Available for 248 countries
35 million records
Available Pricing:
One-off purchase
Monthly License
Yearly License
Usage-based
Free sample preview
Nexdata | Multilingual Parallel Corpus Data | 200 Million Pairs | Text AI Training Data | Natural Language Processing Data | Translation Data
by
Nexdata
Specifications
Storage format : TXT
Data content : Parallel Corpus Data
Data size : 200 million pairs ... Off-the-shelf parallel corpus data (Translation Data) covers many fields including spoken language, traveling
Available for 109 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$5,000 / purchase
Free sample preview
WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing
We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... Key Features of Our Audio Data Datasets:
Vast Collection: Our repository consists of over 600 hours
Available for 64 countries
600 Hours of Recording
Pricing available upon request
Web Scraping | data parsing | and processing services
by
AnaChart
data solutions through our expertise in web scraping, data parsing, and processing services followed ... Data Quality and Assurance Standards
We maintain rigorous U.S.
Available for 6 countries
Pricing available upon request
Free sample preview
Monetize data on Datarade Marketplace
List your data on our global B2B marketplace to reach 100k monthly buyers
More Textual data Products
Discover related textual data products.
400 hours
95% sentence accuracy
61 countries covered
The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician partici...
50K Hours
98% sentence/word
29 countries covered
The recorded text is a mixture multi-language sentences, covering general scenes and human-computer interaction scenes. The audio data is rich in content and...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in.
NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
438M records
249 countries covered
45 months of historical data
Job Postings Data is your guide to the job market. With Coresignal's job posting datasets or Jobs API, you can access millions of new and historical job post...
35 million records
248 countries covered
Clean data is an excellent data solution for companies with limited data engineering capabilities and those who want to reduce time to value. Dataset consist...
10K recordings
95% accuracy
64 countries covered
Authentic and spoofed faces recorded with different mobile phone cameras, showcasing both men and women, with and without glasses, under indoor and outdoor l...
20K Hours of Audio
95% Match Rate
215 countries covered
We help the client source, curate, & transcribe the right set of data required to train AI/ML model, with utmost precision. We offered audio data collection ...
99% accuracy
240 countries covered
Conversational AI training data generated for specific custom use cases. We have a large pool of customer support agents all over the world to generate AI vo...
USA covered
FactSquared Transcribe provides automated, full-text, searchable, indexed feeds of audio and video content.
USA covered
FactSquared Analyze offers unique data-driven insights into what public figures are -- and aren’t -- saying in their public comments on market-moving topics.
191 countries covered
We have been working on several projects for Data Annotation, Data-Collection and data labeling services since September 2019. The volume of our annotators a...
249 countries covered
2 years of historical data
Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars, arenas, churches, movie theaters, retail and many more)...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in.
NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
1 hourly update (at least)
249 countries covered
Use Reomnify's expert data science team and proprietary algorithms to build customers to build bespoke, trustworthy datasets.
We can provide you with stru...
200 Countries
250 countries covered
16 years of historical data
Get 50TB of 10+ Years of Historical Data continuously, with live API and on demand historical datasets. We offer a firehose option, with 170+ languages and c...
6 countries covered
AnaChart have developed expertise in web scraping data parsing and processing services, as well as testing for quality assurance. AnaChart offers services to...
26M records
249 countries covered
45 months of historical data
Easily find and get job postings from any industry and location. Job postings API allows you to use a wide selection of filters to discover job listings you'...
35M records
249 countries covered
Comprehensive data on companies worldwide. Discover and analyze businesses from any industry with all the necessary information at hand. This multi-source co...