Refine your data search
50+ Results
5.0(2)

Textual Data | NLP-enriched Data | Transcription Data | Entity Extraction & Disambiguation | Ready-to-use

We also offer bespoke integrations, leveraging your data to enhance the accuracy of event detection and
Available for 250 countries
55 languages
5 years of historical data
99.95% SLA
Pricing available upon request
Free sample preview

PDF Scraping Textual Data | Transcription Data | E-Receipt Data, PDF Text Extraction

Use Reomnify’s expert data science team and proprietary algorithms to build customers to build bespoke ... We can provide you with structured data via PDF Scraping from company financial statements and reports
Available for 249 countries
1 hourly update (at least)
Pricing available upon request

FileMarket | Text Recognition Data | 50,000 Images | Computer Vision Data | AI Model Training Data | Textual data | Annotated Imagery Data

This dataset is part of our extensive offerings, which also include Textual Data, Object Detection Data ... , Large Language Model (LLM) Data, and Deep Learning (DL) Data.
Available for 160 countries
50K images
97% accuracy
Pricing available upon request
Free sample preview

Textual Data API | Deep Learning Data | Full Text | Firehose | 3.5M+ daily news articles | Noise-free

by Webz.io
Matthieu Vaxelaire CEO Mention What makes our Textual Data unique? ... How is our News Data sourced?
Available for 250 countries
200 Countries
16 years of historical data
Pricing available upon request
Free sample preview

Bitext | AI Training Data | Textual Data | 9 Languages for Synthetic Text Data | 100% Utterances Semantically Equivalent | 20 Verticals Covered

by bitext
trusted source for top-tier Textual Data. ... Enhance your AI models with Bitext’s comprehensive Textual Data and access high-quality data with 100%
Available for 249 countries
9 Languages
100% Utterances Semantically Equivalent
Available Pricing:
One-off purchase
Monthly License
Yearly License
Free sample preview
4.9(2)

Factori AI & ML Training Data | Consumer Data | USA | Machine Learning Data

by Factori
data is gathered and aggregated via surveys, digital services, and public data sources. ... Our comprehensive data enrichment solution includes a variety of data sets that can help you address
Available for 1 countries
300 + Million Profiles
1 years of historical data
97% fill rate
Starts at
$360,000 / year
Free sample preview
5.0(1)

AI & ML Training Data | 800M Profiles for LLMs, Generative AI, NLP & Predictive Models

by Xverum
What Makes Our Data Unique? ... How Is the Data Sourced?
Available for 250 countries
730M Individual Profiles
4 years of historical data
99% Complete and Fully Updated Data
Available Pricing:
One-off purchase
Monthly License
Yearly License
Usage-based
Free sample preview
10% Datarade discount
5.0(1)

WebAutomation Off the Shelf Datasets | Audio Data for AI & ML Training | 600+ Hours of Recording | Speech Recognition, Natural Language Processing

We offer a comprehensive collection of audio data, amounting to over 600 hours of high-quality recordings ... Key Features of Our Audio Data Datasets: Vast Collection: Our repository consists of over 600 hours
Available for 64 countries
600 Hours of Recording
Pricing available upon request

Nexdata | Multilingual Parallel Corpus Data | 200 Million Pairs | Text AI Training Data | Natural Language Processing Data | Translation Data

by Nexdata
Specifications Storage format : TXT Data content : Parallel Corpus Data Data size : 200 million pairs ... , 1 million hours of Audio Data and 800TB of Annotated Imagery Data.
Available for 108 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$5,000 / purchase
Free sample preview
4.9(7)

AI & ML Training Data | Artificial Intelligence (AI) | Machine Learning (ML) Datasets | Deep Learning Datasets | Easy to Integrate | Free Sample

[Related tags:AI Training Data, Textual data, Machine Learning (ML) Data, Deep Learning (DL) Data, ... Annotated Imagery Data, Synthetic Data, Audio Data, Large Language Model (LLM) Data,ML Training Data,
Available for 61 countries
50M Records
30 days of historical data
100% Data Coverage
Starts at
$25 / month

Monetize data on Datarade Marketplace

List your data on our global B2B marketplace to reach 100k monthly buyers

More Textual data Products

Discover related textual data products.
50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 49 participants from Limpopo, North-W...
350K calls per month
63 countries covered
1 years of historical data
Access a vast collection of transcribed customer call records tailored to your needs. Ideal for in-depth analysis of customer interactions and behavior trend...
240 countries covered
At Bitext, we offer advanced linguistic tools designed for automated pre-labeling of datasets to help scale Data Annotation and Labeling (DAL) projects.
200 Countries
250 countries covered
16 years of historical data
Get 50TB of 10+ Years of Historical Data continuously, with live API and on demand historical datasets. We offer a firehose option, with 170+ languages and c...
438M records
249 countries covered
45 months of historical data
Job Postings Data is your guide to the job market. With Coresignal's job posting datasets or Jobs API, you can access millions of new and historical job post...
598M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...
35M records
249 countries covered
Comprehensive data on companies worldwide. Discover and analyze businesses from any industry with all the necessary information at hand. This multi-source co...
20K Hours of Audio
95% Match Rate
215 countries covered
We help the client source, curate, & transcribe the right set of data required to train AI/ML model, with utmost precision. We offered audio data collection ...
191 countries covered
We have been working on several projects for Data Annotation, Data-Collection and data labeling services since September 2019. The volume of our annotators a...
438M records
249 countries covered
45 months of historical data
Job Postings Data is your guide to the job market. With Coresignal's job posting datasets or Jobs API, you can access millions of new and historical job post...
400 hours
95% sentence accuracy
60 countries covered
The AI Training Data is recorded by native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician partici...
50M Records
100% Data Coverage
61 countries covered
APISCRAPY's AI & ML training data is meticulously curated and labelled to ensure the best quality. Our training data comes from a variety of areas, including...
1 PB
90% Accuracy
88 countries covered
Off-the-shelf 1PB unsupervised text data covers test questions, textbooks, e-books, papers, parallel copora, online Q&A, chating dialogue and etc.
20K hours
98% Word Accuracy Rate
41 countries covered
Off-the-shelf 20,000 hours of Casual Conversation Speech data, covering 30+ languages. Covering diverse domains like self-media, conversations, live streams,...
55 languages
99.95% SLA
250 countries covered
Track specific events that influence the market you operate in. NewsCatcher scans news articles from over 90,000 outlets worldwide, including hyper-local ...
1 hourly update (at least)
249 countries covered
Use Reomnify's expert data science team and proprietary algorithms to build customers to build bespoke, trustworthy datasets. We can provide you with stru...
200 Countries
250 countries covered
16 years of historical data
Get 50TB of 10+ Years of Historical Data continuously, with live API and on demand historical datasets. We offer a firehose option, with 170+ languages and c...
730M Individual Profiles
99% Complete and Fully Updated Data
250 countries covered
Xverum’s Machine Learning (ML) data will help you to train LLMs and generative AI with 800M B2B profiles. 100+ attributes, global coverage, and GDPR-complian...