Let data providers come to you!

Post your request to reach 1240+ data providers and find the best match for your data needs

How it works

Tell us what you need
2-3 mins
Receive proposals
within 24 hours
Connect with providers
Post request now
Post your data request

Best Textual datasets & Databases

Easily explore, compare & preview top Textual datasets via Datarade.
50+ Results

FileMarket | Text Recognition Data | 50,000 Images | Computer Vision Data | AI Model Training Data | Textual data | Annotated Imagery Data

This dataset is part of our extensive offerings, which also include Textual Data, Object Detection Data ... , Large Language Model (LLM) Data, and Deep Learning (DL) Data.
Available for 160 countries
50K images
97% accuracy
Pricing available upon request
Free sample preview
4.9(2)

Factori AI & ML Training Data | Consumer Data | USA | Machine Learning Data

by Factori
Our US consumer graph database is a comprehensive data, which can be used to training AI & ML models. ... data is gathered and aggregated via surveys, digital services, and public data sources.
Available for 1 countries
300 + Million Profiles
1 years of historical data
97% fill rate
Starts at
$360,000 / year
Free sample preview
5.0(2)

AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

, Consumer Behavior Data, Consumer Sentiment Data, Consumer Review Data, AI Training Data, Textual Data ... , Consumer Sentiment Data, Consumer Review Data, AI Training Data and Transcription Data applications
Available for 63 countries
350K calls per month
1 years of historical data
Starts at
$5,000$4,500 / purchase
Free sample preview
10% Datarade discount
5.0(1)

Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning (DL), NLP & LLM Training

by Xverum
How Is the Data Sourced? ... What Makes Our Data Unique?
Available for 250 countries
730M Individual Profiles
3 years of historical data
100% Open Web Data
Starts at
$1,000$900 / month
Free sample preview
10% Datarade discount

Parallel Corpus Data | 200 Million Pairs | Machine Translation Data | Natural Language Processing Data | Translation Data

by Nexdata
Audio Data and 800TB of Annotated Imagery Data. ... Specifications Storage format : TXT Data content : Parallel Corpus Data Data size : 200 million pairs
Available for 109 countries
200 million pairs
10 years of historical data
90% Accuracy
Starts at
$10,000 / purchase
Free sample preview
4.8(12)

Coresignal | Clean Data | Company Data | AI-Enriched Datasets | Global / 35M+ Records / Updated Weekly

Clean data is an excellent data solution for companies with limited data engineering capabilities and ... Dataset consists of company data, 35M+ records in total.
Available for 248 countries
35 million records
Available Pricing:
One-off purchase
Monthly License
Yearly License
Usage-based
Free sample preview

AI Training Data | Annotated Checkout Flows for Retail, Restaurant, and Marketplace Websites

by MealMe
data across hundreds of real merchants. ... AI Training Data featuring meticulously annotated checkout flows from leading retail, restaurant, and
Available for 1 countries
10K Annotated Flows
Pricing available upon request
Free sample preview
4.9(7)

AI & ML Training Data | Artificial Intelligence (AI) | Machine Learning (ML) Datasets | Deep Learning Datasets | Easy to Integrate | Free Sample

[Related tags:AI Training Data, Textual data, Machine Learning (ML) Data, Deep Learning (DL) Data ... , Annotated Imagery Data, Synthetic Data, Audio Data, Large Language Model (LLM) Data,ML Training Data
Available for 61 countries
50M Records
30 days of historical data
100% Data Coverage
Starts at
$25 / month

China Investor Relation Activity Analytics (CIRA) | Investor Relation Event Data | China earnings call transcript | Alternative Data | Daily Update

The dataset provides: 1) insightful text analytics of IR events, 2) rich data fields about events and ... CIRA integrates these three types of IR events from 11 sources into 6 unified data tables.
Available for 1 countries
5K A-share
14 years of historical data
Available Pricing:
Yearly License
Free sample preview

FileMarket | 20,000 Voice Memos | Multilingual Training Data for Conversational AI | Machine Learning (ML) Data

Whether you require Transcription Data, Machine Learning (ML) Data, Large Language Model (LLM) Data, ... Deep Learning (DL) Data, or Audio Data, we are equipped to provide comprehensive solutions that align
Available for 240 countries
20K voice memos
Pricing available upon request
Free sample preview

Can't find the data you're looking for?

Let data providers come to you by posting your request

Post your request

More Textual data Products

Discover related textual data products.
350K calls per month
63 countries covered
1 years of historical data
Access a vast collection of transcribed customer call records tailored to your needs. Ideal for in-depth analysis of customer interactions and behavior trend...
598M records
249 countries covered
Clean Data is an excellent solution for companies with limited information engineering capabilities and those who want to reduce time to value. Dataset consi...
10K recordings
95% accuracy
64 countries covered
Authentic and spoofed faces recorded with different mobile phone cameras, showcasing both men and women, with and without glasses, under indoor and outdoor l...
5M reviews
63 countries covered
This review dataset captures buyer feedback about online marketplaces, products, sellers, and customer service.
40K Hours
98% sentence/word
55 countries covered
The speech data is collected from native English speakers in 40 countries,covering a varity of pronunciation habits and characteristics. The script is design...
438M records
249 countries covered
45 months of historical data
Job Postings Data is your guide to the job market. With Coresignal's job posting datasets or Jobs API, you can access millions of new and historical job post...
65K Hours
98% sentence/word
103 countries covered
Off-the-shelf Scripted Monologues Speech Datasets cover 100+ languages. All the Machine Learning (ML) Data are collected from native speakers, with signed au...
4.5M images
100% Image attachment
249 countries covered
Wirestock's AI/ML Image Training Data, 4.5M Files with Metadata: This data product offers a vast collection of images and associated metadata, ideal for trai...
20 hours
Norway covered
The fourth part of 20 hours of Norwegian dialogue (two people, separate tracks) about general topics. The dataset is high quality with no noise and high-qual...
20K voice memos
240 countries covered
We help clients source, curate, and transcribe data for AI and machine learning models. Our services include customized audio data collection and transcripti...
191 countries covered
We have been working on several projects for Data Annotation, Data-Collection and data labeling services since September 2019. The volume of our annotators a...
249 countries covered
2 years of historical data
Snippets database has sound / audio / sonic recordings across all kinds of venues (restaurants, bars, arenas, churches, movie theaters, retail and many more)...
5M reviews
63 countries covered
This review dataset captures buyer feedback about online marketplaces, products, sellers, and customer service.
10K Annotated Flows
USA covered
AI Training Data featuring meticulously annotated checkout flows from leading retail, restaurant, and marketplace websites. Includes detailed step-by-step us...
5K A-share
China covered
14 years of historical data
CIRA integrates these three types of IR events from 11 sources into 6 unified data tables. The dataset provides: 1) insightful text analytics of IR events, 2...
250 countries covered
85 years of historical data
To order data visit: https://weatherdata.ai/order-single-location-data/ WeatherDataAI Single-Point lets you download daily historical weather data for any...
20K Hours
98% accuracy
72 countries covered
Off-the-shelf 20,000 hours Unscripted Call Center Telephony Speech Data, covering 30+ languages including English, German, French, Spanish, Italian, Portugue...
2M pairs
95% Accuracy
51 countries covered
Off-the-shelf 2 millions pairs SFT text data. Contains 12 types of SFT QA, and the accuracy is not less than 95%. All prompts are manually written to meet di...