Best Large Language Model (LLM) Datasets & Databases

Easily explore, compare & preview top Large Language Model (LLM) Datasets via Datarade.
Filter by
62 Large Language Model (LLM) Data Datasets
Logo of Nexdata

Unsupervised Speech Data |1 Million Hours | Spontaneous Speech | LLM | Pre-training |Large Language Model(LLM) Data

by Nexdata
Available in
USA
UK
Germany
France
Italy
and 35 more countries
Logo of MealMe

Large Language Model (LLM) Data | Machine Learning (ML) Data | AI Training Data (RAG) for 1M+ Global Grocery, Restaurant, and Retail Stores

by MealMe
Latitude
ZIP Code
City Name
URL
State Abbreviation
and 6 more attributes
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of Silencio Network

Large Language Model (LLM) Data | 10 Million POI Average Noise Levels | 35 B + Data Points | 100% Traceable Consent

by Silencio Network
Latitude
Longitude
City Name
POI Name
POI ID
and 5 more attributes
Available in
USA
UK
Germany
France
Italy
and 231 more countries
Logo of TagX

TagX | 10000+ Multilingual Image Dataset | Text Detection | Global coverage | LLM data | LLM finetuning

by TagX
4.9
Available in
UK
Germany
France
Italy
Spain
and 97 more countries
Logo of Dappier

Dappier | Breaking News Data | RAG API, LLM Compatible | Real-Time Updates | Unlimited Data

by Dappier
5.0
URL
Company Domain
Website
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of Xverum

Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning (DL), NLP & LLM Training

by Xverum
5.0
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of CrawlBee

CrawlBee | ML Training Data | LLM Data | Generative AI Data | Code Base Training Data | Healthcare Training Data

by CrawlBee
4.8
Available in
USA
Logo of Canaria Inc.

Canaria | Indeed Job Postings Data | U.S. | 4M+ Monthly Indeed Job Postings Data | AI-LLM Enhanced with 3 Years of Historical Indeed Job Postings Data

by Canaria Inc.
5.0
Company Name
ZIP Code
Company Industry
City Name
Company ID
and 10 more attributes
Available in
USA
Logo of Nexdata

Foundation Model Data Collection and Data Annotation | Large Language Model(LLM) Data | SFT Data| Red Teaming Services

by Nexdata
Available in
USA
UK
Germany
France
Italy
and 110 more countries
Logo of Silencio Network

Large Language Model (LLM) Noise Data | Noise Complaints + Urban Noise Levels | CCPA, GDPR Compliant | 100% Traceable Consent

by Silencio Network
Latitude
Longitude
Country Code Alpha-2
Available in
USA
UK
Germany
France
Italy
and 231 more countries

Can't find the data you're looking for?

Let data providers come to you by posting your request

Post your request

More Large Language Model (LLM) Data Products

Discover related large language model (llm) data products.