Best Large Language Model (LLM) Datasets & Databases

Easily explore, compare & preview top Large Language Model (LLM) Datasets via Datarade.
Filter by
73 Large Language Model (LLM) Data Datasets
Logo of Nexdata

Unsupervised Speech Data |1 Million Hours | Spontaneous Speech | LLM | Pre-training |Large Language Model(LLM) Data

by Nexdata
Available in
USA
UK
Germany
France
Italy
and 42 more countries
Logo of FileMarket

FileMarket | 20,000 photos | AI Training Data | Large Language Model (LLM) Data | Machine Learning (ML) Data | Deep Learning (DL) Data |

by FileMarket
Available in
USA
UK
Germany
France
Italy
and 244 more countries
Logo of MealMe

Large Language Model (LLM) Data | Machine Learning (ML) Data | AI Training Data (RAG) for 1M+ Global Grocery, Restaurant, and Retail Stores

by MealMe
ZIP Code
Latitude
State Abbreviation
City Name
URL
and 6 more attributes
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of Silencio Network

Large Language Model (LLM) Noise Level Data | Noise Complaints | CCPA, GDPR Compliant | 160k Data Points | 100% Traceable Consent

by Silencio Network
Latitude
Longitude
Country Code Alpha-2
Available in
USA
UK
Germany
France
Italy
and 231 more countries
Logo of TagX

TagX | 10000+ Multilingual Image Dataset | Text Detection | Global coverage | LLM data | LLM finetuning

by TagX
4.9
Available in
UK
Germany
France
Italy
Spain
and 97 more countries
Logo of Xverum

Machine Learning (ML) Data | 800M+ B2B Profiles | AI-Ready for Deep Learning (DL), NLP & LLM Training

by Xverum
5.0
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of Nexdata

Foundation Model Data Collection and Data Annotation | Large Language Model(LLM) Data | SFT Data| Red Teaming Services

by Nexdata
Available in
USA
UK
Germany
France
Italy
and 109 more countries
Logo of Silencio Network

Large Language Model (LLM) Training Data | 180+ Countries | AI-Enhanced Ground Truth Based | 10M+ Hours of Measurements | 100% Traceable Consent

by Silencio Network
Latitude
Longitude
Country Code Alpha-2
Available in
USA
UK
Germany
France
Italy
and 245 more countries
Logo of TagX

TagX Data collection for AI/ ML training | LLM data | Data collection for AI development & model finetuning | Text, image, audio, and document data

by TagX
4.9
Product Name
Available in
USA
UK
Germany
France
Italy
and 244 more countries
Logo of Oxford Languages

French Language Datasets | 150+ Years of Research | AI | NLP | LLMs | Dictionary Display | Translation Data | EU, Africa, Canada Coverage

by Oxford Languages
Available in
France
Canada
Switzerland
Belgium
Vietnam
and 32 more countries

Can't find the data you're looking for?

Let data providers come to you by posting your request

Post your request

More Large Language Model (LLM) Data Products

Discover related large language model (llm) data products.