Radiology Medical Dataset | 45 M Studies | 10 M Patients | 7 Modalities | Diagnostic AI Data product image in hero

Radiology Medical Dataset | 45 M Studies | 10 M Patients | 7 Modalities | Diagnostic AI Data

FileMarket
No reviews yetBadge iconVerified Data Provider
Name
Link
xxxxxxxxxx Xxxxxxxxx
xxxxxx xxxxxxxxxx
Xxxxx Xxxxxx
Xxxxxxxxxx Xxxxxx
Xxxxxxxxx Xxxxxxxxxx
xxxxxxxxx Xxxxxxxxx
xxxxxxxxx Xxxxxxx
xxxxxx Xxxxx
xxxxxxxxxx xxxxxx
Xxxxxxxxxx xxxxxx
Volume
45M
stadies
Coverage
1
Country
History
10
years

Data Dictionary

[Sample] Dataset Samples
Attribute Type Example Mapping
Name
String Sample
Link
String https://www.dropbox.com/scl/fo/ox5my8r9u50fonmm85csv/ALS-...

Description

45 M radiology studies from 10 M de-ID patients (~4.5/ID): X-ray, CT, MRI, mammography, dental, fluoro, PET/SPECT. Each exam carries age/sex, body region, vendor, year, pathology tags. Region-balanced, 2015-25 span. Plug-and-play fuel for disease detection, triage, and multimodal clinical AI.
1. Volume & Composition Studies: 45 000 000 DICOM exams (images & reports) Patients: 10 000 000 unique IDs, average 4.5 studies each Modalities (share): X-ray / digital fluoroscopy – 30.6 M (68 %) CT – 6.75 M (15 %) MRI – 2.25 M (5 %) Mammography – 1.80 M (4 %) Dental intra-oral & OPG – 2.03 M (4.5 %) Interventional & fluoroscopy – 1.35 M (3 %) PET / SPECT / nuclear – 0.23 M (0.5 %) 2. Body-Region Coverage Chest / thorax – 18 M (40 %) Extremities – 9 M (20 %) Head / brain – 5.85 M (13 %) Abdomen / pelvis – 4.5 M (10 %) Spine – 3.15 M (7 %) Breast – 1.8 M (4 %) Cardiac / vascular / ENT & other – 2.7 M (6 %) 3. Demographics Sex: 47 % male (4.7 M) · 53 % female (5.3 M) Age bands (IDs): 0-17 15 % · 18-44 35 % · 45-64 30 % · 65+ 20 % 4. Temporal Spread Studies acquired 2015-2025; yearly volume grows from 2.95 M (2015) to 5.87 M (2022) with ongoing 2025 inflow. 6. Pathology Labels (radiologist-verified subsets) Normal / no significant findings – 27 M (60 %) Respiratory infections (pneumonia, COVID-19, TB) – 2.7 M (6 %) Trauma / fractures – 2.7 M (6 %) Oncology overall – 3.15 M (7 %) with lung (2.5 %) & breast (1.5 %) highlights Cardiovascular, musculoskeletal degenerative, neuro, renal, congenital and others cover the remaining 21 %. 7. Provenance & Compliance All files were de-identified at source; acquisition sites provided consent for research redistribution under HIPAA-safe-harbor and GDPR guidelines. Internal audits confirm removal of PHI and pixel burn-ins. 8. Core Use-Cases - Large-scale pre-training of radiology foundation models (vision transformers, CLIP-style image-text pairs) - Fine-tuning for disease detection (TB, COVID-19, lung nodules, fractures, cancers) - Cross-modal fusion (report-generation, reasoning) - Bias & fairness studies across sex, age, vendor, and body-region strata - Synthetic data generation and active-learning bootstraps Take your diagnostic-AI pipeline from proof-of-concept to production with 45 million expertly-curated, richly-annotated imaging studies.

Country Coverage

Europe (1)
Belarus

History

10 years of historical data

Volume

45 million stadies

Pricing

License Starts at
One-off purchase
$90,000,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
S3 Bucket
SFTP
REST API
Azure Blob Storage
Frequency
on-demand

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Data-Efficient Machine Learning
Deep Learning Medical Imaging

Categories

Related Searches

Related Products

Frequently asked questions

What is Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data?

45 M radiology studies from 10 M de-ID patients (~4.5/ID): X-ray, CT, MRI, mammography, dental, fluoro, PET/SPECT. Each exam carries age/sex, body region, vendor, year, pathology tags. Region-balanced, 2015-25 span. Plug-and-play fuel for disease detection, triage, and multimodal clinical AI.

What is Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data used for?

This product has 4 key use cases. FileMarket recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Data-Efficient Machine Learning, and Deep Learning Medical Imaging. Global businesses and organizations buy Medical Imagery Data from FileMarket to fuel their analytics and enrichment.

Who can use Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Medical Imagery Data. Get in touch with FileMarket to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data go?

This product has 10 years of historical coverage. It can be delivered on a on-demand basis.

Which countries does Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data cover?

This product includes data covering 1 country like Belarus. FileMarket is headquartered in United States of America.

How much does Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data cost?

Pricing for Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data starts at USD90,000,000 per purchase. Connect with FileMarket to get a quote and arrange custom pricing models based on your data requirements.

How can I get Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data?

Businesses can buy Medical Imagery Data from FileMarket and get the data via S3 Bucket, SFTP, REST API, and Azure Blob Storage.

What is the data quality of Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data?

You can compare and assess the data quality of FileMarket using Datarade’s data marketplace.

What are similar products to Radiology Medical Dataset 45 M Studies 10 M Patients 7 Modalities Diagnostic AI Data?

This product has 3 related products. These alternatives include Gesture Recognition Data 10,000 ID Computer Vision Data AI Training Data Machine Learning (ML) Data, Selfie Video Dataset 3K+ videos Global Coverage Face & Voice Biometrics Computer-Vision Data, and Large Language Model (LLM) Data 10 M Hours of Urban Noise Level Measurement CCPA, GDPR Compliant 35 B + Data Points 100% Traceable Consent. You can compare the best Medical Imagery Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$90,000,000 / purchase
License Starts at
One-off purchase
$90,000,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

FileMarket

Unique Audio and Multimedia Datasets for AI

Verified provider icon Verified Provider
100% Response rate

Trusted by

Customer Logo #1 of FileMarket
Customer Logo #2 of FileMarket
Customer Logo #3 of FileMarket