HistAI Data Hub
The World's Largest
WSI Data Hub
A curated repository of whole slide images spanning hundreds of tissue types, stains, and disease states — fully licensed for commercial AI development and available for instant download.
Real, diverse cases
76,355 de-identified patient cases across cancer and non-cancer diagnoses, with paired metadata for cohort building.
H&E + IHC + special stains
218K H&E slides plus 83K IHC and special stains spanning 400+ markers and clones.
Commercial-ready rights
Full perpetual commercial license. Train, fine-tune, and deploy AI models with no usage restrictions and no hidden fees.
Dataset at a glance
The full distributions behind the Data Hub — explore the demographics, organ systems, diagnoses, scanners, and stains that make up the dataset.
Cancer vs. non-cancer
Diagnosis category across all cases
Gender
Patient gender across all cases
Age distribution
Cases grouped into 10-year deciles.
Scanners
Number of slide files by scanner
Top organ systems
Number of cases by organ system (top 20)
Top cancer diagnoses
Top 20 by case count
Top non-cancer diagnoses
Top 20 by case count
Top 100 stains
Files by stain. 218,434 H&E slides plus a long tail of IHC clones and special stains.
| # | Stain | Files | Share |
|---|---|---|---|
| 1 | H&E | 218,434 | 77% |
| 2 | Ki67 (MM1) | 3,765 | 1.3% |
| 3 | ER (6F11) | 3,446 | 1.2% |
| 4 | Giemsa | 3,217 | 1.1% |
| 5 | Progesterone Receptor | 2,968 | 1.0% |
| 6 | HER2 (c-erbB-2) | 2,355 | 0.8% |
| 7 | CD10 (56C6) | 1,446 | 0.5% |
| 8 | CD20 (MJ1) | 1,413 | 0.5% |
| 9 | Ki67 (30-9) | 1,300 | 0.5% |
| 10 | CK7(OV-TL 12/30) | 1,270 | 0.4% |
| 11 | Bcl-6 (LN22) | 1,187 | 0.4% |
| 12 | MCK (AE1& AE3) | 1,183 | 0.4% |
| 13 | Bcl-2 (bcl-2/100/D5) | 1,168 | 0.4% |
| 14 | HER2/neu | 1,163 | 0.4% |
| 15 | CK20 (Ks20.8) | 1,141 | 0.4% |
| 16 | Ki67 (SP6) | 1,079 | 0.4% |
| 17 | CD3 (LN10) VET | 1,079 | 0.4% |
| 18 | CD5 (4C7) | 1,035 | 0.4% |
| 19 | Synaptophysin | 1,022 | 0.4% |
| 20 | p63 7JUL | 1,000 | 0.4% |
| 21 | S100(4C4.9) | 975 | 0.3% |
| 22 | CD23 (1B12) | 945 | 0.3% |
| 23 | TTF-1(SPT24) | 944 | 0.3% |
| 24 | MUM1 Protein (MUM1p) | 825 | 0.3% |
| 25 | Pax-8 (Polyclonal) | 813 | 0.3% |
| 26 | HER2/neu (4B5) | 801 | 0.3% |
| 27 | GATA-3 (L50-823) | 800 | 0.3% |
| 28 | CDX2 (EPR2764Y) | 798 | 0.3% |
| 29 | Cyclin D1 (D1-GM) | 784 | 0.3% |
| 30 | CD34 (QBEnd/10) | 783 | 0.3% |
| 31 | CD138 (B-A38) | 762 | 0.3% |
| 32 | PD-L1 (28-8) | 727 | 0.3% |
| 33 | MCK (AE1/AE3) | 712 | 0.2% |
| 34 | CD30 (1G12) | 696 | 0.2% |
| 35 | CD45 (X16/99) | 664 | 0.2% |
| 36 | CD20(L26) | 617 | 0.2% |
| 37 | CK14 (LL002) | 596 | 0.2% |
| 38 | P40 (BC28) | 591 | 0.2% |
| 39 | Chromogranin A (LK2H10) | 578 | 0.2% |
| 40 | SMA(1A4) | 555 | 0.2% |
| 41 | Ki67 (MIB-1) VET | 554 | 0.2% |
| 42 | ER (SP1) | 548 | 0.2% |
| 43 | Vimentin (V9) | 536 | 0.2% |
| 44 | CK HMW | 523 | 0.2% |
| 45 | AMACR | 522 | 0.2% |
| 46 | P53 (DO-7) | 507 | 0.2% |
| 47 | PR (16) | 503 | 0.2% |
| 48 | MSH6 (PU29) | 494 | 0.2% |
| 49 | PR (1E2) | 481 | 0.2% |
| 50 | WT1 WT49 | 477 | 0.2% |
| # | Stain | Files | Share |
|---|---|---|---|
| 51 | CD56 (CD564) | 475 | 0.2% |
| 52 | SOX-10 (EP268) | 459 | 0.2% |
| 53 | Pax-5 (1EW) | 447 | 0.2% |
| 54 | Desmin (DE-R-11) | 446 | 0.2% |
| 55 | CD138 (MI15) | 435 | 0.2% |
| 56 | E-Cadherin (36B5) | 435 | 0.2% |
| 57 | Epstein Barr Virus (CS1-4) | 434 | 0.2% |
| 58 | Chromogranin A(DAK-A3) | 413 | 0.1% |
| 59 | CK7(RN7) | 401 | 0.1% |
| 60 | CD56 (123C3.D5) | 373 | 0.1% |
| 61 | CD3 (MRQ -39) | 361 | 0.1% |
| 62 | MLH1 (MLH1) | 361 | 0.1% |
| 63 | S100 (Polyclonal) | 354 | 0.1% |
| 64 | PMS2 (MRQ-28) | 342 | 0.1% |
| 65 | p63 (i27-i) | 340 | 0.1% |
| 66 | CD117 (T595) | 333 | 0.1% |
| 67 | Melan-A (A103) | 332 | 0.1% |
| 68 | ER (1D5) | 326 | 0.1% |
| 69 | Estrogen Receptor | 309 | 0.1% |
| 70 | MSH2 (25D12) | 292 | 0.1% |
| 71 | Progesterone Receptor (PgR636) | 288 | 0.1% |
| 72 | EMA (GP1.4) | 286 | 0.1% |
| 73 | HMB45 (HMB-45) | 275 | 0.1% |
| 74 | CD68 (514H12) | 267 | 0.1% |
| 75 | CD4 (4B12) | 266 | 0.1% |
| 76 | Napsin A (poly) | 257 | 0.1% |
| 77 | P16(R19-D) | 256 | 0.1% |
| 78 | CK 5/6 D5/16B4 | 254 | 0.1% |
| 79 | TTF1 (SPT24) | 248 | 0.1% |
| 80 | Van Gieson | 244 | 0.1% |
| 81 | MSH2 (G219-1129) | 237 | 0.1% |
| 82 | CK 8 & 18 (B22.1&B23.1) | 226 | 0.1% |
| 83 | Calretinin (CAL6) | 226 | 0.1% |
| 84 | Calponin-1 (EP798Y) | 224 | 0.1% |
| 85 | Argentum | 223 | 0.1% |
| 86 | Lambda Light Chain (SHL53) | 221 | 0.1% |
| 87 | Kappa Light Chain (CH15) | 218 | 0.1% |
| 88 | CD15 (MMA) | 204 | 0.1% |
| 89 | CD99 (EPR3097Y) | 202 | 0.1% |
| 90 | DOG-1 (K9) | 200 | 0.1% |
| 91 | PAS | 194 | 0.1% |
| 92 | Cyclin D1 (SP4-R) | 191 | 0.1% |
| 93 | SATB2 (EP281) | 186 | 0.1% |
| 94 | Synaptophysin (27G12) | 183 | 0.1% |
| 95 | Arginase-1 (EP261) | 183 | 0.1% |
| 96 | CD79a (11E3) | 179 | 0.1% |
| 97 | INSM1 | 176 | 0.1% |
| 98 | Alcian Blue Shifa | 172 | 0.1% |
| 99 | CK5/6 (D5/16B4) | 170 | 0.1% |
| 100 | PD-L1 (ZR3) | 169 | 0.1% |
Build the next generation of pathology AI.
Filter, preview, and download cohorts tailored to your research — with transparent pricing and a commercial license that travels with the data.