Pathology Without Barriers
From datasets to digital workflows, we empower pathologists and researchers to accelerate discoveries without the usual hurdles.
Trusted by over 3 thousand researchers at leading organizations
The World’s Largest WSI Data Hub
At Your Fingertips
Access thousands of high-quality, curated whole slide images.
Choose from variety of tissues, stains, clones and diseases.
Secure full perpetual commercial license suitable for research and development.
Nothing to negotiate, transparent and competitve pricing
Pay and go. Cohorts are formed and available for download within minutes.
Analyze statistical distributions of your cohorts before you buy.
Removing Hurdles. Accelerating Research.
Huge curated diverse dataset with transparent pricing and licinsing.
Cloud-native slide sharing, viewing & annotation.
Cloud-based scalable storage on Microsoft Azure.
CellDX Platform
A Complete Digital Pathology Environment
Buy curated WSIs with commercial licenses.
Seamless cloud-based exploration and collaboration.
Variety of tools for manual slides annotation.
Fast growing library of SOTA specialized pathology models.
From managing slides to simultaneous annotation, CellDX has you covered.
Breakthrough in medical AI is coming soon to CellDX.
Built for Every Stakeholder in Pathology.
Pathologists. Save time with intuitive tools and AI-assisted workflows.
Researchers. Access the largest datasets and accelerate discoveries.
Pharma & Biotech. Accelerate biomarker development with AI models trained on diverse patient cohorts.
Trusted by Innovators. Driven by Open Source.
HistAI is the largest open-source contributor in the industry
News & Publications
News
10/8/2025
The largest curated dataset of more than 100K of WSIs available publicly
8/20/2025
The Hibou Family of Pathology Foundation Models, with over 1.5M downloads on Hugging Face, is now available in the Microsoft Azure AI Foundry Model Catalog.
6/3/2025
HistAI has released a landmark open-source dataset of 112,000 whole slide images and fully digitized pathology reports.
Publications
Dmitry Nechaev, Alexey Pchelnikov, Ekaterina Ivanova · arxiv · 5/17/2025
The HISTAI dataset is a large, open-access collection of over 100,000 multimodal whole slide images with rich metadata.
Dmitry Nechaev, Alexey Pchelnikov, Ekaterina Ivanova · arxiv · 4/7/2025
The SPIDER is the largest open-access, patch-level dataset spanning multiple organs with expert annotations and strong baseline models.
Dmitry Nechaev, Alexey Pchelnikov, Ekaterina Ivanova · arxiv · 8/20/2024
The Hibou family of vision transformers, pretrained on over 1 million whole slide images.
CellDX Pricing
billed annually
- 10GB Storage included
- 20 AI Widget runs
- Access to proprietary Data Hub (100,000+ WSIs)
- Access to proprietary AI models
- Real-time multi-user collaboration
- Advanced WSI labelling tools
- Optimized performance
- Add-on packs of storage
- Add-on packs of AI Widget runs
- Powerful GPU instances
- Up to unlimited storage or clients' storage
- Up to unlimited AI widgets runs
- Unlimited seats
- Task management and tracking
- Role-based access
- Download JSON-files with manual and auto annotations
- API access
- Shared Workspace
- Preferred Data Hub pricing
- GPU queue priority
- Custom AI models development
- Custom integrations