Datasets

The MIDOG challenge allows the use of all publicly available datasets for training or model selection. However, we provide some notable datasets that participants should be aware of:

The MIDOG++ dataset contains mitosis annotations for 11,937 mitotic figures across 503 individual tumor cases, spanning 7 different tumor types. It is the largest dataset of its kind to date.

The MITOS_WSI_CMC dataset contains whole slide image (WSI) annotations for mitosis in canine breast cancer. In total, 13,907 mitotic figure annotations are provided across 21 WSIs.

The MITOS_WSI_CCMCT dataset contains whole slide image annotations for mitosis in canine mast cell tumor. In total, 44,880 mitotic figure objects were annotated across 32 whole slide images.

The AMi-Br dataset containts atypical mitotic figure subclassification for the MIDOG 2021 and the TUPAC16 challenge datasets. The dataset provides the original coordinates but also patches extracted around the objects. In total, 3,720 annotations are provided, split into 832 atypical and 2,888 normal mitotic figures.

The MIDOG 2025 Atypical Training Set contains atypical mitotic figure subclassification for the entire MIDOG++ dataset, encompassing 11,939 mitotic figures from all 7 domains of MIDOG++ for which we provide a three expert blinded majority vote result.

Downloads are available from multiple sources:
– deepmicroscopy.org [images] [labels]
– google drive [images+labels]
– more coming up