MILK10k Benchmark consists of paired clinical close-up and dermatoscopic image for a set of lesions. The dataset’s metadata include age (in 5-year intervals), sex, anatomic site, and skin tone. Skin tone is categorized into six levels, ranging from very dark (0) to very light (5), intentionally distinct from the Fitzpatrick skin types to avoid confusion. Most patients had skin tones in the middle ranges. Diagnoses were mapped to a simplified classification based on the ISIC2018/2019 challenge and HAM10000 diagnostic categories. The dataset includes 11 broad diagnostic categories:
Although these broad diagnostic categories align with those in MILK10k, there can be different underlying granular diagnoses, primarily in the broad categories “other benign” and “other malignant proliferations”.
Furthermore, all images have been annotated using the MONET framework, with probabilities for the following concept term groups included in the metadata:
MILK10k Benchmark is the accompanying test set to the MILK10k dataset and covers the same diagnostic categories. MILK10k is available on the ISIC Archive.
Images were provided by the following institutions: