Collections (55)
Name
15 Exemplar Infundibulocystic Basal Cell Carcinomas
This collection contains the 15 images uploaded for "Dermoscopic Features of Infundibulocystic Basal Cell Carcinoma (IBCC): An Observational Study."
2025-11-07 15 images
BCN20000
The BCN20000 dataset consists of dermoscopic images of skin lesions taken between 2010 and 2016 at the Hospital Clínic in Barcelona. It aims to tackle the challenge of classifying dermoscopic images of skin cancer under diverse conditions, including lesions in challenging locations (such as nails and mucosa), large lesions exceeding the dermoscopy device's aperture, and hypo-pigmented lesions. A detailed data descriptor is available: Hernández-Pérez, C., Combalia, M., Podlipnik, S. et al. BCN20000: Dermoscopic Lesions in the Wild. Sci Data 11, 641 (2024). https://doi.org/10.1038/s41597-024-03387-w
2023-06-29 18,946 images
BRAAFF-Annotated Acral Lesions Dataset (BALD)
The BRAAFF-Annotated Acral Lesions Dataset (BALD): A curated set of dermatoscopic images of acral melanoma and nevi from various sources. Müller C, Tschandl P, Rinner C, Kyrgidis A, Koga H, Moscarella E, Apalla Z, Di Stefani A, Kobayashi K, Lazaridou E, Longo C, Phan A, Saida T, Sotiriou E, Tanaka M, Thomas L, Zalaudek I, Argenziano G, Lallas A, Kittler H. The BRAAFF-Annotated Acral Lesions Dataset (BALD): A Curated Set of Dermatoscopic Images of Acral Melanoma and Nevi from Various Sources. J Invest Dermatol. 2025 Jan 17:S0022-202X(25)00021-1 [Additional metadata available here](https://github.com/kittler/BALD)
2024-11-22 666 images
Challenge 2016: Test
Test set from the ISIC 2016 Challenge. The "Skin Lesion Analysis Towards Melanoma Detection" challenge leverages a dataset of annotated skin lesion images from the ISIC Archive, The dataset contains a representative mix of images of both malignant and benign skin lesions.
2021-11-11 379 images
Challenge 2016: Training
Training set from the ISIC 2016 Challenge. The "Skin Lesion Analysis Towards Melanoma Detection" challenge leverages a dataset of annotated skin lesion images from the ISIC Archive, The dataset contains a representative mix of images of both malignant and benign skin lesions.
2021-11-11 900 images
Challenge 2017: Test
Test set from the ISIC 2017 Challenge.
2021-11-11 600 images
Challenge 2017: Training
Training set from the ISIC 2017 Challenge.
2021-11-11 2,000 images
Challenge 2017: Validation
Validation set from the ISIC 2017 Challenge.
2021-11-11 150 images
Challenge 2018: Task 1-2: Test
Test set from the ISIC 2018 Challenge. The lesion images come from the [HAM10000 Dataset](https://doi.org/10.7910/DVN/DBW86T), and were acquired with a variety of [dermatoscope types](https://dermoscopedia.org/Principles_of_dermoscopy), from all anatomic sites (excluding mucosa and nails), from a historical sample of patients presented for skin cancer screening, from several different institutions. Images were collected with approval of the Ethics Review Committee of University of Queensland (Protocol-No. 2017001223) and Medical University of Vienna (Protocol-No. 1804/2017). The distribution of disease states represent a modified "real world" setting whereby there are more benign lesions than malignant lesions, but an over-representation of malignancies. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 1,000 images
Challenge 2018: Task 1-2: Training
Training set from the ISIC 2018 Challenge. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 2,594 images
Challenge 2018: Task 1-2: Validation
Validation set from the ISIC 2018 Challenge. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 100 images
Challenge 2018: Task 3: Test
Test set from the ISIC 2018 Challenge. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 1,512 images
Challenge 2018: Task 3: Training
Training set from the ISIC 2018 Challenge. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 10,015 images
Challenge 2018: Task 3: Validation
Validation set from the ISIC 2018 Challenge. When using the ISIC 2018 datasets in your research, please cite the following works: > [1] Noel Codella, Veronica Rotemberg, Philipp Tschandl, M. Emre Celebi, Stephen Dusza, David Gutman, Brian Helba, Aadi Kalloo, Konstantinos Liopyris, Michael Marchetti, Harald Kittler, Allan Halpern: "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)", 2018; arxiv.org/abs/1902.03368 > > [2] Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2021-11-11 193 images
Challenge 2019: Test
Test set from the ISIC 2019 Challenge.
2021-11-11 8,238 images
Challenge 2019: Training
Training set from the ISIC 2019 Challenge.
2021-11-11 25,331 images
Challenge 2020: Test
Evaluation set from the ML challenge: [SIIM-ISIC Melanoma Classification](https://www.kaggle.com/c/siim-isic-melanoma-classification/leaderboard).
2021-11-11 10,982 images
Challenge 2020: Training
2021-11-11 33,126 images
Challenge 2024: Training
Official collection
2024-05-09 401,059 images
Collection for ISBI 2016: 100 Lesion Classification
2022-02-23 100 images
Consecutive biopsies for melanoma across year 2020
Consecutive biopsies of lesions with nevus, melanoma, lentigo, etc. in the clinical OR histologic diagnosis at Memorial Sloan Kettering Cancer Center between 1/1/2020 and 12/31/2020.
2022-06-09 1,295 images
Consumer AI apps
A small study evaluated the accuracy of smartphone and web-based dermatology apps offering AI diagnostics using an independent test set of clinical images. Sun MD, Kentley J, Mehta P, Dusza S, Halpern AC, Rotemberg V. Accuracy of commercially available smartphone applications for the detection of melanoma. Br J Dermatol. 2022;186(4):744-746. doi:10.1111/bjd.20903
2021-11-12 35 images
DERM12345
Skin lesion datasets provide essential information for understanding various skin conditions and developing effective diagnostic tools. They aid the artificial intelligence-based early detection of skin cancer, facilitate treatment planning, and contribute to medical education and research. Published large datasets have partially coverage the subclassifications of the skin lesions. This limitation highlights the need for more expansive and varied datasets to reduce false predictions and help improve the failure analysis for skin lesions. This study presents a diverse dataset comprising 12,345 dermatoscopic images with 40 subclasses of skin lesions, collected in Turkiye, which comprises different skin types in the transition zone between Europe and Asia. Each subgroup contains high-resolution images and expert annotations, providing a strong and reliable basis for future research. The detailed analysis of each subgroup provided in this study facilitates targeted research endeavors and enhances the depth of understanding regarding the skin lesions. This dataset distinguishes itself through a diverse structure with its 5 super classes, 15 main classes, 40 subclasses and 12,345 high-resolution dermatoscopic images. Yilmaz, A., Yasar, S.P., Gencoglan, G. et al. DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 40 Subclasses. Sci Data 11, 1302 (2024). [https://doi.org/10.1038/s41597-024-04104-3](https://doi.org/10.1038/s41597-024-04104-3)
2024-09-10 12,345 images
EASY Dermoscopy Expert Agreement Study
A collection of 248 melanocytic lesions that were submitted by experts as exemplars for 1 out of 31 dermoscopic features (8 images per feature), and used for evaluating agreement among (unrelated) experts on (1) malignancy, (2) feature presence, and (3) feature localization within a lesion. The repository of image masks and superpixel annotations is here: https://github.com/ISIC-Research/expert-annotation-agreement-data
2022-08-05 248 images
HAM10000
Dermatoscopic images of the most common classes of pigmented skin lesions: Pigmented Actinic Keratoses / Bowen's disease, Basal Cell Carcinoma, Benign Keratoses (Seborrheic Keratosis, Solar Lentigo and Lichen-Planus Like Keratosis), Dermatofibroma, Melanocytic Nevi, Melanoma and Vascular lesions. Images are made available in preparation for the "Human-Against-Machine with 10000 training images" study, and originate mainly from the ViDIR Group (Department of Dermatology, Medical University of Vienna) and a skin cancer office in Australia (School of Medicine, University of Queensland). Data is provided under the CC BY-NC 4.0 license, attribution should be made by referencing the data descriptor manuscript: Tschandl, P., Rosendahl, C. & Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 5, 180161 doi:10.1038/sdata.2018.161 (2018).
2023-04-06 11,720 images
HIBA Skin Lesions
A dataset of clinical and dermoscopy images of skin lesions collected in Argentina as a reference for the evaluation of AI tools in this population
2023-03-23 1,635 images
Hospital Italiano de Buenos Aires Skin Lesions
A dataset of clinical and dermoscopy images of skin lesions collected in Argentina as a reference for the evaluation of AI tools in this population
2023-03-23 1,635 images
Hospital Italiano de Buenos Aires - Skin Lesions Images (2019-2022)
A dataset of clinical and dermoscopy images of skin lesions collected in Argentina as a reference for the evaluation of AI tools in this population. Supplementary data and an exploratory analysis of the data are publicly available at https://github.com/piashiba/HIBASkinLesionsDataset.
2023-08-29 1,616 images
IMA++
This collection contains all the images associated with the [IMA++ dataset](https://doi.org/10.5281/zenodo.14201692). The **IMA++ dataset** is the largest publicly available multi-annotator skin lesion segmentation (SLS) dataset, collected from the ISIC Archive to facilitate skin lesion image segmentation research. It contains **17,684 segmentation masks** spanning **14,967 dermoscopic images**, where **2,394 dermoscopic images have 2-5 segmentations per image** from the ISIC Archive, annotated by **16 distinct annotators**, with at least one annotation per image. The dataset captures a wide range of segmentation styles influenced by annotator expertise, tools used, and manual review processes, making it a valuable resource for developing and evaluating SLS models.
2026-03-05 14,967 images
ISIC Balanced
These are the images used in the paper: Analysis of the ISIC image datasets: Usage, benchmarks and recommendations Paper Link: https://www.sciencedirect.com/science/article/pii/S1361841521003509 They have also been used by newer versions such as: Skin Lesion Classification Using Dermoscopic Images and Clinical Metadata: Insights from Multimodal Models Paper Link: https://openaccess.thecvf.com/content/CVPR2025W/MULA2025/papers/Ahammed_Skin_Lesion_Classification_Using_Dermoscopic_Images_and_Clinical_Metadata_Insights_CVPRW_2025_paper.pdf https://api.isic-archive.com/collections/469/
2025-11-06 9,810 images
ISIC-DICM-17K
ISIC-DICM-17K (ISIC Dermoscopic Images and Clinical Metadata 17K) is a curated and balanced dataset derived from the International Skin Imaging Collaboration (ISIC) Archive Gallery. It comprises 17,060 dermoscopic images and clinical metadata (8,530 melanoma and 8,530 non-melanoma classes). For more details, please follow the project’s GitHub repository: [https://github.com/mmu-dermatology-research/isic-dicm-17k](https://github.com/mmu-dermatology-research/isic-dicm-17k) This dataset was used in this study and benchmark to explore the effectiveness of multimodal learning for skin lesion classification: S. Ahammed, X. Cui, W. Lu and M. H. Yap, "Skin Lesion Classification using Dermoscopic Images and Clinical Metadata: Insights from Multimodal Models," 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2025, pp. 222-230, DOI: 10.1109/CVPRW67362.2025.00027
2025-10-03 17,060 images
iToBoS 2024 - Skin Lesion Detection with 3D-TBP
## The iToBoS dataset: skin region images extracted from 3D total body photographs for lesion detection The early detection of skin cancer is critical for improving patient outcomes. Traditionally, dermatologists rely on dermoscopy to examine pigmented skin lesions. While this non-invasive technique enhances diagnostic accuracy, its effectiveness is highly dependent on the clinician’s expertise. Additionally, capturing dermoscopic images for every suspicious lesion is a labor-intensive process. Given these challenges, there is an increasing need for computer-aided diagnosis (CAD) systems that utilize conventional cameras. Such systems can support general physicians and other non-specialist practitioners in identifying potential malignant lesion, improving early detection and intervention. Moreover, they facilitate longitudinal tracking of lesions, aiding researchers in studying disease progression and treatment efficacy. This dataset provides high-resolution skin patch images extracted from 3D total body photographs to support the development of advanced machine learning models for lesion detection. It serves as a valuable resource for researchers working on automated skin lesion analysis, particularly in the context of total body photography (TBP). This dataset contains 59,997 lesion identifying regions-of-interest (ROIs) embedded in 16,954 images stemming from 100 patients. ## Dataset Description: The iToBoS dataset consists of 16,954 high-resolution images of skin regions obtained from anonymized 3D avatars of patients. These avatars were generated using the Canfield VECTRA WB360 system, a cutting-edge imaging technology that captures comprehensive, full-body skin images using 92 fixed cameras arranged in 46 stereo pairs with xenon flash lighting. The images were collected from patients at two clinical sites: the Clinical Hospital of Barcelona (Spain) (n=7,729) and the University of Queensland (Australia) (n=9,225). The dataset provides diverse anatomical locations, including the torso, arms, and legs, with each image having an average resolution of 1012x827 pixels and a 45-pixel overlap between adjacent images. The images are extracted from 3D avatars while ensuring compliance with GDPR regulations by automatically removing patient facial features. Each image is accompanied by metadata, including patient age range, body location, and sun damage score, allowing for in-depth analysis and stratification. ## Significance of the Dataset: 1. **Facilitates Automated Skin Lesion Detection:** The dataset supports the development of AI-based lesion detection models that can improve early diagnosis of skin cancer, particularly in regions with limited access to dermatological expertise. 2. **Supports Total Body Photography Research:** Leveraging 3D TBP for lesion detection is an emerging field, and this dataset provides a benchmark for further exploration. 3. **Enhances Machine Learning Applications:** The dataset serves as a benchmark for developing state-of-the-art computer vision and deep learning models for detection of skin lesions. ## PNG Format The image files were originally captured in PNG format, but are published here in compressed JPEG format. Our internal testing indicates that over 97% of the JPEG images achieve a PSNR greater than 35dB when compared to the original PNG versions, while being only ~6% of the original dataset size. Additionally, the original PNG files are available [on Figshare](https://figshare.com/articles/dataset/iToBoS_2024_-_Skin_Lesion_Detection_with_3D-TBP/28452545). ## Funding EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Societal Challenges | H2020 Health (H2020 Societal Challenges - Health, Demographic Change and Well-being) - SC1-BHC-06-2020-965221
2025-05-15 16,954 images
Longitudinal Images with Various Types for Lesion Viewer Testing
127 dermoscopic and clinical images matched to already published longitudinal images in the Archive
2024-04-18 127 images
Longitudinal overview images of posterior trunks
Digital camera photographs of the posterior trunk of patients at risk for melanoma, taken at UPMC Hillman Cancer Center as part of the 96-099 image/tissue banking protocol. Images from "Development and narrow validation of computer vision approach to facilitate assessment of change in pigmented cutaneous lesions," corresponding author John M Kirkwood.
2022-09-06 36 images
Melanocytic lesions used for dermoscopic feature annotations
248 dermoscopic images of melanocytic lesions (nevi and melanomas). The images were selected for exemplars of specific features, were contributed by 21 dermoscopy experts from around the world, and are comprised of 113 nevi and 133 melanomas (plus 1 AIMP and 1 dermatofibroma).
2022-02-16 248 images
Melanoma and Nevus Dermoscopy Images with Confirmed Histopathological Diagnosis
Collection defined by Dr. Jorge A. Rios-Duarte for a research project.
2023-11-14 18,133 images
MEL-SELF - Dermoscopic
Dermoscopic lesion images (close-up views of benign and malignant lesions) from the MEL-SELF trial (the Melanoma Self Surveillance trial).
2026-03-30 3,008 images
MILK10k
*MILK10k* consists of 10480 images, each representing a paired clinical close-up and dermatoscopic image for 5240 lesions. The dataset’s metadata include age (in 5-year intervals), sex, anatomic site, skin tone, diagnosis, method of ground truth establishment (histopathology or other means), and, if a dermatoscopic image of the same lesion was previously included in ISIC, its corresponding ISIC identifier. Skin tone is categorized into six levels, ranging from very dark (0) to very light (5), intentionally distinct from the Fitzpatrick skin types to avoid confusion. Most patients had skin tones in the middle ranges. Of the 5240 lesions, 95.7% were biopsied or excised, with histopathology serving as the gold standard for diagnosis. Diagnoses were mapped to both the ISIC-Dx diagnostic scheme and a simplified classification based on the ISIC2018/2019 challenge and HAM10000 diagnostic categories. The dataset includes 11 broad diagnostic categories: 1. Basal cell carcinoma (bcc) 2. Melanocytic nevus (nv) 3. Benign keratinocytic lesion (bkl) 4. Squamous cell carcinoma/keratoacanthoma (sccka) 5. Melanoma (mel) 6. Actinic keratosis/intraepidermal carcinoma (akiec) 7. Dermatofibroma (df) 8. Inflammatory and infectious conditions (inf) 9. Vascular lesions and hemorrhage (vasc) 10. Other benign proliferations including collision tumors (ben_oth) 11. Other malignant proliferations including collision tumors (mal_oth) Additionally, we provide the most specific ISIC-Dx diagnosis and its parent branch in the ISIC-Dx diagnostic tree. In cases where a dermatoscopic image of the same lesion was already included in the ISIC archive, its ISIC identifier is reported in the metadata. Furthermore, all images have been annotated using the MONET framework, with probabilities for the following concept term groups included in the metadata: 1. Ulceration, crust 2. Hair 3. Vasculature, vessels 4. Erythema 5. Pigmentation 6. Gel, water drop, fluid, dermoscopy liquid 7. Skin markings, pen ink, purple pen In addition to *MILK10k*, we have curated a smaller benchmark dataset, called *MILK10k Benchmark* derived from the same sources and covering the same diagnostic categories. This dataset is available as part of a live challenge within the ISIC framework and can be accessed on ISIC. Images were provided by the following institutions: - Department of Dermatology, Medical University of Vienna, Vienna, Austria - Medicine Faculty Department of Dermatology, Ankara University, Ankara, Turkey - Mayne Academy of General Practice, Medical School, The University of Queensland, Australia - Dermatology Service, Memorial Sloan Kettering Cancer Center, New York, USA - Independent Researcher, 1000 Skopje, North Macedonia
2025-03-31 10,480 images
MILK10k Benchmark
*MILK10k Benchmark* consists of paired clinical close-up and dermatoscopic image for a set of lesions. The dataset’s metadata include age (in 5-year intervals), sex, anatomic site, and skin tone. Skin tone is categorized into six levels, ranging from very dark (0) to very light (5), intentionally distinct from the Fitzpatrick skin types to avoid confusion. Most patients had skin tones in the middle ranges. Diagnoses were mapped to a simplified classification based on the ISIC2018/2019 challenge and HAM10000 diagnostic categories. The dataset includes 11 broad diagnostic categories: 1. Basal cell carcinoma (bcc) 2. Melanocytic nevus (nv) 3. Benign keratinocytic lesion (bkl) 4. Squamous cell carcinoma/keratoacanthoma (sccka) 5. Melanoma (mel) 6. Actinic keratosis/intraepidermal carcinoma (akiec) 7. Dermatofibroma (df) 8. Inflammatory and infectious conditions (inf) 9. Vascular lesions and hemorrhage (vasc) 10. Other benign proliferations including collision tumors (ben_oth) 11. Other malignant proliferations including collision tumors (mal_oth) Although these broad diagnostic categories align with those in MILK10k, there can be different underlying granular diagnoses, primarily in the broad categories “other benign” and “other malignant proliferations”. Furthermore, all images have been annotated using the MONET framework, with probabilities for the following concept term groups included in the metadata: 1. Ulceration, crust 2. Hair 3. Vasculature, vessels 4. Erythema 5. Pigmentation 6. Gel, water drop, fluid, dermoscopy liquid 7. Skin markings, pen ink, purple pen *MILK10k Benchmark* is the accompanying test set to the *MILK10k* dataset and covers the same diagnostic categories. *MILK10k* is available on the ISIC Archive. Images were provided by the following institutions: - Department of Dermatology, Medical University of Vienna, Vienna, Austria - Medicine Faculty Department of Dermatology, Ankara University, Ankara, Turkey - Mayne Academy of General Practice, Medical School, The University of Queensland, Australia - Dermatology Service, Memorial Sloan Kettering Cancer Center, New York, USA - Independent Researcher, 1000 Skopje, North Macedonia
2025-03-31 958 images
MSK-1
Both benign and malignant melanocytic lesions. Almost all diagnoses were confirmed by histopathology reports; the remainder consists of benign lesions confirmed by clinical follow-up. Images were not taken with modern digital cameras.
2015-06-30 1,678 images
MSK-2
Biopsy-confirmed melanocytic and non-melanocytic skin lesions. This dataset includes over 500 melanomas. Many images have polarized and contact variants.
2015-06-26 4,880 images
MSK-3
Assorted images and lesions, mostly nevi and basal cell carcinomas. These images were found based on a search not filtered for any particular pathology. All diagnoses confirmed by histopathology.
2016-09-30 466 images
MSK-4
Images found based on a search for patients with a personal history, clinical diagnosis, or differential diagnosis of melanoma. All diagnoses confirmed by histopathology.
2016-11-04 2,050 images
MSK-5
Seborrheic keratoses obtained from patients during a clinical visit. These lesions were not biopsied and were determined to be seborrheic keratoses by agreement of three experts.
2016-11-11 111 images
MSKCC Consecutive biopsies across year 2020_cohort
Consecutive biopsies of lesions with nevus, melanoma, lentigo, etc. in the clinical OR histologic diagnosis at MSKCC between 1/1/2020 and 12/31/2020.
2022-06-09 1,295 images
MSKCC Skin Tone Labeling Dataset
This dataset contains detailed skin tone annotations collected from a prospective, single-center observational study performed at Memorial Sloan Kettering Cancer Center from 2023-2024. The cohort consists of 64 adult patients who underwent full-body skin examinations by board-certified dermatologists. To ensure diverse representation across the spectrum of skin tones, patients were recruited to achieve a balanced distribution across all six Fitzpatrick Skin Types. This dataset was developed to evaluate the reliability of different skin tone labeling methods and to support fairness research in dermatologic AI. The dataset comprises both patient-level and site-level metadata for skin tone classification using the Fitzpatrick Skin Type scale, Monk Skin Tone scale, Pantone SkinTone Guide, and colorimeter readings (SkinColorCatch, Delfin Technologies). A total of 4,879 dermoscopic images are included. Skin tone assessments were collected across both lesional and non-lesional (normal skin) sites, mapped to standardized anatomic locations. All skin lesions are assumed to be benign, as they were imaged immediately following dermatologic evaluation. All data were collected under an IRB-approved protocol with informed consent. The dataset has been fully de-identified in accordance with HIPAA regulations, and no protected health information (PHI) is included.
2024-12-23 4,879 images
My
2025-03-24 18,946 images
Newly-acquired and longer-existing acquired melanoma and nevi
A collection of dermoscopic images of biopsied melanocytic lesions. This was used to test for performance degradation of an AI melanoma classifier in discriminating melanomas from nevi among newly-acquired melanocytic lesions versus longer-existing acquired melanocytic lesions.
2023-05-18 187 images
PAD-UFES-20
Full name: PAD-UFES-20: a skin lesion dataset composed of patient data and clinical images collected from smartphones Full data descriptor published by Pacheco et al. at https://doi.org/10.1016/j.dib.2020.106221 Pacheco AGC, Lima GR, Salomão AS, et al. PAD-UFES-20: A skin lesion dataset composed of patient data and clinical images collected from smartphones. Data Brief. 2020;32:106221. Published 2020 Aug 25. doi:10.1016/j.dib.2020.106221 Summary description - The PAD-UFES-20 dataset was collected along with the Dermatological and Surgical Assistance Program (in Portuguese: Programa de Assistência Dermatológica e Cirurgica - PAD) at the Federal University of Espírito Santo (UFES-Brazil), which is a nonprofit program that provides free skin lesion treatment, in particular, to low-income people who cannot afford private treatment. - The dataset consists of 2,298 samples of six different types of skin lesions. Each sample consists of a clinical image and up to 22 clinical features including the patient's age, skin lesion location, Fitzpatrick skin type, and skin lesion diameter. - The skin lesions are: Basal Cell Carcinoma (BCC), Squamous Cell Carcinoma (SCC), Actinic Keratosis (ACK), Seborrheic Keratosis (SEK), Bowen’s disease (BOD), Melanoma (MEL), and Nevus (NEV). As the Bowen’s disease is considered SCC in situ, we clustered them together, which results in six skin lesions in the dataset, three skin cancers (BCC, MEL, and SCC) and three skin disease (ACK, NEV, and SEK) - All BCC, SCC, and MEL are biopsy-proven. The remaining ones may have clinical diagnosis according to a consensus of a group of dermatologists. In total, approximately 58% of the samples in this dataset are biopsy-proven. This information is described in the metadata. - The images present in the dataset have different sizes because they are collected using different smartphone devices. All images are available in .png format. - The metadata associated with each skin lesion is composed of up to 26 features. All features are available in a CSV document in which each line represents a skin lesion and each column a metadata feature. - In total, there are 1,373 patients and 2,298 images present in the dataset. Each image/sample has a reference to the patient and the skin lesion in the metadata. Ethics statement The dataset was collected along with the Dermatological and Surgical Assistance Program (PAD) of the Federal University of Espírito Santo. The program is managed by the Department of Specialized Medicine and was approved by the university ethics committee (nº 500002/478) and the Brazilian government through Plataforma Brasil (nº 4.007.097), the Brazilian agency responsible for research involving human beings. In addition, all data is collected under patient consent and the patient’s privacy is completely preserved. Data is also available on Mendeley at https://doi.org/10.17632/zr7vgbcyr2.1
2024-11-05 2,298 images
PROVe-AI
We conducted a prospective, observational clinical validation study to assess the diagnostic accuracy of the AI algorithm (ADAE) in predicting melanoma from dermoscopy skin lesion images. Patients who had consented for a skin biopsy to exclude melanoma were eligible. All lesions underwent biopsy.
2022-11-14 603 images