Submitted:
12 June 2025
Posted:
12 June 2025
You are already at the latest version
Abstract

Keywords:
Introduction
Literature Review
AI in Dermatology: Benefits and Bias Risks
Dataset Representation Challenges
Limitations of the Fitzpatrick Scale
Alternatives: Monk Skin Tone Scale and Color Space Methods
Augmentation as a Potential Fairness Tool
Fairness-Oriented Model Adjustments
Relevance to This Study
Methods
Data
- md5hash – unique image identifier
- label – disease class
- fitzpatrick_scale – integer (1 to 6 or -1 for missing)
- url – source location of the image
- qc – quality control flag (optional)
- A multi-class disease classification label (20 classes used in this study)
- A Fitzpatrick skin tone label (6 categories used after filtering)
- Fitzpatrick Type II had the most samples (4,808 images)
- Fitzpatrick Type VI had only 635 images
- Fitzpatrick = -1 (missing) occurred in approximately 565 images, which were excluded from the final dataset
- Images labeled with Fitzpatrick = -1 were excluded
- Entries with broken or inaccessible URLs were skipped
- All images were resized to 224×224 pixels
-
Channel-wise normalization was applied using ImageNet means and standard deviations:
- -
- Mean = [0.485, 0.456, 0.406]
- -
- Std = [0.229, 0.224, 0.225]
-
Augmentations were applied during training (not testing), including:
- -
- Random horizontal flip (p = 0.5)
- -
- Random rotation (up to ±15°)
- -
- Random resized cropping
- -
- Occasional brightness and contrast jitter
Training-Time Preprocessing and Augmentation
Predictors and Outcome Measures
- Disease classification — a 20-class multi-class output
- Fitzpatrick skin type classification — a 6-class categorical output
- Per-class precision, recall, and F1-score
- Macro-average ROC AUC
- Fitzpatrick-type-specific accuracy
Model Architecture and Training
- One head for disease classification (20 outputs, softmax activation)
- One head for Fitzpatrick classification (6 outputs, softmax activation)
- Batch size: 32
- Initial learning rate: 0.01
- Momentum: 0.9
- Scheduler: StepLR (step size = 30 epochs, gamma = 0.1)
Evaluation Strategy
Results
Overall Model Performance
Confusion Matrix Analysis

Class-Level Performance
Fitzpatrick Skin Tone Performance
Discussion
Summary of Results
Limitations
Future Directions
Practical Implications
References
- Buolamwini, J., and T. Gebru. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. Proceedings of the 1st Conference on Fairness, Accountability and Transparency 81: 77–91. [Google Scholar]
- Daneshjou, R., K. Vodrahalli, W. Liang, R. A. Novoa, M. Jenkins, V. Rotemberg, J. Ko, S. M. Swetter, E. E. Bailey, O. Gevaert, P. Mukherjee, M. Phung, K. Yekrang, B. Fong, R. Sahasrabudhe, J. Zou, and A. Chiou. 2021. Disparities in dermatology ai: Assessments using diverse clinical images [Evaluates state-of-the-art dermatology AI models on diverse clinical images]. arXiv arXiv:2111.08006. [Google Scholar]
- Du, S., B. Hers, N. Bayasi, G. Hamarneh, and R. Garbi. 2023. Fairdisco: Fairer ai in dermatology via disentanglement contrastive learning. Computer Vision–ECCV 2022 Workshops: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part IV; pp. 185–202. [Google Scholar]
- Groh, M., C. Harris, L. R. Soenksen, F. Lau, D. Kim, A. Abid, and J. Zou. 2022. Evaluating deep neural networks trained on clinical images in dermatology with the fitzpatrick 17k dataset. Scientific Data 9, 1: 1–13. [Google Scholar]
- Guo, L. N. 2021. Algorithmic bias in dermatology: An emerging health equity concern. The Lancet Digital Health 3, 7: e399–e400. [Google Scholar]
- He, K., X. Zhang, S. Ren, and J. Sun. 2016. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; pp. 770–778. [Google Scholar]
- Hernandez, M. 2023. Ai fairness in dermatology: When more data is not enough. Journal of Biomedical Informatics 135: 104241. [Google Scholar]
- Kalb, T., K. Kushibar, C. Cintas, K. Lekadir, O. Díaz, and R. Osuala. 2023. Revisiting skin tone fairness in dermatological lesion classification. Artificial Intelligence in Medicine 13911: 185–194. [Google Scholar] [CrossRef]
- Lee, C. S., and S. Nagpal. 2022. Skin tone analysis and ai equity: Assessing performance on diverse datasets. npj Digital Medicine 5, 1: 121. [Google Scholar]
- Monk, E. P. 2023. The cost of color: Skin tone, image datasets, and algorithmic bias. Science Advances 9, 5: eadf9380. [Google Scholar]
- Pundhir, P., A. Zhou, and R. Sarkar. 2024. When frequency isn’t enough: Complexity-aware evaluation of dermatology classifiers. Proceedings of the AAAI Conference on Artificial Intelligence 38, 3: 2114–2123. [Google Scholar]



| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| erythema annulare centrifigum | 1.000 | 0.923 | 0.960 | 13 |
| tungiasis | 0.917 | 0.917 | 0.917 | 12 |
| hailey hailey disease | 0.826 | 0.950 | 0.884 | 20 |
| pediculosis lids | 0.889 | 0.842 | 0.865 | 19 |
| congenital nevus | 0.889 | 0.800 | 0.842 | 10 |
| hidradenitis | 0.750 | 0.900 | 0.818 | 10 |
| papilomatosis confluentes and reticulate | 0.739 | 0.895 | 0.810 | 19 |
| fordyce spots | 1.000 | 0.667 | 0.800 | 9 |
| folliculitis | 0.821 | 0.742 | 0.780 | 31 |
| myiasis | 0.750 | 0.750 | 0.750 | 4 |
| acrodermatitis enteropathica | 0.778 | 0.700 | 0.737 | 10 |
| nevus sebaceous of jadassohn | 0.667 | 0.800 | 0.727 | 5 |
| disseminated actinic porokeratosis | 0.667 | 0.800 | 0.727 | 5 |
| acne vulgaris | 0.688 | 0.759 | 0.721 | 29 |
| syringoma | 0.750 | 0.692 | 0.720 | 13 |
| mucous cyst | 0.833 | 0.625 | 0.714 | 8 |
| pityriasis rubra pilaris | 0.607 | 0.850 | 0.708 | 20 |
| necrobiosis lipoidica | 0.700 | 0.700 | 0.700 | 10 |
| nematode infection | 0.643 | 0.750 | 0.692 | 24 |
| keloid | 0.625 | 0.769 | 0.690 | 13 |
| fixed eruptions | 0.706 | 0.667 | 0.686 | 18 |
| stasis edema | 0.833 | 0.556 | 0.667 | 9 |
| keratosis pilaris | 0.750 | 0.600 | 0.667 | 10 |
| squamous cell carcinoma | 0.756 | 0.596 | 0.667 | 52 |
| lymphangioma | 0.692 | 0.643 | 0.667 | 14 |
| scleromyxedema | 0.667 | 0.667 | 0.667 | 12 |
| ehlers danlos syndrome | 0.588 | 0.769 | 0.667 | 13 |
| neurofibromatosis | 0.609 | 0.700 | 0.651 | 20 |
| vitiligo | 0.714 | 0.588 | 0.645 | 17 |
| pityriasis rosea | 0.667 | 0.615 | 0.640 | 13 |
| acquired autoimmune bullous diseaseherpes gestationis | 0.000 | 0.000 | 0.000 | 6 |
| striae | 0.000 | 0.000 | 0.000 | 7 |
| basal cell carcinoma morpheiform | 0.000 | 0.000 | 0.000 | 4 |
| xanthomas | 0.000 | 0.000 | 0.000 | 9 |
| dermatomyositis | 0.091 | 0.067 | 0.077 | 15 |
| epidermal nevus | 0.143 | 0.143 | 0.143 | 7 |
| solid cystic basal cell carcinoma | 0.143 | 0.143 | 0.143 | 7 |
| stevens johnson syndrome | 0.167 | 0.143 | 0.154 | 7 |
| calcinosis cutis | 0.111 | 0.250 | 0.154 | 4 |
| sun damaged skin | 0.200 | 0.143 | 0.167 | 7 |
| lupus subacute | 0.222 | 0.143 | 0.174 | 14 |
| behcets disease | 0.143 | 0.250 | 0.182 | 4 |
| lentigo maligna | 0.200 | 0.200 | 0.200 | 5 |
| eczema | 0.273 | 0.176 | 0.214 | 17 |
| pustular psoriasis | 0.200 | 0.250 | 0.222 | 4 |
| dermatofibroma | 0.143 | 0.500 | 0.222 | 2 |
| rosacea | 0.200 | 0.273 | 0.231 | 11 |
| dariers disease | 0.231 | 0.231 | 0.231 | 13 |
| malignant melanoma | 0.250 | 0.250 | 0.250 | 8 |
| epidermolysis bullosa | 0.333 | 0.222 | 0.267 | 9 |
| nevocytic nevus | 0.333 | 0.222 | 0.267 | 9 |
| dyshidrotic eczema | 0.222 | 0.333 | 0.267 | 6 |
| photodermatoses | 0.265 | 0.273 | 0.269 | 33 |
| porphyria | 0.273 | 0.273 | 0.273 | 11 |
| drug eruption | 0.250 | 0.333 | 0.286 | 15 |
| pilomatricoma | 0.500 | 0.200 | 0.286 | 5 |
| livedo reticularis | 0.333 | 0.286 | 0.308 | 7 |
| ichthyosis vulgaris | 0.400 | 0.250 | 0.308 | 8 |
| paronychia | 0.250 | 0.429 | 0.316 | 7 |
| porokeratosis of mibelli | 0.300 | 0.333 | 0.316 | 9 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).