Submitted:
23 August 2025
Posted:
26 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
Data Collection
- Cytological Changes (CC) – including features such as clear cell cytoplasm, oncocytic transformation, granular cell transformation, and eosinophilic cytoplasmic inclusion bodies.
- Architectural Changes (A) – comprising suprabasal melanocytes, pseudogranulomatous structures, plexiform arrangements, and angioadnexocentric patterns.
- Changes in the Extracellular Matrix (CEM) – including increased elastic fiber prominence, osseous metaplasia (Osteonevus of Nanta), and mucin deposition.
- Changes Imitating Non-Melanocytic Components (CINC) – such as pseudolacunae, Pseudo Dabska-like patterns, neurotization (C-cell and pseudomeissnerian types), lipidization, and glandular/tubular-like formations.
- Interactions with Adjacent Structures (IAS) – including epidermal interactions (IAS-E), folliculitis, and cystic formations (epidermal, dermal, or trichilemmal).
3. Results
| Predictor | Melanocitic lesion | N | Q1/Med/Q3 | AM± SD | MW, p value | Squared correlation ratio |
|---|---|---|---|---|---|---|
| Age | 182 | 33.00 / 42.00 / 58.00 | 45.51 ± 18.10 | |||
| M | 42 | 55.75 / 66.50 / 74.75 | 66.48 ± 12.41 | < 0.0001 | 0.4048 | |
| N | 140 | 30.00 / 37.00 / 48.00 | 39.21 ± 14.44 | |||
| dh (cm) | 182 | 0.50 / 0.80 / 1.20 | 1.00 ± 0.73 | |||
| M | 42 | 1.20 / 1.55 / 2.45 | 1.80 ± 0.85 | < 0.0001 | 0.3554 | |
| N | 140 | 0.40 / 0.70 / 1.00 | 0.76 ± 0.49 | |||
| dv (cm) | 182 | 0.20 / 0.40 / 0.58 | 0.42 ± 0.27 | |||
| M | 42 | 0.30 / 0.50 / 0.60 | 0.51 ± 0.30 | 0.0171 | 0.0306 | |
| N | 140 | 0.20 / 0.35 / 0.50 | 0.39 ± 0.26 | |||
| Predictor | Melanocitic lesion | Significance by cell(Fisher's exact test) | χ2 test |
Assosiation coefficients | |||
|---|---|---|---|---|---|---|---|
| Melanoma (M) | Nevi (N) | ||||||
| Frequency (Proportion) | Frequency (Proportion) | M | N | p-value | |||
| Gender | Female | 17 (0.093) | 109 (0.599) | < | > |
< 0.0001 |
Odds ratio 0.193 [0.094;0.400] |
| Male | 25 (0.137) | 31 (0.170) | > | < | |||
| Location | 0 | 3 (0.016) | 24 (0.132) | < | 0.0030 |
Cramer's V 0.2967 |
|
| 1 | 10 (0.055) | 55 (0.302) | < | ||||
| 2 | 3 (0.016) | 5 (0.027) | |||||
| 3 | 19 (0.104) | 52 (0.286) | |||||
| 4 | 7 (0.038) | 4 (0.022) | > | < | |||
| Predictor | Category | Melanocitic Lesion | Significance by Cell(Fisher's Exact Test) | χ2 test | Odds Ratio [95% CI] | ||
|---|---|---|---|---|---|---|---|
| Melanoma (M) | Nevi (N) | ||||||
| Frequency (Proportion) | Frequency (Proportion) | M | N | p-Value | |||
| CC | 0 | 24 (0.132) | 132 (0.725) | < | > | < 0.0001 | 0.081 [0.032;0.203] |
| 1 | 18 (0.099) | 8 (0.044) | > | < | |||
| AA | 0 | 27 (0.148) | 78 (0.429) |
0.3241 |
1.431 [0.707;2.898] | ||
| 1 | 15 (0.082) | 62 (0.341) | |||||
| CINC-L | 0 | 42 (0.231) | 116 (0.637) | > | < |
0.0040 |
|
| 1 | 0 (0.000) | 24 (0.132) | < | > | |||
| CINC-T | 0 | 42 (0.231) | 110 (0.604) | > | < | 0.0010 |
|
| 1 | 0 (0.000) | 30 (0.165) | < | > | |||
| CEM-BL | 0 | 17 (0.093) | 11 (0.060) | > | < |
< 0.0001 |
7.975 [3.386;18.782] |
| 1 | 25 (0.137) | 129 (0.709) | < | > | |||
| CEM-TL | 0 | 27 (0.148) | 26 (0.143) | > | < |
< 0.0001 |
7.892 [3.719;16.747] |
| 1 | 15 (0.082) | 114 (0.626) | < | > | |||
| CEM-S | 0 | 23 (0.126) | 28 (0.154) | > | < |
< 0.0001 |
4.842 [2.339;10.024] |
| 1 | 19 (0.104) | 112 (0.615) | < | > | |||
| IAS-F | 0 | 31 (0.170) | 113 (0.621) |
0.3342 |
0.673 [0.305;1.489] | ||
| 1 | 11 (0.060) | 27 (0.148) | |||||
| IAS-T | 0 | 38 (0.209) | 134 (0.736) |
0.1914 |
0.425 [0.121;1.491] | ||
| 1 | 4 (0.022) | 6 (0.033) | |||||
| IAS-E | 0 | 39 (0.214) | 69 (0.379) | > | < |
< 0.0001 |
13.377 [4.270;41.903] |
| 1 | 3 (0.016) | 71 (0.390) | < | > | |||
| PIT | 0 | 40 (0.220) | 102 (0.560) | > | < |
0.0021 |
7.451 [1.971;28.170] |
| 1 | 2 (0.011) | 38 (0.209) | < | > | |||
Machine Learning Models




| Sensitivity | Specificity | Pos Pred Value | Neg Pred Value | F1 | Balanced Accuracy | |
|---|---|---|---|---|---|---|
| Training set | 0.833 | 0.990 | 0.962 | 0.951 | 0.893 | 0.912 |
| Test set | 0.750 | 0.976 | 0.900 | 0.932 | 0.893 | 0.863 |
| Predicted | |||
|---|---|---|---|
| M | N | ||
| Training set | M | 25 | 1 |
| N | 5 | 97 | |
| Test set | M | 9 | 1 |
| N | 3 | 41 | |

4. Discussion
5. Conclusions
Abbreviations
| ML | Machine Learning |
| GLMNET | Elastic Net Regression |
| RF | Random Forest |
| PLS | Partial Least Squares |
| KNN | k-Nearest Neighbors |
| CTREE | Conditional Inference Trees |
| AUC | Area Under the Receiver Operating Characteristic Curve |
| OR | Odds Ratio |
| CC | Cytological Changes |
| AA | Architectural Changes |
| CEM | Changes in the Extracellular Matrix |
| CINC | Changes Imitating Non-Melanocytic Components |
| IAS | Interactions with Adjacent Structures |
| PIT | Pityrosporum |
| DH | Horizontal Diameter |
| DV | Vertical Diameter |
| SD | Standard Deviation |
| Q1 / Q3 | First and Third Quartile |
| CI | Confidence Interval |
| FFPE | Formalin-Fixed Paraffin-Embedded |
| H&E | Hematoxylin and Eosin |
| AM | Arithmetic Mean |
| PPV | Positive Predictive Value |
| NPV | Negative Predictive Value |
| F1 | F1-Score |
| ROC | Receiver Operating Characteristic |
| IQR | Interquartile Range |
| XLSTAT | XLSTAT Pro statistical software |
| Jamovi | Jamovi statistical software |
| SnowCluster | SnowCluster module for Jamovi |
References
- Siegel, R. L. , Miller, K. D., Fuchs, H. E., & Jemal, A. (2023). Cancer statistics, 2023. CA: A Cancer Journal for Clinicians, 73. [CrossRef]
- Leiter, U. , Keim, U., Eigentler, T., Katalinic, A., Holleczek, B., Martus, P.,... Garbe, C. (2020). Incidence, mortality, and trends of non melanoma skin cancer in Germany. Journal of Investigative Dermatology 140(3), 579–588. [CrossRef]
- Leiter, U. , Keim, U., & Garbe, C. (2020). Epidemiology of skin cancer: Update 2020. Advances in Experimental Medicine and Biology, 1268, 123–139. [CrossRef]
- Marchetti MA, Liopyris K, Dusza SW, Codella NCF, Gutman DA, Helba B, Kalloo A, Halpern AC; International Skin Imaging Collaboration. Computer algorithms show potential for improving dermatologists' accuracy to diagnose cutaneous melanoma: Results of the International Skin Imaging Collaboration 2017. J Am Acad Dermatol. 2020 Mar;82(3):622-627. [CrossRef] [PubMed]
- Waqar S, George S, Jean-Baptiste W, Yusuf Ali A, Inyang B, Koshy FS, George K, Poudel P, Chalasani R, Goonathilake MR, Mohammed L. Recognizing Histopathological Simulators of Melanoma to Avoid Misdiagnosis. Cureus. 2022 Jun 20;14(6):e26127. [CrossRef] [PubMed]
- Ahmed Alsayyah, Differentiating between early melanomas and melanocytic nevi: A state-of-the-art review, Pathology - Research and Practice, Volume 249, 2023, 154734, ISSN 0344-0338. [CrossRef]
- Kassem, M. A. , Hosny, K. M., Damaševičius, R., & Eltoukhy, M. M. (2021). Machine Learning and Deep Learning Methods for Skin Lesion Classification and Diagnosis: A Systematic Review. Diagnostics, 11, 8, 1390. [CrossRef]
- Brinker, T. J. , Hekler, A., Enk, A. H., Berking, C., Haferkamp, S., Hauschild, A.,... von Kalle, C. (2019). Deep learning outperformed 136 of 157 dermatologists in a head-to-head dermoscopic melanoma image classification task. European Journal of Cancer, 113, 47–54. [CrossRef]
- Tschandl, P. , Rinner, C., Apalla, Z., Argenziano, G., Codella, N., Halpern, A.,... Kittler, H. (2020). Human–computer collaboration for skin cancer recognition. Nature Medicine, 26, 1229–1234. [CrossRef]
- Bancroft, J. D. , & Gamble, M. (2008). Theory and practice of histological techniques (6th ed.). Churchill Livingston.
- Suvarna, S. K. , Layton, C., & Bancroft, J. D. (2019). Bancroft’s theory and practice of histological techniques.
- Addinsoft. (2024). XLSTAT statistical and data analysis solution (Version 2024.1) [Computer software]. https://www.xlstat.com.
- The Jamovi Project. (2023). Jamovi (Version 2.4) [Computer software]. https://www.jamovi.org.
- Ratner, B. (2023). SnowCluster: Machine learning module for Jamovi [Computer software]. https://www.jamovi.org.
- Wold, S. , Sjöström, M., & Eriksson, L. (2001). PLS-regression: A basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems, 58, 2, 109–130. [CrossRef]
- Hothorn, T. , Hornik, K., & Zeileis, A. (2006). Unbiased recursive partitioning: A conditional inference framework. Journal of Computational and Graphical Statistics, 15, 3, 651–674. [CrossRef]
- Breiman, L. (2001). Random forests. Machine Learning, 45, 1, 5–32. [CrossRef]
- Zou, H. , & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67, 2, 301–320. [CrossRef]
- Cover, T. , & Hart, P. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory 13(1), 21–27. [CrossRef]
- James, G. , Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: With applications in R. Springer. [CrossRef]
- Kuhn, M. , & Johnson, K. (2013). Applied predictive modeling. Springer. [CrossRef]
- Waseh, S. , & Lee, J. B. (2023). Advances in melanoma: Epidemiology, diagnosis, and prognosis. Frontiers in Medicine, 10, Article 1268479. [CrossRef]
- Waqar S, George S, Jean-Baptiste W, Yusuf Ali A, Inyang B, Koshy FS, George K, Poudel P, Chalasani R, Goonathilake MR, Mohammed L. Recognizing Histopathological Simulators of Melanoma to Avoid Misdiagnosis. Cureus. 2022 Jun 20;14(6):e26127. [CrossRef] [PubMed]
- Yuan Z, Li Y, Zhang S, Wang X, Dou H, Yu X, Zhang Z, Yang S, Xiao M. Extracellular matrix remodeling in tumor progression and immune escape: from mechanisms to treatments. Mol Cancer. 2023 Mar 11;22(1):48. [CrossRef] [PubMed]
- Mooi, W. , & Krausz, T. (2007). Pathology of melanocytic disorders (2nd ed.). CRC Press. [CrossRef]
- Kuhn, M. , & Johnson, K. (2013). Applied predictive modeling. Springer. [CrossRef]
- James, G. , Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: With applications in R. Springer. [CrossRef]
- Esteva A, Chou K, Yeung S, Naik N, Madani A, Mottaghi A, Liu Y, Topol E, Dean J, Socher R. Deep learning-enabled medical computer vision. NPJ Digit Med. 2021 Jan 8;4(1):5. [CrossRef] [PubMed]
- Bechelli S, Delhommelle J. Machine Learning and Deep Learning Algorithms for Skin Cancer Classification from Dermoscopic Images. Bioengineering (Basel). 2022 Feb 27;9(3):97. [CrossRef] [PubMed]
- Kassem, M. A. , Hosny, K. M., Damaševičius, R., & Eltoukhy, M. M. (2021). Machine Learning and Deep Learning Methods for Skin Lesion Classification and Diagnosis: A Systematic Review. Diagnostics, 11(8), 1390. [CrossRef]
- Zou, H. , & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2), 301–320. [CrossRef]
- Addinsoft. (2024). XLSTAT statistical and data analysis solution (Version 2024.1) [Computer software]. https://www.xlstat.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).