Submitted:
05 December 2024
Posted:
06 December 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Model Design
2.1.1. Pre-trained Models
2.1.2. Architecture
2.2. Datasets
2.2.1. ISIC 2018
2.2.2. CBIS-DDSM
2.2.3. Shenzhen Hospital & Montgomery County Chest X-ray Sets
2.3. Implementation Details
2.4. Data Preprocessing
2.5. Loss Functions
2.6. Evaluation Metrics
3. Results
3.1. Dermatoscopic Images
3.2. Mammography Images
3.3. Lung X-ray Images
4. Discussion
5. Conclusions
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Kar, M.K.; Nath, M.K.; Neog, D.R. A Review on Progress in Semantic Image Segmentation and Its Application to Medical Images. SN Comput. Sci. 2021, 2, 1–30. [Google Scholar] [CrossRef]
- Gupta, M.; Mishra, A. A systematic review of deep learning based image segmentation to detect polyp. Artif. Intell. Rev. 2024, 57, 1–53. [Google Scholar] [CrossRef]
- Minaee, S.; Boykov, Y.Y.; Porikli, F.; Plaza, A.J.; Kehtarnavaz, N.; Terzopoulos, D. Image Segmentation Using Deep Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 2021, PP, 1. [Google Scholar] [CrossRef] [PubMed]
- Alnasser, T.N.; Abdulaal, L.; Maiter, A.; Sharkey, M.; Dwivedi, K.; Salehi, M.; Garg, P.; Swift, A.J.; Alabed, S. Advancements in cardiac structures segmentation: a comprehensive systematic review of deep learning in CT imaging. Front. Cardiovasc. Med. 2024, 11, 1323461. [Google Scholar] [CrossRef]
- P. C. Bauer, Computational Social Science: Theory & Application. Accessed: Nov. 28, 2024. [Online]. Available: https://bookdown.org/paul/2021_computational_social_science/.
- Siddique, N.; Paheding, S.; Elkin, C.P.; Devabhaktuni, V. U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications. IEEE Access 2021, 9, 82031–82057. [Google Scholar] [CrossRef]
- Ronneberger, O.F.P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation[C]. Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18. Springer International Publishing, 2015: 234-241. [CrossRef]
- Azad, R.; Aghdam, E.K.; Rauland, A.; Jia, Y.; Avval, A.H.; Bozorgpour, A.; Karimijafarbigloo, S.; Cohen, J.P.; Adeli, E.; Merhof, D. Medical Image Segmentation Review: The Success of U-Net. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 10076–10095. [Google Scholar] [CrossRef] [PubMed]
- Cui, H.; Hu, L.; Chi, L. Advances in Computer-Aided Medical Image Processing. Appl. Sci. 2023, 13, 7079. [Google Scholar] [CrossRef]
- Yao, W.; Bai, J.; Liao, W.; Chen, Y.; Liu, M.; Xie, Y. From CNN to Transformer: A Review of Medical Image Segmentation Models. J. Imaging Informatics Med. 2024, 37, 1529–1547. [Google Scholar] [CrossRef] [PubMed]
- Rayed, E.; Islam, S.S.; Niha, S.I.; Jim, J.R.; Kabir, M.; Mridha, M. Deep learning for medical image segmentation: State-of-the-art advancements and challenges. Informatics Med. Unlocked 2024, 47. [Google Scholar] [CrossRef]
- Liu, X.; Song, L.; Liu, S.; Zhang, Y. A Review of Deep-Learning-Based Medical Image Segmentation Methods. Sustainability 2021, 13, 1224. [Google Scholar] [CrossRef]
- Aggarwal, C.C. Neural Networks and Deep Learning; Springer Nature: Dordrecht, GX, Netherlands, 2018. [Google Scholar]
- M. F. Uddin, N. M. F. Uddin, N. Tafshir, and M. M. Khan, "A Comparative Analysis Towards Melanoma Classification Using Transfer Learning by Analyzing Dermoscopic Images," Dec. 2023; 02, arXiv:arXiv:2312.01212. [Google Scholar] [CrossRef]
- Oktay et al., "Attention U-Net: Learning Where to Look for the Pancreas," May 20, 2018. arXiv:arXiv:1804.03999. [CrossRef]
- X. Chen, L. X. Chen, L. Yao, and Y. Zhang, "Residual Attention U-Net for Automated Multi-Class Segmentation of COVID-19 Chest CT Images," Apr. 12, 2020. arXiv:arXiv:2004.05645. [CrossRef]
- L.-C. Chen, Y. L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," Aug. 22, 2018. arXiv:arXiv:1802.02611. [CrossRef]
- Matsoukas, C.; Haslum, J.F.; Sorkhei, M.; Soderberg, M.; Smith, K. "What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors," Jun. 09, 2022, arXiv: arXiv:2203.01825.
- N. Codella et al., "Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)," Mar. 29, 2019. arXiv:arXiv:1902.03368. [CrossRef]
- Tschandl, P.; Rosendahl, C.; Kittler, H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data 2018, 5, 180161. [Google Scholar] [CrossRef] [PubMed]
- CBIS-DDSM," The Cancer Imaging Archive (TCIA). Accessed: Nov. 28, 2024. [Online]. Available: https://www.cancerimagingarchive.net/collection/cbis-ddsm/.
- Lee, R.S.; Gimenez, F.; Hoogi, A.; Miyake, K.K.; Gorovoy, M.; Rubin, D.L. A curated mammography data set for use in computer-aided detection and diagnosis research. Sci. Data 2017, 4, 170177–170177. [Google Scholar] [CrossRef]
- Clark, K.; Vendt, B.; Smith, K.; Freymann, J.; Kirby, J.; Koppel, P.; Moore, S.; Phillips, S.; Maffitt, D.; Pringle, M.; et al. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. J. Digit. Imaging 2013, 26, 1045–1057. [Google Scholar] [CrossRef]
- Jaeger, S.; Karargyris, A.; Candemir, S.; Folio, L.; Siegelman, J.; Callaghan, F.; Xue, Z.; Palaniappan, K.; Singh, R.K.; Antani, S.; et al. Automatic Tuberculosis Screening Using Chest Radiographs. IEEE Trans. Med Imaging 2013, 33, 233–245. [Google Scholar] [CrossRef] [PubMed]
- Candemir, S.; Jaeger, S.; Palaniappan, K.; Musco, J.P.; Singh, R.K.; Xue, Z.; Karargyris, A.; Antani, S.; Thoma, G.; McDonald, C.J. Lung Segmentation in Chest Radiographs Using Anatomical Atlases With Nonrigid Registration. IEEE Trans. Med Imaging 2013, 33, 577–590. [Google Scholar] [CrossRef]
- Stirenko, S.; Kochura, Y.; Alienin, O.; Rokovyi, O.; Gordienko, Y.; Gang, P.; Zeng, W. Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation. In Proceedings of the 2018 IEEE 38th International Conference on Electronics and Nanotechnology (ELNANO), Kyiv, Ukraine, 24–26 April 2018; pp. 422–428. [Google Scholar]
- Rahman, T.; Khandakar, A.; Kadir, M.A.; Islam, K.R.; Islam, K.F.; Mazhar, R.; Hamid, T.; Islam, M.T.; Kashem, S.; Bin Mahbub, Z.; et al. Reliable Tuberculosis Detection Using Chest X-Ray With Deep Learning, Segmentation and Visualization. IEEE Access 2020, 8, 191586–191601. [Google Scholar] [CrossRef]
- Jadon, S. A survey of loss functions for semantic segmentation. In Proceedings of the 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB); 2020, Institute of Electrical and Electronics Engineers (IEEE); pp. 1–7.
- Xu, Y.; Quan, R.; Xu, W.; Huang, Y.; Chen, X.; Liu, F. Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches. Bioengineering 2024, 11, 1034. [Google Scholar] [CrossRef] [PubMed]
- R. R. Shamir, Y. R. R. Shamir, Y. Duchin, J. Kim, G. Sapiro, and N. Harel, "Continuous Dice Coefficient: a Method for Evaluating Probabilistic Segmentations," Jun. 26, 2019. arXiv:arXiv:1906.11031. [CrossRef]
- Müller, D.; Soto-Rey, I.; Kramer, F. Towards a guideline for evaluation metrics in medical image segmentation. BMC Res. Notes 2022, 15, 1–8. [Google Scholar] [CrossRef]
- J. Wang, L. J. Wang, L. Wei, L. Wang, Q. Zhou, L. Zhu, and J. Qin, "Boundary-aware Transformers for Skin Lesion Segmentation," Oct. 08, 2021. arXiv:arXiv:2110.03864. [CrossRef]
- D. Jha, M. A. D. Jha, M. A. Riegler, D. Johansen, P. Halvorsen, and H. D. Johansen, "DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation," Jun. 27, 2020. arXiv:arXiv:2006.04868. [CrossRef]
- Bencevic, M.; Galic, I.; Habijan, M.; Babin, D. Training on Polar Image Transformations Improves Biomedical Image Segmentation. IEEE Access 2021, 9, 133365–133375. [Google Scholar] [CrossRef]
- Ibtehaz, N.; Rahman, M.S. MultiResUNet : Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 2020, 121, 74–87. [Google Scholar] [CrossRef]
- J. Chen et al., "TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation," Feb. 08, 2021. arXiv:arXiv:2102.04306. [CrossRef]
- J. Wang et al., "ProMISe: Promptable Medical Image Segmentation using SAM," Sep. 28, 2024. arXiv:arXiv:2403.04164. [CrossRef]
- F. Tang, Q. F. Tang, Q. Huang, J. Wang, X. Hou, J. Su, and J. Liu, "DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation," Dec. 21, 2022. arXiv:arXiv:2212.11677. [CrossRef]
- Wang, H.; Cao, P.; Wang, J.; Zaiane, O.R. UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-Wise Perspective with Transformer. Proc. Conf. AAAI Artif. Intell. 2022, 36, 2441–2449. [Google Scholar] [CrossRef]
- X. Huang, Z. X. Huang, Z. Deng, D. Li, and X. Yuan, "MISSFormer: An Effective Medical Image Segmentation Transformer," Dec. 19, 2021. arXiv:arXiv:2109.07162. [CrossRef]
- F. Touazi, D. F. Touazi, D. Gaceb, M. Chirane, and S. Herzallah, "Two-Stage Approach for Semantic Image Segmentation of Breast Cancer : Deep Learning and Mass Detection in Mammographic images".
- Kamran, S.A.; Hossain, K.F.; Tavakkoli, A.; Bebis, G.; Baker, S. Swin-SFTNet : Spatial Feature Expansion and Aggregation Using Swin Transformer for Whole Breast Micro-Mass Segmentation. 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI). LOCATION OF CONFERENCE, ColombiaDATE OF CONFERENCE; pp. 1–5.
- Rai, H.M.; Dashkevych, S.; Yoo, J. Next-Generation Diagnostics: The Impact of Synthetic Data Generation on the Detection of Breast Cancer from Ultrasound Imaging. Mathematics 2024, 12, 2808. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, S.; Chen, J.; Wu, C. Whole mammographic mass segmentation using attention mechanism and multiscale pooling adversarial network. J. Med Imaging 2020, 7, 054503. [Google Scholar] [CrossRef]
- Liu, D.; Wu, B.; Li, C.; Sun, Z.; Zhang, N. TrEnD: A transformer-based encoder-decoder model with adaptive patch embedding for mass segmentation in mammograms. Med Phys. 2023, 50, 2884–2899. [Google Scholar] [CrossRef] [PubMed]
- Hatamizadeh, A.; Tang, Y.; Nath, V.; Yang, D.; Myronenko, A.; Landman, B.; Roth, H.R.; Xu, D. “UNETR: Transformers for 3D Medical Image Segmentation,” Oct. 09, 2021, arXiv: arXiv:2103.10504.















| VGG19 | ResNet50 | DenseNet121 | |
|---|---|---|---|
| U-Net | 0.717 | 0.878 | 0.848 |
| DeepLabV3+ | 0.815 | 0.860 | 0.853 |
| U-Net | Attention U-Net | Residual Attention U-Net | DeepLabV3+ | |
|---|---|---|---|---|
| Dice Coefficient | 0.878 | 0.843 | 0.829 | 0.860 |
| DL Architectures | U-Net | Attention U-Net | Residual Attention U-Net | DeepLabV3+ |
|---|---|---|---|---|
| Parameters | 32.6 M | 52.9 M | 58.9 M | 17.8 M |
| BCE 1 | Dice Loss | BCE + Dice Loss | Focal Tversky Loss | Log-Cosh Dice Loss | Jaccard Loss | |
|---|---|---|---|---|---|---|
| U-Net | 0.878 | 0.863 | 0.869 | 0.863 | 0.854 | 0.845 |
| DeepLabV3+ | 0.860 | 0.868 | 0.877 | 0.852 | 0.902 | 0.861 |
| U-Net BCE |
U-Net Log-Cosh Dice |
DeepLabV3+ BCE |
DeepLabV3+ Log-Cosh Dice |
|
|---|---|---|---|---|
| Dice Coefficient | 0.748 | 0.756 | 0.750 | 0.789 |
| U-Net BCE |
U-Net Log-Cosh Dice |
DeepLabV3+ BCE |
DeepLabV3+ Log-Cosh Dice |
|
|---|---|---|---|---|
| Dice Coefficient | 0.9540 | 0.905 | 0.9544 | 0.951 |
| Model | Dice |
|---|---|
| DoubleU-Net [33] | 0.896 |
| Polar Res-U-Net++ [34] | 0.925 |
| MultiResUNet [35] | 0.869 |
| TransUNet [36] | 0.850 |
| ProMISe [37] | 0.921 |
| DuAT [38] | 0.923 |
| UCTransNet [39] | 0.890 |
| MISSFormer [40] | 0.866 |
| DeepLabV3+ (Our Approach) | 0.903 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).