Submitted:
20 June 2023
Posted:
21 June 2023
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. U-Net Approaches
2.2. Attention-Based and Transformer Based
2.3. Other Approaches
2.3. Motivation
3. Material and Methods
3.1. U-Net Architecture
3.2.1. Attention Module Analysis
3.3. Dataset Collection and Description
3.4. Data Preprocessing
256. The preprocessing was applied to all examples in the dataset.
3.5. Training Setup
3.5.1. Loss Function
3.5.2. Evaluation Metrics
4. Experimental Results
5. Conclusions
Author Contributions
Funding
Data Availability Statement
References
- Oktay et al., "Attention U-Net: Learning Where to Look for the Pancreas, arXiv 2018, arXiv:1804.03999.
- Z. Kong et al., "Automated Maxillofacial Segmentation in Panoramic Dental X-Ray Images Using an Efficient Encoder-Decoder Network," IEEE Access, vol. 8, pp. 207822–207833, 2020. [CrossRef]
- 3. C.-W. Wang et al., "A benchmark for comparison of dental radiography analysis algorithms," Medical Image Analysis, vol. 31, pp. 63–76, Jul. 2016. [CrossRef]
- Wirtz, A. , Mirashi, S.G. and Wesarg, S. (2018) 'Automatic teeth segmentation in panoramic X-ray images using a coupled shape model in combination with a neural network', Medical Image Computing and Computer Assisted Intervention – MICCAI 2018, pp. 712–719. [CrossRef]
- Sabarudin, Akmal, and Yu Jin Tiau. "Image Quality Assessment in Panoramic Dental Radiography: A Comparative Study between Conventional and Digital Systems." Quantitative Imaging in Medicine and Surgery, vol. 3, no. 1, 1 Feb. 2013, pp. 43–48, www.ncbi.nlm.nih.gov/pubmed/23483085. Available online: www.ncbi.nlm.nih.gov/pubmed/23483085 (accessed on 24 November 2022). [CrossRef]
- M. L. Kantor, A. B. Reiskin, and A. G. Lurie, "A clinical comparison of X-ray films for detection of proximal surface caries," The Journal of the American Dental Association, vol. 111, no. 6, pp. 967–969, Dec. 1985. [CrossRef]
- R. Fitzgerald, "Error in Radiology," Clinical Radiology, vol. 56, no. 12, pp. 938–946, Dec. 2001. [CrossRef]
- Brady, R. Ó. Laoide, P. McCarthy, and R. McDermott, "Discrepancy and error in radiology: concepts, causes and consequences," The Ulster medical journal, vol. 81, no. 1, pp. 3–9, 2012. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3609674/.
- P. L. Lin, Y. H. Lai, and P. W. Huang, "An effective classification and numbering system for dental bite-wing radiographs using teeth region and contour information," Pattern Recognition, vol. 43, no. 4, pp. 1380–1392, Apr. 2010. [CrossRef]
- M. H. Mahoor and M. Abdel-Mottaleb, "Classification and numbering of teeth in dental bite-wing images," Pattern Recognition, vol. 38, no. 4, pp. 577–586, Apr. 2005. [CrossRef]
- Yuniarti, A. S. Nugroho, B. Amaliah, and A. Z. Arifin, "Classification and Numbering of Dental Radiographs for an Automated Human Identification System," TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 10, no. 1, p. 137, Mar. 2012. [CrossRef]
- Z. Li, W. Yang, S. Peng, and F. Liu, "A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects,". arXiv 2020, arXiv:2004.02806.
- Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," arXiv.org, , 2015. https://arxiv.org/abs/1505. 18 May 0459. [Google Scholar]
- K. Kamnitsas et al., "Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation," Medical Image Analysis, vol. 36, pp. 61–78, Feb. 2017. [CrossRef]
- R. Wang, T. Lei, R. Cui, B. Zhang, H. Meng, and A. K. Nandi, "Medical image segmentation using deep learning: A survey," IET Image Processing, Jan. 2022. [CrossRef]
- W. Li, F. Jia, and Q. Hu, "Automatic Segmentation of Liver Tumor in CT Images with Deep Convolutional Neural Networks," Journal of Computer and Communications, vol. 03, no. 11, pp. 146–151, 2015. [CrossRef]
- V. Cherukuri, P. Ssenyonga, B. Warf, A. Kulkarni, V. Monga, and S. Schiff, "Learning Based Segmentation of CT Brain Images: Application to Post-operative Hydrocephalic Scans," IEEE Transactions on Biomedical Engineering, 2017. Available online: https://www.semanticscholar.org/paper/Learning-Based-Segmentation-of-CT-Brain-Images%3A-to-Cherukuri-Ssenyonga/f4c38fca0d2df65278e222f535b08e66830b03df (accessed on 29 May 2023).
- Y. Liu, S. Chen, L. Wee, A. Dekker, and I. Bermejo, "Deep learning based unpaired image-to-image translation applications for medical physics: a systematic review," vol. 68, no. 5, pp. 05TR01–05TR01, Feb. 2023. [CrossRef]
- Z. Zhou, M. Rahman, N. Tajbakhsh, and J. Liang, "UNet++: A Nested U-Net Architecture for Medical Image Segmentation. arXiv 2018, arXiv:1807.1016.
- F. Isensee et al., "nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation, arXiv 2018, arXiv:1809.10486.
- L. F. Machado, P. C. A. Watanabe, G. A. Rodrigues, and L. O. M. Junior, "Deep learning for automatic mandible segmentation on dental panoramic x-ray images," Biomedical Physics & Engineering Express, vol. 9, no. 3, p. 035015, Mar. 2023. [CrossRef]
- C. Rohrer, J. Krois, J. Patel, H. Meyer-Lueckel, J. A. Rodrigues, and F. Schwendicke, "Segmentation of Dental Restorations on Panoramic Radiographs Using Deep Learning," Diagnostics, vol. 12, no. 6, p. 1316, 22. 20 May. [CrossRef]
- I.-S. Song et al., "Deep learning-based apical lesion segmentation from panoramic radiographs," Imaging Science in Dentistry, vol. 52, 2022. [CrossRef]
- Rini Widyaningrum, Ika Candradewi, A. Seno, and Rizki Aulianisa, "Comparison of Multi-Label U-Net and Mask R-CNN for panoramic radiograph segmentation to detect periodontitis," vol. 52, no. 4, pp. 383–383, Jan. 2022. [CrossRef]
- Vaswani et al., "Attention Is All You Need,". arXiv 2017, arXiv:1706.03762.
- Y. Zhang, X. Wang, X. Jiang, C. Shi, and Y. Ye, "Hyperbolic Graph Attention Network,". arXiv 2019, arXiv:1912.03046.
- D.-K. Nguyen and T. Okatani, "Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering,". arXiv 2018, arXiv:1804.00775.
- Y. Liang et al., "Exploring Forensic Dental Identification with Deep Learning," 2021. Accessed: Jun. 02, 2023. [Online]. Available online: https://proceedings.neurips.cc/paper_files/paper/2021/file/1a423f7c07a179ec243e82b0c017a034-Paper.pdf.
- W. Li et al., "Segmentation and accurate identification of large carious lesions on high quality x-ray images based on Attentional U-Net model. A proof of concept study," Journal of Applied Physics 132, 033103 (2022), vol. 132, no. 3, pp. 033103–033103, Jul. 2022. [CrossRef]
- Dosovitskiy et al., "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,". arXiv 2020, arXiv:2010.11929.
- Almalki and L. J. Latecki, "Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic Radiographs, arXiv 2022, arXiv:2210.11404.
- P. Harsh, R. Chakraborty, S. Tripathi, and K. Sharma, "Attention U-Net Architecture for Dental Image Segmentation," IEEE Xplore, Jun. 01, 2021. Available online: https://ieeexplore.ieee.org/document/9498422 (accessed on 11 May 2022).
- M. Biswas, R. Pramanik, S. Sen, A. Sinitca, D. Kaplun, and R. Sarkar, "Microstructural segmentation using a union of attention guided U-Net models with different color transformed images," Scientific Reports, vol. 13, no. 1, Apr. 2023. [CrossRef]
- D. John and C. Zhang, "An attention-based U-Net for detecting deforestation within satellite sensor imagery," International Journal of Applied Earth Observation and Geoinformation, vol. 107, p. 102685, Mar. 2022. [CrossRef]
- R. Karthik, M. Radhakrishnan, R. Rajalakshmi, and J. Raymann, "Delineation of ischemic lesion from brain MRI using attention gated fully convolutional network," Biomedical Engineering Letters, vol. 11, no. 1, pp. 3–13, Nov. 2020. [CrossRef]
- Dayı, H. Üzen, İ. B. Çiçek, and Ş. B. Duman, "A Novel Deep Learning-Based Approach for Segmentation of Different Type Caries Lesions on Panoramic Radiographs," Diagnostics, vol. 13, no. 2, p. 202, Jan. 2023. [CrossRef]
- N. Nafi'iyah, C. Fatichah, D. Herumurti, E. Renwi Astuti, and R. Hardani Putra, "MobileNetV2 Ensemble Segmentation for Mandibular on Panoramic Radiography," Intelligent Engineering & Systems, vol. 16, no. 2, pp. 546–560, Feb. 2023. [CrossRef]
- S. Arora, S. K. Tripathy, R. Gupta, and R. Srivastava, "Exploiting multimodal CNN architecture for automated teeth segmentation on dental panoramic X-ray images," Proceedings of the Institution of Mechanical Engineers, Part H: Journal of Engineering in Medicine, p. 095441192311571, Feb. 2023. [CrossRef]
- K. Panetta, R. Rajendran, A. Ramesh, S. Rao, and S. Agaian, "Tufts Dental Database: A Multimodal Panoramic X-Ray Dataset for Benchmarking Diagnostic Systems," IEEE journal of biomedical and health informatics, vol. 26, no. 4, pp. 1650–1659, Apr. 2022. [CrossRef]
- Saumya Jetley, N. A. Lord, N. Lee, and Philip, "Learn to Pay Attention," OpenReview, , 2023. Available online: https://openreview.net/forum?id=HyzbhfWRW (accessed on 27 May 2023).
- Y. Yoshimi et al., "Image preprocessing with contrast-limited adaptive histogram equalization improves the segmentation performance of deep learning for the articular disk of the temporomandibular joint on magnetic resonance images," Apr. 2023. [CrossRef]
- Z.-L. Ni et al., "RAUNet: Residual Attention U-Net for Semantic Segmentation of Cataract Surgical Instruments," pp. 139–149, Dec. 2019. [CrossRef]
- F. Oztekin et al., "Automatic semantic segmentation for dental restorations in panoramic radiography images using U-Net model," International Journal of Imaging Systems and Technology, vol. 32, no. 6, pp. 1990–2001, Sep. 2022. [CrossRef]






is the pixel accuracy,
is the IoU, and
is the dice score.
is the pixel accuracy,
is the IoU, and
is the dice score.

| Fold | Metrics (%) | ||
| Dice | IoU | PA | |
| 1 | 90.18 | 82.12 | 97.66 |
| 2 | 92.01 | 85.2 | 98.13 |
| 3 | 93.16 | 87.2 | 98.34 |
| 4 | 94.77 | 90.06 | 98.76 |
| 5 | 95.71 | 91.78 | 98.99 |
| 6 | 96.25 | 92.78 | 99.14 |
| 7 | 96.73 | 93.66 | 99.2 |
| 8 | 96.98 | 94.13 | 99.3 |
| 9 | 97.15 | 94.45 | 99.32 |
| 10 | 97.21 | 94.56 | 99.37 |
| Average | 95.01 | 90.6 | 98.82 |
| ` | Backbone | Metrics (%) | Data Split | Epochs | Batch Size | Learning Rate | Weight Decay | |||
| Dice | IoU | PA | Train | Test | ||||||
| FPN [39] | ResNet18 | 92.24 | 86.37 | 95.17 | 85% | 15% | 150 | 16 | ||
| U-Net [39] | ResNet18 | 92.27 | 86.42 | 95.11 | 85% | 15% | 150 | 16 | ||
| U-Net++ [39] | ResNet18 | 92.43 | 86.54 | 95.15 | 85% | 15% | 150 | 16 | ||
| PSPNet [39] | ResNet18 | 91.49 | 85.66 | 94.76 | 85% | 15% | 150 | 16 | ||
| DeepLabV3 [39] | ResNet18 | 91.87 | 86.02 | 94.91 | 85% | 15% | 150 | 16 | ||
| DeepLabV3+ [39] | ResNet18 | 91.80 | 86.41 | 95.13 | 85% | 15% | 150 | 16 | ||
| nnU-Net [39] | -- | 90.86 | 86.11 | 94.91 | 85% | 15% | 150 | 16 | ||
| CE-Net [39] | -- | 86.62 | 81.64 | 92.67 | 85% | 15% | 400 | 16 | ||
| Attention U-Net | -- | 95.01 | 90.6 | 98.82 | 10-Fold | 100 | 8 | -- | ||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).