Submitted:
22 April 2024
Posted:
25 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- •
- We have discussed the characteristics of different types of deep learning methods for document image binarization and have also illustrated the receptive fields of the CNN and Transformer models.
- •
- By incorporating the Mobile ViT block into the U-Net structure, we aim to broaden the receptive fields of the image, capturing both global and local characteristics more effectively. This marks the first application of Mobile ViT in document image binarization. The parameters of the proposed model are only one-fourth of those of a similar model [63] based on U-Net for document image binarization.
- •
- MVT-Unet is a straightforward end-to-end model trained only once, without employing pre- or post-processing steps or ensemble methods.
- •
- MVT-Unet demonstrates superior performance compared to state-of-the-art results on two standard Document Binarization Competition (DIBCO) datasets, for both handwritten and machine-printed documents.
2. Related Work
2.1. Traditional Binarization Techniques
2.2. Deep Learning Binarization Approaches
3. Methodology
3.1. Baseline Network for Document Image Binarization
3.2. Improving Network Receptive Field Range
3.3. Proposed Model
4. Experiments and Analysis
4.1. Introduction of Experiments
4.2. Qualitative evaluation
4.3. Quantitative Evaluation
| Predicted Values\Ground Truth | Possitive (1) | Negtive (0) |
|---|---|---|
| Possitive (1) | TP | FP |
| Negtive (0) | FN | TN |
5. Ablation experiment
5.1. Experiments on the Performance of Mobile ViT Block
5.2. Experiments on the Number of Channels in Mobile ViT Block
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Pan, Y.F.; Hou, X.; Liu, C.L. Text Localization in Natural Scene Images Based on Conditional Random Field. 2009 10th International Conference on Document Analysis and Recognition, 2009, pp. 6–10. [CrossRef]
- Gupta, M.R.; Jacobson, N.P.; Garcia, E.K. OCR binarization and image pre-processing for searching historical documents. Pattern Recognition 2007, 40, 389–397. [Google Scholar] [CrossRef]
- Saabni, R.; Asi, A.; El-Sana, J. Text line extraction for historical document images. Pattern Recognit. Lett. 2014, 35, 23–33. [Google Scholar] [CrossRef]
- He, S.; Wiering, M.; Schomaker, L. Junction detection in handwritten documents and its application to writer identification. Pattern Recognit. 2015, 48, 4036–4048. [Google Scholar] [CrossRef]
- Giotis, A.P.; Sfikas, G.; Gatos, B.; Nikou, C. A survey of document image word spotting techniques. Pattern Recognit. 2017, 68, 310–332. [Google Scholar] [CrossRef]
- Kumar, G.; Bhatia, P.K. A Detailed Review of Feature Extraction in Image Processing Systems. 2014 Fourth International Conference on Advanced Computing & Communication Technologies, 2014, pp. 5–12. [CrossRef]
- Smith, R.W. An Overview of the Tesseract OCR Engine. Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) 2007, 2, 629–633. [Google Scholar] [CrossRef]
- Gatos, B.; Ntirogiannis, K.; Pratikakis, I. ICDAR 2009 Document Image Binarization Contest (DIBCO 2009). 2009 10th International Conference on Document Analysis and Recognition, 2009, pp. 1375–1382. [CrossRef]
- Pratikakis, I.; Gatos, B.; Ntirogiannis, K. H-DIBCO 2010 - Handwritten Document Image Binarization Competition. 2010 12th International Conference on Frontiers in Handwriting Recognition, 2010, pp. 727–732. [CrossRef]
- Pratikakis, I.; Gatos, B.; Ntirogiannis, K. ICDAR 2011 Document Image Binarization Contest (DIBCO 2011). 2011 International Conference on Document Analysis and Recognition, 2011, pp. 1506–1510. [CrossRef]
- Pratikakis, I.; Gatos, B.; Ntirogiannis, K. ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012). 2012 International Conference on Frontiers in Handwriting Recognition, 2012, pp. 817–822. [CrossRef]
- Pratikakis, I.; Gatos, B.; Ntirogiannis, K. ICDAR 2013 Document Image Binarization Contest (DIBCO 2013). 2013 12th International Conference on Document Analysis and Recognition, 2013, pp. 1471–1476. [CrossRef]
- Ntirogiannis, K.; Gatos, B.; Pratikakis, I. ICFHR2014 Competition on Handwritten Document Image Binarization (H-DIBCO 2014). 2014 14th International Conference on Frontiers in Handwriting Recognition, 2014, pp. 809–813. [CrossRef]
- Pratikakis, I.; Zagoris, K.; Barlas, G.; Gatos, B. ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016). 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2016, pp. 619–623. [CrossRef]
- Pratikakis, I.; Zagoris, K.; Barlas, G.; Gatos, B. ICDAR2017 Competition on Document Image Binarization (DIBCO 2017). 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017, Vol. 01, pp. 1395–1403. [CrossRef]
- Pratikakis, I.; Zagori, K.; Kaddas, P.; Gatos, B. ICFHR 2018 Competition on Handwritten Document Image Binarization (H-DIBCO 2018). 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2018, pp. 489–493. [CrossRef]
- Pratikakis, I.; Zagoris, K.; Karagiannis, X.; Tsochatzidis, L.; Mondal, T.; Marthot-Santaniello, I. ICDAR 2019 Competition on Document Image Binarization (DIBCO 2019). 2019 International Conference on Document Analysis and Recognition (ICDAR), 2019, pp. 1547–1556. [CrossRef]
- Seuret, M.; Nicolaou, A.; Stutzmann, D.; Maier, A.; Christlein, V. ICFHR 2020 Competition on Image Retrieval for Historical Handwritten Fragments. 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2020, pp. 216–221. [CrossRef]
- Calvo-Zaragoza, J.; Gallego, A.J. A selectional auto-encoder approach for document image binarization. Pattern Recognition 2019, 86, 37–47. [Google Scholar] [CrossRef]
- Otsu, N. A Threshold Selection Method from Gray-Level Histograms. IEEE Transactions on Systems, Man, and Cybernetics 1979, 9, 62–66. [Google Scholar] [CrossRef]
- Niblack, W. An introduction to digital image processing, 1986.
- Sauvola, J.; Pietikäinen, M. Adaptive document image binarization. Pattern Recognition 2000, 33, 225–236. [Google Scholar] [CrossRef]
- Wolf, C.; Jolion, J.M. Extraction and recognition of artificial text in multimedia documents. Formal Pattern Analysis & Applications 2004, 6, 309–326. [Google Scholar]
- Bernsen, J. Dynamic thresholding of grey-level images. ICPR`86: Proceedings of International Conference on Pattern Recognition, 1986, pp. 1251–1255.
- Gatos, B.; Pratikakis, I.; Perantonis, S. Adaptive degraded document image binarization. Pattern Recognition 2006, 39, 317–327. [Google Scholar] [CrossRef]
- Khurshid, K.; Siddiqi, I.; Faure, C.; Vincent, N. Comparison of Niblack inspired binarization methods for ancient documents. Electronic imaging, 2009.
- Jiang, L.; Chen, K.; Yan, S.; Zhou, Y.; Guan, H. Adaptive Binarization for Degraded Document Images. 2009 International Conference on Information Engineering and Computer Science, 2009, pp. 1–4. [CrossRef]
- Bataineh, B.; Abdullah, S.N.H.S.; Omar, K. An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows. Pattern Recognition Letters 2011, 32, 1805–1813. [Google Scholar] [CrossRef]
- Su, B.; Lu, S.; Tan, C.L. Robust Document Image Binarization Technique for Degraded Document Images. IEEE Transactions on Image Processing 2013, 22, 1408–1417. [Google Scholar] [CrossRef] [PubMed]
- Hadjadj, Z.; Meziane, A.; Cherfa, Y.; Cheriet, M.; Setitra, I. ISauvola: Improved Sauvola’s Algorithm for Document Image Binarization. Image Analysis and Recognition, 2016, Vol. 9730, pp. 737–745. [CrossRef]
- Mustafa, W.A.; Kader, M.M.M.A. Binarization of Document Image Using Optimum Threshold Modification. Journal of Physics: Conference Series 2018, 1019. [Google Scholar] [CrossRef]
- Zemouri, E.T.; Chibani, Y.; Brik, Y. Enhancement of Historical Document Images by Combining Global and Local Binarization Technique. International Journal of Information Engineering and Electronic Business 2014, 4. [Google Scholar] [CrossRef]
- Ntirogiannis, K.; Gatos, B.; Pratikakis, I. A combined approach for the binarization of handwritten document images. Pattern Recognit. Lett. 2014, 35, 3–15, Frontiersin Handwriting Processing. [Google Scholar] [CrossRef]
- Chaudhary, P.; Ambedkar, B. AN EFFECTIVE AND ROBUST TECHNIQUE FOR THE BINARIZATION OF DEGRADED DOCUMENT IMAGES. Int. J. Res. Eng. Technol. 2014, 03, 140–145. [Google Scholar]
- Saddami, K.; Arnia, F.; Away, Y.; Munadi, K. Kombinasi Metode Nilai Ambang Lokal dan Global untuk Restorasi Dokumen Jawi Kuno. J. Teknol. Inf. Dan Ilmu Komput. 2020, 7, 163–170. [Google Scholar] [CrossRef]
- Lu, S.; Su, B.; Tan, C. Document image binarization using background estimation and stroke edges. Int. J. Doc. Anal. Recognit. (IJDAR) 2010, 13, 303–314. [Google Scholar] [CrossRef]
- Santhanaprabhu, G.; Karthick, B.; Srinivasan, P.; Vignesh, R.; Sureka, K. Extraction and Document Image Binarization Using Sobel Edge Detection. J. Eng. Res. Appl. 2014, 4, 15–21. [Google Scholar]
- Lelore, T.; Bouchara, F. FAIR: A Fast Algorithm for Document Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 2039–2048. [Google Scholar] [CrossRef]
- Holambe, S.; Shinde, U.; Choudhari, B. Image Binarization for Degraded Document Images. Int. J. Comput. Appl. 2015, 128, 38–43. [Google Scholar]
- Jia, F.; Shi, C.; He, K.; Wang, C.; Xiao, B. Document image binarization using structural symmetry of strokes. 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, 2016, pp. 411–416.
- Lai, A.N.; Lee, G. Binarization by local k-means clustering for Korean text extraction. 2008 IEEE International Symposium on Signal Processing and Information Technology. IEEE, 2008, pp. 117–122.
- Tong, L.J.; Chen, K.; Zhang, Y.; Fu, X.L.; Duan, J.Y. Document image binarization based on NFCM. 2009 2nd International Congress on Image and Signal Processing. IEEE, 2009, pp. 1–5.
- Biswas, B.; Bhattacharya, U.; Chaudhuri, B.B. A global-to-local approach to binarization of degraded document images. 2014 22nd International Conference on Pattern Recognition. IEEE, 2014, pp. 3008–3013.
- Soua, M.; Kachouri, R.; Akil, M. GPU parallel implementation of the new hybrid binarization based on Kmeans method (HBK). J. Real-Time Image Process 2018, 14, 363–377. [Google Scholar] [CrossRef]
- Annabestani, M.; Saadatmand-Tarzjan, M. A new threshold selection method based on fuzzy expert systems for separating text from the background of document images. Iran. J. Sci. Technol. Trans. Electr. Eng. 2019, 43, 219–231. [Google Scholar] [CrossRef]
- Xiong, W.; Zhou, L.; Yue, L.; Li, L.; Wang, S. An enhanced binarization framework for degraded historical document images. EURASIP Journal on Image and Video Processing 2021, 2021, 13. [Google Scholar] [CrossRef]
- Pastor-Pellicer, J.; España-Boquera, S.; Zamora-Martínez, F.; Afzal, M.Z.; Castro-Bleda, M.J. Insights on the Use of Convolutional Neural Networks for Document Image Binarization. Advances in Computational Intelligence; Rojas, I., Joya, G., Catala, A., Eds.; Springer International Publishing: Cham, 2015; pp. 115–126. [Google Scholar]
- Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440.
- Tensmeyer, C.; Martinez, T. Document image binarization with fully convolutional neural networks. 2017 14th IAPR international conference on document analysis and recognition (ICDAR). IEEE, 2017, Vol. 1, pp. 99–104.
- Calvo-Zaragoza, J.; Vigliensoni, G.; Fujinaga, I. Pixel-wise binarization of musical documents with convolutional neural networks. 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA), 2017, pp. 362–365. [CrossRef]
- Vo, Q.N.; Kim, S.H.; Yang, H.J.; Lee, G. Binarization of degraded document images based on hierarchical deep supervised network. Pattern Recognit. 2018, 74, 568–586. [Google Scholar] [CrossRef]
- Ma, K.; Shu, Z.; Bai, X.; Wang, J.; Samaras, D. Docunet: Document image unwarping via a stacked u-net. Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4700–4709.
- He, S.; Schomaker, L. DeepOtsu: Document enhancement and binarization using iterative deep learning. Pattern Recognit. 2019, 91, 379–390. [Google Scholar] [CrossRef]
- Bezmaternykh, P.; Ilin, D.; Nikolaev, D. U-Net-bin: hacking the document image binarization contest. Comput. Opt. 2019, 43, 825–832. [Google Scholar] [CrossRef]
- Ayyalasomayajula, K.R.; Malmberg, F.; Brun, A. PDNet: Semantic segmentation integrated with a primal-dual network for document binarization. Pattern Recognit. Lett. 2019, 121, 52–60. [Google Scholar] [CrossRef]
- Huang, X.; Li, L.; Liu, R.; Xu, C.; Ye, M. Binarization of degraded document images with global-local U-Nets. Optik 2020, 203, 164025. [Google Scholar] [CrossRef]
- Xiong, W.; Jia, X.; Yang, D.; Ai, M.; Li, L.; Wang, S. DP-LinkNet: A convolutional network for historical document image binarization. KSII Transactions on Internet and Information Systems (TIIS) 2021, 15, 1778–1797. [Google Scholar]
- Xiong, W.; Yue, L.; Zhou, L.; Wei, L.; Li, M. FD-Net: A Fully Dilated Convolutional Network for Historical Document Image Binarization. Pattern Recognition and Computer Vision: 4th Chinese Conference, PRCV 2021, Beijing, China, October 29–November 1, 2021, Proceedings, Part I 4. Springer, 2021, pp. 518–529.
- Kang, S.; Iwana, B.K.; Uchida, S. Complex image processing with less data—Document image binarization by integrating multiple pre-trained U-Net modules. Pattern Recognition 2021, 109, 107577. [Google Scholar] [CrossRef]
- Dey, A.; Das, N.; Nasipuri, M. Variational Augmentation for Enhancing Historical Document Image Binarization. arXiv preprint arXiv:2211.06581, arXiv:2211.06581 2022.
- Yang, Z.; Xiong, Y.; Wu, G. GDB: Gated convolutions-based Document Binarization. arXiv preprint arXiv:2302.02073, arXiv:2302.02073 2023.
- Zhao, P.; Wang, W.; Zhang, G.; Lu, Y. Alleviating pseudo-touching in attention U-Net-based binarization approach for the historical Tibetan document images. Neural Comput. Appl. 2023, 35, 13791–13802. [Google Scholar] [CrossRef]
- Vahid, R.; Konstantin, B.; Clemens, N. A hybrid cnn-transformer model for historical document image binarization. HIP ’23: 7th International Workshop on Historical Document Imaging and Processing, San Jose CA USA, 2023, pp. 79–84.
- Zhao, J.; Shi, C.; Jia, F.; Wang, Y.; Xiao, B. Document image binarization with cascaded generators of conditional generative adversarial networks. Pattern Recognit. 2019, 96, 106968. [Google Scholar] [CrossRef]
- De, R.; Chakraborty, A.; Sarkar, R. Document Image Binarization Using Dual Discriminator Generative Adversarial Networks. IEEE Signal Processing Letters 2020, PP, 1–1. [Google Scholar] [CrossRef]
- Souibgui, M.A.; Kessentini, Y. DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 1180–1191. [Google Scholar] [CrossRef] [PubMed]
- Kumar, A.; Ghose, S.; Chowdhury, P.N.; Roy, P.P.; Pal, U. UDBNET: Unsupervised Document Binarization Network via Adversarial Game. 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 2021, pp. 7817–7824.
- Suh, S.; Kim, J.; Lukowicz, P.; Lee, Y.O. Two-stage generative adversarial networks for binarization of color document images. Pattern Recognition, 2022; 108810. [Google Scholar]
- Jemni, S.K.; Souibgui, M.A.; Kessentini, Y.; Fornés, A. Enhance to read better: A multi-task adversarial network for handwritten document image enhancement. Pattern Recognition 2022, 123, 108370. [Google Scholar] [CrossRef]
- Racjesh, B.; Agrawal, M.K.; Bhuva, M.; Kishore, K.; Javed, M. Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks. In Computer Vision and Machine Intelligence: Proceedings of CVMI 2022; Springer, 2023; pp. 761–774.
- Fathallah, A.; El Yacoubi, M.; Amara, N.B. EHDI: enhancement of historical document images via generative adversarial network. 18th International Conference on Computer Vision Theory and Applications (VISAPP). SCITEPRESS-Science and Technology Publications, 2023, Vol. 4, pp. 238–245.
- Guo, Y.; Ji, C.; Zheng, X.; Wang, Q.; Luo, X. Multi-scale multi-attention network for moiré document image binarization. Signal Process. Image Commun. 2021, 90, 116046. [Google Scholar] [CrossRef]
- Pandey, S.; Bharti, J. Document Enhancement and Binarization Using Deep Learning Approach. Proceedings of Third International Conference on Intelligent Computing, Information and Control Systems: ICICCS 2021. Springer, 2022, pp. 133–145.
- Peng, X.; Wang, C.; Cao, H. Document binarization via multi-resolutional attention model with DRD loss. 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2019, pp. 45–50.
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Networks. ML 2014, [1406.2661].
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015; Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F., Eds.; Springer International Publishing: Cham, 2015; pp. 234–241. [Google Scholar]
- Nikitin, F.; Dokholyan, V.; Zharikov, I.; Strijov, V. U-Net Based Architectures for Document Text Detection and Binarization, 2019. [CrossRef]
- Detsikas, N.; Mitianoudis, N.; Papamarkos, N. A Dilated MultiRes Visual Attention U-Net for historical document image binarization. Signal Processing: Image Communication 2024, 122, 117102. [Google Scholar] [CrossRef]
- Dumoulin, V.; Visin, F. A guide to convolution arithmetic for deep learning, 2018. arXiv:stat.ML/1603.07285].
- Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. International conference on machine learning. pmlr, 2015, pp. 448–456.
- Souibgui, M.A.; Biswas, S.; Jemni, S.K.; Kessentini, Y.; Fornés, A.; Lladós, J.; Pal, U. Docentr: An end-to-end document image enhancement transformer. 2022 26th International Conference on Pattern Recognition (ICPR). IEEE, 2022, pp. 1699–1705.
- Burie, J.C.; Coustaty, M.; Hadi, S.; Kesiman, M.W.A.; Ogier, J.M.; Paulus, E.; Sok, K.; Sunarya, I.M.G.; Valy, D. ICFHR2016 Competition on the Analysis of Handwritten Text in Images of Balinese Palm Leaf Manuscripts. 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2016, pp. 596–601. [CrossRef]
- Ayatollahi, S.M.; Ziaei Nafchi, H. Persian heritage image binarization competition (PHIBC 2012). 2013 First Iranian Conference on Pattern Recognition and Image Analysis (PRIA), 2013, pp. 1–4. [CrossRef]
- Nicolaou, A.; Christlein, V.; Riba, E.; Shi, J.; Vogeler, G.; Seuret, M. TorMentor: Deterministic dynamic-path, data augmentations with fractals. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 2707–2711.
- Lu, H.; Kot, A.; Shi, Y. Distance-reciprocal distortion measure for binary document images. IEEE Signal Processing Letters 2004, 11, 228–231. [Google Scholar] [CrossRef]
- Tan, M.; Le, Q.V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. CoRR 2019, abs/1905.11946, [1905.11946]. [Google Scholar]














| Algorithm | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| Otsu [20] | 15.03 | 80.18 | 82.65 | 26.45 |
| Sauvola at el. [22] | 16.71 | 82.89 | 87.95 | 6.59 |
| Xiong at el. [46] | 21.68 | 94.26 | 95.16 | 2.08 |
| Kang at el. [59] | 21.37 | 95.16 | 96.44 | 1.13 |
| Tensmeyer at el. [49] | 20.60 | 92.53 | 96.67 | 2.48 |
| Zhao at el. [64] | 21.91 | 94.96 | 96.15 | 1.55 |
| Jemni at el. [69] | 22.00 | 95.18 | 94.63 | 1.62 |
| Souibgui at el. [81] | 22.29 | 95.31 | 96.29 | 1.60 |
| Model 1 [63] | 23.16 | 96.02 | 97.31 | 1.14 |
| Model 2 [63] | 23.24 | 96.26 | 97.29 | 1.12 |
| Model 3 [63] | 23.24 | 96.25 | 97.51 | 1.12 |
| Ensenbel model [63] | 23.27 | 96.25 | 97.58 | 1.11 |
| Proposed method | 23.32 | 96.37 | 97.73 | 1.08 |
| Algorithm | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| Otsu [20] | 13.83 | 77.73 | 77.89 | 15.54 |
| Sauvola at el. [22] | 14.25 | 77.11 | 84.1 | 8.85 |
| Xiong at el. [46] | 17.99 | 89.37 | 90.80 | 5.51 |
| Kang at el. [59] | 15.85 | 91.57 | 93.55 | 2.92 |
| Winer Algorithm [15] | 18.28 | 91.04 | 92.86 | 3.40 |
| Zhao at el. [64] | 17.83 | 90.73 | 92.58 | 3.58 |
| Jemni at el. [69] | 17.45 | 89.80 | 89.95 | 4.03 |
| Souibgui at el. [81] | 19.11 | 92.53 | 95.15 | 2.37 |
| Model 1 [63] | 18.99 | 92.50 | 95.05 | 2.49 |
| Model 2 [63] | 19.04 | 92.60 | 94.83 | 2.44 |
| Ensenbel Model [63] | 19.04 | 93.01 | 95.42 | 2.29 |
| Proposed method | 19.29 | 93.23 | 95.90 | 2.22 |
| Algorithm | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| Otsu [20] | 9.74 | 51.45 | 53.05 | 59.07 |
| Sauvola at el. [22] | 13.78 | 67.81 | 74.08 | 17.69 |
| Xiong at el. [46] Winner | 19.11 | 88.34 | 90.37 | 4.93 |
| Kang at el. [59] | 19.39 | 89.71 | 91.62 | 2.51 |
| Zhao at el. [64] | 18.37 | 87.73 | 90.60 | 4.58 |
| Jemni at el. [69] | 20.18 | 92.41 | 94.35 | 2.60 |
| Souibgui at el. [81] | 19.46 | 90.59 | 93.97 | 3.35 |
| Model 1 [63] | 19.79 | 90.65 | 93.50 | 3.63 |
| Model 2 [63] | 19.94 | 91.87 | 95.62 | 2.77 |
| Model 3 [63] | 19.88 | 91.46 | 95.00 | 3.00 |
| Ensenbel Model [63] | 20.29 | 92.47 | 95.99 | 2.50 |
| Proposed method | 19.517 | 90.5907 | 94.796 | 3.29 |
| Image name | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| 1-2018 | 20.23 | 89.96 | 97.52 | 3.18 |
| 2-2018 | 16.70 | 78.35 | 83.88 | 8.78 |
| 3-2018 | 18.40 | 95.38 | 99.05 | 1.5 |
| 4-2018 | 20.77 | 85.61 | 96.81 | 2.65 |
| 5-2018 | 19.02 | 90.17 | 93.96 | 4.25 |
| 6-2018 | 21.96 | 96.67 | 97.91 | 1.76 |
| 7-2018 | 22.86 | 93.21 | 94.87 | 2.25 |
| 8-2018 | 16.78 | 90.31 | 95.09 | 2.71 |
| 9-2018 | 23.39 | 97.01 | 97.44 | 1.23 |
| 10-2018 | 15.05 | 89.24 | 91.44 | 5.61 |
| Average | 19.52 | 90.59 | 94.80 | 3.39 |
| Algorithm | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| Otsu [20] | 9.029 | 47.67 | 48.01 | 109.84 |
| Sauvola at el. [22] | 13.12 | 64.71 | 66.37 | 21.24 |
| Xiong at el. [46] | 11.84 | 46.61 | 47.06 | 24.13 |
| Model-9.0M [63] | 14.99 | 65.81 | 68.10 | 9.98 |
| Model-37.0M [63] | 15.64 | 72.07 | 73.46 | 7.72 |
| Proposed Model-8.9M | 15.25 | 65.92 | 66.20 | 8.97 |
| Name | PSNR | FM | pFM | DRD |
|---|---|---|---|---|
| M0 | 18.56 | 91.57 | 94.91 | 2.94 |
| M1 | 18.84 | 92.45 | 95.37 | 2.55 |
| M2 | 18.88 | 92.39 | 95.12 | 2.75 |
| M3 | 19.29 | 93.23 | 95.99 | 2.22 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).