Submitted:
17 March 2023
Posted:
17 March 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction

2. Related Works
| .Article | Contribution | Shortcomings and future improvements |
|---|---|---|
| Chen et al. [11] | They put forward a model for multi-directional nameplate text recognition, which converts curved text into nearly horizontal text. | Unable to process extremely long text, lack of long text data set verification. |
| Ning et al. [12] | This study proposes a method of using the improved Faster-RCNN as a label positioning algorithm. | Typical nameplates have visible locations, so basic target detection algorithms are adequate and faster. |
| Zhang et al. [13] | The study proposed a similar edge detection method to locate and transform the position of nameplates. | CTPN is a powerful text detection algorithm, but now there are stronger text detection algorithms available. |
| Panhwar et al. [14] | This research suggests a framework for signboard text detection and recognition in natural environment. | The accuracy of model identification needs to be improved now, and the speed of experimental results is not improved. |
| Wu et al. [15] | The study proposes a region-based method for providing good coverage of the corners of plaques. | Despite the algorithm’s impressive performance, its limited background fails to prove its applicability in complex industrial settings. |
| Zhao et al. [16] | Hough transform is incorporated into the deep learning framework, and a new semantic line detection method in natural scenes is proposed. | Hough transform is common in line segment and simple figure prediction, and this improved method can be used to predict other shapes in complex background. |
| Kagawa et al. [17] | A robust method for detecting aging sockets to ensure normal use of IC chips in sockets was proposed in the article. | More images need examining, applying this method to different types of diffuse reflection, and analyzing the properties of diffused reflected light in detail is necessary. |
| Aslani et al. [18] | A method for counting incomplete or fragmented red blood cells was proposed in the article, which was preceded by image preprocessing. | Despite saving researchers a lot of time, this method still has significant errors in medicine and requires more precise performance. |
| Marzougui et al. [19] | A visual-based lane tracking method was proposed in this study. | Errors in traffic scenarios can be deadly, making this method challenging to use in practical applications. |
| Kumar et al. [20] | An efficient method for detecting highway lanes was proposed in this study. | Method detects both curved and straight lane lines but needs enhanced accuracy for real road conditions. |
| Ahmad et al. [21] | The study used the probabilistic Hough transform for clustering, effectively correcting skewed documents. | Detecting densely packed text can be challenging, requiring additional processing. Although it may be effective in clean environments, it has not been demonstrated to be robust across varied backgrounds. |
| Ma et al. [22] | This paper suggests using deep compression learning to calculate edges in high-resolution images of deep-sea mining. | Data compression is inevitably accompanied by the loss of accuracy, so it is necessary to extract and retain key information modules. |
| Yang et al. [23] | A multi-feature fusion network is proposed and a new end-to-end 3D object detection framework is designed. | The accuracy of single-stage detection algorithm is often lower than other methods, and the detection speed of this method has no obvious advantage. |
3. The Proposed Method
3.1. Hough Transform and Its Variants
3.2. Two-Sided Detection Utilizing Probabilistic Hough Transform
3.3. Relevant Principles of Image Correction Based on Probabilistic Hough Transform

4. Experiments & Analysis
4.1. Dataset
4.2. Metrics
| / | Actual class | ||
|---|---|---|---|
| Positive | Negative | ||
| Predicted class | Positive | True Positive(TP) | False Positive(FP) |
| Negative | False Negtive(FN) | True Negtive(TN) | |
4.3. Perspective Transform Based on Probabilistic Hough Transform
4.4. Comparative Experiment
| Dataset | Precision | Recall | F-measure | FPS |
|---|---|---|---|---|
| TD500 | 0.86 | 0.79 | 0.82 | 66 |
| TD500 trained model in MEND | 0.34 | 0.14 | 0.84 | 58 |
| SynthText pre-trained in MEND | 0.14 | 0.01 | 0.02 | 61 |
| SynthText pre-trained, and trained in new pictures in MEND |
0.84 | 0.81 | 0.82 | 52 |
| Dataset | Precision | Recall | F-measure | FPS |
|---|---|---|---|---|
| TD500 | 0.91 | 0.80 | 0.85 | 40 |
| TD500 trained model in MEND | 0.36 | 0.26 | 0.30 | 28 |
| SynthText pre-trained in MEND | 0.06 | 0.16 | 0.86 | 25 |
| SynthText pre-trained, and trained in new pictures in MEND |
0.86 | 0.84 | 0.85 | 36 |
| Dataset | Precision | Recall | F-measure | FPS |
|---|---|---|---|---|
| ICDAR2015 | 0.91 | 0.83 | 0.87 | 12 |
| TD500 pre-trained model in MEND | 0.09 | 0.083 | 0.08 | 10 |
| ICDAR2015 pre-trained, and trained in new pictures in MEND |
0.72 | 0.45 | 0.55 | 11 |
| TOTALTEXT | 0.86 | 0.84 | 0.85 | 25 |
| TOTALTEXT pre-trained model in MEND |
0.17 | 0.08 | 0.10 | 25 |
| TOTALTEXT pre-trained and trained in new pictures in MEND |
0.91 | 0.86 | 0.89 | 25 |
| Dataset | Precision | Recall | F-measure | FPS |
|---|---|---|---|---|
| Faster-RCNN trained in MEND | 0.71 | 0.73 | 0.72 | 14 |
| Faster-RCNN trained in new | ||||
| pictures in MEND(ours) | 0.85 | 0.75 | 0.80 | 14 |
| Yolov3 trained model in MEND | 0.72 | 0.45 | 0.55 | 25 |
| Yolov3 trained in new | ||||
| pictures in MEND (ours) | 0.80 | 0.82 | 0.81 | 25 |
5. Conclusions
Funding
References
- Yu, X.; Ye, X.; Zhang, S. Floating pollutant image target extraction algorithm based on immune extremum region. Digital Signal Processing 2022, 123, 103442. [Google Scholar] [CrossRef]
- Yu, X.; Tian, X. A fault detection algorithm for pipeline insulation layer based on immune neural network. International Journal of Pressure Vessels and Piping 2022, 196, 104611. [Google Scholar] [CrossRef]
- Liang, T.; Bao, H.; Pan, W.; Pan, F. Traffic sign detection via improved sparse R-CNN for autonomous vehicles. Journal of Advanced Transportation 2022, 2022, 1–16. [Google Scholar] [CrossRef]
- Liao, M.; Wan, Z.; Yao, C.; Chen, K.; Bai, X. Real-time scene text detection with differentiable binarization. Proceedings of the AAAI conference on artificial intelligence, 2020, Vol. 34, pp. 11474–11481.
- Sabu, A.M.; Das, A.S. A Survey on various Optical Character Recognition Techniques. 2018 conference on emerging devices and smart systems (ICEDSS). IEEE, 2018, pp. 152–155.
- Wu, W.; Xing, J.; Yang, C.; Wang, Y.; Zhou, H. Texts as Lines: Text Detection with Weak Supervision. Mathematical Problems in Engineering 2020, 2020, 1–12. [Google Scholar] [CrossRef]
- Li, J.; Huang, T.; Yang, Y.; Xu, Q. Detection and Recognition of Characters on the Surface of Metal Workpieces with Complex Background. 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), 2020.
- Yuan, J.; Guo, M.; Huang, B.; Hu, R.; Dian, S. Processing and Recognition of Characters Image in Complex Environment. 2022 International Conference on Innovations and Development of Information Technologies and Robotics (IDITR), 2022, pp. 100–104.
- Khan, T.; Sarkar, R.; Mollah, A.F. Deep learning approaches to scene text detection: a comprehensive review. Artificial Intelligence Review 2021, 54, 3239–3298. [Google Scholar] [CrossRef]
- Long, S.; He, X.; Yao, C. Scene text detection and recognition: The deep learning era. International Journal of Computer Vision 2021, 129, 161–184. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, Z.; Qiao, Y.; Lai, J.; Jiang, J.; Zhang, Z.; Fu, B. Orientation robust scene text recognition in natural scene. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE, 2019, pp. 901–906.
- Baifeng, N.; Ganzi, H.; Yu, Y. Research on nameplate image recognition algorithm based on R-CNN and SSD deep learning detection methods. 2022 IEEE 2nd International Conference on Power, Electronics and Computer Applications (ICPECA). IEEE, 2022, pp. 580–584.
- Shuliang, Z.; Xin, H.; Haochen, Z.; Jianyu, W. Design of text position detection method for electrical equipment nameplate. 2021 International Conference on Cyber-Physical Social Intelligence (ICCSI). IEEE, 2021, pp. 1–4.
- Panhwar, M.A.; Memon, K.A.; Abro, A.; Zhongliang, D.; Khuhro, S.A.; Memon, S. Signboard detection and text recognition using artificial neural networks. 2019 IEEE 9th international conference on electronics information and emergency communication (ICEIEC). IEEE, 2019, pp. 16–19.
- Wu, Y.; Li, Z.; Wang, Y.; Huang, Z.; Zheng, Z. Application Research of Feature Extraction Method of Power Equipment Nameplate. Computer Science and Application 2019, 09, 2084–2097. [Google Scholar] [CrossRef]
- Zhao, K.; Han, Q.; Zhang, C.B.; Xu, J.; Cheng, M.M. Deep hough transform for semantic line detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 2021, 44, 4793–4806. [Google Scholar] [CrossRef] [PubMed]
- Kagawa, T.; Ikemoto, M.; Ohtake, S. A robust method of IC seating inspection in burn-in sockets using Hough transform. 2022 IEEE International Conference on Consumer Electronics-Taiwan. IEEE, 2022, pp. 1–2.
- Aslani, A.A.; Zolfaghari, M.; Sajedi, H. Automatic Counting Red Blood Cells in the Microscopic Images by EndPoints Method and Circular Hough Transform. 2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM). IEEE, 2022, pp. 1–5.
- Marzougui, M.; Alasiry, A.; Kortli, Y.; Baili, J. A lane tracking method based on progressive probabilistic Hough transform. IEEE access 2020, 8, 84893–84905. [Google Scholar] [CrossRef]
- Kumar, S.; Jailia, M.; Varshney, S. An efficient approach for highway lane detection based on the Hough transform and Kalman filter. Innovative infrastructure solutions 2022, 7, 290. [Google Scholar] [CrossRef]
- Ahmad, R.; Naz, S.; Razzak, I. Efficient skew detection and correction in scanned document images through clustering of probabilistic hough transforms. Pattern Recognition Letters 2021, 152, 93–99. [Google Scholar] [CrossRef]
- Ma, C.; Li, X.; Li, Y.; Tian, X.; Wang, Y.; Kim, H.; Serikawa, S. Visual information processing for deep-sea visual monitoring system. Cognitive Robotics 2021, 1, 3–11. [Google Scholar] [CrossRef]
- Yang, S.; Lu, H.; Li, J. Multifeature fusion-based object detection for intelligent transportation systems. IEEE Transactions on Intelligent Transportation Systems 2022. [Google Scholar] [CrossRef]
- Lai, J.; Guo, L.; Qiao, Y.; Chen, X.; Zhang, Z.; Liu, C.; Li, Y.; Fu, B. Robust text line detection in equipment nameplate images. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE, 2019, pp. 889–894.
- Li, J.; Zhang, W.; Han, R. Application of machine vision in defects inspection and character recognition of nameplate surface. 2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science. IEEE, 2014, pp. 295–298.
- Nakayama, Y.; Lu, H.; Li, Y.; Kamiya, T. WideSegNeXt: semantic image segmentation using wide residual network and NeXt dilated unit. IEEE Sensors Journal 2020, 21, 11427–11434. [Google Scholar] [CrossRef]
- Gupta, A.; Vedaldi, A.; Zisserman, A. Synthetic data for text localisation in natural images. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2315–2324.
- Yao, C.; Bai, X.; Liu, W.; Ma, Y.; Tu, Z. Detecting texts of arbitrary orientations in natural images. 2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012, pp. 1083–1090.
- Karatzas, D.; Gomez-Bigorda, L.; Nicolaou, A.; Ghosh, S.; Bagdanov, A.; Iwamura, M.; Matas, J.; Neumann, L.; Chandrasekhar, V.R.; Lu, S.; others. ICDAR 2015 competition on robust reading. 2015 13th international conference on document analysis and recognition (ICDAR). IEEE, 2015, pp. 1156–1160.
- Ch’ng, C.K.; Chan, C.S. Total-text: A comprehensive dataset for scene text detection and recognition. 2017 14th IAPR international conference on document analysis and recognition (ICDAR). IEEE, 2017, Vol. 1, pp. 935–942.
- Tian, Z.; Huang, W.; He, T.; He, P.; Qiao, Y. Detecting text in natural image with connectionist text proposal network. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14. Springer, 2016, pp. 56–72.
- Zhou, X.; Yao, C.; Wen, H.; Wang, Y.; Zhou, S.; He, W.; Liang, J. East: an efficient and accurate scene text detector. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2017, pp. 5551–5560.
- Wang, P.; Zhang, C.; Qi, F.; Huang, Z.; En, M.; Han, J.; Liu, J.; Ding, E.; Shi, G. A single-shot arbitrarily-shaped text detector based on context attended multi-task learning. Proceedings of the 27th ACM international conference on multimedia, 2019, pp. 1277–1285.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
- He, T.; Zhang, Z.; Zhang, H.; Zhang, Z.; Xie, J.; Li, M. Bag of tricks for image classification with convolutional neural networks. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 558–567.





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).