Submitted:
13 September 2024
Posted:
13 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
2.1. Traditional Machine Learning-Based Image Classification Methods
2.2. Deep Learning-Based Image Classification Methods
2.3. High-Resolution Image Detection Related Research
2.4. Image Retrieval Methods
3. Methodology
3.1. Classification of Carriage Line-Array Images
3.2. Template Matching-Based Image Recognition Method
3.3. Weighed Radial Basis Function for Coordinate Transformation
3.4. Objective Function Design
3.5. Alignment Quality Assessment Method
4. Experiments
4.1. Parameter Selection
4.2. Image Recognition Accuracy Evaluation
4.3. Comparison of Image Recognition Speeds
4.4. Algorithm Robustness Evaluation
4.4.1. Evaluation of Resistance to Local Nonlinear Distortions
4.4.2. Evaluation of Resistance to False Detections
4.4.3. Evaluation of Resistance to Missed Detections
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Chang, L.; Liu, Z.; Shen,Y.; Zhang,G. Novel multistate fault diagnosis and location method for key components of high-speed trains. IEEE Trans. Ind. Electron. 2020, 68, 3537-3547. [CrossRef]
- Lu, S.; Liu, Z.; Shen, Y. Automatic fault detection of multiple targets in railway maintenance based on time-scale normalization. IEEE Trans. Instrum. Meas. 2018, 67, 849-865. [CrossRef]
- Fu, Z.; Pan, X.; Zhang, G. Linear array image alignment under nonlinear scale distortion for train fault detection. IEEE Sens. J.. [CrossRef]
- Chen, H.; Jiang, B. A review of fault detection and diagnosis for the traction system in high-speed trains. IEEE Trans. Intell. Transp. Syst. 2019, 21, 450-465. [CrossRef]
- Chandra, M.A.; Bedi, S.S. Survey on SVM and their application in image classification. Int. J. Inf. Technol. 2021, 13, 1-11. [CrossRef]
- Bosch, A.; Zisserman, A.; Munoz, X. Image classification using random forests and ferns. In Proc. 2007 IEEE 11th Int. Conf. Comput. Vis. 2007, 1-8. [CrossRef]
- Amato, G.; Falchi, F. kNN based image classification relying on local feature similarity. In Proc. 3rd Int. Conf. Similarity Search Appl. 2010, 101-108. [CrossRef]
- Timofte, R.; Tuytelaars, T.; Van Gool, L. Naive Bayes image classification: beyond nearest neighbors. In Proc. Asian Conf. Comput. Vis., Berlin, Heidelberg: Springer, 2012, 689-703. [CrossRef]
- Wang, C.; Huang, K. How to use bag-of-words model better for image classification. Image Vis. Comput. 2015, 38, 65-74. [CrossRef]
- Tan, M.; Le, Q. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proc. Int. Conf. Mach. Learn., PMLR, 2019, 6105-6114.
- Tan, M.; Le, Q. Efficientnetv2: Smaller models and faster training. In Proc. Int. Conf. Mach. Learn., PMLR, 2021,10096-10106.
- Touvron, H.; Bojanowski, P.; Caron, M. ResMLP: Feedforward networks for image classification with data-efficient training. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 45, 5314-5321. [CrossRef]
- He, K.; Zhang, X.; Ren, S. Deep residual learning for image recognition. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, 770-778.
- Liang, J. Image classification based on RESNET. J. Phys.: Conf. Ser. 2020, 1634, 012110. [CrossRef]
- Cheng, D.; Meng, G.; Cheng, G. SeNet: Structured edge network for sea–land segmentation. IEEE Geosci. Remote Sens. Lett. 2016, 14, 247-251. [CrossRef]
- Zhang, X.; Zhou, X.; Lin, M. Shufflenet: An extremely efficient convolutional neural network for mobile devices. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2018, 6848-6856. [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A. An image is worth 16x16 words: Transformers for image recognition at scale.2020, arXiv preprint, arXiv:2010.11929. [CrossRef]
- Zheng, L.; Ai, P.; Wu, Y. Building recognition of UAV remote sensing images by deep learning. In Proc. IGARSS 2020 IEEE Int. Geosci. Remote Sens. Symp., 2020, 1185-1188. [CrossRef]
- Hou, X.; Ao, W.; Xu, F. End-to-end automatic ship detection and recognition in high-resolution Gaofen-3 spaceborne SAR images. In Proc. IGARSS 2019 IEEE Int. Geosci. Remote Sens. Symp., 2019, 9486-9489. [CrossRef]
- Sun, Z.; Xiong, B.; Lei, Y. Ship classification in high-resolution SAR images based on CNN regional feature fusion. In Proc. 2021 CIE Int. Conf. Radar, 2021,1445-1449. [CrossRef]
- Liu, Y.; Chang, M.; Xu, J. High-resolution remote sensing image information extraction and target recognition based on multiple information fusion. IEEE Access 2020, 8, 121486-121500. [CrossRef]
- Liu, Z.; Gao, G.; Sun, L.; HRDNet: High-resolution detection network for small objects. In Proc. 2021 IEEE Int. Conf. Multimedia Expo (ICME), 2021, 1-6. [CrossRef]
- Růžička, V.; Franchetti, F. Fast and accurate object detection in high resolution 4K and 8K video using GPUs. In Proc. 2018 IEEE High Perform. Extreme Comput. Conf. (HPEC), 2018, 1-7. [CrossRef]
- Li, Y. Research and application of deep learning in image recognition. In Proc. 2022 IEEE 2nd Int. Conf. Power Electron. Comput. Appl. (ICPECA), 2022, 994-999. [CrossRef]
- Liu, H.; Wang, R.; Shan, S. Deep supervised hashing for fast image retrieval. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, 2064-2072.
- Gordo, A.; Almazan, J. Revaud, J. End-to-end learning of deep visual representations for image retrieval. Int. J. Comput. Vis. 2017, 124, 237-254. [CrossRef]
- Liu, C.; Yu, G.; Volkovs, M. Guided similarity separation for image retrieval. Adv. Neural Inf. Process. Syst. 2019, 32.
- Ramzi, E.; Audebert, N.; Rambour, C. Optimization of rank losses for image retrieval. 2023, arXiv preprint, arXiv:2309.08250. [CrossRef]
- Smeulders, A. W. M.; Worring, M.; Santini, S. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1349-1380. [CrossRef]
- Noh, H.; Araujo, A.; Sim, J. Large-scale image retrieval with attentive deep local features. In Proc. IEEE Int. Conf. Comput. Vis., 2017, 3456-3465. [CrossRef]
- Henkel, C. Efficient large-scale image retrieval with deep feature orthogonality and hybrid-swin-transformers.2021, arXiv preprint, arXiv:2110.03786. [CrossRef]
- Lee, C. C.; Chung, P.; Tsai, J. Robust radial basis function neural networks. IEEE Trans. Syst. Man, Cybern. B 1999, 29, 674-685. [CrossRef]
- Ranganathan, A. The Levenberg-Marquardt algorithm. Tutoral on LM Algorithm 2004, 11, 101-110.
- Alcantarilla, P. F.; Solutions, T. Fast explicit diffusion for accelerated features in nonlinear scale spaces. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 34, 1281-1298. [CrossRef]
- Leutenegger, S.; Chli, M.; Siegwart, R. BRISK: Binary robust invariant scalable keypoints. In Proc. 2011 Int. Conf. Comput. Vis., 2011, 2548-2555. [CrossRef]
- Rublee, E.; Rabaud, V.; Konolige, K. ORB: An efficient alternative to SIFT or SURF. In Proc. 2011 Int. Conf. Comput. Vis., 2011, 2564-2571. [CrossRef]
- Lowe, D.G.; Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis., 2004, 60, 91-110. [CrossRef]
- DeTone, D.; Malisiewicz, T.; Rabinovich, A. Superpoint: Self-supervised interest point detection and description. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit. Workshops, 2018, 224-236. [CrossRef]
- Yoo, J. C.; Han, T. H. Fast normalized cross-correlation. Circuits, Systems and Signal Processing 2009, 28, 819-843. [CrossRef]








| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1.0 | |
| Mean Recognition Accuracy (%) | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 87.50 |
| Mean Matching Ratio (%) | 99.77 | 99.84 | 99.71 | 99.96 | 100.00 | 97.46 | 96.86 | 96.66 | 96.25 | 95.32 |
| Class | 76-category | 19-category | |||||||||||
| AKAZE[34] | BRISK[35] | ORB[36] | SIFT[37] | Super[38] | NCC[39] | Ours | AKAZE[34] | BRISK[35] | ORB[36] | SIFT[37] | Super[38] | Ours | |
| B22 | 0.087 | 0.095 | 0.11 | 0.13 | 0.122 | 0.059 | 1.0 | 0.133 | 0.255 | 0.225 | 0.69 | 0.355 | 1.0 |
| B22-1 | 0.397 | 0.225 | 0.23 | 0.27 | 0.32 | 0.238 | 1.0 | 0.488 | 0.278 | 0.183 | 0.71 | 0.773 | 1.0 |
| B23 | 0.14 | 0.167 | 0.112 | 0.1275 | 0.17 | 0.093 | 1.0 | 0.44 | 0.518 | 0.365 | 0.227 | 0.304 | 1.0 |
| B23-1 | 0.445 | 0.502 | 0.477 | 0.455 | 0.477 | 0.421 | 1.0 | 0.688 | 0.82 | 0.988 | 0.30 | 0.361 | 1.0 |
| BDL1 | 1.0 | 1.0 | 0.975 | 0.998 | 0.985 | 0.942 | 1.0 | 1.0 | 0.715 | 0.59 | 0.295 | 0.613 | 1.0 |
| BH1 | 0.16 | 0.09 | 0.057 | 0.237 | 0.45 | 0.149 | 1.0 | 0.308 | 0.113 | 0.14 | 0.41 | 0.19 | 1.0 |
| BX1K | 0.112 | 0.109 | 0.108 | 0.152 | 0.252 | 0.097 | 1.0 | 0.444 | 0.438 | 0.385 | 0.644 | 1.0 | 1.0 |
| BX1K-1 | 0.082 | 0.129 | 0.102 | 0.087 | 0.112 | 0.052 | 1.0 | 0.179 | 0.185 | 0.185 | 0.35 | 0.448 | 1.0 |
| C64K | 1.0 | 0.982 | 1.0 | 1.0 | 0.93 | 0.932 | 1.0 | 1.0 | 0.63 | 0.565 | 0.69 | 0.68 | 1.0 |
| C70E | 0.243 | 0.043 | 0.058 | 0.23 | 0.02 | 0.069 | 1.0 | 0.8 | 1.0 | 1.0 | 0.68 | 0.57 | 1.0 |
| C70 | 1.0 | 0.968 | 1.0 | 0.998 | 0.995 | 0.942 | 1.0 | 1.0 | 0.578 | 0.595 | 0.843 | 0.503 | 1.0 |
| JSQ5 | 0.273 | 0.175 | 0.168 | 0.188 | 0.335 | 0.178 | 1.0 | 0.388 | 0.315 | 0.401 | 0.688 | 0.995 | 1.0 |
| JSQ6 | 0.089 | 0.164 | 0.196 | 0.199 | 0.139 | 0.107 | 1.0 | 0.181 | 0.544 | 0.628 | 0.818 | 0.74 | 1.0 |
| JSQ6-1 | 0.163 | 0.155 | 0.143 | 0.155 | 0.203 | 0.114 | 1.0 | 0.315 | 0.3325 | 0.45 | 0.2525 | 0.435 | 1.0 |
| NX17K | 0.133 | 0.233 | 0.165 | 0.118 | 0.118 | 0.103 | 1.0 | 0.14 | 0.388 | 0.36 | 0.308 | 0.463 | 1.0 |
| NX70A | 0.998 | 0.995 | 0.963 | 0.998 | 0.995 | 0.940 | 1.0 | 1.0 | 1.0 | 0.963 | 0.998 | 0.593 | 1.0 |
| NX70 | 0.515 | 0.503 | 0.548 | 0.470 | 0.483 | 0.454 | 1.0 | 0.53 | 0.683 | 0.538 | 0.373 | 0.123 | 1.0 |
| X68BK | 0.5 | 0.573 | 0.543 | 0.455 | 0.473 | 0.459 | 1.0 | 0.728 | 0.363 | 0.18 | 0.413 | 0.303 | 1.0 |
| X70 | 0.1 | 0.113 | 0.143 | 0.280 | 0.018 | 0.081 | 1.0 | 0.16 | 0.793 | 0.668 | 0.605 | 1.0 | 1.0 |
| Ave. ACC | 0.391 | 0.379 | 0.373 | 0.397 | 0.399 | 0.338 | 1.0 | 0.515 | 0.51 | 0.481 | 0.538 | 0.549 | 1.0 |
| Class | 76-category | 19-category | ||||||||||||
| Efficient NetV2 [11] |
Resmlp [12] |
Resnet [13] |
Senet [15] |
Shufflenet [16] |
Vit [17] |
Ours | Efficient NetV2 [11] |
Resmlp [12] |
Resnet [13] |
Senet [15] |
Shufflenet [16] |
Vit [17] |
Ours | |
| B22 | 0.123 | 0.513 | 0.513 | 0.600 | 0.128 | 0.253 | 1.0 | 0.165 | 0.193 | 0.680 | 0.787 | 0.573 | 0.655 | 1.0 |
| B22-1 | 0.475 | 0.380 | 0.380 | 0.348 | 0.438 | 0.338 | 1.0 | 0.673 | 0.440 | 0.818 | 0.448 | 0.328 | 0.168 | 1.0 |
| B23 | 0.298 | 0.488 | 0.488 | 0.393 | 0.158 | 0.383 | 1.0 | 0.563 | 0.406 | 0.405 | 0.993 | 0.660 | 0.790 | 1.0 |
| B23-1 | 0.500 | 0.503 | 0.503 | 0.580 | 0.498 | 0.535 | 1.0 | 0.750 | 0.120 | 0.635 | 0.545 | 0.553 | 0.693 | 1.0 |
| BDL1 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.520 | 0.678 | 0.750 | 0.663 | 0.838 | 1.0 |
| BH1 | 0.215 | 0.308 | 0.308 | 0.220 | 0.085 | 0.120 | 1.0 | 0.228 | 0.513 | 0.713 | 0.815 | 0.808 | 0.011 | 1.0 |
| BX1K | 0.149 | 0.423 | 0.423 | 0.460 | 0.168 | 0.402 | 1.0 | 0.149 | 0.703 | 0.418 | 0.753 | 0.413 | 0.305 | 1.0 |
| BX1K-1 | 0.139 | 0.309 | 0.309 | 0.386 | 0.094 | 0.215 | 1.0 | 0.160 | 0.548 | 0.680 | 0.785 | 0.500 | 0.453 | 1.0 |
| C64K | 1.0 | 1.0 | 1.0 | 1.0 | 0.985 | 1.0 | 1.0 | 1.0 | 0.730 | 0.728 | 0.238 | 0.185 | 0.703 | 1.0 |
| C70E | 0.033 | 0.260 | 0.260 | 0.160 | 0.213 | 0.035 | 1.0 | 0.033 | 0.693 | 1.0 | 0.873 | 0.423 | 0.455 | 1.0 |
| C70 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.443 | 1.0 | 0.765 | 0.137 | 0.565 | 1.0 |
| JSQ5 | 0.788 | 0.510 | 0.510 | 0.623 | 0.300 | 0.625 | 1.0 | 0.908 | 0.485 | 0.693 | 0.560 | 0.965 | 0.708 | 1.0 |
| JSQ6 | 0.366 | 0.528 | 0.528 | 0.558 | 0.145 | 0.319 | 1.0 | 0.468 | 0.533 | 0.643 | 0.778 | 0.340 | 0.680 | 1.0 |
| JSQ6-1 | 0.250 | 0.268 | 0.268 | 0.378 | 0.158 | 0.433 | 1.0 | 0.355 | 0.573 | 0.500 | 1.0 | 0.513 | 1.0 | 1.0 |
| NX17K | 0.248 | 0.158 | 0.158 | 0.238 | 0.133 | 0.118 | 1.0 | 0.248 | 0.260 | 0.608 | 0.398 | 0.990 | 0.398 | 1.0 |
| NX70A | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.245 | 0.613 | 0.440 | 0.645 | 0.618 | 1.0 |
| NX70 | 0.520 | 0.513 | 0.513 | 0.513 | 0.513 | 0.513 | 1.0 | 0.523 | 0.343 | 0.659 | 0.928 | 0.570 | 0.280 | 1.0 |
| X68BK | 0.500 | 0.503 | 0.503 | 0.500 | 0.500 | 0.500 | 1.0 | 0.750 | 0.490 | 0.860 | 0.270 | 0.298 | 0.250 | 1.0 |
| X70 | 0.555 | 0.583 | 0.583 | 0.553 | 0.133 | 0.178 | 1.0 | 0.630 | 0.513 | 0.298 | 0.680 | 0.618 | 0.613 | 1.0 |
| Ave. ACC | 0.482 | 0.539 | 0.539 | 0.553 | 0.402 | 0.472 | 1.0 | 0.558 | 0.460 | 0.664 | 0.674 | 0.536 | 0.536 | 1.0 |
| Method | AKAZE | BRISK | ORB | SIFT | Superpoint | EfficientNetV2 | Resmlp | Resnet | Senet | Shufflenet | Vit | Ours |
| Time(s) | 3.263 | 4.586 | 5.312 | 2.274 | 4.762 | 0.082 | 0.031 | 0.054 | 0.063 | 0.139 | 0.082 | 0.024 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).