Submitted:
03 September 2025
Posted:
03 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Dataset, Environment, and Parameters
2.2. Loss Function and Model Evaluation Metrics
2.3. Improved YOLOv7 Network Architecture
2.3.1. YOLOv7 Network Architecture
2.3.2. YOLOv7-STE
2.3.3. Loss Function
3. Experimental Results and Discussion
3.1. Comparative Experimental Results Analysis
3.2. Ablation Experiment
3.3. Comparison of Different Models
4. Conclusions
Author Contributions
Funding
References
- Yang, H.; He, J.; Liu, Z.; Zhang, C. LLD-MFCOS: A Multiscale anchor-free detector based on label localization distillation for wheelset tread defect detection. EEE Trans. Instrum. Meas. 2024, 73, 5003815–1–15. [Google Scholar] [CrossRef]
- Song, Y.; Ji, Z.; Guo, X.; et al. A comprehensive laser image dataset for real-time measurement of wheelset geometric parameters. Scientific Data 2024, 11, 1. [Google Scholar] [CrossRef] [PubMed]
- Liu, J.; Jiang, S.; Wang, Z.; et al. Detection of Train Wheelset Tread Defects with Small Samples Based on Local Inference Constraint Network. Electronics 2024, 13, 11. [Google Scholar] [CrossRef]
- Zhang, C.; Xu, Y.; Yin, H.L. Deformable residual attention network for defect detection of train wheelset tread. Vis. Comput. 2024, 40, 1775–1785. [Google Scholar] [CrossRef]
- Xiangyang, Z.; Jiangping, L. Integrated intelligent system for rail flaw detection vehicle. Electric Drive for Locomotives 2021, 133–137. [Google Scholar]
- Jiangping, L.; Xizhuo, Y.; Jingwei, C.; et al. Intelligent rail flaw detection system based on deep learning and sup- port vector machine. Electric Drive for Locomotives 2021, 100–107. [Google Scholar]
- Xue, W.; Zhenbo, F. A new method of wavelet and sup- port vector machine for detection of the train wheel bruise. China Mech. Eng. 2004, 15, 1641–1643. [Google Scholar]
- Zhijun, X.; Jianzheng, C.. Tread profile of wheel detec- tion method based image processing and Hough transform . Electron. Meas. Technol. 2017, 40, 117–121.
- Zhenwen, S.; Guiyun, W.. Fast method of detect- ing packaging bottle defects based on ECA-EfficientDet. J. Sens. 2022, 2022, 9518910.
- Palazzetti, L.; Rangarajan, A.K.; Dinca, A.; et al. The hawk eye scan: Halyomorpha halys detection relying on aerial tele photos and neural networks. Comput. Electron. Agric. 2024, 226, 109365. [Google Scholar] [CrossRef]
- Shi, C.; Yang, H.; Cai, J.; et al. A Survey of Galaxy Pairs in the SDSS Photometric Images based on Faster-RCNN. Astron. J. 2024, 168, 90. [Google Scholar] [CrossRef]
- Guo, H.; Wu, T.; Gao, G.; et al. Lightweight safflower cluster detection based on YOLOv5. Sci. Rep. 2024, 14, 18579. [Google Scholar] [CrossRef]
- Li, Z.; Zhu, Y.; Sui, S.; et al. Real-time detection and counting of wheat ears based on improved YOLOv7. Comput. Electron. Agric. 2024, 218, 108670. [Google Scholar] [CrossRef]
- Changfan, Z.; Xinliang, H.; Jing, H.; et al. Yolov4 high-speed train wheelset tread defect detection system based on multiscale feature fusion. J. Adv. Transp. 2022, 2022, 1172654. [Google Scholar] [CrossRef]
- Bowen, Z.; Huacai, L.; Shengbo, Z.; et al. Night target detection algorithm based on improved YOLOv7. Sci. Rep. 2024, 14, 15771. [Google Scholar] [CrossRef] [PubMed]
- Wang, D.; Qian, Y.; Lu, J.; et al. Fs-yolo: fire-smoke detection based on improved YOLOv7. Multimed. Syst. 2024, 30, 215. [Google Scholar] [CrossRef]
- Liu, Y.; Jiang, B.; He, H.; et al. Helmet wearing detection algorithm based on improved YOLOv5. Sci. Rep. 2024, 14, 8768. [Google Scholar] [CrossRef] [PubMed]
- Ren, Y.; Zhang, H.; Sun, H.; et al. LightRay: Lightweight network for prohibited items detection in X-ray images during security inspection. Comput. Electr. Eng. 2022, 103, 108283. [Google Scholar] [CrossRef]
- Tian, Y.; Wang, S.; Li, E.; et al. MD-YOLO: Multi-scale Dense YOLO for small target pest detection. Comput. Electron. Agric. 2023, 213, 108233. [Google Scholar] [CrossRef]
- Wei, W.; Wei, P.; Yi, L. . Application of Improved YOLOv3 in Aerial Target Detection. Application of Improved YOLOv3 in Aerial Target Detection. Computer Engineering and Applications 2020, 56, 17–23. [Google Scholar]
- Kaiqi, H.; Xiaorong, L.; Maoyun, H. Research onsmall target detection method based on improved YOLOv3. Transducer and Microsystem Technologies 2022, 41, 52–55.
- Lin, Z.; Huang, M.; Zhou, Q. Infrared small target detection based on YOLO v4. Journal of Physics: Conference Series. IOP Publ. 2023, 2450, 012019.
- Cai, Y.; Yao, Z.; Jiang, H. ,et al.Rapid detection of fish with SVC symptoms based on machine vision combined with a NAM-YOLO v7 hybrid model. Aquaculture 2024, 582. [CrossRef]
- Kang, M.; Ting, C.M.; Ting, F.F.; et al. ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation. Image Vis. Comput. 2024, 147, 105057. [Google Scholar] [CrossRef]
- Li, H.; Li, J.; Wei, H.; et al. Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arxiv 2206, arXiv:2206.02424. [Google Scholar]
- Ghiasi, G.; Cui, Y.; Srinivas, A.; et al. Simple copy-paste is a strong data augmentation method for instance segmentation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021, 2918-2928.
- Karras, T.; Aittala, M.; Laine, S.; et al. Alias-free generative adversarial networks. Adv. Neural Inf. Process. Syst. 2021, 34, 852–863. [Google Scholar]
- Abayomi-Alli, O.O.; Damaševičius, R.; Misra, S.; et al. Cassava disease recognition from low-quality images using enhanced data augmentation model and deep learning. Expert Syst. 2021, 38, e12746. [Google Scholar] [CrossRef]
- Ye, Y.; Li, Y.; Ouyang, R.; et al. Improving machine learning based phase and hardness prediction of high-entropy alloys by using Gaussian noise augmented data. Comput. Mater. Sci. 2023, 223, 112140. [Google Scholar] [CrossRef]
- Yang, J.; Zhang, X.; Song, C. Research on a small target object detection method for aerial photography based on improved YOLOv7. Vis. Comput. 2024, 1–15. [Google Scholar] [CrossRef]
- Zheng, Z.; Wang, P.; Liu, W.; et al. Distance-IoU loss: Faster and better learning for bounding box regression. Proc. AAAI Conf. Artif. Intell. 2020, 34, 12993–13000. [Google Scholar] [CrossRef]
- Zhang, Y.F.; Ren, W.; Zhang, Z.; et al. Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing 2022, 506, 146–157. [Google Scholar] [CrossRef]











| Categories | Number of targets | |||
|---|---|---|---|---|
| Training set | Validation set | Test set | ||
| pit | 720 | 40 | 40 | |
| bruise | 720 | 40 | 40 | |
| peel | 720 | 40 | 40 | |
| total | 2160 | 120 | 120 | |
| Hardware and Software | Configuration Parameter |
| Computer | Operating System: Windows10 |
| CPU: Intel(R) Core (TM) i9-9900K CPU@3.60GHz | |
| GPU: NVIDIA GeForce RTX 3090 | |
| RAM: 16 GB | |
| Video memory: 24 GB | |
| Software version | Python3.9.12 + PyTorch1.9.1 + CUDA11.7 + cuDNN8.2.1 + Opencv4.5.5+Visual Studio Code2022 (1.69.1) |
| Parameter | Value |
|---|---|
| Batch size | 64 |
| Learning rate | 0.01 |
| Warm-up epochs | 3 |
| Number of iterations | 120 |
| Momentum parameter | 0.937 |
| Image size | 640 × 640 |
| Optimizer | SGD |
| Model | Parameter size (MB) | mAP@0.5 (%) | mAP@0.5, 0.95(%) | FPS |
|---|---|---|---|---|
| YOLOv7-STE | 61.09 | 97.3 | 62.9 | 76.3 |
| YOLOv7 | 135 | 95.7 | 52.77 | 74.4 |
| YOLOv5 | 155.78 | 86.6 | 49.5 | 72.3 |
| SSD | 183.2 | 48.67 | 39.06 | 35.4 |
| Faster R-CNN | 216 | 59.33 | 33.6 | 8.64 |
| Loss function | Model volume | mAP@0.5, 0.95 | mAP@0.5(%) | |||
|---|---|---|---|---|---|---|
| (MB) | (%) | all classes | pit | peel | bruise | |
| CIoU | 135 | 51.33 | 93.9 | 91.6 | 97.3 | 92.9 |
| WIoU | 135 | 51.44 | 94.1 | 90 | 99.5 | 93 |
| SIoU | 135 | 51.34 | 93.9 | 91.5 | 97.3 | 93 |
| DIoU | 135 | 51.69 | 94.6 | 91.1 | 99.2 | 93.6 |
| EIoU | 135 | 52.77 | 95.7 | 92.6 | 99.5 | 95.1 |
| GSConv | STE | EIoU | Model Volume | mAP@0.5, 0.95 | mAP@0.5(%) | FPS | |||
|---|---|---|---|---|---|---|---|---|---|
| (MB) | (%) | all classes | pit | peel | bruise | ||||
| √ | 135 | 52.77 | 95.7 | 92.6 | 99.5 | 95.1 | 74.4 | ||
| √ | √ | 51 | 49.9 | 92.3 | 82.4 | 99.5 | 95.2 | 87.1 | |
| √ | √ | 149 | 51.12 | 95.9 | 97 | 96.9 | 93.8 | 61.3 | |
| √ | √ | √ | 61.09 | 54.9 | 97.3 | 96.8 | 99.6 | 95.4 | 76.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).