Submitted:
26 March 2025
Posted:
08 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. YOLOv8 Overview
2.1. Input End
2.2. Backbone
2.2.1. C2f
2.2.2. SPPF
2.3. Neck
2.4. Head
2.4.1. Decoupled Head
2.4.2. Anchor-Free
3. Improved YOLOv8s Object Detection Algorithm
3.1. Cloformer and CBAM Attention Mechanism

3.2. Model Lightweight Design
3.3. Improvement of the SPPF Module
4. Experimental Results and Analysis
4.1. Experimental Dataset
4.2. Experimental Environment and Parameter Settings
4.3. Evaluation Indicators
4.4. Melting Experiments and Result Analysis
4.5. Controlled Experiment
4.5.1. Comparison Before and After Improvement
4.5.2. Comparison with Other Algorithms
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Fang, F.; Hu, X.-j.; Zhang, B.-m.; Xie, Z.-h.; Jiang, J.-q. Deformation of dual-structure medium carbon steel in cold drawing. Materials Science and Engineering: A 2013, 583, 78–83. [CrossRef]
- Zhou, L.; Gong, J.; Li, B. Image Information Restoration of Automotive Strip Steel Surface Based on Sparse Representation. Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences 2021, 48, 141–148. [CrossRef]
- Redmon, J.; Divvala, S.K.; Girshick, R.B.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015, 779–788.
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 2017, 39, 1137–1149. [CrossRef]
- Feng, X.; Gao, X.-w.; Luo, L. X-SDD: A New Benchmark for Hot Rolled Steel Strip Surface Defects Detection. Symmetry 2021, 13, 706.
- Li, S.; Wang, X. YOLOv5-based Defect Detection Model for Hot Rolled Strip Steel. Journal of Physics: Conference Series 2022, 2171, 012040. [CrossRef]
- Shen, L.; Shi, T.; Liao, J. Surface Defect Detection Algorithm of Hot-Rolled Strip Based on Improved YOLOv7. IAENG International Journal of Computer Science 2024, 51, 345-354.
- Zhang, W.K.; Liu, J. Steel Surface Defect Detection Based on Improved YOLOv8s. Journal of Beijing Information Science & Technology University (Natural Science Edition) 2023, 33–40. [CrossRef]
- Wang, L.L.; Gong, Z.Z.; Liang, Z.Q. Surface Defect Detection of Strip Steel Based on Improved YOLOv5s Algorithm. Machine Tool & Hydraulics 2024, 181–186. [CrossRef]
- Dong, J.; Cheng, J.; Wu, J.; Zhang, C.; Zhao, S.; Tang, X. Real-Time AIoT for UAV Antenna Interference Detection via Edge-Cloud Collaboration. IEEE Internet of Things Journal 2024, 1-1. [CrossRef]
- Fan, Q.; Huang, H.; Guan, J.; He, R. Rethinking Local Perception in Lightweight Vision Transformer. 2023.
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. Springer, Cham 2018.
- Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 21-26 July 2017, 2017; pp. 1800-1807.
- Chen, J.; Kao, S.h.; He, H.; Zhuo, W.; Wen, S.; Lee, C.H.; Chan, S.H.G. Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17-24 June 2023, 2023; pp. 12021-12031.
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. ArXiv 2022, abs/2209.02976.
- Hussain, M. YOLO-v1 to YOLO-v8, the Rise of YOLO and Its Complementary Nature toward Digital Manufacturing and Industrial Defect Detection. Machines 2023. [CrossRef]
- Zhu, X.H.; Li, G.W.; Chang, D.F.; Du, J.W. A Surface Defect Detection Method for Hot-Rolled Strip Steel Based on Improved YOLOv8s. Ship Engineering 2025, 124–131. [CrossRef]
- Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y.M. YOLOv4: Optimal Speed and Accuracy of Object Detection. ArXiv 2020, abs/2004.10934.
- Sun, Y.; Chen, G.; Zhou, T.; Zhang, Y.; Liu, N. Context-aware Cross-level Fusion Network for Camouflaged Object Detection. In Proceedings of the International Joint Conference on Artificial Intelligence, 2021.
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature Pyramid Networks for Object Detection. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 21-26 July 2017, 2017; pp. 936-944.
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path Aggregation Network for Instance Segmentation. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 18-23 June 2018, 2018; pp. 8759-8768.
- Li, X.; Wang, W.; Wu, L.; Chen, S.; Hu, X.; Li, J.; Tang, J.; Yang, J. Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection. ArXiv 2020, abs/2006.04388.
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. ArXiv 2019, abs/1911.08287.
- Huang, L.; Yang, Y.; Deng, Y.; Yu, Y. DenseBox: Unifying Landmark Localization with End to End Object Detection. ArXiv 2015, abs/1509.04874.
- Wu, C.K.; Huang, F.; Li, B.; Gu, L.L.; Liu, L.; Fang, Y.M. DMS-YOLOv8 Slab Detection Algorithm Based on Improved Depthwise Separable Convolution and Hybrid Attention Mechanism. Metallurgical Automation 2024, 31–39.
- Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. ArXiv 2017, abs/1704.04861.
- Jocher, G.; Stoken, A.; Chaurasia, A.; Borovec, J.; Kwon, Y.; Michael, K.; ... Thanh Minh, M. ultralytics/yolov5: v6.0-YOLOv5n ’Nano’ models, Roboflow integration, TensorFlow export, OpenCV DNN support. Zenodo 2021.
- Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. In Proceedings of the International Conference on Learning Representations, 2017.
- Buslaev, A.V.; Parinov, A.; Khvedchenya, E.; Iglovikov, V.I.; Kalinin, A.A. Albumentations: fast and flexible image augmentations. ArXiv 2018, abs/1809.06839.
- Chattopadhay, A.; Sarkar, A.; Howlader, P.; Balasubramanian, V.N. Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 12-15 March 2018, 2018; pp. 839-847.
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.E.; Fu, C.-Y.; Berg, A.C. SSD: Single Shot MultiBox Detector. In Proceedings of the European Conference on Computer Vision, 2015.
- Tan, M.; Pang, R.; Le, Q.V. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13-19 June 2020, 2020; pp. 10778-10787.
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. DETRs Beat YOLOs on Real-time Object Detection. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2024, pp. 16965–16974.
- Wang, C.; He, W.; Nie, Y.; Guo, J.; Liu, C.; Wang, Y.; Han, K. Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism. In Proceedings of the Advances in Neural Information Processing Systems; Oh, A.; Naumann, T.; Globerson, A.; Saenko, K.; Hardt, M.; Levine, S., Eds. Curran Associates, Inc., 2023, Vol. 36, pp. 51094–51112.














| Experiments | mAP@0.5 | Param(M) | FLOPs(G) | FPS |
|---|---|---|---|---|
| 1 | 76.8 | 12.69 | 15.17 | 98.8 |
| 2 | 77.4 | 12.51 | 15.48 | 94.6 |
| 3 | 77.5 | 11.81 | 15.52 | 89.1 |
| 4 | 77.1 | 11.57 | 15.09 | 102.3 |
| 5 | 76.6 | 11.75 | 14.79 | 102.9 |
| 6 | 76.1 | 12.45 | 14.74 | 102.7 |
| Exp | CBAM | DWConv | PConv | CloFomer | SimSPPF | Params | FLOPs | P | R | mAP@0.5 | FPS |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | / | / | / | / | / | 11.17 | 14.36 | 74.4 | 95 | 75.68 | 107.8 |
| 2 | ✔ | / | / | / | / | 11.51 | 14.36 | 74.3 | 95 | 76.65 | 102.4 |
| 3 | / | ✔ | / | / | / | 9.61 | 12.5 | 79.4 | 95 | 77.61 | 136.7 |
| 4 | / | / | ✔ | / | / | 8.31 | 10.35 | 71.1 | 97 | 75.23 | 173.5 |
| 5 | / | / | / | ✔ | / | 12.75 | 15.9 | 77.2 | 96 | 76.82 | 83.6 |
| 6 | / | / | / | / | ✔ | 11.17 | 14.36 | 76.8 | 94 | 75.88 | 129.9 |
| 7 | ✔ | ✔ | / | / | / | 9.95 | 12.5 | 74.9 | 96 | 78.43 | 122.6 |
| 8 | ✔ | / | ✔ | / | / | 8.62 | 10.35 | 73.5 | 96.5 | 76.9 | 168.3 |
| 9 | ✔ | / | / | ✔ | / | 12.98 | 15.9 | 76.8 | 95.8 | 77.2 | 85.2 |
| 10 | ✔ | / | / | / | ✔ | 11.51 | 14.36 | 75.2 | 95.5 | 76.4 | 115.6 |
| 11 | / | ✔ | ✔ | / | / | 7.85 | 10.35 | 75 | 96.8 | 76.1 | 155.4 |
| 12 | / | ✔ | / | ✔ | / | 10.2 | 13.2 | 78.6 | 96.2 | 78.9 | 105.7 |
| 13 | / | ✔ | / | / | ✔ | 9.61 | 12.5 | 78 | 95.5 | 77.8 | 130.5 |
| 14 | / | / | ✔ | ✔ | / | 10.03 | 12.8 | 75.3 | 97.2 | 76.5 | 142.1 |
| 15 | / | / | ✔ | / | ✔ | 8.31 | 10.35 | 72.8 | 96.8 | 75.9 | 165.2 |
| 16 | / | / | / | ✔ | ✔ | 12.75 | 15.9 | 77.8 | 96.5 | 77.6 | 90.4 |
| 17 | ✔ | ✔ | ✔ | / | / | 7.1 | 8.49 | 72.9 | 97 | 77.27 | 191.7 |
| 18 | ✔ | ✔ | / | ✔ | / | 10.75 | 13.2 | 77.5 | 97 | 79.2 | 110.3 |
| 19 | ✔ | ✔ | / | / | ✔ | 9.95 | 12.5 | 76.2 | 96.2 | 78.8 | 125 |
| 20 | ✔ | / | ✔ | ✔ | / | 9.35 | 12.8 | 76.8 | 97.5 | 78.5 | 135.8 |
| 21 | ✔ | / | ✔ | / | ✔ | 8.62 | 10.35 | 74 | 96 | 77.1 | 160.1 |
| 22 | / | ✔ | ✔ | ✔ | / | 9.1 | 11.2 | 78.2 | 97.5 | 78.7 | 148.9 |
| 23 | / | ✔ | ✔ | / | ✔ | 7.85 | 10.35 | 75.6 | 96.3 | 76.8 | 157.3 |
| 24 | ✔ | ✔ | ✔ | ✔ | / | 8.27 | 9.31 | 77 | 98 | 79.08 | 180.7 |
| 25 | ✔ | ✔ | ✔ | ✔ | ✔ | 8.28 | 9.31 | 77.5 | 98 | 80.1 | 182.4 |
| Method | mAP@0.5 | FLOPs | Params | FPS |
|---|---|---|---|---|
| YOLOv5s | 75.6 | 7.2 | 8.7 | 195 |
| YOLOv5n | 71.2 | 4.5 | 1.9 | 280 |
| YOLOv6s | 78.2 | 9.8 | 12.4 | 165 |
| YOLOv6n | 73.5 | 6.1 | 4.2 | 220 |
| YOLOv7 | 79.1 | 13.5 | 37.4 | 135 |
| YOLOv8n | 74.8 | 5.8 | 3.1 | 250 |
| YOLOv8s | 77.3 | 8.4 | 11.2 | 190 |
| YOLOv8m | 79.5 | 15.2 | 25.8 | 120 |
| YOLOv8l | 80.7 | 21.9 | 43.7 | 85 |
| YOLOv9-t | 77.8 | 7.6 | 9.8 | 175 |
| YOLOv11-n | 74.5 | 6.6 | 2.6 | 240 |
| YOLOv12-n | 76.2 | 6.7 | 2.6 | 235 |
| YOLOv10-n | 76.4 | 5.2 | 2.5 | 265 |
| GoldYOLO-n | 78.5 | 12.6 | 5.6 | 200 |
| RT-DETR-1 | 80.3 | 103 | 32 | 77 |
| Faster R-CNN | 76.4 | 41.5 | 134.2 | 22 |
| SSD | 68.9 | 26.8 | 35.4 | 125 |
| EfficientDet-D1 | 73.8 | 13.6 | 10.5 | 95 |
| Ours | 80.41 | 8.21 | 9.31 | 182.4 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).