Submitted:
07 July 2025
Posted:
08 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. YOLOv8 Network Model
3. YOLOv8 Improves the Network Model
3.1. GRF-SPPF
3.2. Coordinate Attention (CA) Attention Mechanism
3.3. Loss Function SIoU
4. Experimental Results and Analysis
4.1. Experimental Environment and Dataset
4.2. Evaluation Indicators
4.2.1. Precision (Accuracy) and Recall (Recall Rate)
4.2.2. AP (Average Precision, Average Accuracy)
4.2.3. MAP@0.5 (Mean Average Precision, Mean Average Accuracy)
4.3. Experiments and Comparison
4.3.1. Ablation Experiment
4.3.2. Comparative Experiments
4.4. Comparison of Experimental Results
5. Conclusions
Author Contributions
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Redmon, J.; Farhadi, A. YOLO9000: Better, faster, stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017; pp. 7263–7271.
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016; pp. 779–788.
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014; pp. 580–587.
- Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015; pp. 1440–1448.
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149.
- Shafiee, M.J.; Chywl, B.; Li, F.; Wong, A. Fast YOLO: A fast you only look once system for real-time embedded object detection in video. J. Comput. Vis. Imaging Syst. 2017, 3. [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Computer Vision – ECCV 2016, Leibe, B., Matas, J., Sebe, N., Welling, M., Eds.; Springer, 2016; 9905 9905, pp. 21–37.
- Abdo, H.; Amin, K.M.; Hamad, A.M. Fall detection based on RetinaNet and MobileNet convolutional neural networks. In 2020 15th International Conference on Computer Engineering and Systems (ICCES), IEEE: Cairo, Egypt, 2020; pp. 1–7.
- Pu, M.; Huang, Y.; Liu, Y.; Guan, Q.; Ling, H. EDTER: Edge detection with transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022; pp. 1402–1412.
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021; pp. 2778–2788.
- Bochkovskiy, A.; Wang, C.Y.; Liao, H.Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934. 2020. [CrossRef]
- Sabour, S.; Frosst, N.; Hinton, G.E. Dynamic routing between capsules. Adv. Neural Inf. Process. Syst. 2017. [CrossRef]
- Varghese, R.; Sambath, M. Yolov8: A novel object detection algorithm with enhanced performance and robustness. In 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), IEEE: Chennai, India, 2024; pp. 1–6.
- Vijayakumar, A.; Vairavasundaram, S. Yolo-based object detection models: A review and its applications. Multimed. Tools Appl. 2024, 83, 83535–83574.
- Wang, C.; Wang, C.; Wang, L.; Li, Y.; Lan, Y. Real-time tracking based on improved YOLOv5 detection in orchard environment for dragon fruit. J. ASABE 2023, 66, 1109–1124.
- Zhang, P.; Liu, Y. A small target detection algorithm based on improved YOLOv5 in aerial image. PeerJ Comput. Sci. 2024, 10, e2007.
- Ouyang, D.; He, S.; Zhang, G.; Luo, M.; Guo, H.; Zhan, J.; Huang, Z. Efficient multi-scale attention module with cross-spatial learning. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE: Rhodes Island, Greece, 2023; pp. 1–5.
- Xie, G.; Xu, Z.; Lin, Z.; Liao, X.; Zhou, T. GRFS-YOLOv8: an efficient traffic sign detection algorithm based on multiscale features and enhanced path aggregation. Signal Image Video Process. 2024, 18, 5519–5534.
- Hou, Q.; Zhou, D.; Feng, J. Coordinate attention for efficient mobile network design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021; pp. 13713–13722.
- Chen, Z.; Liu, C.; Filaretov, V.F.; Yukhimets, D.A. Multi-scale ship detection algorithm based on YOLOv7 for complex scene SAR images. Remote Sens. 2023, 15, 2071.
- Zheng, Z.; Wang, P.; Ren, D.; Liu, W.; Ye, R.; Hu, Q.; Zuo, W. Enhancing geometric factors in model learning and inference for object detection and instance segmentation. IEEE Trans. Cybern. 2021, 52, 8574–8586.
- Zhang, H.; Zhang, S. Focaler-iou: More focused intersection over union loss. arXiv preprint arXiv:2401.10525 2024. [CrossRef]
- Yu, J.; Jiang, Y.; Wang, Z.; Cao, Z.; Huang, T. Unitbox: An advanced object detection network. In Proceedings of the 24th ACM International Conference on Multimedia, Association for Computing Machinery: New York, NY, USA, 2016; pp. 516–520.
- Zheng, S.; Luo, X.; Sun, Y. Pedestrian detection algorithm in fog based on improved YOLOv8. J. Hubei Univ. (Nat. Sci. Ed.) 2025, 1-9.
- Luo, Y.; Yang, W.; Wu, S.; Xu, Z.; Pan, N. Ship target detection algorithm in SAR images based on YOLOv8-CKS. J. Hubei Univ. (Nat. Sci. Ed.) 2025, 1-8.
- Xiao, Z.; Yan, S.; Qu, H. Safety helmet detection method in complex environment based on multiple mechanism optimization of YOLOv8. Comput. Eng. Appl. 2024, 60, 172–182.
- Sun, Y.; Zheng, S.; Luo, X. Sign language recognition algorithm based on improved YOLOv8n. J. Hubei Univ. (Nat. Sci. Ed.) 2025, 1-12.
- Liu, Z.; Zhang, P.; Guan, X.; Yu, S.; Zhang, X. Wood defect detection and classification based on improved YOLOv8. For. Eng. 2025, 1-16.







| Configure | Parameter |
|---|---|
| GPU | RTX4090 (24GB) 1 elevation configuration |
| CPU | 16vCPUIntel(R)Xeon(R)Platnum8352VCPU@210GHZ |
| internal storage: | 120GB |
| Hard disk system disk: | 30GB |
| PyTorch | 1.11.0 |
| python | 3.8 |
| cuda | 11.2 |
| Name | Set the number |
|---|---|
| epochs | 120 |
| batch | 32 |
| imgsz | 640 |
| optimizer | SGD |
| patience | 50 |
| lr0 | 0.01 |
| close_mosaic | 0 |
| weight_decay | 0.0005 |
| Model | GRFSPPF | CA | SIOU | mAP@0.5/% |
|---|---|---|---|---|
| YOLOv8 | × | × | × | 88.1 |
| A | √ | × | × | 88.7 |
| B | × | √ | × | 88.6 |
| C | × | × | √ | 88.5 |
| D | √ | √ | × | 89.2 |
| E | √ | × | √ | 89.1 |
| Proposed method | √ | √ | √ | 89.6 |
| Model | Params/M | GFLOPs/G | FPS/(f·s-1) | mAP@0.5/% |
|---|---|---|---|---|
| Faster R-CNN | 41.5 | 207.3 | 12 | 89.2 |
| SSD512 | 26.8 | 99.6 | 45 | 85.7 |
| YOLOv3-608 | 61.5 | 154.7 | 35 | 87.4 |
| YOLOv5n | 46.5 | 109.1 | 95 | 88.9 |
| YOLOv7n | 37.2 | 104.7 | 105 | 89.1 |
| YOLOv8s | 3.1 | 8.2 | 120 | 88.1 |
| Ours | 6.4 | 9.3 | 105 | 89.6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).