Submitted:
22 May 2024
Posted:
23 May 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Basic Theory of the YOLOv8 Network
3. DAC-YOLO Method Construction
3.1. Multi-Branch Coordinate Attention
3.2. Deformable Convolution Based on MBCA
3.3. MBCADC2F Module
3.4. Deformable Convolutional Net-Attention-YOLO Object Detection Network
4. Example Verification
4.1. Experimental Dataset
4.2. Environmental Design and Evaluation Metrics
4.3. Experimental Results and Analysis
4.3.1. Ablation Experiment
4.3.2. Comparison Experiment of Different Detection Algorithms
5. Conclusion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Xing, S.-L.; Ye, J.-S.; Jiang, C.-Y. Review the study on typical diseases and design countermeasures of China concrete curved bridges. 2010 International Conference on Mechanic Automation and Control Engineering. IEEE, 2010: 4805-4808.
- NI F, ZHANG J, CHEN Z. Zernike-moment measurement of thin-crack width in images enabled by dual-scale deep learning. Computer-Aided Civil and Infrastructure Engineering, 2019, 34(5): 367-384.
- Y. LeCun, Y. Y. LeCun, Y. Bengio, G. Hinton, Deep learning, Nature. 521 (2015) 436–444.
- Cha, Y.; Choi, W.; Büyüköztürk, O. Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks. Comput. Aided Civil Infrastruct. Eng 2017, 32, 361–378. [Google Scholar] [CrossRef]
- Ni, F.; Zhang, J.; Chen, Z. Pixel-level crack delineation in images with convolutional feature fusion. Struct. Control. Heal. Monit. 2018, 26, e2286. [Google Scholar] [CrossRef]
- Dung, C.V.; Anh, L.D. Autonomous concrete crack detection using deep fully convolutional neural network. Autom. Constr. 2018, 99, 52–58. [Google Scholar] [CrossRef]
- Ali, R.; Cha, Y.-J. Subsurface damage detection of a steel bridge using deep learning and uncooled micro-bolometer. Constr. Build. Mater. 2019, 226, 376–387. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA. 2014. [CrossRef]
- GIRSHICK, R. Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile. 2015.
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017: 1137-1149.
- HE K, GKIOXARI G, DOLLAR P, et al. Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020: 386-397.
- REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, Real-Time Object Detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA. 2016.
- LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single Shot MultiBox Detector[M]. Computer Vision – ECCV 2016, Lecture Notes in Computer Science. 2016: 21-37.
- REDMON J, FARHADI A. YOLO9000: Better, Faster, Stronger. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI. 2017.
- REDMON J, FARHADI A. YOLOv3: An Incremental Improvement. arXiv: Computer Vision and Pattern Recognition, 2018.
- BOCHKOVSKIY A, WANG C Y, LIAO H Y. YOLOv4: Optimal Speed and Accuracy of Object Detection. Cornell University - arXiv preprint arXiv, 2020.
- Ultralytics YOLOv5. Available online: https://github.com/ultralytics/yolov5.
- LI C, LI L, JIANG H, et al. YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. 2022.
- WANG C Y, BOCHKOVSKIY A, LIAO H Y. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv preprint arXiv:2207.02696 , 2022.
- GE Z, LIU S, WANG F, et al. YOLOX: Exceeding YOLO Series in 2021. 2021.
- Ultralytics YOLOV8. Available online: https://github.com/ultralytics/ultralytics.
- Feng, C.; Zhong, Y.; Gao, Y.; Scott, M.R.; Huang, W. TOOD: Task-aligned One-stage Object Detection. International Conference on Computer Vision, 2021.
- ZHANG X B, LUO Z P, JI J H, et al. Intelligent Surface Cracks Detection in Bridges Using Deep Neural Network. International Journal of Structural Stability and Dynamics, 2023-07-06.
- Yu, Z. YOLO V5s-based Deep Learning Approach for Concrete Cracks Detection. SHS Web Conf. 2022, 144, 03015. [Google Scholar] [CrossRef]
- Jin, Q.; Han, Q.; Su, N.; Wu, Y.; Han, Y. A Deep Learning and Morphological Method for Concrete Cracks Detection. J. Circuits, Syst. Comput. 2023, 32. [Google Scholar] [CrossRef]
- Wu, Y.; Han, Q.; Jin, Q.; Li, J.; Zhang, Y. LCA-YOLOv8-Seg: An Improved Lightweight YOLOv8-Seg for Real-Time Pixel-Level Crack Detection of Dams and Bridges. Appl. Sci. 2023, 13, 10583. [Google Scholar] [CrossRef]
- Elfwing, S.; Uchibe, E.; Doya, K. Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Networks 2018, 107, 3–11. [Google Scholar] [CrossRef] [PubMed]
- TSOTSOS J, K. A Computational Perspective on Visual Attention[M]. The MIT Presse Books. 2011.
- HOU Q, ZHOU D, FENG J. Coordinate Attention for Efficient Mobile Network Design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA. 2021.
- DAI J, QI H, XIONG Y, et al. Deformable Convolutional Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Venice. 2017.
- ZHANG J, QIAN S, TAN C. Automated bridge surface crack detection and segmentation using computer vision-based deep learning model[J/OL]. Engineering Applications of Artificial Intelligence, 2022: 105225.
- ROBBINS H, MONRO S. A Stochastic Approximation Method[J/OL]. The Annals of Mathematical Statistics: 400-407.
- SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization[J/OL]. International Journal of Computer Vision, 2020: 336-359.













| Labels(diseases) | Number | Labels(diseases) | Number |
|---|---|---|---|
| liefeng(crack) | 17636 | shenshui(seepage) | 9244 |
| boluo(spalling) | 11875 | fengwo(comb surface) | 8330 |
| kongdong(cavity) | 7082 | lujin(steel exposed) | 6584 |
| mamian(pockmark) | 8274 |
| Data set | Well-lit images |
Partial shadow or occlusion images | Low-lighting images |
Dark-lighting images |
total |
|---|---|---|---|---|---|
| Train | 6697 | 1798 | 2608 | 3425 | 14528 |
| Val | 923 | 239 | 298 | 356 | 1816 |
| Test | 876 | 225 | 326 | 389 | 1816 |
| Model | Parameters/M | FLOPs/G | FPS/f·s-1 | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|---|---|
| YOLOv8s | 11.1 | 28.7 | 70.9 | 89.1 | 82.0 | 87.4 | 68.9 |
| +MBCA | 11.2 | 28.7 | 68.9 | 90.2 | 82.7 | 87.9 | 68.9 |
| +DCNv2 | 11.2 | 27.5 | 73.8 | 90.0 | 82.5 | 88.1 | 70.2 |
| proposed method | 11.3 | 27.5 | 74.4 | 91.3 | 85.4 | 89.4 | 73.3 |
| Model | Parameters/M | FLOPs/G | FPS/f·s-1 | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|---|---|
| YOLOv3-tiny | 12.1 | 19.1 | 76.9 | 81.7 | 73.4 | 78.6 | 53.4 |
| YOLOv5s | 7.0 | 16.8 | 78.3 | 91.2 | 84.9 | 88.7 | 67.4 |
| YOLOv6s | 16.3 | 44.2 | 69.4 | 90.2 | 81.5 | 87.7 | 69.6 |
| YOLOv8s | 11.1 | 28.7 | 70.9 | 89.1 | 82.0 | 87.4 | 68.9 |
| Proposed method | 11.3 | 27.5 | 74.4 | 91.3 | 85.4 | 89.4 | 73.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).