Submitted:
05 October 2023
Posted:
06 October 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Underwater Robotics Visual Detection System
3. Materials and Methods
3.1. Feature extraction network
3.2. Feature Optimization Network
3.3. Feature fusion network
4. Experiments
4.1. Underwater Target Dataset
4.2. Experimental Setups
4.2.1. Experimental Environment and Training Parameters
4.2.2. Model Evaluation Metrics
4.3. Comparison with Mainstream Methods
4.4. Ablation Studies
4.5. Under Robotics Visual Detection System Performance Test
4.5.1. Underwater robotics Experimental Platform
4.5.2. Performance Comparison Test
5. Conclusions
References
- Fanzong, Z.; Xueting, Z.; Jingbiao, L.; Hao, L.; Zhengjing, Z.; Shihe, Z. Magnetic gradient tensor positioning method implemented on an autonomous underwater vehicle platform. Journal of Marine Science and Engineering 2023, 11(10), 1909. [Google Scholar]
- Tao, J.; Yize, S.; Hai, H.; Hongde, Q.; Xi, C.; Lingyu, L.; Zongyu, Z.; Xinyue, H. Binocular vision based non-singular fast terminal control for the UVMs small target grasp. Journal of Marine Science and Engineering 2023, 11(10), 1905. [Google Scholar]
- Jing, Y. Protection of underwater cultural heritage in China: new developments. International Journal of Cultural Policy 2019, 25, 756–764. [Google Scholar] [CrossRef]
- Pearson, N.; Thompson, B. S. Saving two fish with one wreck: Maximizing synergies in marine biodiversity conservation and underwater cultural heritage protection. Marine Policy 2023, 152, 105613. [Google Scholar] [CrossRef]
- Manley, J. E. Unmanned maritime vehicles, 20 years of commercial and technical evolution. OCEANS 2016 MTS/IEEE Monterey 2016, 1–6. [Google Scholar]
- An, D.; Mu, Y.; Wang, Y.; Li, B.; Wei, Y. Intelligent Path Planning Technologies of Underwater Vehicles: a Review. Journal of Intelligent & Robotic Systems 2023, 107, 22. [Google Scholar]
- Kot, R. Review of Obstacle Detection Systems for Collision Avoidance of Underwater roboticss Tested in a Real Environment. Electronics 2022, 11, 3615. [Google Scholar] [CrossRef]
- Qin, J.; Yang, K.; Li, M.; Zhong, J.; Zhang, H. Real-Time Positioning and Tracking for Vision-Based Unmanned Underwater Vehicles. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2022, 46, 163–168. [Google Scholar] [CrossRef]
- Fayaz, S.; Parah, S. A.; Qureshi, G. J. Underwater object detection: architectures and algorithms–a comprehensive review. Multimedia Tools and Applications 2022, 81, 20871–20916. [Google Scholar] [CrossRef]
- Forsyth, D. Object detection with discriminatively trained part-based models. Computer 2014, 47, 6–7. [Google Scholar] [CrossRef]
- Lowe, D. G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Dalal, N.; Triggs, B. (2005, June). Histograms of oriented gradients for human detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2005; Vol. 1, pp. 886-893.
- Cutter, G.; Stierhoff, K.; Zeng, J. Automated detection of rockfish in unconstrained underwater videos using haar cascades and a new image dataset: Labeled fishes in the wild. In 2015 IEEE Winter Applications and Computer Vision Workshops (pp. 57-62). IEEE.
- Rizzini, D. L.; Kallasi, F.; Oleari, F.; Caselli, S. Investigation of vision-based underwater object detection with multiple datasets. International Journal of Advanced Robotic Systems 2015, 12, 77. [Google Scholar] [CrossRef]
- Qiu, S.; JIN, W. Radon transform detection method for underwater moving target based on water surface characteristic wave. Acta Optica Sinica 2019, 39, 25–37. [Google Scholar]
- Chen, L.; Zhou, F.; Wang, S.; Dong, J.; Li, N.; Ma, H.; Zhou, H. SWIPENET: Object detection in noisy underwater images. arXiv preprint arXiv:2010.10006. arXiv:2010.10006.
- Lei, F.; Tang, F.; Li, S. Underwater target detection algorithm based on improved YOLOv5. Journal of Marine Science and Engineering 2022, 10, 310. [Google Scholar] [CrossRef]
- Yan, J.; Zhou, Z.; Zhou, D.; Su, B.; Xuanyuan, Z.; Tang, J.; Liang, W. Underwater object detection algorithm based on attention mechanism and cross-stage partial fast spatial pyramidal pooling. Frontiers in Marine Science 2022, 9, 1056300. [Google Scholar] [CrossRef]
- Song, P.; Li, P.; Dai, L.; Wang, T.; Chen, Z. Boosting R-CNN: Reweighting R-CNN samples by RPN’s error for underwater object detection. Neurocomputing 2023, 530, 150–164. [Google Scholar] [CrossRef]
- Zeng, L.; Sun, B.; Zhu, D. Underwater target detection based on Faster R-CNN and adversarial occlusion network. Engineering Applications of Artificial Intelligence 2021, 100, 104190. [Google Scholar] [CrossRef]
- Zhang, W.; Zhuang, P.; Sun, H. H.; Li, G.; Kwong, S.; Li, C. Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement. IEEE Transactions on Image Processing 2022, 31, 3997–4010. [Google Scholar] [CrossRef]
- Shortis, M. Camera calibration techniques for accurate measurement underwater. 3D recording and interpretation for maritime archaeology 2019, 11–27. [Google Scholar]
- Chen, X. Q.; Xia, K.; Hu, W.; Cao, M.; Deng, K.; Fang, S. Extraction of underwater fragile artifacts: research status and prospect. Heritage Science 2022, 10, 9. [Google Scholar] [CrossRef]
- Hu, K.; Weng, C.; Zhang, Y.; Jin, J.; Xia, Q. An overview of underwater vision enhancement: From traditional methods to recent deep learning. Journal of Marine Science and Engineering 2022, 10, 241. [Google Scholar] [CrossRef]
- Wei, S. E.; Ramakrishna, V.; Kanade, T.; Sheikh, Y. Convolutional pose machines. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2016; pp. 4724-4732.
- Dai, J.; Qi, H.; Xiong, Y.; Li, Y.; Zhang, G.; Hu, H.; Wei, Y. Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision. 2017; pp. 764-773.
- Yu, F.; Wang, D.; Shelhamer, E.; Darrell, T. Deep layer aggregation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2018; pp. 2403-2412.
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2018; pp. 7132-7141.
- Hu, J.; Shen, L.; Albanie, S.; Sun, G.; Vedaldi, A. Gather-excite: Exploiting feature context in convolutional neural networks. Advances in neural information processing systems 2018, 31. [Google Scholar]
- Zeiler, M. D.; Taylor, G. W.; Fergus, R. Adaptive deconvolutional networks for mid and high level feature learning. In Proceedings of the IEEE conference on computer vision. 2011; pp. 2018–2025. [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 2015, 28. [Google Scholar] [CrossRef] [PubMed]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C. Y.; Berg, A. C. Ssd: Single shot multibox detector. In Proceedings of the European conference on computer vision. 2016; pp. 21–37.
- Jocher, G.; Chaurasia, A.; Stoken, A.; Borovec, J.; Kwon, Y.; Michael, K.; Mammana, L. ultralytics/yolov5: v6. 2-yolov5 classification models, apple m1, reproducibility, clearml and deci. ai integrations. Zenodo 2022.
- Wang, C. Y.; Bochkovskiy, A.; Liao, H. Y. M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023; pp. 7464-7475.
- Xu, G.; Liu, K.; Zhao, Y.; Li, S.; Wang, X. Research on the modeling and simulation technology of underwater vehicle. In OCEANS 2016-Shanghai (pp. 1-6).










| Method | Precision(%) | Recall(%) | F1(%) | mAP(%) |
|---|---|---|---|---|
| Faster-RCNN[31] | 90.2 | 88.5 | 89.3 | 89.4 |
| SSD[32] | 81.9 | 82.2 | 82.0 | 82.8 |
| YOLOv5-l[33] | 88.3 | 87.8 | 88.1 | 88.7 |
| YOLOv7[34] | 90.1 | 88.4 | 89.3 | 89.9 |
| Ours | 93.1 | 91.4 | 92.2 | 92.8 |
| Method | Precision(%) | Recall(%) | F1(%) | Map(%) |
|---|---|---|---|---|
| DLA[27] | 88.4 | 87.3 | 87.6 | 88.8 |
| MDLA | 89.6 | 89.2 | 89.4 | 90.5 |
| MDLA+DCN | 90.9 | 90.1 | 90.9 | 91.4 |
| MDLA+DCN+BAM | 93.1 | 91.4 | 92.2 | 92.8 |
| Parameters | Value |
|---|---|
| Maximum operating depth | 1000 m |
| Cruising speed | 2 knots |
| Maximum speed | 5 knots |
| Diameter | 350 mm |
| Length | 3.6 m |
| Weight in air | 250Kg |
| Method | mAP(%) | Params(M) | Input shape | FPS |
|---|---|---|---|---|
| SSD[32] | 82.8 | 24.5 | 640×640 | 15 |
| YOLOv5-l[33] | 88.7 | 46.5 | 640×640 | 11 |
| YOLOv5-s[33] | 86.5 | 14.1 | 640×640 | 21 |
| YOLOv7[34] | 89.9 | 74.4 | 640×640 | 10 |
| YOLOv7-tiny[34] | 86.4 | 13.2 | 640×640 | 22 |
| Ours | 92.8 | 18.9 | 640×640 | 18 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).