Submitted:
10 August 2025
Posted:
12 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
3. Materials and Methods
3.1. Dataset
3.2. Performance Evaluation Metrics
- Precision measures the proportion of correctly predicted bounding boxes () among all predicted boxes ():
- Recall measures the proportion of correctly predicted bounding boxes () among all ground truth boxes ():
3.3. Architecture of the Proposed YOLO-SSOD
3.3.1. Bi-level Routing Attention—BRA
4. Results and Discussions
4.1. Environmental Setup
4.2. Performance Evaluation on SAR Ship Dataset
4.3. Ablation Study of the Proposed YOLO-SSOD
4.3.1. Ablation Study on BRA Positioning in YOLO-SSOD
4.3.2. Ablation Study on the Small Object Detection Layer Applied to YOLOv10n
4.3.3. Ablation Study on TODL and BRA Applied to YOLOv10n
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Milios, A.; Bereta, K.; Chatzikokolakis, K.; Zissis, D.; Matwin, S. Automatic Fusion of Satellite Imagery and AIS data for Vessel Detection. In Proceedings of the 2019 22th International Conference on Information Fusion (FUSION); 2019; pp. 1–5. [Google Scholar] [CrossRef]
- Yasir, M.; Liu, S.; Mingming, X.; Wan, J.; Pirasteh, S.; Dang, K.B. ShipGeoNet: SAR Image-Based Geometric Feature Extraction of Ships Using Convolutional Neural Networks. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–13. [Google Scholar] [CrossRef]
- Yasir, M.; Shanwei, L.; Mingming, X.; Jianhua, W.; Nazir, S.; Islam, Q.U.; Dang, K.B. SwinYOLOv7: Robust ship detection in complex synthetic aperture radar images. Appl. Soft Comput. 2024, 160, 111704. [Google Scholar] [CrossRef]
- Galdelli, A.; Mancini, A.; Ferrà, C.; Tassetti, A.N. A Synergic Integration of AIS Data and SAR Imagery to Monitor Fisheries and Detect Suspicious Activities. Sensors 2021, 21. [Google Scholar] [CrossRef]
- Humayun, M.F.; Nasir, F.A.; Bhatti, F.A.; Tahir, M.; Khurshid, K. YOLO-OSD: Optimized Ship Detection and Localization in Multiresolution SAR Satellite Images Using a Hybrid Data-Model Centric Approach. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 5345–5363. [Google Scholar] [CrossRef]
- Li, J.; Xu, C.; Su, H.; Gao, L.; Wang, T. Deep Learning for SAR Ship Detection: Past, Present and Future. Remote Sens. 2022, 14. [Google Scholar] [CrossRef]
- El-Darymli, K.; McGuire, P.; Power, D.; Moloney, C.R. Target detection in synthetic aperture radar imagery: A state-of-the-art survey. J. Appl. Remote Sens. 2013, 7, 071598. [Google Scholar] [CrossRef]
- Li, J.; Qu, C.; Shao, J. Ship detection in SAR images based on an improved faster R-CNN. In Proceedings of the 2017 SAR in Big Data Era: Models, Methods and Applications (BIGSARDATA). IEEE; 2017; pp. 1–6. [Google Scholar]
- Wang, Y.; Wang, C.; Zhang, H.; Dong, Y.; Wei, S. A SAR Dataset of Ship Detection for Deep Learning under Complex Backgrounds. Remote Sens. 2019, 11. [Google Scholar] [CrossRef]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. 2024; arXiv:YOLOv10: Real-Time End-to-End Object Detection, cs.CV/2405.14458].
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 2016; pp. 779–788. [Google Scholar] [CrossRef]
- Yi, H.; Liu, B.; Zhao, B.; Liu, E. Small Object Detection Algorithm Based on Improved YOLOv8 for Remote Sensing. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 1734–1747. [Google Scholar] [CrossRef]
- Gao, Z.; Yu, X.; Rong, X.; Wang, W. Improved YOLOv8n for Lightweight Ship Detection. J. Mar. Sci. Eng. 2024, 12. [Google Scholar] [CrossRef]
- Li, Y.; Li, Q.; Pan, J.; Zhou, Y.; Zhu, H.; Wei, H.; Liu, C. SOD-YOLO: Small-Object-Detection Algorithm Based on Improved YOLOv8 for UAV Images. Remote Sens. 2024, 16. [Google Scholar] [CrossRef]
- Shen, L.; Lang, B.; Song, Z. DS-YOLOv8-Based Object Detection Method for Remote Sensing Images. IEEE Access 2023, 11, 125122–125137. [Google Scholar] [CrossRef]
- Song, Y.; Tang, L.; He, G. LW-SARDet: A Lightweight SAR Ship Detector via Decomposed Convolution. IEEE Geosci. Remote Sens. Lett. 2024, 21, 1–5. [Google Scholar] [CrossRef]
- Hong, Z.; Yang, T.; Tong, X.; Zhang, Y.; Jiang, S.; Zhou, R.; Han, Y.; Wang, J.; Yang, S.; Liu, S. Multi-Scale Ship Detection From SAR and Optical Imagery Via A More Accurate YOLOv3. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 6083–6101. [Google Scholar] [CrossRef]
- Xu, P.; Li, Q.; Zhang, B.; Wu, F.; Zhao, K.; Du, X.; Yang, C.; Zhong, R. On-Board Real-Time Ship Detection in HISEA-1 SAR Images Based on CFAR and Lightweight Deep Learning. Remote Sens. 2021, 13. [Google Scholar] [CrossRef]
- Zhou, K.; Zhang, M.; Wang, H.; Tan, J. Ship Detection in SAR Images Based on Multi-Scale Feature Extraction and Adaptive Feature Fusion. Remote Sens. 2022, 14. [Google Scholar] [CrossRef]
- Li, X.; Yang, W.; Jiang, Y.; Guan, S.; Zhang, Y. Research on SAR Image Ship Target Detection Algorithm with Improved YOLOv7. In Proceedings of the 2024 2nd International Conference on Signal Processing and Intelligent Computing (SPIC); 2024; pp. 443–448. [Google Scholar] [CrossRef]
- Shao, R.; Pan, X.; Ding, D.; Yang, L.; Han, M.; Huang, Z. Small Ship Detection Method of SAR Images Using Lightweight Feature-fused Enhancement Learning. In Proceedings of the 2024 IEEE International Conference on Computational Electromagnetics (ICCEM); 2024; pp. 1–3. [Google Scholar] [CrossRef]
- Haochen Wang, Juan Shi, H. K.F.L.; Wang, F. YOLOSAR-Lite: a lightweight framework for real-time ship detection in SAR imagery. Int. J. Digit. Earth 2024, 17, 2405525. [Google Scholar] [CrossRef]
- Xu, Y.; Du, W.; Deng, L.; Zhang, Y.; Wen, W. Ship target detection in SAR images based on SimAM attention YOLOv8. IET Commun. 2024. [Google Scholar] [CrossRef]
- Huang, Y.; Wang, D.; Huang, W.; An, D. A ViT Merged Oriented-Detector with Neuron Attention for Ship Detection in SAR Images. In Proceedings of the 2024 IEEE 7th International Conference on Electronic Information and Communication Technology (ICEICT); 2024; pp. 85–90. [Google Scholar] [CrossRef]
- Zhang, Y.; Dong, C.; Guo, L.; Meng, X.; Liu, Y.; Wei, Q. AFMSFFNet: An Anchor-Free-Based Feature Fusion Model for Ship Detection. Remote Sens. 2024, 16. [Google Scholar] [CrossRef]
- Yu, J.; Zhou, G.; Zhou, S.; Qin, M. A Fast and Lightweight Detection Network for Multi-Scale SAR Ship Detection under Complex Backgrounds. Remote Sens. 2022, 14. [Google Scholar] [CrossRef]
- Zhang, G.; Li, Z.; Li, X.; Yin, C.; Shi, Z. A Novel Salient Feature Fusion Method for Ship Detection in Synthetic Aperture Radar Images. IEEE Access 2020, 8, 215904–215914. [Google Scholar] [CrossRef]
- Lin, H.; Liu, J.; Li, X.; Wei, L.; Liu, Y.; Han, B.; Wu, Z. DCEA: DETR With Concentrated Deformable Attention for End-to-End Ship Detection in SAR Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 17292–17307. [Google Scholar] [CrossRef]
- Wang, X.; Cui, Z.; Cao, Z.; Dang, S. Dense Docked Ship Detection via Spatial Group-Wise Enhance Attention in SAR Images. In Proceedings of the IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium; 2020; pp. 1244–1247. [Google Scholar] [CrossRef]
- Li, S.; Fu, X.; Dong, J. Improved Ship Detection Algorithm Based on YOLOX for SAR Outline Enhancement Image. Remote Sensing 2022, 14. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.u.; Polosukhin, I. Attention is All you Need. In Proceedings of the Advances in Neural Information Processing Systems; Guyon, I.; Luxburg, U.V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., Eds. Curran Associates, Inc., Vol. 30. 2017. [Google Scholar]
- Gong, H.; Mu, T.; Li, Q.; Dai, H.; Li, C.; He, Z.; Wang, W.; Han, F.; Tuniyazi, A.; Li, H.; et al. Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images. Remote Sens. 2022, 14. [Google Scholar] [CrossRef]
- Chen, X.; Wei, C.; Xin, Z.; Zhao, J.; Xian, J. Ship Detection under Low-Visibility Weather Interference via an Ensemble Generative Adversarial Network. J. Mar. Sci. Eng. 2023, 11. [Google Scholar] [CrossRef]
- Chen, L.; Shi, W.; Deng, D. Improved YOLOv3 Based on Attention Mechanism for Fast and Accurate Ship Detection in Optical Remote Sensing Images. Remote Sens. 2021, 13. [Google Scholar] [CrossRef]
- Xu, K.; Ba, J.; Kiros, R.; Cho, K.; Courville, A.C.; Salakhutdinov, R.; Zemel, R.S.; Bengio, Y. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. CoRR, 2015, abs/1502.03044, [1502.03044]. [Google Scholar]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-End Object Detection with Transformers. In Proceedings of the Computer Vision – ECCV 2020; Vedaldi, A.; Bischof, H.; Brox, T.; Frahm, J.M., Eds., Cham; 2020; pp. 213–229. [Google Scholar]
- Zhu, L.; Wang, X.; Ke, Z.; Zhang, W.; Lau, R.W. Biformer: Vision transformer with bi-level routing attention. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition; 2023; pp. 10323–10333. [Google Scholar]
- Xie, Z.; Fu, M.; Liu, X. Detection of Fittings Based on the Dynamic Graph CNN and U-Net Embedded with Bi-Level Routing Attention. Electronics 2023, 12. [Google Scholar] [CrossRef]
- He, B.; Chen, Y.; Zhu, D.; Xu, Z. Domain adaptation via Wasserstein distance and discrepancy metric for chest X-ray image classification. Sci. Rep. 2024, 14, 2690. [Google Scholar] [CrossRef]
- Yao, G.; Zhu, S.; Zhang, L.; Qi, M. HP-YOLOv8: High-Precision Small Object Detection Algorithm for Remote Sensing Images. Sensors 2024, 24. [Google Scholar] [CrossRef]
- Jin, X.; Tong, A.; Ge, X.; Ma, H.; Li, J.; Fu, H.; Gao, L. YOLOv7-Bw: A Dense Small Object Efficient Detector Based on Remote Sensing Image. IECE Trans. Intell. Syst. 2024, 1, 30–39. [Google Scholar] [CrossRef]
- Li, J.; Feng, Y.; Shao, Y.; Liu, F. IDP-YOLOV9: Improvement of Object Detection Model in Severe Weather Scenarios from Drone Perspective. Appl. Sci. 2024, 14. [Google Scholar] [CrossRef]
- Li, J.; Xie, C.; Wu, S.; Ren, Y. UAV-YOLOv5: A Swin-Transformer-Enabled Small Object Detection Model for Long-Range UAV Images. Ann. Data Sci. 2024, pp. 1–30.
- Liu, Q.; Jiang, R.; Xu, Q.; Wang, D.; Sang, Z.; Jiang, X.; Wu, L. YOLOv8n_BT: Research on Classroom Learning Behavior Recognition Algorithm Based on Improved YOLOv8n. IEEE Access 2024, 12, 36391–36403. [Google Scholar] [CrossRef]
- Yasir, M.; Liu, S.; Pirasteh, S.; Xu, M.; Sheng, H.; Wan, J.; de Figueiredo, F.A.; Aguilar, F.J.; Li, J. YOLOShipTracker: Tracking ships in SAR images using lightweight YOLOv8. Int. J. Appl. Earth Obs. Geoinf. 2024, 134, 104137. [Google Scholar] [CrossRef]
- Yasir, M.; Shanwei, L.; Mingming, X.; Hui, S.; Hossain, M.S.; Colak, A.T.I.; Wang, D.; Jianhua, W.; Dang, K.B. Multi-scale ship target detection using SAR images based on improved Yolov5. Front. Mar. Sci. 2023, 9, 1086140. [Google Scholar] [CrossRef]
- Teixeira, E.; Araujo, B.; Costa, V.; Mafra, S.; Figueiredo, F. Literature review on ship localization, classification, and detection methods based on optical sensors and neural networks. Sensors 2022, 22, 6879. [Google Scholar] [CrossRef] [PubMed]
- Rocha, R.d.L.; de Figueiredo, F.A. Beyond Land: A Review of Benchmarking Datasets, Algorithms, and Metrics for Visual-Based Ship Tracking. Electronics 2023, 12, 2789. [Google Scholar] [CrossRef]
- Kang, M.; Ting, C.M.; Ting, F.F.; Phan, R.C.W. BGF-YOLO: enhanced YOLOv8 with multiscale attentional feature fusion for brain tumor detection. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention – MICCAI 2024: 27th International Conference, Marrakesh, Morocco, 2024, Proceedings, Part VIII; Linguraru, M.G.; et al.., Eds., Cham, 2024; Vol. 15008, Lecture Notes in Computer Science (LNCS), October 6–10; pp. 35–45. [CrossRef]
- Qu, Y.; Wang, C.; Xiao, Y.; Yu, J.; Chen, X.; Kong, Y. Optimization Algorithm for Surface Defect Detection of Aircraft Engine Components Based on YOLOv5. Appl. Sci. 2023, 13. [Google Scholar] [CrossRef]
- Li, J.; Xu, C.; Su, H.; Gao, L.; Wang, T. Deep Learning for SAR Ship Detection: Past, Present and Future. Remote Sens. 2022, 14. [Google Scholar] [CrossRef]
- Cui, Z.; Wang, X.; Liu, N.; Cao, Z.; Yang, J. Ship Detection in Large-Scale SAR Images Via Spatial Shuffle-Group Enhance Attention. IEEE Trans. Geosci. Remote Sens. 2021, 59, 379–391. [Google Scholar] [CrossRef]
- Chen, Y.; Yu, J.; Xu, Y. SAR Ship Target Detection for SSDv2 under Complex Backgrounds. In Proceedings of the 2020 International Conference on Computer Vision, Image and Deep Learning (CVIDL); 2020; pp. 560–565. [Google Scholar] [CrossRef]
- Chaudhary, Y.; Mehta, M.; Goel, N.; Bhardwaj, P.; Gupta, D.; Khanna, A. YOLOv3 Remote Sensing SAR Ship Image Detection. In Proceedings of the Data Analytics and Management; Khanna, A.; Gupta, D.; Pólkowski, Z.; Bhattacharyya, S.; Castillo, O., Eds., Singapore; 2021; pp. 519–531. [Google Scholar]
- Zhu, C.; Zhao, D.; Liu, Z.; Mao, Y. Hierarchical Attention for Ship Detection in SAR Images. In Proceedings of the IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium; 2020; pp. 2145–2148. [Google Scholar] [CrossRef]
- Wang, X.; Cui, Z.; Cao, Z.; Tian, Y. Ship Detection in Large Scale Sar Images Based on Bias Classification. In Proceedings of the IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium; 2020; pp. 1263–1266. [Google Scholar] [CrossRef]
- Wu, Z.; Hou, B.; Ren, B.; Ren, Z.; Wang, S.; Jiao, L. A Deep Detection Network Based on Interaction of Instance Segmentation and Object Detection for SAR Images. Remote Sens. 2021, 13. [Google Scholar] [CrossRef]
- Dong, Y.; Zhang, H.; Wang, C.; Zhang, B.; Li, L. Ship Detection based on M2Det for SAR images under Heavy Sea State. In Proceedings of the EUSAR 2021; 2021, 13th European Conference on Synthetic Aperture Radar; pp. 1–4.
- Li, J.; Liu, C.; Lu, X.; Wu, B. CME-YOLOv5: An Efficient Object Detection Network for Densely Spaced Fish and Small Targets. Water 2022, 14. [Google Scholar] [CrossRef]
- Kim, M.; Jeong, J.; Kim, S. ECAP-YOLO: Efficient Channel Attention Pyramid YOLO for Small Object Detection in Aerial Image. Remote Sens. 2021, 13. [Google Scholar] [CrossRef]
- Yang, W.; Ma, X.; Hu, W.; Tang, P. Lightweight Blueberry Fruit Recognition Based on Multi-Scale and Attention Fusion NCBAM. Agronomy 2022, 12. [Google Scholar] [CrossRef]







| Model | P | R | mAP | mAP- | layers | GFLOPS | FPS |
|---|---|---|---|---|---|---|---|
| YOLOv10n | 0,951 | 0,947 | 0,9736 | 0,687 | 385 | 8,4 | 769 |
| YOLO-SSOD (ours) | 0,950 | 0,948 | 0,9770 | 0,685 | 413 | 9,2 | 434 |
| Model | Train/Val/Test | P | R | mAP0.5 | mAP- |
|---|---|---|---|---|---|
| [9] | 7:2:1 | - | - | 0,8907 | - |
| [52] | 8:1:1 | - | - | 0,9470 | - |
| [27] | 7:2:1 | 0,944 | - | 0,9189 | - |
| [53] | 7:2:1 | - | - | 0,9107 | - |
| [54] | - | - | - | 0,9025 | - |
| [29] | 7:2:1* | - | - | 0,9390 | - |
| [55] | 7:1:2 | - | - | 0,9510 | - |
| [56] | 7:2:1* | - | - | 0,9240 | - |
| [18] | 7:2:1 | - | - | 0,9346 | - |
| [17] | 7:0:3 | 0,837 | 0,958 | 0,9552 | - |
| [57] | 7:0:3 | - | - | 0,9580 | - |
| [58] | 7:2:1* | - | - | 0,9439 | - |
| [26] | 8:0:2 | 0,911 | 0,922 | 0,9610 | - |
| [19] | 8:0:2 | - | - | 0,9510 | - |
| [25] | 7:2:1 | 0,924 | 0,957 | 0,9721 | - |
| [28] | 8:2:0 | - | - | 0,9620 | - |
| [22] | 7:1:2 | 0,927 | 0,914 | 0,9606 | - |
| YOLOv10n | 7:2:1 | 0,951 | 0,947 | 0,9736 | 0,687 |
| [23] | 8:0:2 | 0,947 | 0,945 | 0,9772 | 0,690 |
| YOLO-SSOD (ours) | 7:2:1 | 0,950 | 0,948 | 0,9770 | 0,685 |
| YOLO-SSOD (ours) | 8:0:2 | 0,952 | 0,951 | 0,9777 | 0,700 |
| BRA Positioning | P | R | mAP0.5 | mAP0.5-0.95 |
|---|---|---|---|---|
| YOLOv10n (baseline) | 0,925 | 0,938 | 0,972 | 0,678 |
| Layer 5 post C2f | 0,925 | 0,929 | 0,966 | 0,671 |
| Layer 7 post C2f | 0,945 | 0,907 | 0,971 | 0,676 |
| Layer 9 post C2f | 0,933 | 0,940 | 0,972 | 0,680 |
| Fusion 4 to 14 | 0,924 | 0,946 | 0,969 | 0,682 |
| Fusion 2 to 17 | 0,930 | 0,938 | 0,971 | 0,682 |
| Layer 9 + Fusion 2 to 18 | 0,944 | 0,927 | 0,974 | 0,678 |
| Fusion 2 to 19 + Fusion 4 to 14 | 0,948 | 0,930 | 0,975 | 0,690 |
| BRA Positioning+TODL | P | R | mAP0.5 | mAP0.5-0.95 |
|---|---|---|---|---|
| YOLOv10n (baseline) | 0,925 | 0,938 | 0,972 | 0,678 |
| Only TODL | 0,913 | 0,939 | 0,974 | 0,688 |
| Layer 5 post C2f | 0,937 | 0,934 | 0,973 | 0,685 |
| Layer 7 post C2f | 0,928 | 0,935 | 0,972 | 0,680 |
| Layer 9 post C2f | 0,917 | 0,921 | 0,968 | 0,679 |
| Fusion 4 to 14 | 0,920 | 0,948 | 0,969 | 0,681 |
| Fusion 2 to 17 | 0,925 | 0,944 | 0,970 | 0,681 |
| Fusion 2 to 19 + Fusion 4 to 14 | 0,928 | 0,925 | 0,968 | 0,683 |
| Model | P | R | mAP0.5 | mAP0.5-0.95 |
|---|---|---|---|---|
| YOLOv10n (baseline) | 0,925 | 0,938 | 0,972 | 0,678 |
| Only TODL | 0,913 | 0,939 | 0,974 | 0,688 |
| Only BRA Fusion 2 to 19 + Fusion 4 to 14 | 0,948 | 0,930 | 0,975 | 0,690 |
| Model | P | R | mAP0.5 | mAP0.5-0.95 |
|---|---|---|---|---|
| YOLOv10n (baseline) | 0,951 | 0,947 | 0,9736 | 0,687 |
| Only TODL | 0,950 | 0,948 | 0,9769 | 0,692 |
| Only BRA Fusion 2 to 19 + Fusion 4 to 14 | 0,950 | 0,948 | 0,9770 | 0,685 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).