Submitted:
26 May 2026
Posted:
26 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Research on the Decoupling of Denoising and Detection Tasks in SAR Imagery
2.2. Research on Multi-Scale Object Detection and Anchor Box Generation Mechanism
2.3. Research on Bounding-Box Regression Losses and Hard-Sample Reweighting
3. The Proposed DN-AnchorNet Framework
3.1. Framework Overview and Optimization Criteria
3.1.1. Overall Architecture and Workflow
3.1.2. Unified Loss Function Design
3.2. Image Denoising Module
3.3. Feature Extraction Network
3.3.1. Backbone Network
3.3.2. Feature Pyramid Network (FPN) Neck
3.3.3. Oriented Detection Head
3.4. Region Proposal Network
3.5. ROI Head Module
4. Experiments and Analysis
4.1. Datasets and Evaluation Protocol
4.2. Implementation Details and Evaluation Metrics
4.3. Ablation Studies
4.3.1. Individual Module Ablation
4.3.2. Combined Module Ablation
4.4. Overall Performance Comparison
4.5. Cross-Dataset Generalization on HRSID
5. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Cao, S.; Zhao, C.; Dong, J.; Fu, X. Ship Detection in Synthetic Aperture Radar Images under Complex Geographical Environments, Based on Deep Learning and Morphological Networks. Sensors 2024, 24, 4290. [CrossRef]
- Chen, H.; He, M.; Yang, Z.; Gan, L. MCEM: Multi-Cue Fusion with Clutter Invariant Learning for Real-Time SAR Ship Detection. Sensors 2025, 25, 5736. [CrossRef]
- Li, J.; Xu, C.; Su, H.; Gao, L.; Wang, T. Deep Learning for SAR Ship Detection: Past, Present and Future. Remote Sens. 2022, 14, 2712. [CrossRef]
- Shin, S.; Kim, Y.; Hwang, I.; Kim, J.; Kim, S. Coupling Denoising to Detection for SAR Imagery. Appl. Sci. 2021, 11, 5569. [CrossRef]
- Zhao, X.; Zhang, B.; Tian, Z.; Xu, C.; Wu, F.; Sun, C. An Anchor-Free Method for Arbitrary-Oriented Ship Detection in SAR Images. In Proceedings of the 2021 SAR in Big Data Era (BIGSARDATA), Nanjing, China, 22–24 September 2021; pp. 1–4. [CrossRef]
- Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 318–327. [CrossRef]
- Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [CrossRef]
- Lee, J.S. Digital image enhancement and noise filtering by use of local statistics. IEEE Trans. Pattern Anal. Mach. Intell. 1980, PAMI-2, 165–168. [CrossRef]
- Frost, V.S.; Stiles, J.A.; Shanmugan, K.S.; Holtzman, J.C. A model for radar images and its application to adaptive digital filtering of multiplicative noise. IEEE Trans. Pattern Anal. Mach. Intell. 1982, PAMI-4, 157–166. [CrossRef]
- Lee, J.S.; Wen, J.H.; Ainsworth, T.L.; Chen, K.S.; Chen, A.J. Improved sigma filter for speckle filtering of SAR imagery. IEEE Trans. Geosci. Remote Sens. 2009, 47, 202–213. [CrossRef]
- Wang, P.; Zhang, H.; Patel, V.M. SAR image despeckling using a convolutional neural network. IEEE Signal Process. Lett. 2017, 24, 1763–1767. [CrossRef]
- Guo, J.; Li, Y.; Liu, H.; Li, W. Compact convolutional autoencoder for SAR target recognition. IET Radar Sonar Navig. 2020, 14, 967–972. [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; pp. 234–241. [CrossRef]
- Yao, X.; Shen, Y.; Lei, Y. FSMD–Net: Joint Spatial–Channel Spectral Modeling for SAR Ship Detection in Complex Inshore Scenarios. Remote Sens. 2026, 18, 1254. [CrossRef]
- Tang, J.; Zhang, F.; Ma, F.; Gao, F.; Yin, Q.; Zhou, Y. How SAR Image Denoise Affects the Performance of DCNN-Based Target Recognition Method. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Brussels, Belgium, 11–16 July 2021; pp. 3609–3612. [CrossRef]
- Zhang, X.; Huo, C.; Xu, N.; Jiang, H.; Cao, Y.; Ni, L.; Pan, C. Multitask Learning for Ship Detection From Synthetic Aperture Radar Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 8048–8062. [CrossRef]
- Zhao, M.; Zhang, X.; Kaup, A. Multitask Learning for SAR Ship Detection With Gaussian-Mask Joint Segmentation. IEEE Transactions on Geoscience and Remote Sensing 2023, 61, 1–16. [CrossRef]
- Pan, Y.; Ye, L.; Xu, Y.; Liang, J. Integrating Prior Knowledge into Attention for Ship Detection in SAR Images. Appl. Sci. 2023, 13, 2941. [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.-Y.; Berg, A.C. SSD: Single Shot MultiBox Detector. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 11–14 October 2016; pp. 21–37. [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [CrossRef]
- Lin, T.-Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature Pyramid Networks for Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 936–944. [CrossRef]
- Liu, J.; Liao, D.; Wang, X.; Li, J.; Yang, B.; Chen, G. LCAS-DetNet: A Ship Target Detection Network for Synthetic Aperture Radar Images. Appl. Sci. 2024, 14, 5322. [CrossRef]
- Zhou, X.; Wang, D.; Krähenbühl, P. Objects as points. arXiv 2019, arXiv:1904.07850. https://arxiv.org/abs/1904.07850.
- Tian, Z.; Shen, C.; Chen, H.; He, T. FCOS: Fully Convolutional One-Stage Object Detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 9626–9635. [CrossRef]
- Law, H.; Deng, J. CornerNet: Detecting Objects as Paired Keypoints. Int. J. Comput. Vis. 2020, 128, 642–656. [CrossRef]
- Kong, T.; Sun, F.; Liu, H.; Jiang, Y.; Li, L.; Shi, J. FoveaBox: Beyond Anchor-based Object Detection. IEEE Trans. Image Process. 2020, 29, 7389–7398. [CrossRef]
- Yang, X.; Li, W.; Wang, H.; Wang, P.; Fu, K. R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual Event, 2–9 February 2021; pp. 3163–3171. [CrossRef]
- Xie, X.; Cheng, G.; Wang, J.; Yao, X.; Han, J. Oriented R-CNN for Object Detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 11–17 October 2021; pp. 3500–3509. [CrossRef]
- Ding, J.; Xue, N.; Long, Y.; Xia, G.-S.; Lu, Q. Learning RoI Transformer for Oriented Object Detection in Aerial Images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 2844–2853. [CrossRef]
- Xu, Y.; Fu, M.; Wang, Q.; Wang, Y.; Chen, K.; Xia, G.-S.; Bai, X. Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 1452–1459. [CrossRef]
- Jocher, G.; Chaurasia, A.; Qiu, J. Ultralytics YOLO (Version 8.0.0) [Computer software]. Ultralytics, 2023. Available online: https://github.com/ultralytics/ultralytics (accessed on 20 April 2026).
- Zhang, X.; Yang, X.; Li, Y.; Yang, J.; Cheng, M.-M.; Li, X. RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 10–17 June 2025; pp. 7416–7426. [CrossRef]
- Rezatofighi, H.; Tsoi, N.; Gwak, J.Y.; Sadeghian, A.; Reid, I.; Savarese, S. Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 658–666. [CrossRef]
- Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 12993–13000. [CrossRef]
- Zhang, H.; Wang, Y.; Dayoub, F.; Sunderhauf, N. VarifocalNet: An IoU-aware Dense Object Detector. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 8510–8519. [CrossRef]
- Geng, Z.; Xu, Y.; Wang, B.-N.; Yu, X.; Zhu, D.-Y.; Zhang, G. Target Recognition in SAR Images by Deep Learning with Training Data Augmentation. Sensors 2023, 23, 941. [CrossRef]
- Zhao, P.; Chen, J.; Wan, H.; Cao, Y.; Wang, S.; Zhang, Y.; Li, Y.; Huang, Z.; Wu, B. Few-Shot Object Detection for SAR Images via Context-Aware and Robust Gaussian Flow Representation. Remote Sens. 2025, 17, 391. [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [CrossRef]
- Xu, C.; Su, H.; Li, J.; Liu, Y.; Yao, L.; Gao, L.; Yan, W.; Wang, T. RSDD-SAR: Rotated ship detection dataset in SAR images. J. Radars 2022, 11, 581–599. [CrossRef]
- Zhang, T.; Zhang, X.; Li, J.; Wang, B.; Xu, X. SAR Ship Detection Dataset (SSDD): Official Release and Comprehensive Data Analysis. Remote Sens. 2021, 13, 3690. [CrossRef]
- Wei, S.; Zeng, X.; Qu, Q.; Wang, M.; Su, H.; Shi, J. HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation. IEEE Access 2020, 8, 120234–120254. [CrossRef]








| Dataset | Images | Ship Instances |
| RSDD-SAR | 7,000 | 10,263 |
| SSDD+ | 1,160 | 2,456 |
| HRSID | 5,604 | 16,951 |
| Category | Specific Setting |
| Baseline model | Oriented R-CNN |
| Backbone network | ResNet-50 (ImageNet pretrained) |
| Improved modules | Denoising module, Adaptive Anchors and Adaptive Smooth L1 |
| Input size | 512 × 512 |
| Data augmentation | Random flipping, random rotation |
| Optimizer | SGD |
| Training hyperparameters | Initial learning rate 0.0025, momentum 0.9, weight decay 1×10⁻⁴ |
| Training scale | 20 epochs, batch size = 2 |
| Learning rate scheduling | Decay at the 8th and 11th epochs |
| Evaluation metrics | Recall, Precision, AP50, F1-score, FPR, FPPI |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| baseline | 0.723 | 0.612 | 0.647 | 0.663 | 38.77 | 1.71 |
| +Denoising module | 0.787 | 0.689 | 0.686 | 0.735 | 31.13 | 1.33 |
| +Adaptive Anchors | 0.752 | 0.635 | 0.653 | 0.688 | 36.54 | 1.62 |
| +Adaptive Smooth L1 | 0.730 | 0.630 | 0.649 | 0.676 | 37.04 | 1.61 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| baseline | 0.640 | 0.582 | 0.579 | 0.609 | 41.8 | 1.72 |
| +Denoising module | 0.622 | 0.713 | 0.590 | 0.665 | 28.67 | 0.93 |
| +Adaptive Anchors | 0.622 | 0.637 | 0.591 | 0.629 | 36.30 | 1.32 |
| +Adaptive Smooth L1 | 0.622 | 0.615 | 0.588 | 0.618 | 38.51 | 1.46 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| Denoising module + Adaptive Anchor | 0.799 | 0.707 | 0.690 | 0.750 | 29.27 | 1.24 |
| Denoising module + Adaptive Smooth L1 | 0.770 | 0.731 | 0.684 | 0.750 | 26.91 | 1.06 |
| Adaptive Anchor + Adaptive Smooth L1 | 0.738 | 0.658 | 0.654 | 0.691 | 34.24 | 1.42 |
| Denoising module + Adaptive Anchor + Adaptive Smooth L1 | 0.762 | 0.752 | 0.699 | 0.757 | 24.83 | 0.94 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| Denoising module + Adaptive Anchor | 0.605 | 0.743 | 0.580 | 0.667 | 26.17 | 0.85 |
| Denoising module + Adaptive Smooth L1 | 0.651 | 0.651 | 0.590 | 0.651 | 26.91 | 0.89 |
| Adaptive Anchor + Adaptive Smooth L1 | 0.645 | 0.665 | 0.602 | 0.655 | 33.53 | 1.22 |
| Denoising module + Adaptive Anchor + Adaptive Smooth L1 | 0.685 | 0.726 | 0.610 | 0.689 | 25.39 | 0.80 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| Faster R-CNN | 0.765±0.012 | 0.429±0.011 | 0.616±0.006 | 0.549±0.011 | 57.14±2.14 | 3.82±0.31 |
| RoI Transformer | 0.768±0.008 | 0.550±0.005 | 0.648±0.005 | 0.641±0.006 | 45.01±1.58 | 2.36±0.22 |
| Gliding Vertex | 0.762±0.007 | 0.485±0.009 | 0.617±0.007 | 0.592±0.008 | 51.54±1.32 | 3.03±0.19 |
| YOLOv8-OBB | 0.370±0.014 | 0.743±0.012 | 0.507±0.009 | 0.594±0.013 | 23.10±1.88 | 0.85±0.28 |
| H2RBOX-URC | 0.512±0.018 | 0.790±0.011 | 0.493±0.011 | 0.621±0.015 | 18.08±1.92 | 0.51±0.30 |
| DN-AnchorNet | 0.762±0.007 | 0.752±0.006 | 0.699±0.007 | 0.757±0.006 | 24.83±0.99 | 0.94±0.16 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| Faster R-CNN | 0.581±0.007 | 0.490±0.008 | 0.474±0.005 | 0.532±0.009 | 50.98±1.12 | 2.26±0.41 |
| RoI Transformer | 0.674±0.006 | 0.671±0.010 | 0.576±0.008 | 0.672±0.008 | 32.95±0.96 | 1.24±0.37 |
| Gliding Vertex | 0.593±0.011 | 0.567±0.006 | 0.508±0.007 | 0.580±0.008 | 43.33±1.41 | 1.70±0.56 |
| YOLOv8-OBB | 0.344±0.017 | 0.712±0.012 | 0.450±0.012 | 0.454±0.015 | 27.69±1.27 | 0.90±0.49 |
| H2RBOX-URC | 0.493±0.014 | 0.810±0.008 | 0.430±0.021 | 0.613±0.011 | 15.89±1.66 | 0.46±0.65 |
| DN-AnchorNet | 0.685±0.007 | 0.726±0.010 | 0.610±0.009 | 0.689±0.009 | 25.39±0.88 | 0.80±0.32 |
| Recall | Precision | AP₅₀ | F1-score | FPR | FPPI | |
| Oriented R-CNN(baseline) | 0.651 | 0.685 | 0.594 | 0.668 | 18.93 | 0.51 |
| Faster R-CNN | 0.636 | 0.651 | 0.590 | 0.644 | 34.85 | 1.00 |
| RoI Transformer | 0.612 | 0.701 | 0.603 | 0.662 | 22.94 | 0.55 |
| Gliding Vertex | 0.630 | 0.707 | 0.601 | 0.666 | 29.32 | 0.77 |
| DN-AnchorNet | 0.668 | 0.716 | 0.619 | 0.689 | 17.94 | 0.42 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).