Submitted:
20 October 2023
Posted:
26 October 2023
You are already at the latest version
Abstract

Keywords:
1. Introduction
- Original features play a reinforcing role in fused features, enhancing residual functions and facilitating stable gradient propagation during backpropagation. However, feature pyramids fail to fully exploit the most original feature information.
- Convolutional neural networks are unable to aggregate information between distant pixels in the spatial domain, resulting in underutilization of long-range correlated information that adversely impacts detection results.
- Within the feature pyramid, we efficiently harness original feature information to process multi-scale features more effectively. We introduce a multi-scale fusion pyramid network that connects original features and fused features while shortening the information transmission paths. This connection extends from large-scale features to fused small-scale features, enabling the module to optimally utilize features at each stage.
- Drawing inspiration from attention mechanisms, we design a global feature context aggregation module to aggregate feature information within feature maps and weight them adaptively for each pixel. Through iterative learning of semantic information between features, we fuse useful global information into local regions, resulting in improved pixel-level attention for objects of interest.
- We introduce a novel object detector and conduct extensive experiments on three challenging datasets: the DIOR-R dataset, the HRSC2016 dataset, and the MAR20 dataset confirming the effectiveness of our approach. Experimental results demonstrate outstanding performance.
2. Releated Work
2.1. Object Detection in General Scenarios
2.2. Object Detection in Remote Sensing Scenarios
3. Methodology
3.1. Basic Rotated Detection Method as Baseline
3.2. Multiscale Feature Fusion Network
3.3. Global Feature Content Aggregation Module
3.4. MFCA
4. Experiments
4.1. Datasets and Evaluation Metrics
4.1.1. Datasets
4.1.2. Evaluation Metrics
4.2. Implementation Details
4.3. Comparisons with State-of-the-Art
4.3.1. Results on DIOR-R
4.3.2. Results on HRSC2016
4.3.3. Results on MAR20
4.4. Ablation Study
4.4.1. Ablation Test with Different Feature Fusion Methods in MFFM
4.4.2. Ablation Test of MFCA
5. Conclusions
Author Contributions
Funding
References
- Barmpoutis, P.; Papaioannou, P.; Dimitropoulos, K.; Grammalidis, N. A review on early forest fire detection systems using optical remote sensing. Sensors 2020, 20, 6442. [Google Scholar] [CrossRef] [PubMed]
- Mohan, A.; Singh, A.K.; Kumar, B.; Dwivedi, R. Review on remote sensing methods for landslide detection using machine and deep learning. Trans. Emerg. Telecommun. Technol. 2021, 32, e3998. [Google Scholar] [CrossRef]
- Karthikeyan, L.; Chawla, I.; Mishra, A.K. A review of remote sensing applications in agriculture for food security: Crop growth and yield, irrigation, and crop losses. J. Hydrol. 2020, 586, 124905. [Google Scholar] [CrossRef]
- Chawla, I.; Karthikeyan, L.; Mishra, A.K. A review of remote sensing applications for water security: Quantity, quality, and extremes. J. Hydrol. 2020, 585, 124826. [Google Scholar] [CrossRef]
- Bo, L.; Xiaoyang, X.; Xingxing, W.; Wenting, T. Ship detection and classification from optical remote sensing images: A survey. Chin. J. Aeronaut. 2021, 34, 145–163. [Google Scholar] [CrossRef]
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2117–2125.
- Yang, L.; Haining, W.; Yuqiang, F.; Shengjin, W.; Zhi, L.; JIANG, B. Learning power Gaussian modeling loss for dense rotated object detection in remote sensing images. Chin. J. Aeronaut. 2023. [Google Scholar]
- Zhang, W.; Jiao, L.; Li, Y.; Huang, Z.; Wang, H. Laplacian feature pyramid network for object detection in VHR optical remote sensing images. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–14. [Google Scholar] [CrossRef]
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 2778–2788.
- Chen, S.; Zhao, J.; Zhou, Y.; Wang, H.; Yao, R.; Zhang, L.; Xue, Y. Info-FPN: An Informative Feature Pyramid Network for object detection in remote sensing images. Expert Syst. Appl. 2023, 214, 119132. [Google Scholar] [CrossRef]
- Shen, F.; Ye, H.; Zhang, J.; Wang, C.; Han, X.; Yang, W. Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models. arXiv 2023, arXiv:2310.06313. [Google Scholar]
- Shen, F.; He, X.; Wei, M.; Xie, Y. A competitive method to vipriors object detection challenge. arXiv 2021, arXiv:2104.09059. [Google Scholar]
- Han, J.; Ding, J.; Li, J.; Xia, G.S. Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–11. [Google Scholar] [CrossRef]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
- Bochkovskiy, A.; Wang, C.Y.; Liao, H.Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv 2020, arXiv:2004.10934. [Google Scholar]
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; others. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 7464–7475.
- Lyu, C.; Zhang, W.; Huang, H.; Zhou, Y.; Wang, Y.; Liu, Y.; Zhang, S.; Chen, K. Rtmdet: An empirical study of designing real-time object detectors. arXiv 2022, arXiv:2212.07784. [Google Scholar]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580–587.
- Girshick, R. Fast r-cnn. Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440–1448.
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2015, 28. [Google Scholar] [CrossRef]
- Dai, J.; Li, Y.; He, K.; Sun, J. R-fcn: Object detection via region-based fully convolutional networks. Adv. Neural Inf. Process. Syst. 2016, 29. [Google Scholar]
- Li, Y.; Wang, H.; Dang, L.M.; Song, H.K.; Moon, H. ORCNN-X: Attention-Driven Multiscale Network for Detecting Small Objects in Complex Aerial Scenes. Remote Sens. 2023, 15, 3497. [Google Scholar] [CrossRef]
- Chen, J.; Hong, H.; Song, B.; Guo, J.; Chen, C.; Xu, J. MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images. Remote Sens. 2023, 15, 371. [Google Scholar] [CrossRef]
- Shen, F.; Xie, Y.; Zhu, J.; Zhu, X.; Zeng, H. Git: Graph interactive transformer for vehicle re-identification. IEEE Trans. Image Process. 2023. [Google Scholar] [CrossRef]
- Guo, M.H.; Xu, T.X.; Liu, J.J.; Liu, Z.N.; Jiang, P.T.; Mu, T.J.; Zhang, S.H.; Martin, R.R.; Cheng, M.M.; Hu, S.M. Attention mechanisms in computer vision: A survey. Comput. Vis. Media 2022, 8, 331–368. [Google Scholar] [CrossRef]
- Shen, F.; Shu, X.; Du, X.; Tang, J. Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval 2023.
- Zheng, S.; Wu, Z.; Xu, Y.; Wei, Z. Instance-Aware Spatial-Frequency Feature Fusion Detector for Oriented Object Detection in Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2023. [Google Scholar] [CrossRef]
- Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11534–11542.
- Zhang, Q.L.; Yang, Y.B. Sa-net: Shuffle attention for deep convolutional neural networks. ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 2235–2239.
- Lee, Y.; Park, J. Centermask: Real-time anchor-free instance segmentation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 13906–13915.
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. ; others. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv, arXiv:2010.11929.
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-end object detection with transformers. European conference on computer vision. Springer, 2020, pp. 213–229.
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10012–10022.
- Shen, F.; Du, X.; Zhang, L.; Tang, J. Triplet Contrastive Learning for Unsupervised Vehicle Re-identification. arXiv 2023, arXiv:2301.09498. [Google Scholar]
- Chalavadi, V.; Jeripothula, P.; Datla, R.; Ch, S.B.; others. mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions. Pattern Recognit. 2022, 126, 108548. [Google Scholar] [CrossRef]
- Zhang, J.; Lei, J.; Xie, W.; Fang, Z.; Li, Y.; Du, Q. SuperYOLO: Super resolution assisted object detection in multimodal remote sensing imagery. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–15. [Google Scholar] [CrossRef]
- Qiao, C.; Shen, F.; Wang, X.; Wang, R.; Cao, F.; Zhao, S.; Li, C. A Novel Multi-Frequency Coordinated Module for SAR Ship Detection 2022. pp. 804–811.
- Li, J.; Li, Z.; Chen, M.; Wang, Y.; Luo, Q. A new ship detection algorithm in optical remote sensing images based on improved R3Det. Remote Sens. 2022, 14, 5048. [Google Scholar] [CrossRef]
- Liu, F.; Chen, R.; Zhang, J.; Ding, S.; Liu, H.; Ma, S.; Xing, K. ESRTMDet: An End-to-End Super-Resolution Enhanced Real-Time Rotated Object Detector for Degraded Aerial Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023. [Google Scholar] [CrossRef]
- Shen, F.; Zhu, J.; Zhu, X.; Xie, Y.; Huang, J. Exploring spatial significance via hybrid pyramidal graph network for vehicle re-identification. IEEE Trans. Intell. Transp. Syst. 2021, 23, 8793–8804. [Google Scholar] [CrossRef]
- Min, L.; Fan, Z.; Lv, Q.; Reda, M.; Shen, L.; Wang, B. YOLO-DCTI: Small Object Detection in Remote Sensing Base on Contextual Transformer Enhancement. Remote Sens. 2023, 15, 3970. [Google Scholar] [CrossRef]
- Cheng, G.; Wang, J.; Li, K.; Xie, X.; Lang, C.; Yao, Y.; Han, J. Anchor-free oriented proposal generator for object detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–11. [Google Scholar] [CrossRef]
- Tian, Z.; Shen, C.; Chen, H.; He, T. FCOS: A simple and strong anchor-free object detector. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 44, 1922–1933. [Google Scholar] [CrossRef]
- Wang, X.; Girdhar, R.; Yu, S.X.; Misra, I. Cut and learn for unsupervised object detection and instance segmentation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 3124–3134.
- Yang, X.; Zhang, G.; Li, W.; Wang, X.; Zhou, Y.; Yan, J. H2RBox: Horizonal Box Annotation is All You Need for Oriented Object Detection. arXiv 2022, arXiv:2210.06742. [Google Scholar]
- Sun, X.; Cheng, G.; Pei, L.; Li, H.; Han, J. Threatening patch attacks on object detection in optical remote sensing images. IEEE Trans. Geosci. Remote Sens. 2023. [Google Scholar] [CrossRef]
- Wan, D.; Lu, R.; Wang, S.; Shen, S.; Xu, T.; Lang, X. YOLO-HR: Improved YOLOv5 for Object Detection in High-Resolution Optical Remote Sensing Images. Remote Sens. 2023, 15, 614. [Google Scholar] [CrossRef]
- Yang, X.; Yan, J.; Ming, Q.; Wang, W.; Zhang, X.; Tian, Q. Rethinking rotated object detection with gaussian wasserstein distance loss. International conference on machine learning. PMLR, 2021, pp. 11830–11841.
- Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. Yolox: Exceeding yolo series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar]
- Liu, Z.; Mao, H.; Wu, C.Y.; Feichtenhofer, C.; Darrell, T.; Xie, S. A convnet for the 2020s. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 11976–11986.
- Ding, X.; Zhang, X.; Han, J.; Ding, G. Scaling up your kernels to 31x31: Revisiting large kernel design in cnns. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 11963–11975.
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path aggregation network for instance segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 8759–8768.
- Wang, X.; Girshick, R.; Gupta, A.; He, K. Non-local neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7794–7803.
- Li, K.; Wan, G.; Cheng, G.; Meng, L.; Han, J. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS J. Photogramm. Remote Sens. 2020, 159, 296–307. [Google Scholar] [CrossRef]
- Liu, Z.; Yuan, L.; Weng, L.; Yang, Y. A high resolution optical satellite image dataset for ship recognition and some new baselines. International conference on pattern recognition applications and methods. SciTePress, 2017, Vol. 2, pp. 324–331.
- Yu, W.; Cheng, G.; Wang, M.; Yao, Y.; Xie, X.; Yao, X.; Han, J. MAR20: A Benchmark for Military Aircraft Recognition in Remote Sensing Images. Natl. Remote Sens. Bull. 2022. [Google Scholar] [CrossRef]
- Zhou, Y.; Yang, X.; Zhang, G.; Wang, J.; Liu, Y.; Hou, L.; Jiang, X.; Liu, X.; Yan, J.; Lyu, C. ; others. Mmrotate: A rotated object detection benchmark using pytorch. Proceedings of the 30th ACM International Conference on Multimedia, 2022, pp. 7331–7334.
- Ding, J.; Xue, N.; Long, Y.; Xia, G.S.; Lu, Q. Learning RoI transformer for oriented object detection in aerial images. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2849–2858.
- Zhang, Y.; Wang, Y.; Zhang, N.; Li, Z.; Zhao, Z.; Gao, Y.; Chen, C.; Feng, H. RoI Fusion Strategy with Self-Attention Mechanism for Object Detection in Remote Sensing Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023. [Google Scholar] [CrossRef]
- Qian, X.; Zhang, N.; Wang, W. Smooth giou loss for oriented object detection in remote sensing images. Remote Sens. 2023, 15, 1259. [Google Scholar] [CrossRef]
- Han, J.; Ding, J.; Li, J.; Xia, G.S. Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–11. [Google Scholar] [CrossRef]
- Wang, Z.; Bao, C.; Cao, J.; Hao, Q. AOGC: Anchor-Free Oriented Object Detection Based on Gaussian Centerness. Remote Sens. 2023, 15, 4690. [Google Scholar] [CrossRef]
- Chen, W.; Han, B.; Yang, Z.; Gao, X. MSSDet: Multi-Scale Ship-Detection Framework in Optical Remote-Sensing Images and New Benchmark. Remote Sens. 2022, 14, 5460. [Google Scholar] [CrossRef]
- Yang, X.; Yan, J.; Feng, Z.; He, T. R3det: Refined single-stage detector with feature refinement for rotating object. Proceedings of the AAAI conference on artificial intelligence, 2021, Vol. 35, pp. 3163–3171.








| Method | Backbone | GF | VE | ETS | TS | CHI | ST | SH | HA | APL | TC | mAP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RoI Trans [59] | R-50 | 69.0 | 43.3 | 78.7 | 54.9 | 72.6 | 70.3 | 81.2 | 47.7 | 63.3 | 81.6 | |
| AOPG [43] | R-50 | 73.2 | 52.4 | 65.4 | 60.0 | 72.5 | 71.3 | 81.2 | 42.3 | 62.4 | 81.5 | |
| ROIF [60] | R-50 | 74.7 | 49.4 | 69.5 | 55.0 | 73.8 | 63.9 | 82.4 | 47.4 | 72.1 | 82.7 | |
| ROIF [60] | ConvNext-50 | 78.6 | 50.6 | 74.9 | 63.2 | 72.7 | 71.2 | 81.3 | 51.1 | 72.2 | 89.8 | |
| AOPG SGIoU [61] | R-50 | 79.5 | 55.9 | 72.9 | 62.6 | 77.4 | 78.3 | 89.7 | 52.6 | 69.6 | 81.5 | |
| RTMDet [18] | CSPNext-52 | 75.8 | 57.3 | 76.1 | 63.8 | 79.8 | 79.6 | 89.8 | 53.2 | 90.4 | 90.5 | |
| Ours | CSPNext-52 | 77.9 | 61.1 | 79.1 | 64.9 | 80.7 | 80.2 | 90.1 | 54.3 | 90.7 | 90.7 | |
| Method | Backbone | GTF | DA | BC | ESA | STA | APO | BF | BR | WM | OP | |
| RoI Trans [59] | R-50 | 82.7 | 26.9 | 87.5 | 68.1 | 78.2 | 37.9 | 71.8 | 40.7 | 65.5 | 55.6 | 63.87 |
| AOPG [43] | R-50 | 81.9 | 31.1 | 87.6 | 78.0 | 72.7 | 37.8 | 71.6 | 40.9 | 70.0 | 54.5 | 64.41 |
| ROIF [60] | R-50 | 84.0 | 29.2 | 82.6 | 78.1 | 80.7 | 39.0 | 72.9 | 40.8 | 67.4 | 55.5 | 65.12 |
| ROIF [60] | ConvNext-50 | 84.7 | 34.1 | 89.7 | 88.7 | 83.0 | 44.0 | 72.2 | 43.9 | 66.5 | 57.5 | 68.49 |
| AOPG SGIoU [61] | R-50 | 82.5 | 36.1 | 88.7 | 82.8 | 75.6 | 53.0 | 71.7 | 46.6 | 71.0 | 59.6 | 69.37 |
| RTMDet [18] | CSPNext-52 | 84.6 | 35.8 | 90.3 | 89.2 | 85.0 | 49.0 | 84.8 | 46.3 | 65.9 | 61.7 | 72.44 |
| Ours | CSPNext-52 | 84.8 | 42.0 | 90.5 | 89.1 | 86.6 | 53.0 | 88.5 | 50.2 | 73.2 | 62.8 | 74.51 |
| Method | Backbone | mAP (07)(%) | mAP (12)(%) |
|---|---|---|---|
| [62] | R-101 | 90.17 | 95.01 |
| AOGC [63] | R-50 | 89.80 | 95.20 |
| MSSDet [64] | R-101 | 76.60 | 95.30 |
| [7] | R-101 | 89.97 | 95.57 |
| MSSDet [64] | R-152 | 77.30 | 95.80 |
| [65] | R-101 | 89.26 | 96.01 |
| DCFPN [7] | R-101 | 89.98 | 96.12 |
| RTMDet [18] | CSPNext-52 | 89.10 | 96.51 |
| Ours | CSPNext-52 | 90.05 | 97.53 |
| Method | A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | A10 | mAP |
|---|---|---|---|---|---|---|---|---|---|---|---|
| [57] | 82.6 | 81.6 | 86.2 | 80.8 | 76.9 | 90.0 | 84.7 | 85.7 | 88.7 | 90.8 | |
| Faster R-CNN [57] | 85.0 | 81.6 | 87.5 | 70.7 | 79.6 | 90.6 | 89.7 | 89.8 | 90.4 | 91.0 | |
| Oriented R-CNN [57] | 86.1 | 81.7 | 88.1 | 69.6 | 75.6 | 89.9 | 90.5 | 89.5 | 89.8 | 90.9 | |
| RoI Trans [57] | 85.4 | 81.5 | 87.6 | 78.3 | 80.5 | 90.5 | 90.2 | 87.6 | 87.9 | 90.9 | |
| RTMDet [18] | 85.5 | 96.0 | 94.6 | 90.9 | 86.0 | 90.9 | 95.1 | 98.7 | 90.9 | 90.9 | |
| Ours | 88.6 | 98.7 | 98.4 | 90.7 | 87.5 | 95.1 | 94.9 | 99.2 | 90.9 | 99.0 | |
| Method | A11 | A12 | A13 | A14 | A15 | A16 | A17 | A18 | A19 | A20 | |
| [57] | 81.7 | 86.1 | 69.6 | 82.3 | 47.7 | 88.1 | 90.2 | 62.0 | 83.6 | 79.8 | 81.1 |
| Faster R-CNN [57] | 85.5 | 88.1 | 63.4 | 88.3 | 42.4 | 88.9 | 90.5 | 62.2 | 78.3 | 77.7 | 81.4 |
| Oriented R-CNN [57] | 87.6 | 88.4 | 67.5 | 88.5 | 46.3 | 88.3 | 90.6 | 70.5 | 78.7 | 80.3 | 81.9 |
| RoI Trans [57] | 85.9 | 89.3 | 67.2 | 88.2 | 47.9 | 89.1 | 90.5 | 74.6 | 81.3 | 80.0 | 82.7 |
| RTMDet [18] | 82.8 | 90.7 | 88.8 | 90.1 | 84.6 | 90.5 | 90.7 | 94.8 | 86.6 | 89.4 | 90.43 |
| Ours | 89.6 | 90.7 | 89.7 | 90.3 | 89.1 | 90.5 | 90.6 | 97.6 | 87.2 | 89.9 | 92.41 |
| Baseline | Red | Orange | Purple | A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | A10 | mAP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ✓ | 85.5 | 96.0 | 94.6 | 90.9 | 86.0 | 90.9 | 95.1 | 98.7 | 90.9 | 90.9 | ||||
| ✓ | ✓ | 87.7 | 98.1 | 93.5 | 90.9 | 86.2 | 90.8 | 93.7 | 99.1 | 90.8 | 94.3 | |||
| ✓ | ✓ | ✓ | 86.3 | 90.8 | 98.7 | 90.6 | 87.2 | 92.2 | 95.3 | 99.3 | 90.9 | 99.7 | ||
| ✓ | ✓ | ✓ | ✓ | 88.1 | 90.7 | 97.0 | 90.8 | 86.5 | 97.7 | 95.9 | 99.3 | 90.9 | 98.6 | |
| Baseline | Red | Orange | Purple | A11 | A12 | A13 | A14 | A15 | A16 | A17 | A18 | A19 | A20 | |
| ✓ | 82.8 | 90.7 | 88.8 | 90.1 | 84.6 | 90.5 | 90.7 | 94.8 | 86.6 | 89.4 | 90.43 | |||
| ✓ | ✓ | 88.6 | 90.8 | 89.5 | 90.3 | 87.9 | 90.5 | 90.6 | 94.2 | 86.8 | 89.2 | 91.17 | ||
| ✓ | ✓ | ✓ | 85.4 | 90.4 | 89.7 | 90.5 | 83.4 | 90.5 | 90.8 | 96.0 | 90.0 | 90.3 | 91.40 | |
| ✓ | ✓ | ✓ | ✓ | 88.3 | 90.7 | 89.6 | 90.0 | 86.7 | 90.3 | 90.8 | 95.9 | 88.2 | 89.2 | 91.76 |
| Baseline | MFFN | GFCAM | A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | A10 | mAP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ✓ | 85.5 | 96.0 | 94.6 | 90.9 | 86.0 | 90.9 | 95.1 | 98.7 | 90.9 | 90.9 | |||
| ✓ | ✓ | 88.1 | 90.7 | 97.0 | 90.8 | 86.5 | 97.7 | 95.9 | 99.3 | 90.9 | 98.6 | ||
| ✓ | ✓ | 87.7 | 97.1 | 93.3 | 90.8 | 86.4 | 90.9 | 92.5 | 98.6 | 90.9 | 99.9 | ||
| ✓ | ✓ | ✓ | 88.6 | 98.7 | 98.4 | 90.7 | 87.5 | 95.1 | 94.9 | 99.2 | 90.9 | 99.0 | |
| Baseline | MFFN | GFCAM | A11 | A12 | A13 | A14 | A15 | A16 | A17 | A18 | A19 | A20 | |
| ✓ | 82.8 | 90.7 | 88.8 | 90.1 | 84.6 | 90.5 | 90.7 | 94.8 | 86.6 | 89.4 | 90.43 | ||
| ✓ | ✓ | 88.3 | 90.7 | 89.6 | 90.0 | 86.7 | 90.3 | 90.8 | 95.9 | 88.2 | 89.2 | 91.76 | |
| ✓ | ✓ | 87.0 | 90.8 | 89.7 | 90.1 | 82.4 | 90.6 | 90.6 | 97.0 | 89.9 | 90.1 | 91.32 | |
| ✓ | ✓ | ✓ | 89.6 | 90.7 | 89.7 | 90.3 | 89.1 | 90.5 | 90.6 | 97.6 | 87.2 | 89.9 | 92.41 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).