Submitted:
27 January 2026
Posted:
29 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We construct UAV-Flow, a benchmark dataset that fills the gap for small and non-rigid targets in water environments. Covering diverse hydrological and lighting conditions, it provides critical data support for researching algorithm robustness.
- We propose FLD-Net, a real-time detection framework robust against water surface interference. By integrating three novel mechanisms—DFEM, DCFN, and DANA—the model effectively overcomes the technical bottlenecks of deformation adaptation, feature loss, and background noise suppression.
- We implement a high-performance edge deployment scheme. Validation on embedded platforms demonstrates the system’s real-time capability and energy efficiency, offering a viable solution for low-cost, automated intelligent water monitoring.
2. Related Work
2.1. Surface Floating Debris Datasets
2.2. UAV Object Detection
3. UAV-Flow Dataset
3.1. Dataset Construction
3.2. Statistical Analysis and Characteristics
4. Methodology
4.1. Deformable Feature Extraction Module
4.2. Dynamic Cross-Scale Fusion Network
4.3. Dual-Domain Anti-Noise Attention
5. Experiments and Discussion
5.1. Experimental Setup
5.2. Ablation Studies
5.3. Comparative Experiments
5.4. Visualization Analysis
5.5. Real-time Edge Deployment and Energy Efficiency
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Bhatia, S.K.; Mehariya, S.; Bhatia, R.K.; Kumar, M.; Pugazhendhi, A.; Awasthi, M.K.; Atabani, A.; Kumar, G.; Kim, W.; Seo, S.O.; et al. Wastewater based microalgal biorefinery for bioenergy production: Progress and challenges. Science of the Total Environment 2021, 751, 141599.
- Van Sebille, E.; Wilcox, C.; Lebreton, L.; Maximenko, N.; Hardesty, D.P.; Van Franeker, J.A.; Eriksen, M.; Siegel, D.; Galgani, F.; Law, K.L. A global inventory of small floating plastic debris. Environmental Research Letters 2015, 10, 124006.
- Hong, S.; Lee, J.; Lim, S. Navigational threats by derelict fishing gear to navy ships in the Korean seas. Marine Pollution Bulletin 2017, 119, 100–105.
- Borrelle, S.B.; Ringma, J.; Law, K.L.; Monnahan, C.C.; Lebreton, L.; McGivern, A.; Murphy, E.; Jambeck, J.; Leonard, G.H.; Hilleary, M.A.; et al. Predicted growth in plastic waste exceeds efforts to mitigate plastic pollution. Science 2020, 369, 1515–1518.
- Di, J.; Xi, K.; Yang, Y. An enhanced YOLOv8 model for accurate detection of solid floating waste. Scientific Reports 2025, 15, 25015.
- Beatty, D.S.; Aoki, L.R.; Graham, O.J.; Yang, B. The future is big—and small: remote sensing enables cross-scale comparisons of microbiome dynamics and ecological consequences. Msystems 2021, 6, e01106–21.
- Lee, H.; Byeon, S.; Kim, J.H.; Shin, J.K.; Park, Y. Construction of a Real-Time Detection for Floating Plastics in a Stream Using Video Cameras and Deep Learning. Sensors 2025, 25, 2225.
- He, L.; Zhou, Y.; Yang, H.; Su, L.; Ma, J. A deep learning-based method for marine oil spill detection and its application in UAV imagery. Marine Pollution Bulletin 2026, 222, 118889.
- Chang, B.; Li, F.; Hu, Y.; Yin, H.; Feng, Z.; Zhao, L. Application of UAV remote sensing for vegetation identification: a review and meta-analysis. Frontiers in Plant Science 2025, 16, 1452053.
- Cao, Y.; Lei, R. Progress and prospect of forest fire monitoring based on the multi-source remote sensing data. National Remote Sensing Bulletin 2024, 28, 1854–1869.
- Wang, J.; Zhao, H. Improved yolov8 algorithm for water surface object detection. Sensors 2024, 24, 5059.
- Yang, Z.; Yu, X.; Dedman, S.; Rosso, M.; Zhu, J.; Yang, J.; Xia, Y.; Tian, Y.; Zhang, G.; Wang, J. UAV remote sensing applications in marine monitoring: Knowledge visualization and review. Science of The Total Environment 2022, 838, 155939.
- Song, F.; Zhang, W.; Yuan, T.; Ji, Z.; Cao, Z.; Xu, B.; Lu, L.; Zou, S. UAV quantitative remote sensing of riparian zone vegetation for river and lake health assessment: A review. Remote Sensing 2024, 16, 3560.
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft coco: Common objects in context. In Proceedings of the European conference on computer vision. Springer, 2014, pp. 740–755.
- Đuraš, A.; Ilioudi, A.; Wolf, B.; Palunko, I.; De Schutter, B. Seaclear Marine Debris Detection & Segmentation Dataset, 2024. Data set. [CrossRef]
- Yang, Z.; et al. River Floating Debris Dataset for UAV-based detection, 2024. [Data set]. Available: https://figshare.com/.
- Pinson, S.; Vollering, A. Floating macroplastic debris in rivers labelled dataset, 2025. Data set. [CrossRef]
- Gonçalves, G.; Andriolo, U.; Gonçalves, L.M.; Sobral, P.; Bessa, F. Beach litter survey by drones: Mini-review and discussion of a potential standardization. Environmental Pollution 2022, 315, 120370.
- Andriolo, U.; et al. Drones for litter monitoring on coasts and rivers: suitable flight altitude and image resolution. Marine Pollution Bulletin 2023, 195, 115521.
- Gallitelli, L.; Girard, P.; Andriolo, U.; Liro, M.; Suaria, G.; Martin, C.; Lusher, A.; Hancke, K.; Blettler, M.; Garcia-Garin, O.; et al. Monitoring macroplastics in aquatic and terrestrial ecosystems: Expert survey reveals visual and drone-based census as most effective techniques. Science of the Total Environment 2024, 955, 176528.
- Wu, X.; Li, W.; Hong, D.; Tao, R.; Du, Q. Deep learning for unmanned aerial vehicle-based object detection and tracking: A survey. IEEE Geoscience and Remote Sensing Magazine 2021, 10, 91–124.
- Tang, G.; Ni, J.; Zhao, Y.; Gu, Y.; Cao, W. A survey of object detection for UAVs based on deep learning. Remote Sensing 2023, 16, 149.
- Jiang, Z.; Wu, B.; Ma, L.; Zhang, H.; Lian, J. APM-YOLOv7 for small-target water-floating garbage detection based on multi-scale feature adaptive weighted fusion. Sensors 2023, 24, 50.
- Zhang, Y.; Ye, M.; Zhu, G.; Liu, Y.; Guo, P.; Yan, J. FFCA-YOLO for Small Object Detection in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing 2024, 62, 1–15.
- Hui, Y.; Wang, J.; Li, B. STF-YOLO: A small target detection algorithm for UAV remote sensing images based on improved SwinTransformer and class weighted classification decoupling head. Measurement 2024, 224, 113936.
- Yu, J.; Zheng, H.; Xie, L.; Zhang, L.; Yu, M.; Han, J. Enhanced YOLOv7 integrated with small target enhancement for rapid detection of objects on water surfaces. Frontiers in Neurorobotics 2023, 17, 1315251.
- Gui, S.; Song, S.; Qin, R.; Tang, Y. Remote sensing object detection in the deep learning era—a review. Remote Sensing 2024, 16, 327.
- Hu, M.; Zhang, Y.; Jiao, T.; Xue, H.; Wu, X.; Luo, J.; Han, S.; Lv, H. An Enhanced Feature-Fusion Network for Small-Scale Pedestrian Detection on Edge Devices. Sensors 2024, 24, 7308.
- Chen, H.; Wang, S.; Guo, H.; Lin, H.; Zhang, Y.; Long, Z.; Huang, H. Study of marine debris around a tourist city in East China: Implication for waste management. Science of the total environment 2019, 676, 278–289.
- Qiao, G.; Yang, M.; Wang, H. An annotated Dataset and Benchmark for Detecting Floating Debris in Inland Waters. Scientific Data 2025, 12, 385.
- Liao, Y.H.; Juang, J.G. Application of Unmanned Aerial Vehicles for Marine Trash Detection and Real-Time Monitoring System. Master’s thesis, Department of Communications, Navigation and Control Engineering, NTOU, ROC, 2021.
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016.
- Ultralytics. YOLOv8. GitHub, 2023. [Online]. Available: https://github.com/ultralytics/ultralytics.
- Jocher, G.; et al. Ultralytics YOLOv11. GitHub, 2024. [Online]. Available: https://github.com/ultralytics/ultralytics.
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 2778–2788.
- Zhang, Z. Drone-YOLO: An efficient neural network method for target detection in drone images. Drones 2023, 7, 526.
- Yi, H.; Liu, B.; Zhao, B.; Liu, E. Small object detection algorithm based on improved YOLOv8 for remote sensing. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2023, 17, 1734–1747.
- Zhang, X.; Du, B.; Jia, Y.; Luo, K.; Jiang, L. MSD-YOLO11n: an improved small target detection model for high precision UAV aerial imagery. Journal of King Saud University Computer and Information Sciences 2025, 37, 1–17.
- Xu, C.; Wang, J.; Yang, W.; Yu, H.; Yu, L.; Xia, G.S. RFLA: Gaussian receptive field based label assignment for tiny object detection. In Proceedings of the European conference on computer vision. Springer, 2022, pp. 526–543.
- Xu, C.; Wang, J.; Yang, W.; Yu, H.; Yu, L.; Xia, G.S. Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark. ISPRS Journal of Photogrammetry and Remote Sensing 2022, 190, 79–93.
- Ye, T.; Qin, W.; Zhao, Z.; Gao, X.; Deng, X.; Ouyang, Y. Real-time object detection network in UAV-vision based on CNN and transformer. IEEE Transactions on Instrumentation and Measurement 2023, 72, 1–13.
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. Detrs beat yolos on real-time object detection. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2024, pp. 16965–16974.
- Fan, Q.; Li, Y.; Deveci, M.; Zhong, K.; Kadry, S. LUD-YOLO: A novel lightweight object detection network for unmanned aerial vehicle. Information Sciences 2025, 686, 121366.
- Zhuang, Y.; Liu, J.; Zhao, H.; Ma, L.; Fang, Z.; Li, L.; Wu, C.; Cui, W.; Liu, Z. A deep learning framework based on structured space model for detecting small objects in complex underwater environments. Communications Engineering 2025, 4, 24.
- Dai, J.; Qi, H.; Xiong, Y.; Li, Y.; Zhang, G.; Hu, H.; Wei, Y. Deformable convolutional networks. In Proceedings of the Proceedings of the IEEE international conference on computer vision, 2017, pp. 764–773.
- Zhu, X.; Hu, H.; Lin, S.; Dai, J. Deformable convnets v2: More deformable, better results. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 9308–9316.
- Xiao, Y.; Su, X.; Yuan, Q.; Liu, D.; Shen, H.; Zhang, L. Satellite video super-resolution via multiscale deformable convolution alignment and temporal grouping projection. IEEE Transactions on Geoscience and Remote Sensing 2021, 60, 1–19.
- Liu, H.; Zhou, X.; Wang, C.; Chen, S.; Kong, H. Fourier-deformable convolution network for road segmentation from remote sensing images. IEEE Transactions on Geoscience and Remote Sensing 2024, 62, 1–17.
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2117–2125.
- Liu, W.; Lu, H.; Fu, H.; Cao, Z. Learning to upsample by learning to sample. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2023, pp. 6027–6037.
- Jiang, Y.; Tan, Z.; Wang, J.; Sun, X.; Lin, M.; Li, H. GiraffeDet: A heavy-neck paradigm for object detection. arXiv preprint arXiv:2202.04256 2022.
- Xu, X.; Jiang, Y.; Chen, W.; Huang, Y.; Zhang, Y.; Sun, X. Damo-yolo: A report on real-time object detection design. arXiv preprint arXiv:2211.15444 2022.
- Wang, J.; Chen, K.; Xu, R.; Liu, Z.; Loy, C.C.; Lin, D. Carafe: Content-aware reassembly of features. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 3007–3016.
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 7132–7141.
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the Proceedings of the European conference on computer vision (ECCV), 2018, pp. 3–19.
- Hou, Q.; Zhou, D.; Feng, J. Coordinate attention for efficient mobile network design. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 13713–13722.
- Jocher, G.; et al. YOLOv5. GitHub, 2020. [Online]. Available: https://github.com/ultralytics/yolov5.
- Zhang, H.; Liu, K.; Gan, Z.; Zhu, G.N. UAV-DETR: efficient end-to-end object detection for unmanned aerial vehicle imagery. arXiv preprint arXiv:2501.01855 2025.










| Dataset | Platform | Environment | Scale | Small Obj. | Main Limitation |
|---|---|---|---|---|---|
| Floater | Shore Cams | Multiple Rivers | 3,000 | 16.3% | Static View |
| FMPD | River Monitor | Multiple Rivers | 2,229 | 36.5% | Static View |
| River Debris | DJI Mini 2 | Single River | 840 | - | Small Scale |
| HAIDA | UAV | Coastal/Port | 324 | 29.6% | Small Scale |
| UAV-Flow | DJI M350 | River/Lake/Flat | 4,593 | 78.9% | Limited Geo. Range |
| Setting | P (%) | R (%) | mAP50 (%) | Params (M) |
|---|---|---|---|---|
| Baseline | 77.85 | 66.18 | 71.96 | 9.40 |
| P2, P3 | 78.14 | 68.55 | 72.78 | 9.42 (+0.02) |
| P4, P5 (Ours) | 78.88 | 67.99 | 73.51 | 9.50 (+0.10) |
| All | 77.51 | 68.53 | 73.32 | 9.52 (+0.12) |
| Adjustment | P (%) | R (%) | mAP50 (%) | Params (M) |
|---|---|---|---|---|
| Baseline | 77.85 | 66.18 | 71.96 | 9.40 |
| + GFPN | 77.78 | 70.53 | 73.58 | 13.72 |
| + GFPN-Scale | 79.89 | 68.42 | 73.39 | 9.66 |
| + DySample(Ours) | 79.32 | 75.04 | 77.69 | 9.67 |
| DFEM | DCFN | DANA | P (%) | R (%) | mAP50 (%) |
|---|---|---|---|---|---|
| 77.85 | 66.18 | 71.96 | |||
| ✓ | 79.31 | 67.76 | 73.44 | ||
| ✓ | 79.32 | 75.04 | 77.69 | ||
| ✓ | 79.41 | 75.88 | 78.95 | ||
| ✓ | ✓ | 78.48 | 75.53 | 78.49 | |
| ✓ | ✓ | 80.66 | 75.00 | 78.71 | |
| ✓ | ✓ | 79.71 | 76.29 | 79.19 | |
| ✓ | ✓ | ✓ | 80.67 | 77.84 | 80.47 |
| Method | P (%) | R (%) | mAP50 (%) | Params (M) | FLOPs (G) | FPS |
|---|---|---|---|---|---|---|
| YOLOv5s [57] | 78.33 | 65.94 | 71.35 | 7.8 | 18.7 | 323.49 |
| YOLOv8s [33] | 78.09 | 67.40 | 72.13 | 9.8 | 23.4 | 309.36 |
| YOLOv11s [34] | 77.85 | 66.18 | 71.96 | 9.4 | 21.3 | 289.91 |
| TPH-YOLOv5 [35] | 78.57 | 69.64 | 75.73 | 45.36 | 245.1 | 8.22 |
| Drone-YOLO [36] | 78.07 | 69.11 | 74.10 | 34.65 | 7.9 | 12.4 |
| FFCA-YOLO [24] | 80.39 | 73.39 | 75.62 | 7.1 | 51.3 | 53.37 |
| RT-DETR [42] | 82.14 | 73.36 | 77.29 | 41.9 | 125.6 | 80.07 |
| UAV-DETR [58] | 82.72 | 74.91 | 79.75 | 44.6 | 161.4 | 35.09 |
| FLD-Net (Ours) | 80.67 | 77.84 | 80.47 | 9.9 | 25.1 | 156.72 |
| Method | HAIDA Trash Dataset | FMPD Dataset | ||||
|---|---|---|---|---|---|---|
| P (%) | R (%) | mAP50 (%) | P (%) | R (%) | mAP50 (%) | |
| YOLOv8 [33] | 69.2 | 67.3 | 71.2 | 37.30 | 43.23 | 33.09 |
| YOLOv11 [34] | 55.5 | 63.5 | 64.9 | 38.17 | 49.30 | 37.00 |
| RT-DETR [42] | 62.9 | 67.3 | 67.0 | 40.35 | 25.23 | 26.77 |
| FLD-Net(Ours) | 74.0 | 66.4 | 73.1 | 41.31 | 49.50 | 39.13 |
| Model | TensorRT | mAP50 (%) | FPS | Power (W) | Efficiency (FPS/W) |
|---|---|---|---|---|---|
| YOLOv11s | 71.96 | 54.0 | 14.2 | 3.80 | |
| YOLOv11s | ✓ | 71.57 | 83.2 | 14.2 | 5.86 |
| FLD-Net | 80.47 | 44.6 | 14.5 | 3.07 | |
| FLD-Net | ✓ | 80.29 | 69.6 | 14.5 | 4.80 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).