Submitted:
22 August 2025
Posted:
25 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methodology
2.1. ADG-YOLO
2.1.1. Basic YOLOv11 Algorithm
2.1.2. C3Ghost
2.1.3. ADown
2.1.4. Proposed ADG-YOLO
2.2. Model Conversion and Edge Deployment
2.3. Target Monitoring Based on YOLOv11 Detection and EKF Tracking
2.4. Monocular Ranging for UAVs Using Similar Triangles
3. Experiment
3.1. Dataset
3.2. Experimental Environment and Experimental Parametes
3.3. Evaluation Metrics
3.4. Model Performance Analysis
4. Target Distance Estimation Experiment
4.1. Experimental Setup
4.2. Distance Measurement for UAV Targets
5. Discussion
6. Conclusions
References
- Lin Y H, Joubert D A, Kaeser S, et al. Field deployment of Wolbachia-infected Aedes aegypti using uncrewed aerial vehicle. Science Robotics 2024, 9, eadk7913. [Google Scholar] [CrossRef]
- Xin Zhou, et al. Swarm of micro flying robots in the wild. Science Robotics 2022, 7, eabm5954. [Google Scholar] [CrossRef]
- Han J, Yan Y, Zhang B. Towards Efficient Multi-UAV Air Combat: An Intention Inference and Sparse Transmission Based Multi-Agent Reinforcement Learning Algorithm. IEEE Transactions on Artificial Intelligence 2025.
- Karimov C, Y. THE ROLE OF UNMANNED AIRCRAFT VEHİCLES IN THE RUSSIAN-UKRAINIAN WAR. Endless light in science 2025, (30 апрель ELB), 83-89.
- Wennerholm, D. Above the trenches: Russian military lessons learned about drone warfare from Ukraine. 2025.
- Tang Z, Ma H, Qu Y, et al. UAV Detection with Passive Radar: Algorithms, Applications, and Challenges. Drones 2025, 9, 76. [Google Scholar] [CrossRef]
- Seidaliyeva U, Ilipbayeva L, Utebayeva D, et al. LiDAR Technology for UAV Detection: From Fundamentals and Operational Principles to Advanced Detection and Classification Techniques. Sensors 2025, 25, 2757. [Google Scholar] [CrossRef]
- Qiu Z, Lu Y, Qiu Z. Review of ultrasonic ranging methods and their current challenges. Micromachines 2022, 13, 520. [Google Scholar] [CrossRef]
- Rahmaniar W, Wang W J, Caesarendra W, et al. Distance measurement of unmanned aerial vehicles using vision-based systems in unknown environments. Electronics 2021, 10, 1647. [Google Scholar] [CrossRef]
- Tian X, Liu R, Wang Z, et al. High quality 3D reconstruction based on fusion of polarization imaging and binocular stereo vision. Information Fusion 2022, 77, 19–28. [Google Scholar] [CrossRef]
- Tang Y, Zhou H, Wang H, et al. Fruit detection and positioning technology for a Camellia oleifera C. Abel orchard based on improved YOLOv4-tiny model and binocular stereo vision. Expert systems with applications 2023, 211, 118573. [Google Scholar] [CrossRef]
- Bao D, Wang P. Vehicle distance detection based on monocular vision [C]//2016 International Conference on Progress in Informatics and Computing (PIC). IEEE, 2016: 187-191.
- Ali A A, Hussein H A. Distance estimation and vehicle position detection based on monocular camera [C]//2016 Al-Sadeq International Conference on Multidisciplinary in IT and Communication Science and Applications (AIC-MITCSA). IEEE, 2016: 1-4.
- Liu F, Shen C, Lin G. Deep convolutional neural fields for depth estimation from a single image [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 5162-5170.
- Li J, Klein R, Yao A. A two-streamed network for estimating fine-scaled depth maps from single rgb images [C]//Proceedings of the IEEE international conference on computer vision. 2017: 3372-3380.
- Eigen D, Fergus R. Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture [C]//Proceedings of the IEEE international conference on computer vision. 2015: 2650-2658.
- Eigen D, Puhrsch C, Fergus R. Depth map prediction from a single image using a multi-scale deep network. Advances in neural information processing systems 2014, 27.
- Jiao J, Cao Y, Song Y, et al. Look deeper into depth: Monocular depth estimation with semantic booster and attention-driven loss [C]//Proceedings of the European conference on computer vision (ECCV). 2018: 53-69.
- Zhe T, Huang L, Wu Q, et al. Inter-vehicle distance estimation method based on monocular vision using 3D detection. IEEE transactions on vehicular technology 2020, 69, 4907–4919. [Google Scholar] [CrossRef]
- Mallot H A, Bülthoff H H, Little J J, et al. Inverse perspective mapping simplifies optical flow computation and obstacle detection. Biological cybernetics 1991, 64, 177–185. [Google Scholar] [CrossRef]
- Tuohy S, O'Cualain D, Jones E, et al. Distance determination for an automobile environment using inverse perspective mapping in OpenCV [C]//IET Irish signals and systems conference (ISSC 2010). IET, 2010: 100-105.
- Wongsaree P, Sinchai S, Wardkein P, et al. Distance detection technique using enhancing inverse perspective mapping [C]//2018 3rd International Conference on Computer and Communication Systems (ICCCS). IEEE, 2018: 217-221.
- Huang L, Zhe T, Wu J, et al. Robust inter-vehicle distance estimation method based on monocular vision. IEEE Access 2019, 7, 46059–46070. [Google Scholar] [CrossRef]
- Qi S H, Li J, Sun Z P, et al. Distance estimation of monocular based on vehicle pose information [C]//Journal of Physics: Conference Series. IOP Publishing 2019, 1168, 032040. [Google Scholar]
- Jiafa M, Wei H, Weiguo S. Target distance measurement method using monocular vision. IET Image Processing 2020, 14, 3181–3187. [Google Scholar] [CrossRef]
- Yang R, Yu S, Yao Q, et al. Vehicle Distance Measurement Method of Two-Way Two-Lane Roads Based on Monocular Vision. Applied Sciences 2023, 13, 3468. [Google Scholar] [CrossRef]
- Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.
- Girshick, R. Fast r-cnn [C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
- Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 2015, 28.
- Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
- Redmon J, Farhadi A. YOLO9000: better, faster, stronger [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.
- Redmon J, Farhadi A. Yolov3: An incremental improvement. arXiv, arXiv:1804.02767.
- Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection. arXiv, arXiv:2004.10934.
- Wu W, Liu H, Li L, et al. Application of local fully Convolutional Neural Network combined with YOLO v5 algorithm in small target detection of remote sensing image. PloS one 2021, 16, e0259283. [Google Scholar]
- Li C, Li L, Jiang H, et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976.
- Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 7464-7475.
- Sohan M, Sai Ram T, Rami Reddy C V. A review on yolov8 and its advancements [C]//International Conference on Data Intelligence and Cognitive Informatics. Springer, Singapore, 2024: 529-545.
- Wang C Y, Yeh I H, Mark Liao H Y. Yolov9: Learning what you want to learn using programmable gradient information [C]//European conference on computer vision. Cham: Springer Nature Switzerland, 2024: 1-21.
- Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection. Advances in Neural Information Processing Systems 2024, 37, 107984–108011. [Google Scholar]
- Khanam R, Hussain M. Yolov11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725.
- Tian Y, Ye Q, Doermann D. Yolov12: Attention-centric real-time object detectors. arXiv 2025, arXiv:2502.12524.
- Cheng Q, Wang Y, He W, et al. Lightweight air-to-air unmanned aerial vehicle target detection model. Scientific Reports 2024, 14, 2609. [Google Scholar]
- Su J, Qin Y, Jia Z, et al. MPE-YOLO: enhanced small target detection in aerial imaging. Scientific Reports 2024, 14, 17799. [Google Scholar] [CrossRef]
- Wang C, Han Y, Yang C, et al. CF-YOLO for small target detection in drone imagery based on YOLOv11 algorithm. Scientific Reports 2025, 15, 1–18. [Google Scholar] [CrossRef]
- Zhou S, Yang L, Liu H, et al. Improved YOLO for long range detection of small drones. Scientific Reports 2025, 15, 12280. [Google Scholar] [CrossRef]
- Kanjalkar P, Kinhikar S, Zagade A, et al. Intelligent Surveillance Tower for Detection of the Drone from the Other Aerial Objects Using Deep Learning [C]//International Conference on Information Science and Applications. Singapore: Springer Nature Singapore, 2023: 39-51.
- Han K, Wang Y, Xu C, et al. GhostNets on heterogeneous devices via cheap operations. International Journal of Computer Vision 2022, 130, 1050–1069. [Google Scholar] [CrossRef]
- Ji C L, Yu T, Gao P, et al. Yolo-tla: an efficient and lightweight small object detection model based on YOLOv5. Journal of Real-Time Image Processing 2024, 21, 141. [Google Scholar] [CrossRef]
- Fang S, Chen C, Li Z, et al. YOLO-ADual: A Lightweight Traffic Sign Detection Model for a Mobile Driving System. World Electric Vehicle Journal 2024, 15, 323. [Google Scholar] [CrossRef]
















| Subset | Total Images | Training Set | Testing Set | Drone Models | Annotation Standard |
|---|---|---|---|---|---|
| Custom-Target | 2670 | 2136 | 534 | DJI 3TD / NEO / DWI-S811 | drone1/drone2/dro-ne3 labels |
| Generalization | 2664 | 2363 | 301 | Multi-brand quad/hexa-rotor | drone label |
| Total | 5334 | 4499 | 835 | - | YOLO format |
| Environment | Parameters |
|---|---|
| CPU | Intel(R) Xeon(R) Platinum 8358P |
| GPU | RTX 3090 |
| GPU memory size | 90GB |
| Operating system | ubuntu18.04 |
| Language | Python 3.8 |
| Frame | PyTorch 1.8.1 |
| CUDA version | CUDA 11.1 |
| Parameters | Setup |
|---|---|
| Epochs | 500 |
| Input image size | 640×640 |
| Batch size | 16 |
| Optimizer | SGD |
| Initial learning rate | 0.01 |
| Model | Params(M) | GFLOPs(G) | FPS | ||
| YOLOv5s | 7.02 | 15.8 | 98.2% | 84.2% | 15 |
| YOLOv8n | 3.00 | 8.1 | 98.2% | 84.2% | 12 |
| YOLOv11n | 2.58 | 6.3 | 98.2% | 85.3% | 10 |
| ADG-YOLO | 1.77 | 5.7 | 98.4% | 85.2% | 27 |
| Experiment ID | |||
|---|---|---|---|
| 1 | 0.5 | 0.52 | 4% |
| 2 | 1 | 0.95 | 5% |
| 3 | 1.5 | 1.64 | 9.3% |
| 4 | 2 | 2.13 | 6.5% |
| 5 | 2.5 | 2.36 | 5.6% |
| 6 | 3 | 3.05 | 1.7% |
| 7 | 3.5 | 3.61 | 3.1% |
| 8 | 4 | 3.98 | 0.5% |
| 9 | 4.5 | 4.36 | 3.1% |
| 10 | 5 | 4.90 | 2% |
| Experiment ID | focal length(mm) | |||
|---|---|---|---|---|
| 1 | 12 | 2 | 2.15 | 7.50% |
| 2 | 12 | 3 | 3.37 | 12.33% |
| 3 | 12 | 4 | 4.28 | 7.00% |
| 4 | 12 | 5 | 4.99 | 0.20% |
| 5 | 12 | 6 | 5.85 | 2.50% |
| 6 | 12 | 7 | 7.58 | 8.29% |
| 7 | 12 | 8 | 8.07 | 0.88% |
| 8 | 12 | 9 | 8.81 | 2.11% |
| 9 | 16 | 11 | 10.59 | 3.73% |
| 10 | 16 | 12 | 11.62 | 3.17% |
| 11 | 16 | 13 | 13.01 | 0.08% |
| 12 | 16 | 14 | 13.73 | 1.93% |
| 13 | 16 | 15 | 14.79 | 1.40% |
| 14 | 16 | 16 | 16.35 | 2.19% |
| 15 | 16 | 17 | 16.87 | 0.76% |
| 16 | 16 | 18 | 18.02 | 0.11% |
| 17 | 16 | 19 | 18.33 | 3.53% |
| 18 | 25 | 21 | 21.06 | 0.29% |
| 19 | 25 | 22 | 22.36 | 1.64% |
| 20 | 25 | 23 | 22.59 | 1.78% |
| 21 | 25 | 24 | 23.66 | 1.42% |
| 22 | 25 | 25 | 25.39 | 1.56% |
| 23 | 25 | 26 | 26.78 | 3.00% |
| 24 | 25 | 27 | 27.66 | 2.44% |
| 25 | 25 | 28 | 28.21 | 0.75% |
| 26 | 25 | 29 | 29.33 | 1.14% |
| 27 | 35 | 31 | 30.52 | 1.55% |
| 28 | 35 | 32 | 31.78 | 0.69% |
| 29 | 35 | 33 | 33.26 | 0.79% |
| 30 | 35 | 34 | 33.70 | 0.88% |
| 31 | 35 | 35 | 34.96 | 0.11% |
| 32 | 35 | 36 | 35.78 | 0.61% |
| 33 | 35 | 37 | 38.21 | 3.27% |
| 34 | 35 | 38 | 38.62 | 1.63% |
| 35 | 35 | 39 | 39.45 | 1.15% |
| 36 | 50 | 41 | 40.73 | 0.66% |
| 37 | 50 | 42 | 42.56 | 1.33% |
| 38 | 50 | 43 | 43.34 | 0.79% |
| 39 | 50 | 44 | 43.97 | 0.07% |
| 40 | 50 | 45 | 45.26 | 0.58% |
| 41 | 50 | 46 | 45.69 | 0.67% |
| 42 | 50 | 47 | 46.91 | 0.19% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).