Submitted:
12 January 2024
Posted:
12 January 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- The presence of multiple markers indicating the landing pad.
- The presence of obstacles near the marker.
- The inability to detect the designated visual marker or the absence of the marker altogether.
- Suggest an algorithm for safe autonomous landing of quadrotor UAVs.
- Introduction of highly likely but undesirable scenarios during vision-based autonomous landing of a quadrotor UAV and work around to deal with those scenarios.
- Integration of advanced algorithms like YOLOv5, DeepSORT, PID control, and Euclidean distance transform for better robustness of the overall system.
- Designing initial tests and simulation environment for the initial testing of the system, prior to real-life tests.
- Build and test a full-fledged hardware system with safe autonomous landing capabilities in real-life scenarios.

2. Related Works
2.1. Autonomous Landing
2.2. Obstacle Avoidance
2.3. Object Detection
3. Methods
3.1. You Only Look Once version 5 (YOLOv5)
3.2. Simple Online and Real-time Tracking with a Deep metric association (Deep SORT)
3.3. Meters per pixel calculation
3.4. Proportional Integral Derivative (PID) Controller

3.5. Finding Safe Alternate Landing Space
3.6. Undesirable Scenarios and Their Solutions
- False detection of landing pad: Object detection techniques may not always be reliable and sometimes there may be instances where false detections are encountered. The system is designed such that it only considers the detections of the landing pad where the confidence score is above 50 %, the probability of failure of the system due to such false detections are reduced.
- Absence of a marker denoting landing pad: There are possibilities that the visual marker denoting the landing pad is not detected by the object detection algorithm. This could be caused by the reasons such as presence of the landing pad in an area out of field of view of the camera, obstruction of the landing pad, absence of the helipad altogether or sometimes even the inability of the object detection algorithm to detect the visual marker. In any of such scenarios, the designed system is capable to land in a safe landing spot. After reaching the end of the mission, in case the landing pad is not recognized, the UAV is elevated slowly to 5m higher than the altitude at the end of the mission. If the landing pad is recognized at any point while raising the altitude of the UAV then the UAV proceeds towards landing in the landing pad, otherwise a nearby alternate safe landing spot is considered based on the algorithm described in section 3.5.
- Presence of multiple landing pads: The object detection algorithm can also sometimes detect two landing pads in a single frame due to actual presence of such landing pads or due to some false detection. Our system only considers only the landing detection with the highest confidence and the one closest to the location at the end of the mission. Additionally, landing pad choosen are also kept in track using the Deep Sort algorithm.
- Presence of obstacles nearby the landing target: There are possibilities of presence of obstacles nearby the landing pad, which may move or remain stationary during the landing process. Eitherway, our system can proceed with the safe autonomous landing. Upon detection of obstacles nearby landing spot the system will sound a buzzer so as to notify that the drone is proceeding with landing. While sounding the buzzer, the system waits for 10 seconds so to observe whether or not the landing pad is devoid of any obstacles. If the landing pad is cleared within 10 seconds then the system proceeds with landing. Otherwise, the system proceeds to land in an alternate safe landing spot as per the algorithm discussed in section 3.5.
3.7. Algorithm for Vision-based Autonomous Landing with Adaption to Perform Safe Landing During Undesirable Scenarios
| Algorithm 1 Algorithm to save information regarding the landing pad, obstacles and the alternate safe landing space |
|
Input: Queue storing captured camera frames ’q_frames’; YOLOv5 model loaded with pre-trained weights ’yolo’
Output: list of the landing pad detections with their position and confidence scores ’helipad’; list of obstacle detections with their position and confidence scores ’obstacles’; position of the largest open space available ’safe’
|
| Algorithm 2 Algorithm for safe autonomous landing |
|
Input: list of obstacles information ’obstacles’; list of landing pad information ’helipad’; information of largest open space available ’safe’
|
4. Setups and Experiments
4.1. Test for accuracy of pixels to meter conversion
4.2. Selection of Object Detection Model
| Method | P(%) | R(%) | (%) | mAP(%) | FPS |
|---|---|---|---|---|---|
| YOLOv5n | 0.36189 | 0.28197 | 0.26161 | 0.13335 | 12 |
| YOLOv5s | 0.45251 | 0.3377 | 0.33179 | 0.18063 | 12 |
| YOLOv5m | 0.50072 | 0.37865 | 0.37837 | 0.21837 | 10 |
| YOLOv5l | 0.51648 | 0.39725 | 0.40004 | 0.23653 | 8 |
| YOLOv5x | 0.57061 | 0.39677 | 0.41358 | 0.24901 | 6 |
| TPH-YOLOv5 | 0.66588 | 0.54958 | 0.59177 | 0.38152 | 5 |
| CMPH-YOLOv5 | 0.67406 | 0.55746 | 0.60015 | 0.38612 | 5 |
4.3. Details of YOLOv5 training
| ine S.N. | Image from UAV | Measurement taken | Altitude | Measured Distance | Calculated Distance | Error |
|---|---|---|---|---|---|---|
| 1 | ![]() |
![]() |
20.0 | 4.3 | 4.5 | 0.2 |
| 2 | ![]() |
![]() |
20.0 | 3.2 | 3.2 | 0.0 |
| 3 | ![]() |
![]() |
20.1 | 2.5 | 2.7 | 0.2 |
| 4 | ![]() |
![]() |
20.0 | 2.6 | 2.6 | 0.0 |
| 5 | ![]() |
![]() |
20.0 | 2.0 | 2.0 | 0.0 |
| 6 | ![]() |
![]() |
20.0 | 2.3 | 2.2 | 0.1 |
| 7 | ![]() |
![]() |
20.1 | 2.4 | 2.5 | 0.1 |
| 8 | ![]() |
![]() |
20.1 | 2.2 | 2.2 | 0.0 |
| 9 | ![]() |
![]() |
20.0 | 2.7 | 2.6 | 0.1 |
| 10 | ![]() |
![]() |
20.0 | 3.0 | 3.0 | 0.0 |
4.3.1. Dataset
4.3.2. Evaluation Metrics
4.3.3. Results of training
4.4. Simulation Setup
| Algorithm 3 Algorithm for simulating the camera frames based on SITL information |
|
Input: location of SITL UAV in terms of metres from a reference point ’vehicle_location’; attitude of SITL UAV ’vehicle_attitude’;landing pad image ’target’
|
4.5. Hardware Setup
- Frame: Quadcopter
- Flight Controller: Pixhawk 5x
- GPS: Pixhawk4 GPS module
- Telemetry radio: HoIybro 433MHz 100mW
- Propellers: 22-inch, pitch: 11
- Motors: MN605-S kv170
- Controller: Taranis x9d
- FRSky x8r

5. Results of real-life implementation of autonomous landing in different scenarios
- The first scenario is the most optimum landing condition when the landing pad is clearly visible once the mission is completed and there are no obstacles nearby the landing pad throughout the landing process. The results for this have been shown in Figure 8.
- The second scenario is created by not placing any visual markers that denote the landing target. The results for this have been shown in Figure 9.
- The third scenario is created by placing a plant and a human near the landing pad after it has been a while since the landing has started. The results for this have been shown in Figure 10.
- The final one is created by moving the position of the visual markers denoting the landing pad multiple times during landing procedure. The results for this have been shown in Figure 11.
6. Discussion and Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| MDPI | Multidisciplinary Digital Publishing Institute |
| DOAJ | Directory of open access journals |
| TLA | Three letter acronym |
| LD | Linear dichroism |
References
- Mohsan, S.A.H.; Othman, N.Q.H.; Li, Y.; Alsharif, M.H.; Khan, M.A. Unmanned aerial vehicles (UAVs): Practical aspects, applications, open challenges, security issues, and future trends. Intelligent Service Robotics 2023, 16, 109–137. [Google Scholar] [CrossRef] [PubMed]
- Hassanalian, M.; Abdelkefi, A. Classifications, applications, and design challenges of drones: A review. Progress in Aerospace Sciences 2017, 91, 99–131. [Google Scholar] [CrossRef]
- Chen, H.; Wang, X.m.; Li, Y. A Survey of Autonomous Control for UAV. 2009 International Conference on Artificial Intelligence and Computational Intelligence, 2009, Vol. 2, pp. 267–271. [CrossRef]
- Tsai, A.C.; Gibbens, P.W.; Stone, R.H. Terminal phase vision-based target recognition and 3D pose estimation for a tail-sitter, vertical takeoff and landing unmanned air vehicle. Advances in Image and Video Technology: First Pacific Rim Symposium, PSIVT 2006, Hsinchu, Taiwan, December 10-13, 2006. Proceedings 1. Springer, 2006, pp. 672–681. [CrossRef]
- Saripalli, S.; Montgomery, J.F.; Sukhatme, G.S. Vision-based autonomous landing of an unmanned aerial vehicle. Proceedings 2002 IEEE international conference on robotics and automation (Cat. No. 02CH37292). IEEE, 2002, Vol. 3, pp. 2799–2804. [CrossRef]
- Fan, Y.; Haiqing, S.; Hong, W. A vision-based algorithm for landing unmanned aerial vehicles. 2008 International Conference on Computer Science and Software Engineering. IEEE, 2008, Vol. 1, pp. 993–996. [CrossRef]
- Wenzel, K.E.; Masselli, A.; Zell, A. Automatic take off, tracking and landing of a miniature UAV on a moving carrier vehicle. Journal of intelligent & robotic systems 2011, 61, 221–238. [Google Scholar] [CrossRef]
- Herissé, B.; Hamel, T.; Mahony, R.; Russotto, F.X. Landing a VTOL unmanned aerial vehicle on a moving platform using optical flow. IEEE Transactions on robotics 2011, 28, 77–89. [Google Scholar] [CrossRef]
- Lee, D.; Ryan, T.; Kim, H.J. Autonomous landing of a VTOL UAV on a moving platform using image-based visual servoing. 2012 IEEE international conference on robotics and automation. IEEE, 2012, pp. 971–976. [CrossRef]
- Kim, J.; Jung, Y.; Lee, D.; Shim, D.H. Outdoor autonomous landing on a moving platform for quadrotors using an omnidirectional camera. 2014 International Conference on Unmanned Aircraft Systems (ICUAS). IEEE, 2014, pp. 1243–1252. [CrossRef]
- Baca, T.; Stepan, P.; Saska, M. Autonomous landing on a moving car with unmanned aerial vehicle. 2017 European conference on mobile robots (ECMR). IEEE, 2017, pp. 1–6. [CrossRef]
- Araar, O.; Aouf, N.; Vitanov, I. Vision based autonomous landing of multirotor UAV on moving platform. Journal of Intelligent & Robotic Systems 2017, 85, 369–384. [Google Scholar] [CrossRef]
- Salagame, A.; Govindraj, S.; Omkar, S. Precision Landing of a UAV on a Moving Platform for Outdoor Applications. arXiv preprint arXiv:2209.14436 2022. [Google Scholar] [CrossRef]
- Chen, J.; Miao, X.; Jiang, H.; Chen, J.; Liu, X. Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network. 2017 Chinese Automation Congress (CAC). IEEE, 2017, pp. 2109–2114. [CrossRef]
- Nguyen, P.H.; Arsalan, M.; Koo, J.H.; Naqvi, R.A.; Truong, N.Q.; Park, K.R. LightDenseYOLO: A fast and accurate marker tracker for autonomous UAV landing by visible light camera sensor on drone. Sensors 2018, 18, 1703. [Google Scholar] [CrossRef] [PubMed]
- Truong, N.Q.; Lee, Y.W.; Owais, M.; Nguyen, D.T.; Batchuluun, G.; Pham, T.D.; Park, K.R. SlimDeblurGAN-based motion deblurring and marker detection for autonomous drone landing. Sensors 2020, 20, 3918. [Google Scholar] [CrossRef] [PubMed]
- Mori, T.; Scherer, S. First results in detecting and avoiding frontal obstacles from a monocular camera for micro unmanned aerial vehicles. 2013 IEEE International Conference on Robotics and Automation. IEEE, 2013, pp. 1750–1757. [CrossRef]
- Souhila, K.; Karim, A. Optical flow based robot obstacle avoidance. International Journal of Advanced Robotic Systems 2007, 4, 2. [Google Scholar] [CrossRef]
- Muratet, L.; Doncieux, S.; Briere, Y.; Meyer, J.A. A contribution to vision-based autonomous helicopter flight in urban environments. Robotics and Autonomous Systems 2005, 50, 195–209. [Google Scholar] [CrossRef]
- Peng, X.Z.; Lin, H.Y.; Dai, J.M. Path planning and obstacle avoidance for vision guided quadrotor UAV navigation. 2016 12th IEEE International Conference on Control and Automation (ICCA). IEEE, 2016, pp. 984–989. [CrossRef]
- Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05). Ieee, 2005, Vol. 1, pp. 886–893. [CrossRef]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. International journal of computer vision 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580–587.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence 2015, 37, 1904–1916. [Google Scholar] [CrossRef] [PubMed]
- Girshick, R. Fast r-cnn. Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440–1448. [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer, 2016, pp. 21–37. [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 779–788. [CrossRef]
- Redmon, J.; Farhadi, A. YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 7263–7271. [CrossRef]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 2018. [Google Scholar] [CrossRef]
- Bochkovskiy, A.; Wang, C.Y.; Liao, H.Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 2020. [Google Scholar] [CrossRef]
- Jocher, G.; Alex, S.; Borovec, J.; NanoCode012.; Chauarasia, A.; TaoXie.; Changyu, L.; V, A.; Laughing.; tkianai.; yxNONG.; Hogan, A.; lorenzomammana.; AlexWang1900.; Hajek, Jan amd Diaconu, L.; Marc.; Kwon, Y.; oleg.; wanghaoyang0106.; Defretin, Y.; Lohia, A.; ml5ah.; Milanko, B.; Fineran, B.; Khromov, D.; Yiwei, D.; Doug.; Durgesh.; Ingham, F. Yolov5. opensourcecode.
- Tian, Z.; Shen, C.; Chen, H.; He, T. Fcos: Fully convolutional one-stage object detection. Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 9627–9636. [CrossRef]
- Zhu, C.; He, Y.; Savvides, M. Feature selective anchor-free module for single-shot object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 840–849. [CrossRef]
- Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 2021. [Google Scholar] [CrossRef]
- Ozge Unel, F.; Ozkalayci, B.O.; Cigla, C. The power of tiling for small object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019, pp. 0–0. [CrossRef]
- Yang, F.; Fan, H.; Chu, P.; Blasch, E.; Ling, H. Clustered object detection in aerial images. Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 8311–8320. [CrossRef]
- Li, C.; Yang, T.; Zhu, S.; Chen, C.; Guan, S. Density map guided object detection in aerial images. proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020, pp. 190–191.
- Ding, J.; Xue, N.; Xia, G.S.; Bai, X.; Yang, W.; Yang, M.Y.; Belongie, S.; Luo, J.; Datcu, M.; Pelillo, M.; others. Object detection in aerial images: A large-scale benchmark and challenges. IEEE transactions on pattern analysis and machine intelligence 2021, 44, 7778–7796. [Google Scholar] [CrossRef] [PubMed]
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 2778–2788. [CrossRef]
- Baidya, R.; Jeong, H. YOLOv5 with convMixer prediction heads for precise object detection in drone imagery. Sensors 2022, 22, 8424. [Google Scholar] [CrossRef] [PubMed]
- Wang, C.Y.; Liao, H.Y.M.; Wu, Y.H.; Chen, P.Y.; Hsieh, J.W.; Yeh, I.H. CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020, pp. 390–391. [CrossRef]
- Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path aggregation network for instance segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 8759–8768. [CrossRef]
- Wojke, N.; Bewley, A.; Paulus, D. Simple online and realtime tracking with a deep association metric. 2017 IEEE international conference on image processing (ICIP). IEEE, 2017, pp. 3645–3649. [CrossRef]
- Bennett, S. The past of PID controllers. Annual Reviews in Control 2001, 25, 43–53. [Google Scholar] [CrossRef]
- Rosenfeld, A.; Pfaltz, J.L. Sequential operations in digital picture processing. Journal of the ACM (JACM) 1966, 13, 471–494. [Google Scholar] [CrossRef]
- Ramesh, A.; Dhariwal, P.; Nichol, A.; Chu, C.; Chen, M. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125 2022, 1, 3. [Google Scholar] [CrossRef]








Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).



















