Submitted:
08 February 2025
Posted:
10 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- To match visible and thermal camera images to a remote sensing RGB map, we investigate several visual features and propose outlier filtering methods to achieve cross day and night geo-registration performance.
- To obtain accurate localization results from geo-registration, we analyze the influence of horizontal attitude on the center point, and we propose the compensation method to correct the localization error to obtain more precise geo-localization.
- We conduct actual long-duration flight experiments with different situations. The experiments include visual registration using various features on visible light and thermal images, geo-localization with horizontal attitude compensation, and integrated navigation. These experiments prove the efficiency of our methods.
2. Related Works
2.1. Visual Navigation
2.2. IR-VIS Method
3. Method
3.1. Visual-Inertial System State Construction and Propagation
3.1.1. Nominal State Propagation
3.1.2. Filter State
3.1.3. Covariance Propagation
3.2. Cross-Domain Visual Registration
3.2.1. Features Extraction and Matching Method
3.2.2. Reference Visible Map Pre-Processing
3.2.3. Camera Image Pre-Processing
- Rotation and Scaling
- Thermal Image Gamma Transformation
3.2.4. Camera-Map Registration

- Sub-map querying
- Camera-map registration method
- Anomaly match checking
3.3. Geo-Localization from Visual Registration
3.3.1. Geo-Localization
3.3.2. Horizontal Attitude Error Compensation
3.4. Filter State Updation with GeoLocalization Observations


4. Experiment
4.1. System Setup
4.2. Dataset Description and Evaluation Method
| Dataset | Dataset 1 | Dataset 2 |
|---|---|---|
| Cruising Altitude(m) | 2400 | 700 |
| Time(s) | 6892 | 5170 |
| Length(km) | 344.6 | 284.35 |
| Daytime(s) | 2504 | 5170 |
| Frequency(Hz) | 1 | 1 |
| Acceptable Matching Threshold | 50 | 50 |
| Location Error Threshold(m) | 100 | 100 |
4.3. Experimental Results
4.3.1. Registration Rate Results


4.3.2. Visual Geo-Localization Results
4.3.3. Visual-Inertial Integrated Navigation Results
5. Conclusions
Author Contributions
Funding
References
- Couturier, A.; Akhloufi, M.A. A review on absolute visual localization for UAV. Robotics and Autonomous Systems 2021, 135, 103666. [Google Scholar] [CrossRef]
- Fraundorfer, F.; Scaramuzza, D. Visual odometry: Part i: The first 30 years and fundamentals. IEEE Robotics and Automation Magazine 2011, 18, 80–92. [Google Scholar]
- Scaramuzza, D.; Zhang, Z. Visual-inertial odometry of aerial robots. arXiv 2019, arXiv:1906.03289. [Google Scholar]
- Qin, T.; Li, P.; Shen, S. Vins-mono: A robust and versatile monocular visual-inertial state estimator. IEEE transactions on robotics 2018, 34, 1004–1020. [Google Scholar] [CrossRef]
- Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Transactions on Robotics 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
- Geneva, P.; Eckenhoff, K.; Lee, W.; Yang, Y.; Huang, G. Openvins: A research platform for visual-inertial estimation. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA); 2020; pp. 4666–4672. [Google Scholar]
- Bianchi, M.; Barfoot, T.D. UAV localization using autoencoded satellite images. IEEE Robotics and Automation Letters 2021, 6, 1761–1768. [Google Scholar] [CrossRef]
- Patel, B.; Barfoot, T.D.; Schoellig, A.P. Visual localization with google earth images for robust global pose estimation of uavs. In Proceedings of the 2020 IEEE international conference on robotics and automation (ICRA); 2020; pp. 6491–6497. [Google Scholar]
- Conte, G.; Doherty, P. Vision-based unmanned aerial vehicle navigation using geo-referenced information. EURASIP Journal on Advances in Signal Processing 2009, 2009, 1–18. [Google Scholar] [CrossRef]
- Mantelli, M.; Pittol, D.; Neuland, R.; Ribacki, A.; Maffei, R.; Jorge, V.; Prestes, E.; Kolberg, M. A novel measurement model based on abBRIEF for global localization of a UAV over satellite images. Robotics and Autonomous Systems 2019, 112, 304–319. [Google Scholar] [CrossRef]
- Jiang, X.; Ma, J.; Xiao, G.; Shao, Z.; Guo, X. A review of multimodal image matching: Methods and applications. Information Fusion 2021, 73, 22–71. [Google Scholar] [CrossRef]
- Dwith Chenna, Y.N.; Ghassemi, P.; Pfefer, T.J.; Casamento, J.; Wang, Q. Free-form deformation approach for registration of visible and infrared facial images in fever screening. Sensors 2018, 18, 125. [Google Scholar] [CrossRef] [PubMed]
- Yu, K.; Ma, J.; Hu, F.; Ma, T.; Quan, S.; Fang, B. A grayscale weight with window algorithm for infrared and visible image registration. Infrared Physics & Technology 2019, 99, 178–186. [Google Scholar]
- Hrkać, T.; Kalafatić, Z.; Krapac, J. Infrared-visual image registration based on corners and hausdorff distance. In Proceedings of the Image Analysis: 15th Scandinavian Conference, SCIA 2007, Aalborg, Denmark, 10-14 June 2007; pp. 383–392. [Google Scholar]
- Ma, J.; Zhao, J.; Ma, Y.; Tian, J. Non-rigid visible and infrared face registration via regularized Gaussian fields criterion. Pattern Recognition 2015, 48, 772–784. [Google Scholar] [CrossRef]
- Du, Q.; Fan, A.; Ma, Y.; Fan, F.; Huang, J.; Mei, X. Infrared and visible image registration based on scale-invariant piifd feature and locality preserving matching. IEEE Access 2018, 6, 64107–64121. [Google Scholar] [CrossRef]
- Zeng, Q.; Adu, J.; Liu, J.; Yang, J.; Xu, Y.; Gong, M. Real-time adaptive visible and infrared image registration based on morphological gradient and C_SIFT. Journal of Real-Time Image Processing 2020, 17, 1103–1115. [Google Scholar] [CrossRef]
- Forster, C.; Zhang, Z.; Gassner, M.; Werlberger, M.; Scaramuzza, D. SVO: Semidirect visual odometry for monocular and multicamera systems. IEEE Transactions on Robotics 2016, 33, 249–265. [Google Scholar] [CrossRef]
- Engel, J.; Koltun, V.; Cremers, D. Direct sparse odometry. IEEE transactions on pattern analysis and machine intelligence 2017, 40, 611–625. [Google Scholar] [CrossRef] [PubMed]
- Mur-Artal, R.; Montiel, J.M.M.; Tardos, J.D. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE transactions on robotics 2015, 31, 1147–1163. [Google Scholar] [CrossRef]
- Van Dalen, G.J.; Magree, D.P.; Johnson, E.N. Absolute localization using image alignment and particle filtering. In Proceedings of the AIAA Guidance, Navigation, and Control Conference; 2016; p. 0647. [Google Scholar]
- Yol, A.; Delabarre, B.; Dame, A.; Dartois, J.-E.; Marchand, E. Vision-based absolute localization for unmanned aerial vehicles. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems; 2014; pp. 3429–3434. [Google Scholar]
- Conte, G.; Doherty, P. An integrated UAV navigation system based on aerial image matching. In Proceedings of the 2008 IEEE Aerospace Conference; 2008; pp. 1–10. [Google Scholar]
- Calonder, M.; Lepetit, V.; Strecha, C.; Fua, P. Brief: Binary robust independent elementary features. In Proceedings of the Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5-11, 2010, Proceedings, Part IV 11, 2010; pp. 778–792.
- Surber, J.; Teixeira, L.; Chli, M. Robust visual-inertial localization with weak GPS priors for repetitive UAV flights. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA); 2017; pp. 6300–6306. [Google Scholar]
- Shan, M.; Wang, F.; Lin, F.; Gao, Z.; Tang, Y.Z.; Chen, B.M. Google map aided visual navigation for UAVs in GPS-denied environment. In Proceedings of the 2015 IEEE international conference on robotics and biomimetics (ROBIO); 2015; pp. 114–119. [Google Scholar]
- Nassar, A.; Amer, K.; ElHakim, R.; ElHelw, M. A deep CNN-based framework for enhanced aerial imagery registration with applications to UAV geolocalization. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2018; pp. 1513–1523.
- Wang, L.; Gao, C.; Zhao, Y.; Song, T.; Feng, Q. Infrared and visible image registration using transformer adversarial network. In Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP); 2018; pp. 1248–1252. [Google Scholar]
- Baruch, E.B.; Keller, Y. Multimodal matching using a hybrid convolutional neural network. Ben-Gurion University of the Negev, 2018.
- Wang, M.; Wu, W.; He, X.; Li, Y.; Pan, X. Consistent ST-EKF for long distance land vehicle navigation based on SINS/OD integration. IEEE Transactions on Vehicular Technology 2019, 68, 10525–10534. [Google Scholar] [CrossRef]
- Wang, M.; Wu, W.; Zhou, P.; He, X. State transformation extended Kalman filter for GPS/SINS tightly coupled integration. Gps Solutions 2018, 22, 1–12. [Google Scholar] [CrossRef]
- Robert, E.; Perrot, T. Invariant filtering versus other robust filtering methods applied to integrated navigation. In Proceedings of the 2017 24th Saint Petersburg International Conference on Integrated Navigation Systems (ICINS); 2017; pp. 1–7. [Google Scholar]
- Ng, P.C.; Henikoff, S. SIFT: Predicting amino acid changes that affect protein function. Nucleic acids research 2003, 31, 3812–3814. [Google Scholar] [CrossRef] [PubMed]
- Potje, G.; Cadar, F.; Araujo, A.; Martins, R.; Nascimento, E.R. XFeat: Accelerated Features for Lightweight Image Matching. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024; pp. 2682–2691.
- DeTone, D.; Malisiewicz, T.; Rabinovich, A. Superpoint: Self-supervised interest point detection and description. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2018; pp. 224–236.
- Zhang, Z. Flexible camera calibration by viewing a plane from unknown orientations. In Proceedings of the Proceedings of the seventh ieee international conference on computer vision, 1999; pp. 666–673.
- Sarlin, P.-E.; DeTone, D.; Malisiewicz, T.; Rabinovich, A. Superglue: Learning feature matching with graph neural networks. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020; pp. 4938–4947.
- Chen, H.; Luo, Z.; Zhang, J.; Zhou, L.; Bai, X.; Hu, Z.; Tai, C.-L.; Quan, L. Learning to match features with seeded graph matching network. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2021; pp. 6301–6310.
- Liu, K.; He, X.; Mao, J.; Zhang, L.; Zhou, W.; Qu, H.; Luo, K. Map Aided Visual-Inertial Integrated Navigation for Long Range UAVs. In Proceedings of the International Conference on Guidance, Navigation and Control, 2022; pp. 6043–6052.










| Sub-map | Center of sub-map (longitude, latitude) |
Descriptors |
|---|---|---|
| … | … | … |
| Visible Light Camera (Flir BFS-U3-51S5C-C) |
Thermal Camera (Guide Sensmart Plug617) |
|
|---|---|---|
| Resolution | ||
| Frequency | 1Hz | 1Hz |
| RMSE(m) | Max Linear Bias(m) | End Point Error(m) | |
|---|---|---|---|
| PINS | 450.933 | 890.55 | 627.49 |
|
Without Gaussian Elliptic Constraint |
112.65 | 423.65 | 75.10 |
| Proposed Method | 42.38 | 261.23 | 65.21 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).