Submitted:
18 March 2023
Posted:
20 March 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
- The proposed SLAM system combines multiple sensors, including panoramic cameras, LiDAR sensors, and IMUs, to achieve high-accuracy and robust performance.
- The early fusion of LiDAR range-maps and visual features enables our SLAM system to produce results with absolute scale, without relying on external data sources such as GPS or ground control points.
- Our middle fusion using a pose graph formulation allows for the seamless integration of data from different sensors, enabling our SLAM system to provide accurate and consistent localization and mapping results.
- We conducted extensive experiments in challenging outdoor scenarios to demonstrate the effectiveness and robustness of our proposed system, even in conditions where only a few features exist. Overall, our work contributes to the development of more accurate and robust SLAM systems for various real-world applications.
2. Related Work
2.1. Visual SLAM
2.2. Panoramic Visual SLAM
2.3. LiDAR SLAM
2.4. Sensor-fusion-based SLAM
3. Methodology
3.1. Maverick Mobile Mapping System and Notation
3.2. Google Cartographer
3.2.1. Local Map Construction
3.2.2. Ceres Scan Matching
3.3. RPV-SLAM with Early Fusion
3.3.1. Feature and Range Module
3.3.2. Tracking Module
3.4. PVL Cartographer SLAM with Pose-Graph-based Middle Fusion
3.5. Global Map Optimization and Loop Closure for PVL Cartographer
4. Experiments
4.1. Dataset
4.2. Results
4.3. Discussion
5. Conclusion
- Firstly, by adopting advanced depth estimation or completion methods, denser range-maps can be created which would enable more visual features to be augmented with the ranges;
- Secondly, incorporating range measurements in both the local and global bundle adjustment would enhance the accuracy of the system;
- Thirdly, efforts are underway to improve the current PVL-Cartographer SLAM to a more tightly coupled visual-LiDAR-IMU SLAM system through pose graph or factor graph;
- Fourthly, the system can be further extended by developing a SLAM pipeline that combines the visual features and LiDAR features;
- Finally, applying deep neural network techniques for feature classification and pose correction would likely improve the system’s overall performance.
References
- Huai, J.; Zhang, Y.; Yilmaz, A. REAL-TIME LARGE SCALE 3D RECONSTRUCTION BY FUSING KINECT AND IMU DATA. ISPRS Annals of Photogrammetry, Remote Sensing & Spatial Information Sciences 2015, 2. [CrossRef]
- Alsadik, B. Ideal angular orientation of selected 64-channel multi beam lidars for mobile mapping systems. Remote sensing 2020, 12, 510. [CrossRef]
- Lin, M.; Cao, Q.; Zhang, H. PVO: Panoramic visual odometry. 2018 3rd International Conference on Advanced Robotics and Mechatronics (ICARM). IEEE, 2018, pp. 491–496. [CrossRef]
- Tardif, J.P.; Pavlidis, Y.; Daniilidis, K. Monocular visual odometry in urban environments using an omnidirectional camera. 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2008, pp. 2531–2538.
- Shi, Y.; Ji, S.; Shi, Z.; Duan, Y.; Shibasaki, R. GPS-supported visual SLAM with a rigorous sensor model for a panoramic camera in outdoor environments. Sensors 2012, 13, 119–136. [CrossRef]
- Ji, S.; Qin, Z.; Shan, J.; Lu, M. Panoramic SLAM from a multiple fisheye camera rig. ISPRS Journal of Photogrammetry and Remote Sensing 2020, 159, 169–183. [CrossRef]
- Sumikura, S.; Shibuya, M.; Sakurada, K. OpenVSLAM: A versatile visual SLAM framework. Proceedings of the 27th ACM International Conference on Multimedia, 2019, pp. 2292–2295.
- Zhang, J.; Kaess, M.; Singh, S. Real-time depth enhanced monocular odometry. 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2014, pp. 4973–4980. [CrossRef]
- Graeter, J.; Wilczynski, A.; Lauer, M. Limo: Lidar-monocular visual odometry. 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2018, pp. 7872–7879. [CrossRef]
- Grisetti, G.; Kümmerle, R.; Stachniss, C.; Burgard, W. A tutorial on graph-based SLAM. IEEE Intelligent Transportation Systems Magazine 2010, 2, 31–43. [CrossRef]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. International journal of computer vision 2004, 60, 91–110. [CrossRef]
- Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speeded-up robust features (SURF). Computer vision and image understanding 2008, 110, 346–359. [CrossRef]
- Mur-Artal, R.; Montiel, J.M.M.; Tardos, J.D. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE transactions on robotics 2015, 31, 1147–1163. [CrossRef]
- Mur-Artal, R.; Tardós, J.D. Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras. IEEE transactions on robotics 2017, 33, 1255–1262. [CrossRef]
- Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Transactions on Robotics 2021, 37, 1874–1890. [CrossRef]
- Engel, J.; Schöps, T.; Cremers, D. LSD-SLAM: Large-scale direct monocular SLAM. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part II 13. Springer, 2014, pp. 834–849.
- Engel, J.; Koltun, V.; Cremers, D. Direct sparse odometry. IEEE transactions on pattern analysis and machine intelligence 2017, 40, 611–625. [CrossRef]
- Kerl, C.; Sturm, J.; Cremers, D. Robust odometry estimation for RGB-D cameras. 2013 IEEE international conference on robotics and automation. IEEE, 2013, pp. 3748–3754. [CrossRef]
- Endres, F.; Hess, J.; Sturm, J.; Cremers, D.; Burgard, W. 3-D mapping with an RGB-D camera. IEEE transactions on robotics 2013, 30, 177–187. [CrossRef]
- Henry, P.; Krainin, M.; Herbst, E.; Ren, X.; Fox, D. RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments. The international journal of Robotics Research 2012, 31, 647–663. [CrossRef]
- Newcombe, R.A.; Izadi, S.; Hilliges, O.; Molyneaux, D.; Kim, D.; Davison, A.J.; Kohi, P.; Shotton, J.; Hodges, S.; Fitzgibbon, A. Kinectfusion: Real-time dense surface mapping and tracking. 2011 10th IEEE international symposium on mixed and augmented reality. Ieee, 2011, pp. 127–136. [CrossRef]
- Nießner, M.; Dai, A.; Fisher, M. Combining Inertial Navigation and ICP for Real-time 3D Surface Reconstruction. Eurographics (Short Papers), 2014, pp. 13–16.
- Kang, J.; Zhang, Y.; Liu, Z.; Sit, A.; Sohn, G. RPV-SLAM: Range-augmented panoramic visual SLAM for mobile mapping system with panoramic camera and tilted LiDAR. 2021 20th International Conference on Advanced Robotics (ICAR). IEEE, 2021, pp. 1066–1072.
- Zhang, J.; Singh, S. LOAM: Lidar odometry and mapping in real-time. Robotics: Science and Systems. Berkeley, CA, 2014, Vol. 2, pp. 1–9.
- Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The kitti dataset. The International Journal of Robotics Research 2013, 32, 1231–1237. [CrossRef]
- Zhang, J.; Singh, S. Visual-lidar odometry and mapping: Low-drift, robust, and fast. 2015 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2015, pp. 2174–2181. [CrossRef]
- Lin, J.; Zhang, F. Loam livox: A fast, robust, high-precision LiDAR odometry and mapping package for LiDARs of small FoV. 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 3126–3131.
- Li, L.; Kong, X.; Zhao, X.; Li, W.; Wen, F.; Zhang, H.; Liu, Y. SA-LOAM: Semantic-aided LiDAR SLAM with loop closure. 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 7627–7634. [CrossRef]
- Mendes, E.; Koch, P.; Lacroix, S. ICP-based pose-graph SLAM. 2016 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR). IEEE, 2016, pp. 195–200. [CrossRef]
- Behley, J.; Stachniss, C. Efficient Surfel-Based SLAM using 3D Laser Range Data in Urban Environments. Robotics: Science and Systems, 2018, Vol. 2018, p. 59.
- Chen, X.; Milioto, A.; Palazzolo, E.; Giguere, P.; Behley, J.; Stachniss, C. Suma++: Efficient lidar-based semantic slam. 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 4530–4537. [CrossRef]
- Shan, T.; Englot, B. Lego-loam: Lightweight and ground-optimized lidar odometry and mapping on variable terrain. 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2018, pp. 4758–4765. [CrossRef]
- Shan, T.; Englot, B.; Meyers, D.; Wang, W.; Ratti, C.; Rus, D. Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping. 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2020, pp. 5135–5142. [CrossRef]
- Shan, T.; Englot, B.; Ratti, C.; Daniela, R. LVI-SAM: Tightly-coupled Lidar-Visual-Inertial Odometry via Smoothing and Mapping. IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 5692–5698.
- Hess, W.; Kohler, D.; Rapp, H.; Andor, D. Real-time loop closure in 2D LIDAR SLAM. 2016 IEEE international conference on robotics and automation (ICRA). IEEE, 2016, pp. 1271–1278. [CrossRef]
- Dwijotomo, A.; Abdul Rahman, M.A.; Mohammed Ariff, M.H.; Zamzuri, H.; Wan Azree, W.M.H. Cartographer slam method for optimization with an adaptive multi-distance scan scheduler. Applied Sciences 2020, 10, 347. [CrossRef]
- Nüchter, A.; Bleier, M.; Schauer, J.; Janotta, P. Continuous-time slam—improving google’s cartographer 3d mapping. Latest Developments in Reality-Based 3D Surveying and Modelling 2018, pp. 53–73.
- Elhashash, M.; Albanwan, H.; Qin, R. A Review of Mobile Mapping Systems: From Sensors to Applications. Sensors 2022, 22, 4262. [CrossRef]
- Qin, T.; Li, P.; Shen, S. Vins-mono: A robust and versatile monocular visual-inertial state estimator. IEEE Transactions on Robotics 2018, 34, 1004–1020. [CrossRef]
- Zhang, Z.; Scaramuzza, D. A Tutorial on Quantitative Trajectory Evaluation for Visual(-Inertial) Odometry. IEEE/RSJ Int. Conf. Intell. Robot. Syst. (IROS), 2018.









| Sequence A | Sequence B | Sequence C | Sequence D | |
|---|---|---|---|---|
| Sensors | Maverick MMS: Ladybug-5 + Velodyne HDL-32 + IMU | |||
| Region | Parking lot | Campus area | Residential area | Residential area |
| Camera frames | 717 | 8382 | 10778 | 4500 |
| Image size | 4096 x 2048 | 8000 x 4000 | 8000 x 4000 | 8000 x 4000 |
| LiDAR frames | 1432 | 17395 | 22992 | 9615 |
| Distance travelled | 324 meters | 7035 meters | 7965 meters | 3634 meters |
| Running time | 94 seconds | 19 minutes | 22 minutes | 10 minutes |
| Ground truth | GNSS/IMU | GNSS/IMU | GNSS/IMU | GNSS/IMU |
| Loop | One small loop | One large loop + a few small loops | Many medium-size loops | A few loops |
| Dynamic objects | Parking, barrier and person | Car, bus and person | Car, bus and person | Car, bus and person |
| Compared methods | ORB-SLAM2 (camera-only) | |||
| VINS-Mono-SLAM (camera + IMU) | ||||
| LOAM (LiDAR) | ||||
| Google-Cartographer-SLAM (LiDAR + IMU) | ||||
| RPV-SLAM (Panoramic camera + LiDAR) | ||||
| Our PVL-SLAM (Panoramic camera + LiDAR + IMU) | ||||
| ORB SLAM2 | VINS-Mono | LOAM | Cartographer | RPV-SLAM | PVL-SLAM | |
|---|---|---|---|---|---|---|
| Sequence A | 5.894 | 3.9974 | Fail | 4.023 | 1.618 | 0.766 |
| Sequence B | 100.870 | 86.897 | Fail | 152.230 | 12.910 | 2.599 |
| Sequence C | 155.908 | 160.765 | Fail | 183.619 | 30.661 | 3.739 |
| Sequence D | 10.665 | 12.875 | Fail | 58.576 | 5.673 | 2.204 |
| Overall | 68.3343 | 66.1336 | Fail | 99.612 | 12.7155 | 2.327 |
| ORB SLAM2 | VINS-Mono | LOAM | Cartographer | RPV-SLAM | PVL-SLAM | |
|---|---|---|---|---|---|---|
| Sequence A | 7.769 | 4.685 | Fail | 6.789 | 3.934 | 3.027 |
| 0.0677 | 0.0410 | 0.0507 | 0.0040 | 0.0236 | ||
| Sequence B | 13.770 | 10.779 | Fail | 15.047 | 3.096 | 1.273 |
| 0.0099 | 0.0109 | 0.0090 | 0.0009 | 0.0019 | ||
| Sequence C | 4.879 | 3.987 | Fail | 5.764 | 3.752 | 0.853 |
| 0.0289 | 0.0301 | 0.0133 | 0.0057 | 0.0018 | ||
| Sequence D | 2.878 | 3.085 | Fail | 4.650 | 1.347 | 2.555 |
| 0.0148 | 0.0178 | 0.0137 | 0.0017 | 0.0035 | ||
| Overall | 7.324 | 5.634 | Fail | 9.843 | 2.393 | 1.069 |
| 0.030 | 0.025 | 0.059 | 0.002 | 0.003 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).