Submitted:
10 June 2026
Posted:
11 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Heliostat Calibration and Vision-Based Measurement
2.2. Deep Learning-Based Keypoint Detection
2.3. Geometry-Constrained Keypoint Detection
3. Materials and Methods
3.1. Overall Pipeline
3.2. Problem Formulation and Optical Challenges
3.3. Resolution-Preserving Feature Fusion and Energy Analysis
3.4. Geometry Consistency Regularization Under Perspective Projection
4. Results
4.1. Dataset and Implementation Protocols
4.2. Quantitative Analysis
4.3. Comparison with a Generic Pose Estimator
4.4. Qualitative Analysis
5. Discussion
5.1. Error Justification for Vision-Based Feedback Control
| Performance Metric | Measured Value |
| End-to-end FPS (wall-clock) ↑ | 25.14 |
| Inference-only FPS ↑ | 27.51 |
| Preprocess time (ms) ↓ | 1.996 |
| Inference time (ms) ↓ | 36.349 |
| Postprocess time (ms) ↓ | 0.784 |
5.2. Deployment Considerations
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Islam, M.T.; Huda, N.; Abdullah, A.B.; Saidur, R. A Comprehensive Review of State-of-the-Art Concentrating Solar Power (CSP) Technologies: Current Status and Research Trends. Renew. Sustain. Energy Rev. 2018, 91, 987–1018. [Google Scholar] [CrossRef]
- Ho, C.K. Advances in Central Receivers for Concentrating Solar Applications. Sol. Energy 2017, 152, 38–56. [Google Scholar] [CrossRef]
- Noone, C.J.; Torrilhon, M.; Mitsos, A. Heliostat Field Optimization: A New Computationally Efficient Model and Biomimetic Layout. Sol. Energy 2012, 86, 792–803. [Google Scholar] [CrossRef]
- Maiga, M.; N’Tsoukpoe, K.E.; Gomna, A.; Fiagbe, Y.A.K. Sources of Solar Tracking Errors and Correction Strategies for Heliostats. Renew. Sustain. Energy Rev. 2024, 203, 114770. [Google Scholar] [CrossRef]
- Berenguel, M.; Rubio, F.R.; Valverde, A.; Lara, P.J.; Arahal, M.R.; Camacho, E.F.; López, M. An Artificial Vision-Based Control System for Automatic Heliostat Positioning Offset. Correction in a Central Receiver Solar Power Plant. Solar Energy 2004, 76, 563–575. https://doi.org/10.1016/j.solener.2003.12.006..
- Phipps, G.S. Heliostat Beam Characterization System–Calibration Technique. Technical Report SAND78-8038, Sandia Laboratories, 1978.
- Röger, M.; Herrmann, P.; Ulmer, S.; Ebert, M.; Prahl, C.; Göhring, F. Techniques to Measure Solar Flux Density Distribution on Large-Scale Receivers. J. Sol. Energy Eng. 2014, 136, 031013. [Google Scholar] [CrossRef]
- Mehos, M.; Price, H.; Cable, R.; Kearney, D.; Kelly, B.; Kolb, G.; Morse, F. Concentrating Solar Power Best Practices Study. Technical Report NREL/TP-5500-75763, National Renewable Energy Laboratory, Golden, CO, USA, 2020. [CrossRef]
- Prahl, C.; Röger, M.; Hilgert, C. Air-Borne Shape Measurement of Parabolic Trough Collector Fields. Sol. Energy 2013, 91, 68–78. [Google Scholar] [CrossRef]
- Chaumette, F.; Hutchinson, S. Visual Servo Control. I. Basic Approaches. IEEE Robot. Autom. Mag. 2006, 13, 82–90. [Google Scholar] [CrossRef]
- Steger, C.; Ulrich, M.; Wiedemann, C. Machine Vision Algorithms and Applications, 2 ed.; Wiley-VCH: Weinheim, Germany, 2018. [Google Scholar]
- Zhang, Z. A Flexible New Technique for Camera Calibration. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1330–1334. [Google Scholar] [CrossRef]
- Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision, 2 ed.; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar] [CrossRef]
- Xu, F.; Li, C.; Sun, F. On-Line Measurement of Tracking Poses of Heliostats in Concentrated Solar Power Plants. Sensors 2024, 24, 6373. [Google Scholar] [CrossRef]
- Pargmann, M.; Quinto, D.M.; Schwarzbözl, P.; Pitz-Paal, R. Automatic Heliostat Learning for In Situ Concentrating Solar Power Plant Metrology with Differentiable Ray Tracing. Nat. Commun. 2024, 15, 6997. [Google Scholar] [CrossRef] [PubMed]
- Lewen, J.; Pargmann, M.; Löwe, M.C.; Maldonado Quinto, D.; Pitz-Paal, R. Inverse Deep Learning Raytracing for Heliostat Surface Prediction. Sol. Energy 2025, 289, 113312. [Google Scholar] [CrossRef]
- Carballo, J.A.; Bonanos, A.M.; Fernández-Reche, J.; Vasallo, M.J.; Marugán-Cruz, C.; Santana, D. Artificial Intelligence in Heliostat Control and Optimization for CSP Plants: A Critical Review. Renew. Sustain. Energy Rev. 2025, 214, 115530. [Google Scholar] [CrossRef]
- Nayar, S.K.; Fang, X.S.; Boult, T. Separation of Reflection Components Using Color and Polarization. Int. J. Comput. Vis. 1997, 21, 163–186. [Google Scholar] [CrossRef]
- Debevec, P.E.; Malik, J. Recovering High Dynamic Range Radiance Maps from Photographs. In Proceedings of the Proceedings of SIGGRAPH 1997, 1997; pp. 369–378. [Google Scholar] [CrossRef]
- Qiu, J.M.; Jiang, P.S.; Zhu, Y.; Yin, Z.X.; Cheng, M.M.; Mu, T.J. Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023; pp. 20823–20833. [Google Scholar] [CrossRef]
- Han, Y.; Yan, H.; Liu, Y.; Zhao, J.; Wang, K.; Wang, Y.; Wang, Y.S. NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024; pp. 21001–21010. [Google Scholar] [CrossRef]
- Tang, J.; Zhang, C.; Yu, Y.T.; Wang, P.; Zhang, Y.; Liu, L.; Yang, Y.L.; Theobalt, C. SpecTRe-GS: Modeling Highly Specular Surfaces with Reflected Nearby Objects by Tracing Rays in 3D Gaussian Splatting. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025. [Google Scholar]
- Newell, A.; Yang, K.; Deng, J. Newell, A.; Yang, K.; Deng, J. Stacked Hourglass Networks for Human Pose Estimation. In Proceedings of the Computer Vision – ECCV 2016. Springer, 2016, Vol. 9912, Lecture Notes in Computer Science, pp. 483–499. [CrossRef]
- Toshev, A.; Szegedy, C. DeepPose: Human Pose Estimation via Deep Neural Networks. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014; pp. 1653–1660. [Google Scholar] [CrossRef]
- Cao, Z.; Hidalgo, G.; Simon, T.; Wei, S.E.; Sheikh, Y. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 172–186. [Google Scholar] [CrossRef]
- Xiao, B.; Wu, H.; Wei, Y. Simple Baselines for Human Pose Estimation and Tracking. In Proceedings of the Computer Vision – ECCV 2018. Springer, 2018, Vol. 11210, Lecture Notes in Computer Science, pp. 472–487. 2018; Vol. 11210, pp. 472–487. [Google Scholar] [CrossRef]
- Maji, D.; Nagori, S.; Mathew, M.; Poddar, D. YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2022; pp. 2637–2646. [Google Scholar] [CrossRef]
- Jiang, T.; Lu, P.; Zhang, L.; Ma, N.; Han, R.; Lyu, C.; Li, Y.; Chen, K. RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose, 2023. arXiv arXiv:cs. [CrossRef]
- Kendall, A.; Cipolla, R. Geometric Loss Functions for Camera Pose Regression with Deep Learning. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017; pp. 6555–6564. [Google Scholar] [CrossRef]
- Jocher, G.; Chaurasia, A.; Qiu, J. Ultralytics YOLOv8, 2023.
- Wang, J.; Sun, K.; Cheng, T.; Jiang, B.; Deng, C.; Zhao, Y.; Liu, D.; Mu, Y.; Tan, M.; Wang, X.; et al. Deep High-Resolution Representation Learning for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 3349–3364. [Google Scholar] [CrossRef]
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature Pyramid Networks for Object Detection. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017; pp. 936–944. [Google Scholar] [CrossRef]
- Li, Y.; Yang, S.; Liu, P.; Zhang, S.; Wang, Y.; Wang, Z.; Yang, W.; Xia, S.T. SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation. In Proceedings of the Computer Vision – ECCV 2022.Springer, 2022, Vol. 13666, Lecture Notes in Computer Science, pp. 89–106. [CrossRef]
- Bolton, W. Programmable Logic Controllers, 6 ed.; Newnes: Oxford, UK, 2015. [Google Scholar]
- Lepetit, V.; Moreno-Noguer, F.; Fua, P. EPnP: An Accurate O(n) Solution to the PnP Problem. Int. J. Comput. Vis. 2009, 81, 155–166. [Google Scholar] [CrossRef]
- Shi, W.; Cao, J.; Zhang, Q.; Li, Y.; Xu, L. Edge Computing: Vision and Challenges. IEEE Internet Things J. 2016, 3, 637–646. [Google Scholar] [CrossRef]
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft COCO: Common Objects in Context. In Proceedings of the Computer Vision – ECCV 2014. Springer, 2014, Vol. 8693, Lecture Notes in Computer Science, pp. 740–755. [CrossRef]
- Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. In Proceedings of the International Conference on Learning Representations, 2019. [Google Scholar]
- MMPose Contributors. OpenMMLab Pose Estimation Toolbox and Benchmark. 2020. [Google Scholar]
- Kalman, R.E. A New Approach to Linear Filtering and Prediction Problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]




| Model Configuration | mAP@0.5 ↑ | mAP@0.5:0.95 ↑ | EPE (px) ↓ |
| Baseline YOLOv8-Pose | 0.9903 | 0.9671 | 2.58 |
| Only_Geo | 0.9883 | 0.9657 | 2.67 |
| Only_P2 | 0.9936 | 0.9823 | 1.89 |
| Ours (P2_Geo) | 0.9927 | 0.9847 | 1.89 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).