PreprintReviewVersion 1Preserved in Portico This version is not peer-reviewed
Advanced Machine Learning Methods for Learning from Sparse Data in High-Dimensional Spaces: A Perspective on Uses in The Upstream of Development of Novel Energy Technologies
Manzhos, S.; Ihara, M. Advanced Machine Learning Methods for Learning from Sparse Data in High-Dimensional Spaces: A Perspective on Uses in the Upstream of Development of Novel Energy Technologies. Physchem2022, 2, 72-95.
Manzhos, S.; Ihara, M. Advanced Machine Learning Methods for Learning from Sparse Data in High-Dimensional Spaces: A Perspective on Uses in the Upstream of Development of Novel Energy Technologies. Physchem 2022, 2, 72-95.
Manzhos, S.; Ihara, M. Advanced Machine Learning Methods for Learning from Sparse Data in High-Dimensional Spaces: A Perspective on Uses in the Upstream of Development of Novel Energy Technologies. Physchem2022, 2, 72-95.
Manzhos, S.; Ihara, M. Advanced Machine Learning Methods for Learning from Sparse Data in High-Dimensional Spaces: A Perspective on Uses in the Upstream of Development of Novel Energy Technologies. Physchem 2022, 2, 72-95.
Abstract
Machine learning (ML) has found increasing use in research on energy conversion and storage technologies, in particular, so-called sustainable technologies. While often ML is used to directly optimize the parameters or phenomena of interest in the space of features, in this perspective, we focus on using ML to construct objects and methods that help in or enable the modeling of the underlying phenomena. We highlight the need for machine learning from very sparse and unevenly distributed numeric data in multidimensional spaces in these applications. After a brief introduction of some common regression-type machine learning techniques, we will focus on more advanced ML techniques which use these known methods as building blocks of more complex schemes and thereby allow working with extremely sparse data and also allow generating insight. Specifically, we will highlight the utility of using representations with sub-dimensional functions by combining the high-dimensional model representation ansatz with machine learning methods like neural networks or Gaussian process regressions in applications ranging from heterogeneous catalysis to nuclear energy.
Keywords
machine learning; neural network; Gaussian process regression; curse of dimensionality; high-dimensional model representation; energy conversion and storage; heterogeneous catalysis.
Subject
CHEMISTRY, Physical Chemistry
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.