Brief Report
Version 1
Preserved in Portico This version is not peer-reviewed
Approximate Dynamic Programming And The Krylov Subspace Methods
Version 1
: Received: 28 June 2023 / Approved: 29 June 2023 / Online: 29 June 2023 (09:56:49 CEST)
How to cite: Zhang, E. Approximate Dynamic Programming And The Krylov Subspace Methods. Preprints 2023, 2023062088. https://doi.org/10.20944/preprints202306.2088.v1 Zhang, E. Approximate Dynamic Programming And The Krylov Subspace Methods. Preprints 2023, 2023062088. https://doi.org/10.20944/preprints202306.2088.v1
Abstract
This report discusses the application of approximate dynamic programming (ADP) and Krylov subspace methods in solving sequential decision-making problems in machine learning. ADP is used when dealing with large state spaces or when the exact dynamics are unknown. The paper explores various ADP methods such as temporal difference learning and least-squares policy evaluation. The authors also focus on the use of Krylov subspace methods for solving the Bellman equation in ADP. The report provides insights into linear approximation, stochastic algorithms, and the Arnoldi algorithm for orthonormal basis. Overall, the paper highlights the theoretical foundation provided by ADP and its connection with Krylov subspace methods, shedding light on their application in computer science.
Keywords
Approximate Dynamic Programming; Krylov subspace method
Subject
Computer Science and Mathematics, Computer Science
Copyright: This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Comments (0)
We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.
Leave a public commentSend a private comment to the author(s)
* All users must log in before leaving a comment