Preprint
Article

This version is not peer-reviewed.

Undiscounted Semi-Markov Decision Processes With Countably Infinite Action Spaces

Submitted:

13 May 2026

Posted:

15 May 2026

You are already at the latest version

Abstract
In this article we study semi-Markov decision processes (SMDPs) where the pay-off criterion is limiting ratio average, generally known as undiscounted pay-off. Here we consider the action space of the decision maker to be possibly countably infinite. However, we do not put any restriction on the reward function. We prove the existence of a near-optimal or ϵ-optimal strategy of the decision maker which turns out to be a deterministic semi-stationary. An efficient algorithm is discussed to compute a near-optimal pure semi-stationary strategy for such SMDP model. Also under some standard ergodicity conditions, we propose an optimality equation of these SMDP models.
Keywords: 
;  ;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated