Undiscounted Semi-Markov Decision Processes With Countably Infinite Action Spaces

Kushal Guha Bakshi; Sagnik Sinha; Ramakant Bhardwaj; Purvee Bhardwaj; Satyendra Narayan

doi:10.20944/preprints202605.1015.v1

Submitted:

13 May 2026

Posted:

15 May 2026

You are already at the latest version

Abstract

In this article we study semi-Markov decision processes (SMDPs) where the pay-off criterion is limiting ratio average, generally known as undiscounted pay-off. Here we consider the action space of the decision maker to be possibly countably infinite. However, we do not put any restriction on the reward function. We prove the existence of a near-optimal or ϵ-optimal strategy of the decision maker which turns out to be a deterministic semi-stationary. An efficient algorithm is discussed to compute a near-optimal pure semi-stationary strategy for such SMDP model. Also under some standard ergodicity conditions, we propose an optimality equation of these SMDP models.

Keywords:

semi-Markov decision processes

;

semi-stationary strategies

;

stochastic games

;

semi-markov games

;

learning in games

;

linear programming

Subject:

Computer Science and Mathematics - Mathematics

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Undiscounted Semi-Markov Decision Processes With Countably Infinite Action Spaces

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe