Preprint
Article

M-Learning: A Computationally Efficient Heuristic for Reinforcement Learning with Delayed Rewards

This version is not peer-reviewed.

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated