Preprint Review Version 1 Preserved in Portico This version is not peer-reviewed

Learning to Communicate through Multi-Agent Reinforcement Learning (MARL): A Systematic Literature Review

Version 1 : Received: 6 April 2024 / Approved: 11 April 2024 / Online: 17 April 2024 (03:29:46 CEST)

How to cite: Beikmohammadi, A. Learning to Communicate through Multi-Agent Reinforcement Learning (MARL): A Systematic Literature Review. Preprints 2024, 2024040813. https://doi.org/10.20944/preprints202404.0813.v1 Beikmohammadi, A. Learning to Communicate through Multi-Agent Reinforcement Learning (MARL): A Systematic Literature Review. Preprints 2024, 2024040813. https://doi.org/10.20944/preprints202404.0813.v1

Abstract

Recent years have witnessed significant advances in reinforcement learning (RL), which has registered remarkable success in solving various sequential decision-making problems in machine learning. Most of the successful RL applications, e.g., the games of Go and Poker, robotics, and autonomous driving, involve the participation of more than one single agent, which naturally fall into the realm of multi-agent RL (MARL), a domain with a relatively long history, and has recently re-emerged due to advances in single-agent RL techniques. Although, typically, the communication protocol between agents is manually specified and not altered during training, recently, some papers have shown signs of trying to emerge a communication between agents on the one hand and, on the other hand, to understand what is exchanged between agents. So, there is a growing body of literature on this topic which includes qualitative and quantitative studies and the ones that apply mixed methods. This study presents the scoping review of the methodological strategies undertaken in a total of 16 research articles. The results present the critical appraisal of quantitative methods in terms of validity and reliability and for qualitative methods considering four trustworthiness factors. In the end, relevant insights are further explored with implications and reflections on how they can benefit one's research in the field.

Keywords

Reinforcement Learning; POMDP; Learn to Communicate; Systematic Literature Review; Quantitative Methods; Qualitative Methods

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.