Submitted:
25 February 2024
Posted:
26 February 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Structure of this paper
2. The imperative for trustworthy agents: applying Theory of Mind
3. The role of intention in explainability
4. Intentions and value alignment
5. Can we genuinely infer intentions from traces of opaque agents?
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
References
- Balke, T.; Gilbert, N. How do agents make decisions? A survey. Journal of Artificial Societies and Social Simulation 2014, 17, 13. [Google Scholar] [CrossRef]
- Minsky, M. Society of mind; Simon and Schuster, 1988.
- Malle, B.F. Attribution theories: How people make sense of behavior. Theories in social psychology 2011, 23, 72–95, Publisher: New York. [Google Scholar]
- Rao, A.S.; Georgeff, M.P. Modeling Rational Agents within a BDI-Architecture. Proceedings of the 2nd International Conference on Principles of Knowledge Representation and Reasoning; Allen, J., Fikes, R., Sandewall, E., Eds.; Morgan Kaufmann publishers Inc.: San Mateo, CA, USA, 1991; pp. 473–484. [Google Scholar]
- Shvo, M.; Klassen, T.Q.; Sohrabi, S.; McIlraith, S.A. Epistemic plan recognition. Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems, 2020, pp. 1251–1259.
- Broome, J. Are intentions reasons? And how should we cope with incommensurable values. Practical rationality and preference: Essays for David Gauthier 2001, pp. 98–120. Publisher: Cambridge University Press Cambridge.
- Baker, C.; Saxe, R.; Tenenbaum, J. Bayesian theory of mind: Modeling joint belief-desire attribution. Proceedings of the annual meeting of the cognitive science society, 2011, Vol. 33. Issue: 33.
- Schuwerk, T.; Kampis, D.; Baillargeon, R.; Biro, S.; Bohn, M.; Byers-Heinlein, K.; Dörrenberg, S.; Fisher, C.; Franchin, L.; Fulcher, T.; others. Action anticipation based on an agent’s epistemic state in toddlers and adults. PsyArXiv 2021.
- Saxe, R.; Baron-Cohen, S. The neuroscience of theory of mind, 2006.
- Richardson, H.; Saxe, R. Early signatures of and developmental change in brain regions for theory of mind. In Neural circuit and cognitive development; Elsevier, 2020; pp. 467–484.
- Baker, C.L.; Jara-Ettinger, J.; Saxe, R.; Tenenbaum, J.B. Rational quantitative attribution of beliefs, desires and percepts in human mentalizing. Nature Human Behaviour 2017, 1, 0064. [Google Scholar] [CrossRef]
- Ho, M.K.; Saxe, R.; Cushman, F. Planning with Theory of Mind. Trends in Cognitive Sciences 2022, 26, 959–971. [Google Scholar] [CrossRef] [PubMed]
- Rao, A.S.; Georgeff, M.P.; others. BDI agents: from theory to practice. Icmas, 1995, Vol. 95, pp. 312–319.
- Winikoff, M.; Sidorenko, G. Evaluating a Mechanism for Explaining BDI Agent Behaviour. Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems; International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2023; AAMAS ’23, pp. 2283–2285. event-place: London, United Kingdom.
- Grice, H.P. Logic and conversation. In Speech acts; Brill, 1975; pp. 41–58.
- Bratman, M. Intention, Plans, and Practical Reason; Cambridge, MA: Harvard University Press: Cambridge, 1987. [Google Scholar]
- Cohen, P.R.; Levesque, H.J. Intention is choice with commitment. Artificial intelligence 1990, 42, 213–261, Publisher: Elsevier. [Google Scholar] [CrossRef]
- Cranefield, S.; Winikoff, M.; Dignum, V.; Dignum, F. No Pizza for You: Value-based Plan Selection in BDI Agents. IJCAI, 2017, pp. 178–184.
- Van de Poel, I. Embedding values in artificial intelligence (AI) systems. Minds and Machines 2020, 30, 385–409. [Google Scholar] [CrossRef]
- van der Weide, T.L.; others. Arguing to motivate decisions. PhD Thesis, Utrecht University, 2011.
- Schwartz, S.H. An overview of the Schwartz theory of basic values. Online readings in Psychology and Culture 2012, 2, 11. [Google Scholar] [CrossRef]
- Godin, G.; Conner, M.; Sheeran, P. Bridging the intention-behaviour gap: The role of moral norm. British Journal of Social Psychology 2005, 44, 497–512. [Google Scholar] [CrossRef] [PubMed]
- Young, L.; Saxe, R. When ignorance is no excuse: Different roles for intent across moral domains. Cognition 2011, 120, 202–214. [Google Scholar] [CrossRef] [PubMed]
- Yoder, K.J.; Decety, J. The neuroscience of morality and social decision-making. Psychology, Crime & Law 2018, 24, 279–295. [Google Scholar] [CrossRef]
- Kautz, H.A.; Allen, J.F.; others. Generalized plan recognition. AAAI. Philadelphia, PA, 1986, Vol. 86, p. 5. Issue: 3237.
- Albrecht, S.V.; Stone, P. Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence 2018, 258, 66–95. [Google Scholar] [CrossRef]
- Sukthankar, G.; Geib, C.; Bui, H.H.; Pynadath, D.; Goldman, R.P. Plan, activity, and intent recognition: Theory and practice; Newnes, 2014.
- Vered, M.; Kaminka, G.A.; Biham, S. Online goal recognition through mirroring: Humans and agents. The Fourth Annual Conference on Advances in Cognitive Systems, 2016, Vol. 4.
- Tambe, M.; Rosenbloom, P.S. Event tracking in a dynamic multiagent environment. Computational Intelligence 1996, 12, 499–522, Publisher: Wiley Online Library. [Google Scholar] [CrossRef]
- Taha, T.; Miró, J.V.; Dissanayake, G. POMDP-based long-term user intention prediction for wheelchair navigation. 2008 IEEE International Conference on Robotics and Automation. IEEE, 2008, pp. 3920–3925.
- Bandyopadhyay, T.; Won, K.S.; Frazzoli, E.; Hsu, D.; Lee, W.S.; Rus, D. Intention-aware motion planning. Algorithmic Foundations of Robotics X: Proceedings of the Tenth Workshop on the Algorithmic Foundations of Robotics. Springer, 2013, pp. 475–491.
- Ramırez, M.; Geffner, H. Goal recognition over POMDPs: Inferring the intention of a POMDP agent. IJCAI. IJCAI/AAAI, 2011, pp. 2009–2014.
- Kerkez, B.; Cox, M.T. Incremental case-based plan recognition with local predictions. International Journal on Artificial Intelligence Tools 2003, 12, 413–463, Publisher: World Scientific. [Google Scholar] [CrossRef]
- Rabkina, I.; Forbus, K.D. Analogical reasoning for intent recognition and action prediction in multi-agent systems. Proceedings of the Seventh Annual Conference on Advances in Cognitive Systems. Cognitive Systems Foundation Cambridge, 2019, pp. 504–517.
- Markovitch, S.; Reger, R. Learning and exploiting relative weaknesses of opponent agents. Autonomous Agents and Multi-Agent Systems 2005, 10, 103–130, Publisher: Springer. [Google Scholar] [CrossRef]
- Hayes, B.; Shah, J.A. Improving Robot Controller Transparency Through Autonomous Policy Explanation. Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction; ACM: Vienna Austria, 2017; pp. 303–312. [Google Scholar] [CrossRef]
- Liu, T.; McCalmon, J.; Le, T.; Rahman, M.A.; Lee, D.; Alqahtani, S. A novel policy-graph approach with natural language and counterfactual abstractions for explaining reinforcement learning agents. Autonomous Agents and Multi-Agent Systems 2023, 37, 34. [Google Scholar] [CrossRef]
- Domènech i Vila, M.; Gnatyshak, D.; Tormos, A.; Alvarez-Napagao, S. Testing Reinforcement Learning Explainability Methods in a Multi-Agent Cooperative Environment. In Frontiers in Artificial Intelligence and Applications; Cortés, A.; Grimaldo, F.; Flaminio, T., Eds.; IOS Press, 2022. [CrossRef]
| 1 | Another important topic to cover when dealing with explanations is that not all are understandable by humans. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).