Submitted:
22 May 2026
Posted:
26 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Humans and Machines
1.2. Theory of Mind
1.3. Theoretical Approaches to ToM
1.4. Paper Contributions
1.5. Paper Outline
2. State of the Art
3. A New Cognitive Model of Theory of Mind
3.1. Generation and Pre-Activation of Hypotheses
3.2. Bayesian Learning
3.3. Hierarchical Predictive Structures
3.4. Prediction of Actions and Mental States
3.4.1. Intentional Actions
3.4.2. Interaction Between Layers
3.4.3. Mental States Are Represented at Multiple Levels of the Hierarchy
3.4.4. Desire Versus Intention in the Hierarchical Model
3.5. Stereotypes and Personality Traits
3.5.1. Stereotypes and Theory of Mind
3.5.2. Stereotype and Predictive Hierarchy
3.5.3. Stereotypes as an Accurate and Efficient Mechanism
3.6. Prediction Errors and Weight of the Error Signal
3.6.1. Contrarian Evidence
3.6.2. Pace of Change of Stereotype Hypotheses
3.6.3. Weight of the Error Signal
3.7. Internal Simulation of Mental States
3.7.1. Simulation by Perspective Taking
3.7.2. Simulation in Other Mental Functions
3.8. From the Simulation of the Self to the Prediction of Others
4. Neuroscientific Evidence Supporting the Model
4.1. Hierarchical Predictive Structure and Neural Correlates
4.2. Actions Prediction and Neural Correlates
4.3. Mental States Prediction and Neural Correlates
4.4. Influence of Stereotypes and Neural Correlates
4.5. Simulative Processes and Neural Correlates
5. Input Stimuli to the Predictive System
6. Theory of Mind Simulations via Reconfigurable Deep Neural Networks
6.1. Artificial Neural Networks
6.2. Deep Neural Networks
6.3. Logical Inferences via Artificial Neural Networks
6.4. Reconfigurable Simulations
7. Conclusions
Funding
References
- Ames, D. R. Inside the mind reader's tool kit: projection and stereotyping in mental state inference. J. Personal. Soc. Psychol. 2004a, 87(3), 340–353. [Google Scholar] [CrossRef] [PubMed]
- Ames, D. R. Strategies for social inference: a similarity contingency model of projection and stereotyping in attribute prevalence estimates. J. Personal. Soc. Psychol. 2004b, 87(5), 573–585. [Google Scholar] [CrossRef]
- Andreas, J. Learning to Compose Neural Networks for Question Answering. arXiv 2016a, arXiv:1601.01705v4. [Google Scholar]
- Andreas, J. Neural module networks. arXiv 2016b, arXiv:1511.02799v3. [Google Scholar]
- Angelucci, A.; Levitt, J. B.; Walton, E. J.; Hupe, J. M.; Bullier, J.; Lund, J. S. Circuits for local and global signal integration in primary visual cortex. J. Neurosci. 2002, 22(19), 8633–8646. [Google Scholar] [CrossRef]
- Apperly, I. A.; Butterfill, S. A. Do humans have two systems to track beliefs and belief-like states? Psychol. Rev. 2009, 116(4), 953. [Google Scholar] [CrossRef] [PubMed]
- Baker, C.; Saxe, R.; Tenenbaum, J. Bayesian theory of mind: Modeling joint belief-desire attribution. Proc. Annu. Meet. Cogn. Sci. Soc. 2011, 33(33). [Google Scholar]
- Bello, P. Cognitive Foundations for a Computational Theory of Mindreading. Adv. Cogn. Syst. 2012, 1, 59–72. [Google Scholar]
- Berthoz, S.; Armony, J. L.; Blair, R. J. R.; Dolan, R. J. An fMRI study of intentional and unintentional (embarrassing) violations of social norms. Brain 2002, 125(8), 1696–1708. [Google Scholar] [CrossRef] [PubMed]
- Bosse, T.; Memon, Z. A.; Treur, J. A two-level BDI-agent model for theory of mind and its use in social manipulation. Proc. AISB 2007 Workshop Mindful Environ. 2007, 4, 335–342. [Google Scholar]
- Brass, M.; Schmitt, R. M.; Spengler, S.; Gergely, G. Investigating action understanding: inferential processes versus action simulation. Curr. Biol. 2007, 17(24), 2117–2121. [Google Scholar] [CrossRef]
- Burnham, D. K.; Harris, M. B. Effects of real gender and labeled gender on adults' perceptions of infants. J. Genet. Psychol. 1992, 153(2), 165–183. [Google Scholar] [CrossRef] [PubMed]
- Clements, W. A.; Perner, J. Implicit understanding of belief. Cogn. Dev. 1994, 9(4), 377–395. [Google Scholar] [CrossRef]
- Clements, W. A.; Rustin, C. L.; McCallum, S. Promoting the transition from implicit to explicit understanding: A training study of false belief. Dev. Sci. 2000, 3(1), 81–92. [Google Scholar] [CrossRef]
- Cloutier, J.; Gabrieli, J. D.; O'young, D.; Ambady, N. An fMRI study of violations of social expectations: when people are not who we expect them to be. NeuroImage 2011, 57(2), 583–588. [Google Scholar] [CrossRef]
- Cohen, W.W.; Yang, F.; Mazaitis, K. R. Tensorlog: Deep learning meets probabilistic dbs. arXiv 2017, arXiv:1707.05390. [Google Scholar] [CrossRef]
- Condry, J. C.; Ross, D. F. Sex and aggression: The influence of gender label on the perception of aggression in children. Child. Dev. 1985, 56(1), 225–233. [Google Scholar] [CrossRef]
- Costantini, M.; Galati, G.; Ferretti, A.; Caulo, M.; Tartaro, A.; Romani, G. L.; Aglioti, S. M. Neural systems underlying observation of humanly impossible movements: an fMRI study. Cereb. Cortex 2005, 15(11), 1761–1767. [Google Scholar] [CrossRef]
- de Weerd, H.; Verbrugge, R.; Verheij, B. Agent-based models for higher-order theory of mind. In Advances in Social Simulation, 229; Kamiński, B., Koloch, G., Eds.; Springer: Berlin, Heidelberg, 2014; pp. 213–224. [Google Scholar]
- Doris, J. M. Lack of character: Personality and moral behavior; Cambridge University Press, 2002. [Google Scholar]
- Fadja, A.N.; Lamma, E.; Riguzzi, F. Deep Probabilistic Logic Programming. PLP@ ILP 2017, 3–14. [Google Scholar]
- Felleman, D. J.; Van Essen, D. C. Distributed hierarchical processing in the primate cerebral cortex. Cereb. Cortex 1991, 1(1), 1–47. [Google Scholar] [CrossRef]
- Fritzke, B. A growing neural gas network learns topologies. Proceedings of NIPS'94, 1994. [Google Scholar]
- Gmytrasiewicz, P.; Moe, G.; Moreno, A. MARTHA Speaks: Implementing Theory of Mind for More Intuitive Communicative Acts. 2015 AAAI Fall Symposium Series, 2015. [Google Scholar]
- Godfrey-Smith, P. Signal, decision, action. J. Philos. 1991, 88(12), 709–722. [Google Scholar] [CrossRef]
- Goodman, G. S.; Golding, J. M.; Helgeson, V. S.; Haith, M. M.; Michelli, J. When a child takes the stand: Jurors' perceptions of children's eyewitness testimony. Law. Hum. Behav. 1987, 11(1), 27. [Google Scholar] [CrossRef]
- Gopnik, A. The theory theory as an alternative to the innateness hypothesis; Chomsky and his critics, L. M., Antony, Hornstein, N., Eds.; Blackwell Publishing: Oxford, 2003; pp. 238–254. [Google Scholar]
- Gopnik, A.; Wellman, H. M. The theory theory. In Mapping the mind: Domain specificity in cognition and culture; Hirschfeld, L., Gelman, S., Eds.; Cambridge University Press: Cambridge, 1994; pp. 257–293. [Google Scholar]
- Gopnik, A.; Wellman, H. M. Reconstructing constructivism: Causal models, Bayesian learning mechanisms, and the theory theory. Psychol. Bull. 2012, 138(6), 1085. [Google Scholar] [CrossRef]
- Grossman, E. D.; Battelli, L.; Pascual-Leone, A. Repetitive TMS over posterior STS disrupts perception of biological motion. Vis. Res. 2005, 45(22), 2847–2853. [Google Scholar] [CrossRef]
- Grossman, E. D.; Jardine, N. L.; Pyles, J. A. fMR-adaptation reveals invariant coding of biological motion on human STS. Front. Hum. Neurosci. 2010, 4, 15. [Google Scholar] [CrossRef]
- Harbers, M.; Van den Bosch, K.; Meyer, J. J. Modeling agents with a theory of mind: Theory-theory versus simulation theory. Web Intell. Agent Syst. An. Int. J. 2012, 10(3), 331–343. [Google Scholar] [CrossRef]
- Harris, P. L. From simulation to folk psychology: The case for development. Mind Lang. 1992, 7(1-2), 120–144. [Google Scholar] [CrossRef]
- Hesslow, G. Conscious thought as simulation of behaviour and perception. Trends Cogn. Sci. 2002, 6(6), 242–247. [Google Scholar] [CrossRef] [PubMed]
- Hiatt, L. M.; Trafton, J. G. A cognitive model of theory of mind. In Proceedings of the 10th International Conference on Cognitive Modeling, 2010; pp. 91–96. [Google Scholar]
- Hiatt, L. M.; Harrison, A. M.; Trafton, J. G. Accommodating human variability in human-robot teams through theory of mind. IJCAI Proc.-Int. Jt. Conf. Artif. Intell. 2011, 22(3), 2066. [Google Scholar]
- Hochstein, S.; Ahissar, M. View from the top: Hierarchies and reverse hierarchies in the visual system. Neuron 2002, 36(5), 791–804. [Google Scholar] [CrossRef]
- Hohwy, J.; Palmer, C. Social cognition as causal inference: Implications for common knowledge and autism. In Perspectives on social ontology and social cognition; Gallotti, M., Michael, J., Eds.; Dordrecht: Springer, 2014; pp. 167–189. [Google Scholar]
- Hudson, M.; Nicholson, T.; Ellis, R.; Bach, P. I see what you say: Prior knowledge of other’s goals automatically biases the perception of their actions. Cognition 2016, 146, 245–250. [Google Scholar] [CrossRef]
- Jastorff, J.; Clavagnier, S.; Gergely, G.; Orban, G. A. Neural mechanisms of understanding rational actions: middle temporal gyrus activation by contextual violation. Cereb. Cortex 2010, 21(2), 318–329. [Google Scholar] [CrossRef]
- Juang, C. F.; Lin, C. T. An On-Line Self-Constructing Neural Fuzzy Inference Network and Its Applications. IEEE Fuzzy Syst. 1998, 6(1), 12–32. [Google Scholar] [CrossRef]
- Kable, J. W.; Chatterjee, A. Specificity of action representations in the lateral occipitotemporal cortex. J. Cogn. Neurosci. 2006, 18(9), 1498–1517. [Google Scholar] [CrossRef] [PubMed]
- Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. J. Artif. Intell. Res. 1996, 4, 237–285. [Google Scholar] [CrossRef]
- Kilner, J. M.; Friston, K. J.; Frith, C. D. Predictive coding: an account of the mirror neuron system. Cogn. Process. 2007, 8(3), 159–166. [Google Scholar] [CrossRef]
- Kim, K. J.; Lipson, H. Towards a theory of mind in simulated robots. In Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers, 2009; pp. 2071–2076. [Google Scholar]
- Koster-Hale, J.; Saxe, R. Theory of mind: a neural prediction problem. Neuron 2013, 79(5), 836–848. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Liu, Q.; Jiang, H.; Evdokimov, A.; Ling, Z. H.; Zhu, X.; Wei, S.; Hu, Y. Probabilistic reasoning via deep learning: Neural association models. arXiv 2016, arXiv:1603.07704. [Google Scholar] [CrossRef]
- Low, J.; Perner, J. Implicit and explicit theory of mind: State of the art. Br. J. Dev. Psychol. 2012, 30(1), 1–13. [Google Scholar] [CrossRef]
- Malle, B. F. How people explain behavior: A new theoretical framework. Personal. Soc. Psychol. Rev. 1999, 3(1), 23–48. [Google Scholar] [CrossRef]
- Mason, M. F.; Cloutier, J.; Macrae, C. N. On construing others: Category and stereotype activation from facial cues. Soc. Cogn. 2006, 24(5), 540–562. [Google Scholar] [CrossRef]
- Mitchell, J. P.; Banaji, M. R.; MacRae, C. N. The link between social cognition and self-referential thought in the medial prefrontal cortex. J. Cogn. Neurosci. 2005, 17(8), 1306–1315. [Google Scholar] [CrossRef]
- Mitchell, J. P.; Macrae, C. N.; Banaji, M. R. Dissociable medial prefrontal contributions to judgments of similar and dissimilar others. Neuron 2006, 50(4), 655–663. [Google Scholar] [CrossRef]
- Mueller-Johnson, K.; Toglia, M. P.; Sweeney, C. D.; Ceci, S. J. The perceived credibility of older adults as witnesses and its relation to ageism. Behav. Sci. Law. 2007, 25(3), 355–375. [Google Scholar] [CrossRef] [PubMed]
- Murphy, P. C.; Sillito, A. M. Corticofugal feedback influences the generation of length tuning in the visual pathway. Nature 1987, 329(6141), 727. [Google Scholar] [CrossRef] [PubMed]
- Oberman, L. M.; Ramachandran, V. S. The simulating social mind: the role of the mirror neuron system and simulation in the social and communicative deficits of autism spectrum disorders. Psychol. Bull. 2007, 133(2), 310. [Google Scholar] [CrossRef]
- Ondobaka, S.; Kilner, J.; Friston, K. The role of interoceptive inference in theory of mind. Brain Cogn. 2017, 112, 64–68. [Google Scholar] [CrossRef]
- Palmer, C. J.; Seth, A. K.; Hohwy, J. The felt presence of other minds: Predictive processing, counterfactual predictions, and mentalising in autism. Conscious. Cogn. 2015, 36, 376–389. [Google Scholar] [CrossRef] [PubMed]
- Pelphrey, K.A.; Vander Wyk, B.C. Functional and neural mechanisms for eye gaze processing. In Oxford Handbook of Face Perception; Calder, A., Rhodes, G., Johnson, M., Haxby, J., Eds.; Oxford University Press: Oxford, UK, 2011; pp. 591–604. [Google Scholar]
- Pelphrey, K. A.; Mitchell, T. V.; McKeown, M. J.; Goldstein, J.; Allison, T.; McCarthy, G. Brain activity evoked by the perception of human walking: controlling for meaningful coherent motion. J. Neurosci. 2003, 23(17), 6819–6825. [Google Scholar] [CrossRef]
- Pelphrey, K. A.; Morris, J. P.; Mccarthy, G. Grasping the intentions of others: the perceived intentionality of an action influences activity in the superior temporal sulcus during social perception. J. Cogn. Neurosci. 2004, 16(10), 1706–1716. [Google Scholar] [CrossRef]
- Pfeifer, J. H.; Lieberman, M. D.; Dapretto, M. “I know you are but what am I?!”: neural bases of self-and social knowledge retrieval in children and adults. J. Cogn. Neurosci. 2007, 19(8), 1323–1337. [Google Scholar] [CrossRef] [PubMed]
- Polceanu, M.; Buche, C. Towards a theory-of-mind-inspired generic decision-making framework. arXiv 2014, arXiv:1405.5048. [Google Scholar]
- Pynadath, D. V.; Marsella, S. C. PsychSim: Modeling theory of mind with decision-theoretic agents. IJCAI 2005, 5, 1181–1186. [Google Scholar]
- Pynadath, D.V.; Rosenbloom, P.S.; nad Marsella, S.C. Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture. In Artificial General Intelligence; Lecture Notes in Computer Science; Goertzel, B., Orseau, L., Snaider AGI 2014, J., Eds.; Springer: Cham, 2014; vol 8598. [Google Scholar]
- Rosch, E. Principles of categorization. In Foundations of cognitive psychology: Core readings; Levitin, D. J., Ed.; MIT Press, 2002; pp. 251–270. [Google Scholar]
- Rosenblatt, F. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms; Spartan Books: Washington DC, 1961. [Google Scholar]
- Rumelhart, D. E.; Hintonm, G. E.; Williams, R. J. Learning Internal Representations by Error Propagation. In Parallel distributed processing: Explorations in the microstructure of cognition, 1: Foundation; and the PDP research group; Rumelhart, David E., McClelland, James L., Eds.; MIT Press: Cambridge, MA, United States, 1986. [Google Scholar]
- Ruffman, T.; Garnham, W.; Import, A.; Connolly, D. Does eye gaze indicate implicit knowledge of false belief? Charting transitions in knowledge. J. Exp. Child. Psychol. 2001, 80(3), 201–224. [Google Scholar] [CrossRef] [PubMed]
- Sagar, H. A.; Schofield, J. W. Racial and behavioral cues in black and white children's perceptions of ambiguously aggressive acts. J. Personal. Soc. Psychol. 1980, 39(4), 590. [Google Scholar] [CrossRef] [PubMed]
- Saha, S.; Singh, G.; Sapienza, M.; Torr, P.; Cuzzolin, F. Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos. Proceeding of the British Machine Vision Conference (BMVC’16), 2016. [Google Scholar]
- Saha, S.; Singh, G.; Cuzzolin, F. AMTnet: Action-Micro-Tube regression by end-to-end trainable deep architecture. Proceeding of the International Conference on Computer Vision (ICCV’17), 2017. [Google Scholar]
- Saxe, R.; Wexler, A. Making sense of another mind: the role of the right temporo-parietal junction. Neuropsychologia 2005, 43(10), 1391–1399. [Google Scholar] [CrossRef]
- Saygin, A. P.; Chaminade, T.; Ishiguro, H.; Driver, J.; Frith, C. The thing that should not be: predictive coding and the uncanny valley in perceiving human and humanoid robot actions. Soc. Cogn. Affect. Neurosci. 2011, 7(4), 413–422. [Google Scholar] [CrossRef]
- Scassellati, B. Theory of mind for a humanoid robot. Auton. Robot. 2002, 12(1), 13–24. [Google Scholar] [CrossRef]
- Schacter, D. L.; Addis, D. R.; Buckner, R. L. Episodic simulation of future events. Ann. New York Acad. Sci. 2008, 1124(1), 39–60. [Google Scholar] [CrossRef]
- Scholl, B. J.; Leslie, A. M. Modularity, development and ‘theory of mind’. Mind Lang. 1999, 14(1), 131–153. [Google Scholar] [CrossRef]
- She, Y.; He, Y.; Wu, D. Learning Topology and Dynamics of Large Recurrent Neural Networks. arXiv 2014, arXiv:1410.1174. [Google Scholar] [CrossRef]
- Sherman, S. M.; Guillery, R. W. On the actions that one nerve cell can have on another: distinguishing “drivers” from “modulators”. Proc. Natl. Acad. Sci. 1998, 95(12), 7121–7126. [Google Scholar] [CrossRef]
- Shibata, H.; Inui, T.; Ogawa, K. Understanding interpersonal action coordination: an fMRI study. Exp. Brain Res. 2011, 211(3-4), 569–579. [Google Scholar] [CrossRef]
- Shultz, S.; Lee, S. M.; Pelphrey, K.; McCarthy, G. The posterior superior temporal sulcus is sensitive to the outcome of human and non-human goal-directed actions. Soc. Cogn. Affect. Neurosci. 2010, 6(5), 602–611. [Google Scholar] [CrossRef] [PubMed]
- Si, M.; Marsella, S. C. Encoding theory of mind in character design for pedagogical interactive narrative. Adv. Hum.-Comput. Interact. 2014, 10. [Google Scholar] [CrossRef]
- Singh, G.; Saha, S.; Sapienza, M.; Torr, P.; Cuzzolin, F. Online Real-time Multiple Spatiotemporal Action Localisation and Prediction on a Single Platform. Proceeding of the International Conference on Computer Vision (ICCV’17), 2017. [Google Scholar]
- Singh, G.; Saha, S.; Cuzzolin, F. TraMNet - Transition Matrix Network for Efficient Action Tube Proposals. Proceeding of the Asian Conference on Computer Vision (ACCV’18), 2018. [Google Scholar]
- Thoermer, C.; Sodian, B.; Vuori, M.; Perst, H.; Kristen, S. Continuity from an implicit to an explicit understanding of false belief from infancy to preschool age. Br. J. Dev. Psychol. 2012, 30(1), 172–187. [Google Scholar] [CrossRef]
- Wang, B.; Low, J.; Jing, Z.; Qinghua, Q. Chinese preschoolers’ implicit and explicit false-belief understanding. Br. J. Dev. Psychol. 2012, 30(1), 123–140. [Google Scholar] [CrossRef]
- Westra, E. Stereotypes, theory of mind, and the action–prediction hierarchy. Synthese 2019, 196, 2821–2846. [Google Scholar] [CrossRef]
- Yoshida, W.; Dolan, R. J.; Friston, K. J. Game theory of mind. PLoS Comput. Biol. 2008, 4(12), e1000254. [Google Scholar] [CrossRef]
- Young, L.; Dodell-Feder, D.; Saxe, R. What gets the attention of the temporo-parietal junction? An fMRI investigation of attention and theory of mind. Neuropsychologia 2010, 48(9), 2658–2664. [Google Scholar] [CrossRef]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.