Submitted:
12 June 2026
Posted:
15 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Core Principles
2.1. Prediction
2.2. Intrinsic Motivation
3. New Directions
3.1. Foundation Models
3.2. Basal Cognition
4. Conclusions
Funding
Declaration of Generative AI and AI-Assisted Technologies
References
- Cangelosi, A.; Asada, M. Cognitive robotics; MIT Press, 2022. [Google Scholar]
- Pezzulo, G.; Barsalou, L. W.; Cangelosi, A.; Fischer, M. H.; McRae, K.; Spivey, M. J. Computational grounded cognition: a new alliance between grounded cognition and computational modeling. Front. Psychol. 2013, 3, 612. [Google Scholar] [CrossRef] [PubMed]
- Vernon, D.; Sandini, G. The importance of being humanoid. Int. J. Humanoid Robot. 2024, 21, 2350022. [Google Scholar]
- O’regan, J. K.; Noë, A. A sensorimotor account of vision and visual consciousness. Behav. Brain Sci. 2001, 24, 939–973. [Google Scholar] [CrossRef] [PubMed]
- Stefanini, E.; Lentini, G.; Grioli, G.; Catalano, M. G.; Bicchi, A. Exploring saliency for learning sensory-motor contingencies in loco-manipulation tasks. Robotics 2024, 13, 58. [Google Scholar]
- Wolpert, D. M.; Ghahramani, Z.; Jordan, M. I. An internal model for sensorimotor integration. Science 1995, 269, 1880–1882. [Google Scholar] [CrossRef] [PubMed]
- Kawato, M. Internal models for motor control and trajectory planning. Curr. Opin. Neurobiol. 1999, 9, 718–727. [Google Scholar] [CrossRef] [PubMed]
- Wolpert, D. M.; Diedrichsen, J.; Flanagan, J. R. Principles of sensorimotor learning. Nat. Rev. Neurosci. 2011, 12, 739–751. [Google Scholar] [CrossRef] [PubMed]
- Demiris, Y.; Khadhouri, B. Hierarchical attentive multiple models for execution and recognition of actions. Robot. Auton. Syst. 2006, 54, 361–369. [Google Scholar] [CrossRef]
- Möller, R.; Schenck, W. Bootstrapping cognition from behavior—a computerized thought experiment. Cogn. Sci. 2008, 32, 504–542. [Google Scholar] [CrossRef] [PubMed]
- Rolf, M. Goal Babbling for an Efficient Bootstrapping of Inverse Models in High Dimensions. Phd thesis, Bielefeld University, Bielefeld, Germany, 2012. [Google Scholar]
- Fedozzi, M. G.; Rea, F.; Sandini, G.; Triesch, J.; Sciutti, A. Canalizing babbling: Development-inspired goal sampling for visuo-motor learning. In 2025 IEEE International Conference on Development and Learning (ICDL); IEEE, 2025; pp. 1–6. [Google Scholar] [CrossRef]
- T. J. Prescott, K. Vogeley, A. Wykowska, Understanding the sense of self through robotics, Science robotics 9 95 (2024) eadn2733. ** The paper proposes that artificial agents can serve as experimental models to study the emergence of the sense of self by implementing embodied, predictive mechanisms. It shows that aspects of the self arise from sensorimotor interaction and layered predictive processes, linking bodily experience to higher-level cognition. Importantly, it lays a developmental line for the emergence of the self, in natural and artificial agents.
- G. Schillaci, C.-N. Ritter, V. V. Hafner, B. Lara, Body representations for robot ego-noise modelling and prediction. towards the development of a sense of agency in artificial agents, in: Proceedings of the Artificial Life Conference 2016, MIT Press One Rogers Street, Cambridge, MA 02142-1209, USA journals-info . . . , 2016, pp. 390–397. [CrossRef]
- W. Ohata, J. Tani, Characterizing the sense of agency in human–robot interaction based on the free energy principle, npj Complexity 2 (2025) 12. **Within the free-energy framework, Sense of Agency in human–robot interaction is modeled as emerging from the balance between top-down predictions and bottom-up sensory input. Prioritizing top-down processes leads to a more self-driven behavior and higher perceived agency, while prioritizing sensory input leads to more adaptive behavior and reduced agency. [CrossRef]
- Friston, K. J. A theory of cortical responses. Philos. Trans. R. Soc. B Biol. Sci. 2005, 360, 815–836. [Google Scholar] [CrossRef] [PubMed]
- Friston, K. J.; Stephan, K. Free-energy and the brain. Synthese 2007, 159, 417–458. [Google Scholar] [CrossRef] [PubMed]
- Clark, A. Whatever next? predictive brains, situated agents, and the future of cognitive science. Behav. Brain Sci. 2013, 36, 181–204. [Google Scholar] [PubMed]
- Ciria, A.; Schillaci, G.; Pezzulo, G.; Hafner, V. V.; Lara, B. Predictive processing in cognitive robotics: a review. Neural Comput. 2021, 33, 1402–1432. [Google Scholar] [CrossRef] [PubMed]
- Oliver, G.; Lanillos, P.; Cheng, G. An empirical study of active inference on a humanoid robot. IEEE Trans. Cogn. Dev. Syst. 2021, 14, 462–471. [Google Scholar] [CrossRef]
- Szadkowski, R. J.; Faigl, J. Lifelong active inference of gait control. IEEE Trans. Neural Netw. Learn. Syst. 2025, 36, 19133–19144. [Google Scholar] [CrossRef] [PubMed]
- D. Kim, H. Kanazawa, Y. Kuniyoshi, Active inference with a self-prior in the mirror-mark task, arXiv preprint arXiv:2604.09673 (2026). ** Under the framework of active inference and the free energy principle, this paper proposes the learning of a self-prior as a model of the self. This is then used to discover a mark in the self in the mirror test. The proposal is implemented on a simulated artificial agent building a body-schema that guides action planning. [CrossRef]
- Taniguchi, T.; Murata, S.; Suzuki, M.; Ognibene, D.; Lanillos, P.; Ugur, E.; Jamone, L.; Nakamura, T.; Ciria, A.; Lara, B.; et al. World models and predictive coding for cognitive and developmental robotics: frontiers and challenges. Adv. Robot. 2023, 37, 780–806. [Google Scholar] [CrossRef]
- Rayyes, R. Intrinsic motivation learning for real robot applications. Front. Robot. AI 2023, 10, 1102438. [Google Scholar] [CrossRef] [PubMed]
- Oudeyer, P.-Y.; Kaplan, F.; Hafner, V. V. Intrinsic motivation systems for autonomous mental development. IEEE Trans. Evol. Comput. 2007, 11, 265–286. [Google Scholar] [CrossRef]
- Baranes, A.; Oudeyer, P.-Y. Active learning of inverse models with intrinsically motivated goal exploration in robots. Robot. Auton. Syst. 2013, 61, 49–73. [Google Scholar] [CrossRef]
- R. Rayyes, H. Donat, J. Steil, Efficient online interest-driven exploration for developmental robots, IEEE Transactions on Cognitive and Developmental Systems 14 (2022) 1367–1377. ** Pursuing open-ended learning, the authors propose an interest measure in which the system selects goals driven by maximizing expected progress, resulting in an adaptive sampling policy that improves sample efficiency and convergence of learned inverse/forward models in high-dimensional sensorimotor spaces. [CrossRef]
- Mahajan, P.; Tang, M.; Li, T. E.; Havoutis, I.; Seymour, B. Neural associative skill memories for safer robotics and modelling human sensorimotor repertoires. Neural Comput. 2025, 1–27. [Google Scholar] [CrossRef] [PubMed]
- D. de Tinguy, T. Verbelen, E. Gamba, B. Dhoedt, Zero-shot structure learning and planning for autonomous robot navigation using active inference, ArXiv abs/2510.09574 (2025). [CrossRef]
- Priorelli, M.; Stoianov, I. P. Dynamic planning in hierarchical active inference. Neural Netw. Off. J. Int. Neural Netw. Soc. 2024, 185, 107075. [Google Scholar]
- C. Schwarke, V. Klemm, M. v. d. Boon, M. Bjelonic, M. Hutter, Curiosity-driven learning of joint locomotion and manipulation tasks, in: J. Tan, M. Toussaint, K. Darvish (Eds.), Proceedings of The 7th Conference on Robot Learning, volume 229 of Proceedings of Machine Learning Research, PMLR, 2023, pp. 2594–2610. URL: https://proceedings.mlr.press/v229/schwarke23a.html.
- Zarifis, S.; Chalkiadakis, I.; Chardouveli, A.; Moutzouri, V.; Sotirchos, A.; Papadimitriou, K.; Filntisis, P.; Efthymiou, N.; Maragos, P.; Pastra, K. Baby sophia: A developmental approach to self-exploration through self-touch and hand regard. arXiv 2025, arXiv:2511.09727. [Google Scholar] [CrossRef]
- Fu, H.; Liu, W.; Zhou, S. Intrinsic-motivation multi-robot social formation navigation with coordinated exploration. Eng. Appl. Artif. Intell. 2025, 159, 111740. [Google Scholar]
- A. Augello, S. Gaglio, I. Infantino, U. Maniscalco, G. Pilato, F. Vella, Roboception and adaptation in a cognitive robot, Robotics and Autonomous Systems 164 (2023) 104400. * The paper introduces an artificial interoceptive system that encodes internal physical states of an agent (e.g., battery state, motor load) as perceptual signals integrated into the control loop. Using reinforcement learning, the robot adapts its behavior by optimizing task performance while regulating these internal variables, enabling self-preservation and context-aware action selection. [CrossRef]
- M. Asada, A. Cangelosi, Reevaluating development and embodiment in robotics, Device 2 (2024). ** This perspective revisits development and embodiment in cognitive robotics in light of recent AI advances, proposing the “starting small” principle as a developmental alternative to big-data training paradigms and outlining key challenges for integrating foundation models with incremental, embodied, and socially grounded learning.
- Kiverstein, J.; Miller, M.; Rietveld, E. The feeling of grip: novelty, error dynamics, and the predictive brain. Synthese 2019, 196, 2847–2869. [Google Scholar]
- Joffily, M.; Coricelli, G. Emotional valence and the free-energy principle. PLoS Comput. Biol. 2013, 9, e1003094. [Google Scholar] [CrossRef] [PubMed]
- Schillaci, G.; Pico Villalpando, A.; Hafner, V. V.; Hanappe, P.; Colliaux, D.; Wintz, T. Intrinsic motivation and episodic memories for robot exploration of high-dimensional sensory spaces. Adapt. Behav. 2021, 29, 549–566. [Google Scholar]
- Hiruma, H.; Ito, H.; Mori, H.; Ogata, T. Deep active visual attention for real-time robot motion generation: Emergence of tool-body assimilation and adaptive tool-use. IEEE Robot. Autom. Lett. 2022, 7, 8550–8557. [Google Scholar]
- López, F. M.; Lenz, M.; Fedozzi, M. G.; Aubret, A.; Triesch, J. Mimo grows! simulating body and sensory development in a multimodal infant model. In 2025 IEEE International Conference on Development and Learning (ICDL); IEEE, 2025; pp. 1–6. [Google Scholar] [CrossRef]
- Y. Ma, Z. Song, Y. Zhuang, J. Hao, I. King, A survey on vision-language-action models for embodied ai, arXiv preprint arXiv:2405.14093 (2024). * This survey offers a structured taxonomy of Vision-Language-Action (VLA) models in embodied AI, highlighting their architectural components and key limitations in generalization and real-world adaptability, and serving as a central reference on the integration of foundation models into robotics. [CrossRef]
- O. M. Team; Ghosh, D.; Walke, H.; Pertsch, K.; Black, K.; Mees, O.; Dasari, S.; Hejna, J.; Kreiman, T.; Xu, C.; et al. Octo: An open-source generalist robot policy. arXiv 2024. [Google Scholar] [CrossRef]
- Kim, M. J.; Pertsch, K.; Karamcheti, S.; Xiao, T.; Balakrishna, A.; Nair, S.; Rafailov, R.; Foster, E.; Lam, G.; Sanketi, P.; et al. Openvla: An open-source vision-language-action model. arXiv 2024. [Google Scholar] [CrossRef]
- O’Neill, A.; Rehman, A.; Maddukuri, A.; Gupta, A.; Padalkar, A.; Lee, A.; Pooley, A.; Gupta, A.; Mandlekar, A.; Jain, A.; et al. Open x-embodiment: Robotic learning datasets and rt-x models: Open x-embodiment collaboration 0. In 2024 IEEE International Conference on Robotics and Automation (ICRA); IEEE, 2024; pp. 6892–6903. [Google Scholar] [CrossRef]
- K. Zhang, R. Xu, P. Ren, J. Lin, H. Wu, L. Lin, X. Liang, Robridge: A hierarchical architecture bridging cognition and execution for general robotic manipulation, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025, pp. 14590–14601. [CrossRef]
- Jin, C.; Tan, W.; Yang, J.; Liu, B.; Song, R.; Wang, L.; Fu, J. Alphablock: Embodied finetuning for vision-language reasoning in robot manipulation. arXiv 2023, arXiv:2305.18898. [Google Scholar] [CrossRef]
- Lykov, A.; Konenkov, M.; Gbagbe, K. F.; Litvinov, M.; Davletshin, D.; Fedoseev, A.; Cabrera, M. A.; Peter, R.; Tsetserukou, D. Cognitiveos: Large multimodal model based system to endow any type of robot with generative ai. In in: 2025 IEEE International Conference on Robotics and Automation (ICRA); IEEE; Volume 2025, pp. 16256–16261. [CrossRef]
- Vernon, D. The future of research in cognitive robotics: Foundation models or developmental cognitive models? Adv. Robot. Res. 2025, e202500066. [Google Scholar]
- L. Chen, S. M. Nguyen, Foundational models for robotics need to be made bio-inspired, 2025 IEEE International Conference on Advanced Robotics and its Social Impacts (ARSO) (2025) 126–133. * This work outlines a biologically inspired framework for advancing foundation models in robotics, emphasizing the integration of structured memory systems, grounded reasoning (e.g., embodied chain-of-thought), multimodal sensorimotor feedback, and self-motivated learning to support scalable, generalizable, and goal-directed robotic behavior.
- Teufel, C.; Fletcher, P. C. Forms of prediction in the nervous system. Nat. Rev. Neurosci. 2020, 21, 231–242. [Google Scholar] [CrossRef] [PubMed]
- Dominici, N.; Ivanenko, Y.; Cappellini, G.; d’Avella, A.; Mondì, V.; Cicchese, M.; Fabiano, A.; Silei, T.; Paolo, A. D.; Giannini, C.; Poppele, R.; Lacquaniti, F. Locomotor primitives in newborn babies and their development. Science 2011, 334, 997–999. [Google Scholar] [CrossRef] [PubMed]
- Kuniyoshi, Y.; Yorozu, Y.; Suzuki, S.; Sangawa, S.; Ohmura, Y.; Terada, K.; Nagakubo, A. Emergence and development of embodied cognition: A constructivist approach using robots. Prog. Brain Res. 2007, 164, 425–445. [Google Scholar] [CrossRef] [PubMed]
- Sandini, G.; Sciutti, A.; Morasso, P. Mutual human-robot understanding for a robot-enhanced society: the crucial development of shared embodied cognition. Front. Artif. Intell. 2025, 8, 1608014. [Google Scholar] [PubMed]
- Levin, M. Technological approach to mind everywhere: An experimentally-grounded framework for understanding diverse bodies and minds. Front. Syst. Neurosci. 2021, 16. [Google Scholar]
- Mordvintsev, A.; Randazzo, E.; Niklasson, E.; Levin, M. Growing neural cellular automata. Distill 2020, 5, e23. [Google Scholar] [CrossRef]
- Hansali, S.; Pio-Lopez, L.; Lapalme, J. V.; Levin, M. The role of bioelectrical patterns in regulative morphogenesis: An evolutionary simulation and validation in planarian regeneration. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2025, 11, 305–331. [Google Scholar] [CrossRef]
- B. Hartl, M. Levin, L. Pio-Lopez, Neural cellular automata: Applications to biology and beyond classical ai, Physics of Life Reviews 56 (2026) 94–108. ** This review surveys applications of neural cellular automata (NCAs) to biological modeling, including morphogenesis, regeneration, aging, bioelectricity, and molecular design, and argues for their relevance as models of multiscale biological systems composed of agential materials. It highlights NCAs as a form of collective, self-organizing AI and discusses their potential beyond classical approaches, along with current challenges and limitations. [PubMed]
- K. Xu, R. Miikkulainen, Neural cellular automata for arc-agi, in: Artificial Life Conference Proceedings 37, volume 2025, MIT Press One Rogers Street, Cambridge, MA 02142-1209, USA journals-info . . . , 2025, p. 16. [CrossRef]
- P. Miotti, E. Niklasson, E. Randazzo, A. Mordvintsev, Differentiable logic cellular automata: From game of life to pattern generation, in: Artificial Life Conference Proceedings 37, volume 2025, MIT Press One Rogers Street, Cambridge, MA 02142-1209, USA journals-info . . . , 2025, p. 54. [CrossRef]
- Pajouheshgar, E.; Xu, Y.; Abbasi, A.; Mordvintsev, A.; Jakob, W.; Süsstrunk, S. Neural cellular automata: From cells to pixels. arXiv 2025, arXiv:2506.22899. [Google Scholar] [CrossRef]
- Hartl, B.; Pio-Lopez, L.; Fields, C.; Levin, M. Remapping and navigation of an embedding space via error minimization: a fundamental organizational principle of cognition in natural and artificial systems. arXiv 2026, arXiv:2601.14096. [Google Scholar] [CrossRef]
- J. Bongard, M. Levin, There’s plenty of room right here: Biological systems as evolved, overloaded, multiscale machines, Biomimetics 8 (2023) 110. ** This work argues for an observer-dependent, continuous view of biological and artificial systems, emphasizing that cognition and function emerge from multi-scale, entangled processes. It introduces the concept of “polycomputing,” where the same substrate simultaneously performs multiple computations, highlighting the tight coupling of form and function and the importance of understanding and controlling behavior across scales in both living and engineered systems. [PubMed]
- Zador, A.; Fellous, J.-M.; Sejnowski, T.; Adam, G.; Aimone, J. B.; Akwaboah, A.; Aloimonos, Y.; Alonso, C. A.; Bartolozzi, C.; Bennington, M. J.; et al. Neuroai and beyond: Bridging between advances in neuroscience and artificialintelligence. arXiv 2026, arXiv:2604.18637. [Google Scholar] [CrossRef]
- Dodig-Crnkovic, G. De-anthropomorphizing the mind: life as a cognitive spectrum in a unified framework for biological minds. Front. Syst. Neurosci. 2026, 20, 1730097. [Google Scholar] [CrossRef] [PubMed]


| Dimension | Classical CR | CR + Foundation Models | CR + Basal Cognition |
|---|---|---|---|
| Embodiment | Morphology and sensorimotor capabilities shaped by and grounded during interaction with the environment | Perceptual and motor priors that provide the basic body skills | Single to multicellular interactions across scales, substrates, and environments |
| Learning | Internal models grounded in sensorimotor interactions | Structured starting conditions that constrain the search space | Multiscale self-organization and distributed feedback-driven dynamics |
| Prediction | Sensorimotor contingencies and internal models | Embedded predictions based on pretrained goal-image associations | Self expected states and distributed, multiscale expected dynamics |
| Intrinsic motivation | Driven by learning progress, curiosity, and novelty search | Added as an auxiliary mechanism to support goal-driven associations | Emerges from viability and self-maintenance |
| Cognition | Rooted in sensorimotor interactions with the environment, driving adaptive behavior | Perceptual and motor priors that scaffold sensorimotor grounding | Emergent property of fundamental regulatory and adaptive dynamics across scales |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).