Submitted:
05 April 2026
Posted:
06 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. The Baldwin Effect
1.2. The Structure of Behaviour
1.3. Aims and Premises
2. The Model
2.1. Heterogeneous Steps and Greedy Assimilation
- Start with no innate steps ()
- Compute for each step
- If all , stop (no further assimilation is favoured)
- Otherwise, fix the next step with the greatest positive (assimilate it), updating
- Repeat step 2 until no steps remain beneficial or all are innate
2.2. Learning and Instincts as Substitutes and Complements
2.3. Compositional Structure and Differential Genetic Assimilation
3. Discussion
3.1. Evidence and Implications
3.1.1. Hypercycle Generation: Recombination and Behavioural Spillover
3.1.2. Genetic Specificity in Neural Wiring
3.1.3. Hierarchical Composition of Control
3.2. Predictions
3.2.1. Learning as Scaffold, not a Substitute
3.2.2. The Re-Emergence of Instinct at Large Brain Size
- A U-shaped or mixed relationship between brain size and proportion of instinctive behaviour.
- Large-brained taxa (e.g., corvids, parrots, cetaceans, primates, cephalopods) should exhibit both high learning ability and extensive suites of species-typical routines.
- These routines should appear modular and recombinable rather than globally rigid.
- Comparative neuroanatomy should show increased structural modularity and specialized circuits rather than uniform “general intelligence” scaling.
3.2.3. Bottleneck-First Assimilation
3.2.3. Ontogeny Partially Recapitulates Assimilation Order
3.2.3. Combinatory Exaptation
3.3. Limitations and Future Directions
3.3.1. The "Automaticity" Force Multiplier
3.3.2. Extended Phenotypes
4. Summary and Conclusions
Appendix A. The Baldwin Buffer Against Error Catastrophe
Appendix B. The Rotten Kid as a Quasi-Species
References
- Aiello, L. C.; Wheeler, P. The expensive-tissue hypothesis: the brain and the digestive system in human and primate evolution. Current anthropology 1995, 36(2), 199–221. [Google Scholar] [CrossRef]
- Amsel, A. Frustration Theory; Cambridge University Press; Cambridge, 1992. [Google Scholar]
- Amsel, A. Précis of Frustration theory: An analysis of dispositional learning and memory. Psychonomic Bulletin & Review 1994, 1, 280–296. [Google Scholar] [CrossRef]
- Azim, E.; Jiang, J.; Alstermark, B.; Jessell, T. M. Skilled reaching relies on a V2a propriospinal internal copy circuit. Nature 2014, 508(7496), 357–363. [Google Scholar] [CrossRef]
- Balleine, B. W.; Dickinson, A. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology 1998, 37(4-5), 407–419. [Google Scholar] [CrossRef]
- Balleine, B. W.; O'doherty, J. P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35(1), 48–69. [Google Scholar] [CrossRef] [PubMed]
- Barabási, D. L.; Ferreira Castro, A.; Engert, F. Three systems of circuit formation: assembly, updating and tuning. Nature Reviews Neuroscience 2025, 1–12. [Google Scholar] [CrossRef] [PubMed]
- Bavelier, D.; Levi, D. M.; Li, R. W.; Dan, Y.; Hensch, T. K. Removing brakes on adult brain plasticity: from molecular to behavioral interventions. Journal of Neuroscience 2010, 30(45), 14964–14971. [Google Scholar] [CrossRef]
- Becker, G. S. A theory of social interactions. Journal of political economy 1974, 82(6), 1063–1093. [Google Scholar] [CrossRef]
- Bitterman, M.E.; Menzel, R.; Fietz, A.; Schader, S. Classical conditioning of proboscis-extension in honey bees. J. Comp. Physiol. Psychol. 1983, 97, 107–119. [Google Scholar] [CrossRef]
- Brooks, R. A. Intelligence without representation. Artificial intelligence 1991, 47(1-3), 139–159. [Google Scholar] [CrossRef]
- Changizi, M. A. Relationship between number of muscles, behavioral repertoire size, and encephalization in mammals. Journal of Theoretical Biology 2003, 220(2), 157–168. [Google Scholar] [CrossRef]
- Chen, W. V.; Maniatis, T. Clustered protocadherins. Development 2013, 140(16), 3297–3302. [Google Scholar] [CrossRef] [PubMed]
- Chowdhury, M. A. R.; An, J.; Jeong, S. The pleiotropic face of CREB family transcription factors. Molecules and cells 2023, 46(7), 399–413. [Google Scholar] [CrossRef]
- Corbit, L. H.; Balleine, B. W. Double dissociation of basolateral and central amygdala lesions on the general and outcome-specific forms of pavlovian-instrumental transfer. Journal of Neuroscience 2005, 25(4), 962–970. [Google Scholar] [CrossRef]
- Dawkins, R. The Extended Phenotype; Oxford; W. H. Freeman, 1982. [Google Scholar]
- De Boer, S. F.; Koolhaas, J. M. Defensive burying in rodents: ethology, neurobiology and psychopharmacology. European journal of pharmacology 2003, 463(1-3), 145–161. [Google Scholar] [CrossRef]
- Deacon, T. W. The symbolic species: The co-evolution of language and the brain; WW Norton & Company, 1998. [Google Scholar]
- Dennett, D. C. Elbow Room; M. I. T. Press, 1984. [Google Scholar]
- Dennett, D.C. Darwin’s Dangerous Idea: Evolution and the Meaning of Life; New York; Simon and Schuster, 1995. [Google Scholar]
- Dezfouli, A.; Balleine, B. W. Habits, action sequences and reinforcement learning. European Journal of Neuroscience 2012, 35(7), 1036–1051. [Google Scholar] [CrossRef] [PubMed]
- Dezfouli, A.; Balleine, B. W. Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized. PLoS computational biology 2013, 9(12), e1003364. [Google Scholar] [CrossRef]
- Dong, H.; Wu, L.; Wang, Z.; Qu, H.; Xie, J.; Liu, Y.; Jin, Y. The modern expansion of Dscam1 isoform diversity in Drosophila is linked to fitness and immunity. PLoS Biology 2025, 23(9), e3003383. [Google Scholar] [CrossRef] [PubMed]
- Dutrow, E. V.; Serpell, J. A.; Ostrander, E. A. Domestic dog lineages reveal genetic drivers of behavioral diversification. Cell 2022, 185(25), 4737–4755. [Google Scholar] [CrossRef]
- Eaton, R.L. The predatory sequence, with emphasis on killing behavior and its ontogeny, in the cheetah (Acinonyx jubatus Schreber). Zeitschrift für Tierpsychologie 1968, 27, 492–504. [Google Scholar] [CrossRef]
- Eigen, M.; Schuster, P. The hypercycle. A principle of natural self-organization. Part A: Emergence of the hypercycle. Naturwissenschaften 1977, 64, 541–565. [Google Scholar] [CrossRef]
- Eigen, M.; Schuster, P. The hypercycle: A principle of natural self-organisation part B: the abstract hypercycle. Naturwissenschaften 65 1978, 7–41. [Google Scholar] [CrossRef]
- Epstein, E. R. The spontaneous interconnection of four repertoires of behaviour in a pigeon (Columba livia). Journal of Comparative Psychology 1987, 2, 197–201. [Google Scholar] [CrossRef]
- Flash, T.; Hochner, B. Motor primitives in vertebrates and invertebrates. Current opinion in neurobiology 2005, 15(6), 660–666. [Google Scholar] [CrossRef]
- Gahnstrom, C. J.; Spiers, H. J. Striatal and hippocampal contributions to flexible navigation in rats and humans. Brain and Neuroscience Advances 2020, 4, 2398212820979772. [Google Scholar] [CrossRef]
- Giszter, S. F. Motor primitives—new data and future questions. Current opinion in neurobiology 2015, 33, 156–165. [Google Scholar] [CrossRef]
- Gould, S. J. The Hedgehog, The Fox, and the Magister's Pox; Three Rivers Press, 2003. [Google Scholar]
- Gould, J. L.; Gould, C.G. Animal Architects: Building and the Evolution of Intelligence; Basic books, 2007. [Google Scholar]
- Gray, J.A.; McNaughton, N. The Neuropsychology of Anxiety, 2nd ed.; Oxford University Press; Oxford, 2000. [Google Scholar]
- Graybiel, A. M. Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci. 2008, 31(1), 359–387. [Google Scholar] [CrossRef]
- Haig, D. Genomic Imprinting and Kinship; Rutgers University Press; New Brunswick, 2001. [Google Scholar]
- Hazlett, B.A. Conditioned reinforcement in the crayfish Ocronectes rusticus. Behaviour 2007, 144, 847–859. [Google Scholar] [CrossRef]
- Hinton, G.E.; Nowlan, S.J. How learning can guide evolution. Complex Systems 1 1987, 495–502. [Google Scholar]
- Hernández, D. G.; Rivera, C.; Cande, J.; Zhou, B.; Stern, D. L.; Berman, G. J. A framework for studying behavioral evolution by reconstructing ancestral repertoires. Elife 2021, 10, e61806. [Google Scholar] [CrossRef]
- Hull, C. Principles of Behavior; New York; Appleton-Century-Crofts, 1943. [Google Scholar]
- Jeong, H.; Ostrander, E. A.; Kim, J. Genomic evidence for behavioral adaptation of herding dogs. Science Advances 2025, 11(18), eadp4591. [Google Scholar] [CrossRef]
- Jerison, H. Evolution of the brain and intelligence; Elsevier, 2012. [Google Scholar]
- Jerome, J.; Frantino, E.P.; Sturmey, P. Backward chaining on the acquisition of internet skills in adults with developmental disabilities. Journal of Applied Behavior Analysis 2007, 40, 185–189. [Google Scholar] [CrossRef]
- Jin, Y.; Li, H. Revisiting Dscam diversity: lessons from clustered protocadherins. Cellular and Molecular Life Sciences 2019, 76(4), 667–680. [Google Scholar] [CrossRef]
- Kiehn, O. Decoding the organization of spinal circuits that control locomotion. Nature Reviews Neuroscience 2016, 17(4), 224–238. [Google Scholar] [CrossRef] [PubMed]
- Kremer, M. The O-ring theory of economic development. The quarterly journal of economics 1993, 108(3), 551–575. [Google Scholar] [CrossRef]
- Lubow, R.E. Latent inhibition and conditioned attention theory; Cambridge; Cambridge University Press, 1989. [Google Scholar]
- MacPhail, E.M. Brain and Intelligence in Vertebrates; Clarendon Press, 1982. [Google Scholar]
- Mattfeld, A. T.; Stark, C. E. Functional contributions and interactions between the human hippocampus and subregions of the striatum during arbitrary associative learning and memory. Hippocampus 2015, 25(8), 900–911. [Google Scholar] [CrossRef] [PubMed]
- Maye, A.; Hsieh, C; Sugihara, G.; Brembs, B. Order in spontaneous behavior. PLoS one 2007, 2, e443. [Google Scholar] [CrossRef]
- Maynard Smith, J. Natural selection: when learning guides evolution. Nature 1987, 329, 761–762. [Google Scholar] [CrossRef]
- McConnell, P. B. Acoustic structure and receiver response in domestic dogs, Canis familiaris. Animal Behaviour 1990, 39(5), 897–904. [Google Scholar] [CrossRef]
- McDowell, K. A.; Hutchinson, A. N.; Wong-Goodrich, S. J.; Presby, M. M.; Su, D.; Rodriguiz, R. M.; West, A. E. Reduced cortical BDNF expression and aberrant memory in Carf knock-out mice. Journal of Neuroscience 2010, 30(22), 7453–7465. [Google Scholar] [CrossRef]
- McNaughton, N. Biology and emotion; Cambridge University Press; Cambridge, 1989. [Google Scholar]
- Ostlund, S. B.; Winterbauer, N. E.; Balleine, B. W. Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. Journal of Neuroscience 2009, 29(25), 8280–8287. [Google Scholar] [CrossRef]
- Owings, D. H.; Coss, R. G. Snake mobbing by California ground squirrels: adaptive variation and ontogeny. Behaviour 1977, 50–69. [Google Scholar] [CrossRef]
- Pfeifer, R.; Bongard, J. How the body shapes the way we think: a new view of intelligence; MIT press, 2006. [Google Scholar]
- Pinker, S. The Blank Slate: The Modern Denial of Human Nature; London; Penguin, 2002. [Google Scholar]
- Pinker, S.; Jackendoff, R. The faculty of language: what's special about it? Cognition 2005, 95(2), 201–236. [Google Scholar] [CrossRef]
- Poldrack, R. A.; Sabb, F. W.; Foerde, K.; Tom, S. M.; Asarnow, R. F.; Bookheimer, S. Y.; Knowlton, B. J. The neural correlates of motor skill automaticity. Journal of Neuroscience 2005, 25(22), 5356–5364. [Google Scholar] [CrossRef]
- Prescott, T. J.; Redgrave, P.; Gurney, K. Layered control architectures in robots and vertebrates. Adaptive Behavior 1999, 7(1), 99–127. [Google Scholar] [CrossRef]
- Punzo, F. The effect of encounter experience on hunting behavior in the spider wasp, Pepsis cerberus Lucas (Hymenoptera; Pompilidae). Texas Journal of Science 2005, 57, 165–174. [Google Scholar]
- Reznikova, Z.; Panteleeva, S.; Danzanov, Z. A new method for evaluating the complexity of animal behavioral patterns based on the notion of Kolmogorov complexity, with ants’ hunting behavior as an example. Neurocomputing 2012, 84, 58–64. [Google Scholar] [CrossRef]
- Roy, A. D. Some thoughts on the distribution of earnings. Oxford Economic Papers 1951, 3, 135–156. [Google Scholar] [CrossRef]
- Roy, D. Myths about memes. Journal of Bioeconomics 2017, 19(3), 281–305. [Google Scholar] [CrossRef]
- Rundus, A. S.; Owings, D. H.; Joshi, S. S.; Chinn, E.; Giannini, N. Ground squirrels use an infrared signal to deter rattlesnake predation. Proceedings of the National Academy of Sciences 2007, 104(36), 14372–14376. [Google Scholar] [CrossRef]
- Sahley, C.; Rudy, J.W.; Gelperin, A. An analysis of associative learning in a terrestrial mollusc. I. Higher-order conditioning, blocking, and a transient US pre-exposure effect. J. Comp. Physiol. 1981, 144, 1–8. [Google Scholar] [CrossRef]
- Sakamoto, K.; Karelina, K.; Obrietan, K. CREB: a multifaceted regulator of neuronal plasticity and protection. Journal of neurochemistry 2011, 116(1), 1–9. [Google Scholar] [CrossRef] [PubMed]
- Shen, W.; Liu, H. H.; Schiapparelli, L.; McClatchy, D.; He, H. Y.; Yates, J. R.; Cline, H. T. Acute synthesis of CPEB is required for plasticity of visual avoidance behavior in Xenopus. Cell reports 2014, 6(4), 737–747. [Google Scholar] [CrossRef]
- Shine, J. M.; Shine, R. Delegation to automaticity: the driving force for cognitive evolution? Frontiers in Neuroscience 2014, 8, 90. [Google Scholar] [CrossRef]
- Skinner, B.F. The Behaviour of Organisms.; New York; Appleton-Century-Crofts, 1938. [Google Scholar]
- Sperry, R. W. Chemoaffinity in the orderly growth of nerve fiber patterns and connections. Proceedings of the National Academy of Sciences 1963, 50(4), 703–710. [Google Scholar] [CrossRef] [PubMed]
- Stagkourakis, S.; Spigolon, G.; Marks, M.; Feyder, M.; Kim, J.; Perona, P.; Anderson, D. J. Anatomically distributed neural representations of instincts in the hypothalamus; bioRxiv, 2023. [Google Scholar]
- Stephens, D. W. Change, regularity, and value in the evolution of animal learning. Behavioral Ecology 1991, 2(1), 77–89. [Google Scholar] [CrossRef]
- Stephenson-Jones, M.; Samuelsson, E.; Ericsson, J.; Robertson, B.; Grillner, S. Evolutionary conservation of the basal ganglia as a common vertebrate mechanism for action selection. Current biology 2011, 21(13), 1081–1091. [Google Scholar] [CrossRef]
- Styfhals, R.; Seuntjens, E.; Simakov, O.; Sanges, R.; Fiorito, G. In silico identification and expression of protocadherin gene family in Octopus vulgaris. Frontiers in physiology 2019, 9, 1905. [Google Scholar] [CrossRef]
- Sumbre, G.; Gutfreund, Y.; Fiorito, G.; Flash, T.; Hochner, B. Control of octopus arm extension by a peripheral motor program. Science 2001, 293(5536), 1845–1848. [Google Scholar] [CrossRef]
- Thorpe, T.W. Learning and Instinct in Animals; Harvard University Press; Cambridge, 1956. [Google Scholar]
- Tinbergen, N. The Study of Instinct.; London; Oxford University Press, 1951. [Google Scholar]
- Van Iersel, J. J. A.; Tinbergen, N. Displacement Reactions" in the Three-Spined Stickleback. Behaviour 1948, 1(1), 56–63. [Google Scholar] [CrossRef]
- Wagner, G. P.; Altenberg, L. Perspective: complex adaptations and the evolution of evolvability. Evolution 1996, 50(3), 967–976. [Google Scholar] [CrossRef] [PubMed]
- Weir, A. A.; Chappell, J.; Kacelnik, A. Shaping of hooks in New Caledonian crows. Science 2002, 297(5583), 981–981. [Google Scholar] [CrossRef] [PubMed]
- Werkhoven, Z.; Bravin, A.; Skutt-Kakaria, K.; Reimers, P.; Pallares, L. F.; Ayroles, J.; De Bivort, B. L. The structure of behavioral variation within a genotype. Elife 2021, 10, e64988. [Google Scholar] [CrossRef]
- Winterbauer, N. E.; Bouton, M. E. Mechanisms of resurgence of an extinguished instrumental behavior. Journal of Experimental Psychology: Animal Behavior Processes 2010, 36(3), 343. [Google Scholar] [CrossRef]
- Wu, Q.; Maniatis, T. A striking organization of a large family of human neural cadherin-like cell adhesion genes. Cell 1999, 97(6), 779–790. [Google Scholar] [CrossRef]
- Yang, Q.; Logan, D.; Giszter, S. F. Motor primitives are determined in early development and are then robustly conserved into adulthood. Proceedings of the National Academy of Sciences 2019, 116(24), 12025–12034. [Google Scholar] [CrossRef]
- Yin, H. H.; Knowlton, B. J. The role of the basal ganglia in habit formation. Nature reviews neuroscience 2006, 7(6), 464–476. [Google Scholar] [CrossRef] [PubMed]
- Yue, Y.; Meng, Y.; Ma, H.; Hou, S.; Cao, G.; Hong, W.; Jin, Y. A large family of Dscam genes with tandemly arrayed 5′ cassettes in Chelicerata. Nature communications 2016, 7(1), 11252. [Google Scholar] [CrossRef]




| 1 | However, note that some sorts of stimuli important for this sort of learning might even be internal feedback signals produced by the very performance of that behaviour--the sort of “fractional anticipatory goal-responses” of Hullian terminology (Amsel, 1994). For instance, even perception of one’s skeletal responses can feedback as stimuli into conditioned associations (McNaughton, 1989). |
| 2 | We have used the ∩ symbol here because it captures the requirement that some aspects of the effects caused by each behaviour in the chain must overlap with the perceptual stimuli necessary to stimulate the next. |
| 3 | For tractability we assume that learning and encoding costs are additive across steps and that learning success probabilities are independent. This deliberately removes interference among components. Such effects would primarily rescale marginal costs or benefits and therefore shift quantitative thresholds without altering the qualitative regimes identified below. If anything, realistic interference in learning would further favour genetic assimilation, making our results conservative. |
| 4 | For simplicity we treat learning difficulty (qk) and genetic encoding costs (gk) as independent across steps. Correlation between these quantities would primarily rescale marginal gains ΔWk and thus alter the ordering or speed of assimilation rather than the qualitative dynamics. Positive correlation would dampen assimilation by offsetting the benefits of hard-to-learn steps with higher encoding costs, whereas negative correlation would accelerate the snowballing pattern. The greedy, path-dependent structure of the process remains unchanged. |
| 5 | Greedy fixation need not be globally optimal when there are strong superadditive complementarities among particular pairs of steps; however, natural selection’s local, gradient-climbing nature makes the greedy approximation biologically apt in many contexts. |
| 6 | Note that this formulation treats learning probabilities qi as fixed. A realistic extension would incorporate that genetic assimilation of some steps frees cognitive capacity (attention, memory, practice time), potentially increasing qj for remaining learned steps — a form of learning-instinct complementarity via automaticity. This positive feedback would accelerate subsequent assimilations and potentially alter optimal ordering, making hard-to-learn steps even more urgent early targets. |
| 7 | Note that this compositional model collapses to the sequential model in Section 2.1, if L = 1 (no compositional links or higher levels), or if qℓ = q and cℓ = c for all ℓ, or if no reuse advantage exists (𝛼 = 0). In those cases, the distinction between primitives and links is unnecessary, and assimilation proceeds on the entire behavioural chain as a single unit. |
| 8 | Such as an innate 'grammar' of nest building, where the overarching structure is hard-wired but the specific 'fill-ins' (materials used) are learned according to local availability. |
| 9 | The "Octopus Principle" of distributed control (innate primitives governed by learned links) described in our model enjoys a sort of "existence proof" in modern robotics. Early AI attempted to solve movement through "General Intelligence" (brute-force calculation of every joint angle), which proved too slow and brittle, while contemporary robotics has moved toward Subsumption Architecture and Morphological Computation, where “intelligence” is offloaded to the physical design of the limb (innate primitives) while the central processor focuses on high-level sequencing (learned links) (Brooks, 1991). The morphological computation principle is an especially good example of an “innate primitive”, in that the “intelligence” is built into the physical mechanics of the robotic limb so that it “knows” how to move automatically (Pfeifer & Bongard, 2006). This is very much the same sort of decomposition into primitives and linking control that our model formalizes. That engineers are converging on the same hierarchical solution as octopuses and primates suggests that our model describes a universal logic applicable to both carbon and silicon-based systems. |
| 10 | Supportive evidence for this “income effect” can be found already for mammals (see Changizi, 2003). |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).