Submitted:
25 November 2025
Posted:
26 November 2025
You are already at the latest version
Abstract
Keywords:
Introduction
- Combinatorial paradox: the probability of spontaneous formation of long, functional RNA or protein sequences is vanishingly small.
- Chemical instability: nucleotides and their polymers are highly susceptible to hydrolysis under prebiotic conditions (Hernández & Piccirilli, 2013; Szilágyi et al., 2019).
- Coordination paradox: the translation system in modern biology requires a pre-existing code, but such a code could not have arisen without the system itself.
2. Nucleotides as an Archive
- Stabilization: nucleotides persisted longer than they would have in isolation, enabling accumulation in sufficient concentrations.
- Encoding: the spatial arrangement of nucleotides on peptide surfaces created a primitive mapping between peptide conformations and nucleotide sequences.
- Function first (peptides)
- Archive second (nucleotides)
- Stabilization throughout (membranes)
-
Coevolutionary Dipeptide Model (Zhou et al., 2025)This model reconstructs early protein motifs and correlates them with codon assignments, suggesting a coevolutionary trajectory between dipeptides and the genetic code. However, it does not propose a chemical mechanism for nucleotide fixation or symbolic transition.
-
Phylogenetic Emergence of Amino Acids (Miller et al., 2024)Based on statistical analysis of ancient proteins, this model outlines the temporal order of amino acid incorporation. It supports the early appearance of thermodynamically stable residues but lacks any framework for peptide–nucleotide interaction or coding logic.
-
Flipon-Based Structural Dynamics (Chistyakov et al., 2024)This model explores noncanonical DNA structures such as Z-DNA and G-quadruplexes, proposing that flipon dynamics influence coding regions. While structurally innovative, it does not address the peptide-first scenario or the emergence of symbolic representation.
| Criterion | RNA World (orthodoxy) | Peptide Protocode (hypothesis) |
| Primary carrier | RNA — information storage and catalysis | Short peptides — functional structures via conformational selection |
| Problem of chance | Requires random emergence of long sequences | Function arises immediately: stable peptides are naturally retained |
| Role of nucleotides | Primary molecules, source of the code | Secondary molecules, “archive” and replicators of successful forms |
| Origin of the code | Code → proteins | Proteins → code |
| Weak point | Combinatorial paradox, RNA instability | Limited direct experimental confirmation to date |
| Philosophical conclusion | Life as a product of random selection | Life as a manifestation of intrinsic molecular order |
Discussion
- Combinatorial challenge: mitigated because short peptides can exhibit catalytic activity without requiring long, improbable sequences.
- Chemical fragility of nucleotides: alleviated by their stabilization through peptide associations.
- Coordination problem: reframed, as the genetic code did not appear fully formed but crystallized gradually from peptide–nucleotide interactions.
- Lipid membranes: Amphiphilic molecules spontaneously assemble into bilayers due to the hydrophobic effect, creating compartments essential for protocell stability (Lombard et al., 2012).
- Biomolecular condensates: Proteins and RNAs undergo liquid–liquid phase separation, forming dynamic droplets without membranes. These condensates exemplify how phase transitions generate functional order in living cells (Saha & Galic, 2018).
- Oscillatory reactions: The Belousov–Zhabotinsky reaction demonstrates how simple chemical systems can self-organize into spatiotemporal patterns, providing a model for rhythmic processes in metabolism (De la Fuente et al., 2021).
- Protein and nucleic acid folding: Secondary structures such as α-helices and β-sheets emerge from local energetic constraints, showing that stable motifs are natural attractors in conformational space (Karimi, 2018). Protein folding has also been described as an autowave process of self-organization in active media (Sidorova et al., 2019).
- Relic motifs: conserved short peptide sequences in modern enzymes may represent vestiges of pre-coding functional motifs. Comparative structural biology could identify such relics.
- Peptide–nucleotide interactions: laboratory experiments can test whether peptides stabilize specific nucleotides and promote their ordered assembly.
- Hybrid systems: synthetic biology can explore peptide–nucleotide co-assemblies to determine whether emergent properties arise that are absent in isolated systems.
- Computational modeling: molecular dynamics simulations can probe the conformational landscapes of short peptides under prebiotic conditions, revealing whether stable motifs emerge preferentially.
Purpose
Definitions
- ℓ— protein length (number of amino acids).
- m— block size (amino acids per motif).
- k=ℓ/m— number of blocks.
- r— catalog size (motifs per block).
- v— generation rate (per year).
- P— number of parallel matrices.
- u— reuse factor.
- a— attrition fraction (here a=0.5).
- s=1−a— survival fraction per block (here s=0.5).
Core Formulas
Parameters
- r = 50, m = 45
- ν·P·u = year−1
- a = 0.5 ⇒ s = 0.5
- For blind search, v = 10^30 year−1 (upper-bound enumeration rate).
- H4: ℓ=102 ⇒ k≈2.27
- H3: ℓ=135 ⇒ k=3.00
Results
- Blind search:
- H4: ,
- H3: ,
- Compressed (no attrition):
- H4: years
- H3: years
- Attrition penalty:
- H4: ≈ 9.7
- H3: = 16
- Corrected times:
- H4: T_eff,corr ≈ 6.8 years
- H3: T_eff,corr ≈ 202 years
| Protein | ℓ | N_blind | T_blind (years) | r, m, k | N_eff | T_eff (years) | Attrition penalty | T_eff,corr (years) |
| H4 | 102 | >> age of universe | r=50, m=45, k≈2.27 | ~0.7 | ~9.7 | ~6.8 | ||
| H3 | 135 | >> age of universe | r=50, m=45, k=3.00 | ~12.6 | 16 | ~202 |
Conclusion and Implications
Future Directions
-
Short peptide functionalityBuilding on prior demonstrations of catalytic activity in minimal peptides, systematic studies of di-, tri-, and tetrapeptides under simulated prebiotic conditions could clarify how widespread such properties are and how they scale with sequence length.
-
Peptide–nucleotide interactionsExisting evidence of peptide–nucleotide binding can be expanded by targeted experiments that test whether peptides stabilize nucleotides, promote ordered assembly, or bias polymerization. These studies would refine the proposed archive function rather than merely establish it.
-
Compartmentalization studiesResearch on lipid vesicles and microvesicles already shows their ability to encapsulate biomolecules. Future work should focus on whether peptide–nucleotide complexes are preferentially retained and stabilized inside such compartments, quantifying the protective effect against environmental stochasticity.
-
Computational modelingMolecular dynamics and statistical models can complement experimental data by mapping conformational landscapes of short peptides and exploring peptide–nucleotide affinities. Modeling should be guided by motifs already observed in laboratory systems.
-
Relic motifs in modern biologyComparative structural biology may reveal conserved short peptide motifs embedded in contemporary enzymes. Identifying such relics would provide indirect evidence for a peptide-first stage, extending current bioinformatic surveys.
-
Astrobiological implicationsGiven existing demonstrations of peptide and vesicle chemistry under diverse conditions, future missions could incorporate assays designed to detect peptide-driven prebiotic chemistry in environments rich in amino acids and capable of compartmentalization.
References
- Bar-Peled, L.; Kory, N. Principles and functions of metabolic compartmentalization. Nature Metabolism 2022, 4(5), 635–648. [Google Scholar] [CrossRef]
- Chen, I. A.; Walde, P. From self-assembled vesicles to protocells. Cold Spring Harbor Perspectives in Biology 2010, 2(7), a002170. [Google Scholar] [CrossRef]
- Chistyakov, A. V.; et al. Flipon dynamics and molecular tinkers in genetic code evolution. Biology Letters 2024, 20(12), 20240635. [Google Scholar] [CrossRef]
- Dalal, A.; Mansy, S. S. Origins of life: the molecules that could have unlocked peptide synthesis. Nature 2025, 626, 45–47. [Google Scholar] [CrossRef]
- De la Fuente, I. M.; Martínez, L.; Carrasco-Pujante, J.; Fedetz, M.; López, J. I. Self-organization and information processing: From basic enzymatic activities to complex adaptive cellular behavior. Frontiers in Genetics 2021, 12, 644615. [Google Scholar] [CrossRef] [PubMed]
- Dill, K. A.; Riek, R.; Greenwald, J.; Fried, S. D. Origins of life: First came evolutionary dynamics. Proceedings of the National Academy of Sciences USA 2023, 120(12), e2301234. [Google Scholar]
- Erastova, V.; Degiacomi, M. T.; Fraser, D. G.; Greenwell, H. C.; Sutherland, J. D. Mineral surface chemistry control for origin of prebiotic peptides. Nature Communications 2017, 8, 2033. [Google Scholar] [CrossRef]
- Frenkel-Pinter, M.; Fisher, S.; Ezerzer, Y. Why nature selected today’s proteins: assembly-driven selection of α-amino acids. Proceedings of the National Academy of Sciences USA 2025, 122(9), e2509876. [Google Scholar]
- Hargreaves, W. R.; Deamer, D. W. Lipids as a source of membranes in prebiotic chemical evolution. Nature 1978, 271(5645), 31–35. [Google Scholar] [CrossRef]
- Hernández, A. R.; Piccirilli, J. S. Chemical origins of life: Prebiotic RNA unstuck. Nature Chemistry 2013, 5(5), 375–376. [Google Scholar] [CrossRef]
- Karimi, T. Molecular mechanism of self-organization in biological systems. In Molecular Mechanisms of Autonomy in Biological Systems; Springer, 2018; pp. 51–70. [Google Scholar]
- Lombard, J.; López-García, P.; Moreira, D. The early evolution of lipid membranes and the three domains of life. Nature Reviews Microbiology 2012, 10(7), 507–515. [Google Scholar] [CrossRef]
- Miller, K. J.; et al. Phylogenetic reconstruction of amino acid emergence in early proteins. PNAS 2024, 121(42), e2410311121. [Google Scholar] [CrossRef]
- Nanda, J.; et al. Emergence of native peptide sequences in prebiotic replication networks. Nature Communications 2017, 8, 434. [Google Scholar] [CrossRef]
- Prosdocimi, F. Origin of life: Drawing the big picture. Progress in Biophysics & Molecular Biology 2023, 183, 1–15. [Google Scholar]
- Royal Society Editorial Board. Amino acid analogues provide multiple plausible pathways to prebiotic peptides. Journal of the Royal Society Interface 2025, 22(195), 20250234. [Google Scholar]
- Saha, T.; Galic, M. Self-organization across scales: from molecules to organisms. Philosophical Transactions of the Royal Society B 2018, 373(1747), 20170113. [Google Scholar] [CrossRef] [PubMed]
- Sidorova, A. E.; Malyshko, E. V.; Kotov, A. R.; Levashova, N. T.; Ustinin, M. N.; Tverdislov, V. A. Protein folding as an autowave process of self-organization in active media. Bulletin of the Russian Academy of Sciences: Physics 2019, 83, 85–90. [Google Scholar] [CrossRef]
- Singh, J.; Thoma, B.; Whitaker, D.; Webley, M. S.; Yao, Y.; Powner, M. V. Thioester-mediated aminoacylation of RNA and peptidyl-RNA synthesis in water. Nature 2025, 644, 933–944. [Google Scholar] [CrossRef]
- Szilágyi, A.; Kun, Á.; Szathmáry, E. Dynamics and stability in prebiotic information integration: an RNA World model from first principles. Scientific Reports 2019, 9, 6942. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, M.; Guo, X.; Gan, D.; Ye, Y.; Zhao, Y.; Ying, J. A model for N-to-C direction in prebiotic peptide synthesis. Chemical Communications 2024, 60(20), 1234–1238. [Google Scholar] [CrossRef]
- Zhou, Y.; et al. Coevolution of early protein motifs and the genetic code. Journal of Molecular Biology 2025, 437(15), 169396. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).