A. Prebiotic Replication and Transcription of DNA
In all living organisms, enzyme-driven, template-directed DNA synthesis proceeds exclusively by 3′ extension of an existing primer through the addition of one nucleotide at a time, whereas 5′ extension is catalyzed by an entirely different enzyme, DNA ligase. In both reactions, formation of the 3′–5′ phosphodiester linkage is driven by the high-energy phosphate stored at the 5′ end of each incoming nucleotide or at the activated 5′ end of an existing nucleotide strand.
In the ancient prebiotic era, however, activation of the 5′ phosphate termini of both existing sequences and incoming nucleotides is believed to have been mediated by chemical activating agents such as N-cyanoimidazole [
7,
8]. Under such conditions, DNA synthesis is expected to proceed in both the 5′ and 3′ directions, resulting invariably in the completion of double-stranded products. This contrasts sharply with the unidirectional 3′ extension characteristic of modern enzyme-driven DNA synthesis. This inferred bidirectionality is particularly important for abiotic replication of DNA and RNA, as discussed below. When ribonucleotides rather than deoxyribonucleotides are supplied, transcription occurs instead of complementary DNA strand synthesis.
Figure 2 illustrates the inferred replication and transcription pathways of a simple single-stranded DNA and of an inverted-repeat-containing single-stranded DNA. In the latter case, the molecule is first converted into a stem–loop structure through gap-filling DNA synthesis. A short complementary DNA fragment, together with a phosphate-activating chemical, then binds to the single-stranded loop region and primes bidirectional DNA synthesis. Elongation proceeds through consecutive ligation of nucleotide blocks 3–4 nucleotides in length [
8], stabilized by π-stacking interactions characteristic of nucleotide oligomers of this size. The resultant products are double-stranded linear DNA and double-stranded stem–loop DNA.
Double-stranded linear DNA must subsequently undergo heat-induced denaturation to separate its strands, whereas double-stranded stem–loop DNA spontaneously separates into two mutually complementary single-stranded stem–loop DNAs at only slightly elevated temperatures. Repetition of this cycle leads to proliferation of DNA molecules.
In ribonucleotide-rich environments, the products formed are double-stranded DNA–RNA hybrids. Upon heating, however, the linear hybrids lose the nascent RNA strand, whereas stem–loop hybrids similarly dissociate into complementary stem–loop RNA and DNA strands without requiring heat treatment. Thus, linear DNA can replicate in environments where heat is intermittently available, such as volcanic ocean floors or hot springs. In contrast, RNA transcripts derived from linear DNA templates—unlike those from stem–loop DNA—cannot be recovered without sustaining strand breaks under such conditions. Consequently, while linear DNA can be replicated through repeated cycles, it cannot be transcribed without damage in intermittently hot environments.
The next adaptive measure taken by problem-solving entities involves heterogeneity in AT content along synthesized double-stranded DNA molecules. Certain regions possess higher AT content and therefore melt more readily than GC-rich regions in calm environments (
Figure 3). If a short CT-rich sequence is present within such a region and a complementary short RNA happens to be available, the RNA binds to the CT-rich sequence and primes DNA synthesis. DNA synthesis on the opposite strand is then initiated in a similar manner. Because synthesis proceeds bidirectionally on both strands, complete DNA replication can be achieved in a calm environment.
In this process, the primer is invariably RNA, because only RNA can outcompete the opposing DNA strand for binding to the CT-rich sequence in partially melted double-stranded DNA. It is known experimentally that purine-rich RNA binds more strongly to complementary DNA than purine-rich DNA binds to the same complementary DNA sequence [
14,
15]. This fact provides the mechanistic basis not only for the inferred use of RNA primers in abiotic DNA replication but also for the strict requirement for RNA primers in modern enzyme-catalyzed DNA replication. Supporting this view, the first ribonucleotides incorporated by DNA primase and by transcriptional RNA polymerases are predominantly A or G [
16,
17], consistent with the purine-rich RNAs postulated here for prebiotic DNA replication.
In a ribonucleotide-rich calm environment, RNA synthesis (transcription) can also occur. Unlike DNA replication, however, RNA synthesis is strongly influenced by interactions between template DNA and nascent RNA, allowing it to be tightly regulated even in a nonenzymatic prebiotic context. In contrast to CT-rich sequences, A-rich sequences on the DNA template form extremely weak interactions with the complementary U-rich RNA transcript [
14,
15], effectively terminating elongation at that position. In modern biology, runs of four consecutive adenines on template DNA are known to function as transcription termination signals [
18]. Thus, the terminator function of A-rich sequences proposed here for abiotic transcription is fully consistent with known experimental observations.
As illustrated in
Figure 3, if an A-rich sequence is located immediately at the 5′ side of a CT-rich sequence, transcription initiated at the CT-rich site is aborted in the 5′ direction. Similarly, if an A-rich sequence occurs somewhere downstream, transcription terminates upon reaching that site. In this manner, both the direction and extent of transcription can be specified by DNA sequence features alone. This property is critically important for the construction of a reliable, highly complex, information-directed protein synthesis system. The RNA transcripts produced are ultimately released from double-stranded DNA via strand-displacement by the complementary DNA strand.
As discussed, double-stranded linear DNA can both replicate and be transcribed in calm environments. What, then, of double-stranded linear RNA? The answer is negative. No nucleotide can bind to partially melted double-stranded RNA while successfully competing with the opposing RNA strand. Thus, RNA cannot serve as a primer for replication of double-stranded linear RNA, just as DNA cannot replicate double-stranded linear DNA unaided with RNA. Even in DNA–RNA hybrid duplexes, replication of the RNA strand cannot occur. The only viable reaction is RNA-primed RNA synthesis on a single-stranded RNA template, which produces double-stranded RNA—a replicative dead end.
B. Replication and Reverse Transcription of Stem-Loop RNA
In modern biology, all RNA is generated by transcription from DNA and lacks intrinsic self-replicability. In the prebiotic world, however, certain RNA species likely possessed the ability to replicate autonomously (
Figure 4), specifically stem-loop RNA structures analogous to those described for DNA. In general, double-stranded RNA is more difficult to dissociate than double-stranded DNA [
14,
15], raising concerns about the feasibility of strand-displacement synthesis in the absence of enzymes.
Unlike linear double-stranded RNA, however, stem-loop RNA already contains a single-stranded priming site from which bidirectional extension is possible. Consequently, toehold-mediated strand displacement can occur simultaneously on both strands, effectively doubling the force available to separate the stem structure [
19]. Although direct experimental evidence for abiotic replication of stem-loop RNA is lacking, the existence of viroids provides compelling indirect support. Viroids are circular, multi-stem-loop single-stranded RNAs that infect plants and replicate despite lacking protein-coding capacity [
20]. It is therefore reasonable to propose that stem-loop RNA replicated abiotically and played a central role in the emergence of the translation system.
A calm environment rich in ribonucleotides and suitable linking chemicals would favor self-replication of stem-loop RNA. Amplified stem-loop RNAs could subsequently ligate to one another via 5′–3′ phosphodiester bonds, forming multi-stem-loop RNA structures with increased numbers of priming sites and enhanced replicative potential. Such RNAs likely gave rise to transfer RNA (tRNA) and ribosomal RNA (rRNA). These molecules can perform amino-acid charging and peptide-bond formation, respectively, without enzymatic assistance [
21,
22], making them well suited for function in the prebiotic era.
If this scenario is correct, the question arises of how independently evolved multi-stem-loop RNA-based translation systems became encoded within modern DNA. A plausible answer lies in reverse transcription of these RNAs. Although double-stranded RNA is more resistant to strand separation than DNA–RNA hybrids [
14,
15], simultaneous toehold-mediated strand displacement on both strands could permit reverse transcription provided DNA primers, deoxyribonucleotides, and appropriate phosphodiester-bond-forming chemicals were available. The resulting DNA sequences could then be integrated into DNA-based problem-solving entities, which can store information in unlimited quantity and with long-term stability.
C. The Translation System
The most enigmatic aspect of the early development of nucleotide-based problem-solving entities is how they constructed the extraordinarily challenging megacomplex translation system using only nonenzymatic, stochastic chemical reactions available in the prebiotic era. The function of this system was to assemble proteins from a pool of amino acids according to information stored in DNA. Its essential components were tRNAs, which sequentially read nucleotide triplets and deliver specified amino acids; ribosomal RNA, which catalyzes the incorporation of tRNA-specified amino acids into the growing polypeptide; and mRNAs, which encode the order and identity of amino acids.
In modern biology, twenty amino acids are used universally. However, only approximately ten amino acids are thought to have been available in the prebiotic era, as they can be synthesized by simple laboratory-based chemical reactions or have been detected in meteorites [
23,
24]. If information stored in DNA as sequences of four distinct nucleobases were read strictly in triplets, as in today’s organisms, up to 61 distinct tRNA species (4
3 minus three termination codons) would be required. However, in present-day cells, a minimum of approximately 31 distinct tRNA
s is necessary for complete translation, owing to wobble recognition, whereby a single tRNA can recognize two or more codons through relaxed base-pairing at the 5′ anticodon position.
If the translation system had to function flawlessly from the outset in the prebiotic world, a similar number of distinct, fully functional tRNAs would have been required—even if wobble recognition operated then. This raises a fundamental question: could such many precisely functioning tRNAs have arisen stochastically and simultaneously in an abiotic environment?
Insight into this problem is provided by the standard codon table (
Figure 5A). In this representation, codon-specified amino acids are categorized by hydrophobicity, with strongly hydrophobic amino acids shown in orange, weakly hydrophobic ones in yellow, and hydrophilic amino acids in blue [
25]. Amino acids plausibly present in the prebiotic era are indicated by dotted symbols. A striking pattern emerges: codons specifying hydrophobic amino acids predominantly contain U or C at the middle codon position, whereas codons specifying hydrophilic amino acids typically contain A or G in this position. Such a pronounced bias would be highly unlikely if codons were assigned purely stochastically.
Unlike the strict Watson–Crick base-pairing that governs DNA–DNA or DNA–RNA interactions, codon recognition by tRNAs is inherently less specific. It involves conventional Watson–Crick pairing supplemented by wobble interactions at the 3′ base of the mRNA codon, as proposed by Crick [
26] and refined subsequently [
27] (
Figure 5B). These wobble interactions substantially reduce the number of distinct tRNAs required for efficient translation.
Could this number have been reduced even further in the prebiotic era? The answer is
yes, if one additional assumption is made. If codon recognition by tRNAs were sufficiently relaxed such that wobble interactions operated not only at the mRNA 3′ codon base but also at the 5′ codon base—and further extended to the middle codon position—the number of required tRNAs would be reduced dramatically to only two: one recognizing the codons specifying hydrophobic amino acids and the other recognizing the codons specifying hydrophilic amino acids (
Figure 5C).
Under this model, the anticodon of the hydrophobic-amino-acid-assigning tRNA would be 3′(A/U)(G)(A/U)5′, whereas that of the hydrophilic-amino-acid-assigning tRNA would be 3′(A/U)(U)(A/U)5′. This scheme provides a compelling explanation for the observed codon table bias: the predominance of U or C at the middle position of hydrophobic codons and A or G at that of hydrophilic codons.
Apparent anomalies in the modern codon table—such as the positioning of hydrophilic serine and threonine within the middle C column, or the assignment of hydrophobic tryptophan within a predominantly hydrophilic region—can be plausibly attributed to later evolutionary reassignments as two serine codons outlined in red are positioned in the hydrophilic region. In particular, the incorporation of tryptophan likely reflects the reassignment of a stop codon due to spatial constraints within the codon table.
If the two-tRNA hypothesis is correct, amino acids incorporated into abiotically synthesized proteins were determined primarily by the middle base of each codon. Thus, each residue would correspond to one of five hydrophobic or one of five hydrophilic amino acids (
Figure 5A, dotted amino acids). In addition to intrinsic charging affinities between tRNAs and amino acids, local amino-acid concentrations would have strongly influenced aminoacylation. As a result, proteins synthesized at different locations would vary in composition. Nevertheless, proteins translated from the same mRNA are expected to have exhibited similar functional properties.
Taken together, these considerations indicate that a translation system built upon only two primitive tRNAs would have been sufficient to support the emergence of prebiotic protein synthesis.
D. Prebiotic Evolution of Nucleotide-Based Problem-Solving Entities
As argued above, nucleotide-based problem-solving entities founded on DNA and RNA continuously evolved by seeking increased prevalence using available nucleobases, sugars, phosphate, and chemical energy sources (
Figure 6A;
Table 1). The fundamental role of DNA is to replicate and stably store all information required for protein production, whereas RNA—transcribed from DNA—provides the functional machinery for protein synthesis and supplies the short primers essential for the replication of double-stranded DNA.
At the earliest stage of the prebiotic era, proliferation of linear DNA was limited to environments where high temperature or extreme alkalinity was intermittently available (
Figure 2). However, in calm environments containing both ribonucleotides and deoxyribonucleotides, unlimited information-storable double-stranded linear DNA could replicate efficiently with the assistance of short RNA primers (
Figure 3). Such calm environments were not only advantageous for DNA and stem-loop RNA proliferation, but also optimal for the long-term preservation of stored information.
Strand nicking represents the most common form of damage to long DNA molecules. Unlike nicked single-stranded DNA, however, nicked double-stranded DNA is readily repairable, making double-stranded DNA a particularly robust medium for information storage. Consequently, calm environments constituted preferable ecological niches in which nucleotide-based problem-solving entities could persist, proliferate, and evolve while preserving their accumulated informational assets.
In environments that were calm yet rich in ribonucleotides, stem-loop RNA structures could replicate autonomously, as argued earlier (
Figure 4). If this occurred, the time required to discover functional combinations of ribosomal RNA and tRNA through stochastic processes would have been dramatically shorter than via transcription-dependent pathways alone. Such RNAs, once formed, could be reverse-transcribed and subsequently integrated into DNA-based entities upon transfer to deoxyribonucleotide-rich environments, as previously discussed.
In the modern biotic world, all hereditary information is stored in DNA, transcribed unidirectionally in the 3′ direction by RNA polymerases and translated into proteins. During prebiotic evolution, however, RNA was poorly suited for durable information storage, owing both to its chemical instability and to its inability to replicate in the linear double-stranded form. This inherent limitation is directly reflected in the organization of life today, where RNA functions primarily as a transient information carrier and catalytic molecule rather than as a long-term genetic archive.
To clarify the relationship between intelligence, problem-solving logic, and life, a set-theoretical conceptualization is instructive (
Figure 6B). Problem-solving logic represents the most fundamental component of intelligence and operates not only in living organisms but also in lifeless chemical systems such as DNA and RNA. However, this logic alone is insufficient for the emergence of life. The origin of life additionally requires self-replicating, information-storing molecular entities, and it is the extraordinary information-handling capacity of DNA that provides the substantive foundation for biological evolution.