Introduction
Major evolutionary innovations, from multicellularity to powered flight have repeatedly transformed life on Earth. Understanding how and why these novelties emerge and persist is therefore central to explaining life’s history. These innovations, however, often follow two very different evolutionary paths: while some innovations, like multicellularity[
1] and C4 photosynthesis[
2], have evolved dozens of times across diverse lineages, others remain restricted to just one or a few groups – occasionally appearing as true evolutionary singularities (e.g., the origin of life or eukaryogenesis)[
3]. One potential explanation for this pattern is priority effects: When an innovation first appears, the pioneering lineage may saturate the available ecological niche space, creating competitive barriers that prevent the same innovation from being successful if it evolves in other groups[
4,
5]. Despite the recognized importance of such niche incumbency in shaping macroevolutionary patterns[
6,
7,
8], the mechanistic basis of priority effects often remains poorly understood[
4]. This mechanistic knowledge gap exemplifies a broader challenge in evolutionary biology: bridging the process-pattern divide between ecological interactions and large-scale evolutionary diversification[
9,
10].
Here, we study how priority effects may manifest mechanistically by studying the physiological bioenergetics of phototrophy. Phototrophy, the ability to use light for metabolic energy, is a major biological innovation that led to an explosion of biomass and biodiversity. However, despite first appearing at least 3.5Ga ago[
11,
12,
13] and being responsible for the vast majority of biomass on Earth[
14,
15], it is unclear why phototrophy has only ever evolved independently in two forms: chlorophototrophy and retinalophototrophy. This low, but nonzero, diversity of phototrophic origins represents a unique opportunity for study – while not a
common evolutionary innovation with sufficient barriers to its evolution to have only appeared twice, phototrophy nonetheless exhibits two separate origins that can be compared to determine the eco-evolutionary relationship between them. Could evolutionary priority effects explain their relationship, and what can we learn about the ecological interactions that engender such priority effects?
To address this question, we develop a mathematical analysis of energy transduction through these modern extant phototrophic metabolisms, and a biophysical model of the trade-offs inherent in their adaptation to different phototrophic ecological niches. We test the hypothesis that these two extant systems are optimized for different ecological conditions and find that they indeed efficiently partition phototrophic niche space between themselves, limiting the opportunity for novel phototrophic lineages to establish themselves. This suggests that the 'dual singularity' of phototrophy exemplifies a fundamental principle: evolutionary innovations can be historically contingent, with early-arising forms constraining the evolutionary possibilities available to later lineages.
Chlorophototrophic and retinalophototrophic pathways originated independently early in the history of life on Earth. They are highly divergent in their structures, compositions, and the forms of chemical energy they make available to cellular metabolism (
Figure 1A). Before describing our computational approach for investigating their evolution and interactions, we provide a brief overview of these two phototrophic systems.
Chlorophototrophy drives both energy metabolism and redox chemistry using chlorophyll and bacteriochlorophyll photochemistry. Found in bacteria and in eukaryotic algae in which cyanobacteria have been taken up as chloroplasts, chlorophototrophy is responsible for the vast majority of primary production on the planet[
14,
15]. The functional unit of chlorophototrophy is the photochemical reaction center (RC;
Figure 1A), which are large transmembrane protein complexes descended from an ancestral homodimer[
13] that contain a minimum of eight organic pigment and redox cofactor molecules[
16,
17]. In addition to their defining pigments, all reaction centers contain iron in the form of iron-sulfur clusters, single coordinated iron atoms, or hemes bound to cytochromes[
18,
19,
20,
21].
All chlorophototrophic reaction centers generate reducing power via chlorophyll and bacteriochlorophyll photochemistry, which can then be used for either energy metabolism by allowing these reduced electrons to flow through a modified electron transport chain or for carbon/nitrogen fixation via redox chemistry. In the case of energy metabolism, protons are pumped across the active membrane much as in a respiratory electron transport chain. In the case of carbon fixation, a reducing equivalent is conveyed to NADPH while the reaction center is ‘reset’ by pulling an electron from a variety of environmental electron sources (such as dissolved Fe
2+ in photoferrotrophs, H
2S in green sulfur bacteria, or water in the case of oxygenic photosynthesis). Absorption of a single photon typically pumps two to four protons across the membrane[
22,
23]. All reaction centers couple a structurally conserved central dimeric core of at least 150 kilodaltons[
16] to additional ‘core antenna’ complexes of a hundred kDa or more which increase the absorption cross section per functional unit by funneling light into the reaction center from additional pigments via Förster resonance transfer. These in turn may be coupled to a diverse variety of additional antenna complexes which vary significantly from lineage to lineage[
24].
Retinalophototrophy, in contrast, is mediated by a single simple 26-28 kDa transmembrane protein, known as a microbial or ‘type-1’ rhodopsin[
25,
26,
27] (
Figure 1A). It is covalently bound to a single pigment molecule known as retinal, derived from the oxidative splitting of a carotenoid via a dioxygenase[
28]. In a few cases, such as the xanthorhodopsins, a single additional carotenoid pigment molecule is bound to the exterior of the protein and functions as a miniature integral antenna[
29], but rhodopsins are not known to be associated with accessory antenna proteins. Rather than participating in redox chemistry, they directly pump protons across the cell membrane driven by light-induced isomerization of the retinal pigment molecule. Thus unlike chlorophototrophic systems they involve only organic molecules with no iron-containing redox cofactors[
30]. Rhodopsins are not known to be used to fix inorganic carbon into biomass in nature because they fail to push membranes to a high enough voltage to reverse electron flow through electron transport chains[
31]. However, while these two phototrophic systems are quite different, the total energy transduced by them is similar. The quantity of light absorbed by retinalophototrophs in the ocean is thought to be at least as large as that absorbed by chlorophototrophs[
32].
Results
We first examined the physiological trade-offs embodied in chlorophototrophic and retinalophototrophic machinery using data derived from physiological studies of extant taxa based on a literature review[
16,
20,
26,
31,
33,
34,
35,
36,
37,
38,
39,
40,
41,
42], calculating their machinery’s energy flux per kDa and energy yield per unit light intensity at different light intensities and in the limits of bright and dim light (see Supplement S1 and
Table S1). Despite possessing both a faster photocycle and higher energy yield per photon absorbed, chlorophototrophic machinery has a substantially lower maximum specific energy flux per unit mass than microbial rhodopsins (
Figure 2A;
Table S1). This is principally because of the much larger mass of chlorophototrophic reaction centers and their associated antennas.
Upon analysis of the function of RCs and rhodopsins at a variety of light intensities, we find that the small protein mass and presence of only a single retinal pigment in a rhodopsin ensures a small light absorption cross section which requires intense ambient light for the machinery to be used effectively. The larger size of a reaction center comes with more pigment molecules and a larger light absorption cross section which allows a higher specific energy flux at low light intensities, but this rapidly saturates and is overtaken by rhodopsins at high light intensities (
Figure 2B).
A clear trade-off between these two pathways exists: while rhodopsins are capable of much higher maximum energy fluxes per unit protein infrastructure at high light levels, RCs are capable of much higher maximum energy yields per unit incident light than retinalophototrophs at low light levels (
Figure 2C,D).
Moving on from these well-studied phototrophic organisms, we next examined the fundamental theoretical physiological capabilities of chlorophototrophy and retinalophototrophy using an integrated physiological model of phototrophic energy transduction (
Figure 1B, Supplement S1,
Table S2). This model takes into account the properties of the conserved catalytic cores at the heart of rhodopsins and reaction centers, variable quantities of associated antenna pigments increasing the light-gathering cross section of the machinery, and both the photodegradation and growth/recycling-mediated dilution of the machinery with time. Light levels were allowed to vary between 0.01 to 4000 micromoles of photons per square meter per second in logarithmic intervals, and the optimal quantity of antenna pigment per catalytic core to optimize specific energy flux of the protein machinery at each light intensity was calculated. The properties of each of these optimal systems was analyzed in terms of specific energy flux per unit protein (protons pumped per second per kDa of protein) and yield per unit incident light (protons pumped per second per kDa of protein per unit of incoming light intensity; see Materials and Methods).
Yield per unit incident light falls nonlinearly as light intensity increases (
Figure 3A) due to both saturation of the central catalytic core and drastic decreases in optimal antenna quantity (see
Figures S2 and S3). Efficiency per unit protein machinery, conversely, rises nonlinearly as light intensity increases (
Figure 3B) due to increased utilization of the catalytic core and a smaller antenna mass, with chlorophototrophs exhibiting a slightly decreased efficiency at the highest light intensities due to photodegradation. Our analysis identifies a crossover point at ca. 186 micromoles of photons per square meter per second (
Figure 3A-B). Above this light intensity, both the energy flux per unit infrastructure and energy yield per unit incident light are superior for retinalophototrophy. Below this light intensity, chlorophototrophy instead dominates for both metrics. Light intensity thus determines which phototrophic system will be more efficient, with two defined ‘niches’ – a low light niche with high yield per unit light, and a high light niche with high flux per unit infrastructure.
Our model recapitulates several key aspects of phototrophic metabolism observed in nature. It indicates a theoretical minimum optimal antenna mass of approximately 95 kDa for optimal chlorophototrophs at the crossover point light level (
Figure S2), with antenna mass being able to increase without limit at low light levels. The lower chlorophototrophic antenna limit is within approximately 20% of the mass of the smallest intrinsic antennas observed in nature in type I reaction centers, while other chlorophototrophs can indeed exhibit extremely large antenna pigments up to and including ‘chlorosome’ antennas the size of organelles[
43]. While we calculate a maximum antenna size of 61 kDa for theoretical antennas associated with rhodopsins at the crossover point, this predicted mass is below the mass of any observed antenna complex and rapidly falls to negligible optimal mass as light intensity increases towards that of full sunlight (see
Figure S2). In addition to this, the crossover point at which higher light intensities are dominated by retinalophototrophy corresponds closely with the empirically observed light intensities above which retinal is over-represented in the ocean and chlorophyll is under-represented[
32] (see File S1 and
Figure S5). Our simple model thus recapitulates key physiological trade-offs underlying variation in organisms filling phototrophic niche space, able to correctly predict the environmental conditions under which either chlorophototrophs or retinalophototrophs should have a physiological and ecological advantage.
Discussion
Phototrophic organisms face a fundamental trade-off between efficiency per unit light and flux per unit infrastructure. The two phototrophic metabolisms that have evolved on Earth have each optimized a different side of this trade-off: chlorophototrophy requires a significant fraction of the proteome to be invested in sophisticated protein machinery which results in a lower energy flux per unit infrastructure but high yield per unit light, while retinalophototrophy results in a high energy flux per unit of simple infrastructure but lower yield per unit light.
As a pathway with high yield per unit light, chlorophototrophy allows the capture of dilute light resources for obligate phototrophs, even in difficult conditions, at the expense of high protein expression and specialization of more of the proteome. As a pathway with a higher flux per unit protein, retinalophototrophy can instead provide a benefit in high light with less total expression of a simple protein, allowing it to be used as a backstop to prevent starvation and increase metabolic flexibility and biomass yield of otherwise heterotrophic organisms without specialization. These two forms of photosystems effectively partition the metabolic niche space available to phototrophs, both in terms of resource availability and in terms of the degree of phototrophic specialization.
By graphing flux per unit protein versus yield per unit resource of optimal phototrophic systems for every tested light intensity, we can create a diagram of phototrophic niche space. It becomes apparent that the optimal physiological machinery for a given environment lies along a Pareto front[
44,
45] (
Figure 3C) with the position along this front dictated by light intensity. Evolution should push novel phototrophic lineages towards this front from any suboptimal starting point they originate from (
Figure 3D). Each phototrophic machinery type exhibits its own Pareto front with differing slopes; the point at which they cross indicates the light intensity separating niches in which one or the other is favored for driving phototrophic energy metabolism.
Why has neither phototrophic strategy completely dominated all available niche space? Fundamental architectural limitations of the catalytic cores of each phototrophic system ensure that the extremes of the investment/resource efficiency tradeoff are only accessible to one or the other. Microbial rhodopsins responsible for retinalophototrophy are a small, light-driven proton pump driven by isomerization of a single organic molecule, which only allows a single proton to be pumped per photocycle. Without any sophisticated redox-active cofactors in its structure, rhodopsin cannot be recruited to interact with electron transport chains or redox metabolism. Retinalophototrophy therefore appears to be incapable of evolving to pump more than one proton per photon and efficiently using available light resources, although its small mass means it enjoys a high maximum specific energy flux. Conversely, the chlorophototrophic reaction center appears to be constrained such that it cannot be reduced below a relatively large minimum size. While proteobacterial type II RCs have either lost or never acquired the integrated antenna domains common to other RCs, the core catalytic subunit appears to never mass under approximately 150 kilodaltons or contain fewer than a minimum of eight cofactor molecules. This minimal unit likely cannot be shrunk further while retaining its function in redox metabolism, limiting its maximum energy flux per unit mass even as it enjoys a high efficiency per unit light captured. Due to these architectural constraints, chlorophototrophy and retinalophototrophy coexist via divergent ecological and physiological trade-offs.
This fundamental flux versus yield tradeoff, manifesting in ecological niches of differing resource availability, appears to exist in other non-phototrophic metabolic contexts as well. The difference between respiration and fermentation is an example – respiration can produce several times the ATP per unit substrate consumed while producing less than half the energy flux per unit protein mass[
46]. The two most common glycolytic pathways, the Etner-Doudoroff (ED) and Embden-Meyerhof-Parnas (EMP) pathway, share this relationship as well with the EMP pathway producing twice the ATP per unit carbohydrate consumed as the ED pathway, but requiring 5-fold more protein mass[
47]. Much as in phototrophy, the lower-flux higher-yield EMP pathway is seen more frequently than the ED pathway in obligate anaerobes which must use glycolysis for energy and obtain higher yield from limited available substrate. This tradeoff can have important effects on growth rates and ecology, with cells requiring less proteome to be devoted to high-flux metabolic machinery able to put more productive capacity towards growth rather than metabolism resulting in a series of widespread ecologically relevant ‘bacterial growth laws’[
46,
48,
49].
As chlorophototrophy and retinalophototrophy dominate the available phototrophic niche space, there exists little opportunity for the establishment of additional phototrophic forms
de novo. All modern phototrophs will be distributed along the combined Pareto front[
44,
45] (
Figure 3C); any newly-evolved inefficient phototrophic system will inevitably be strictly inferior to both extant established forms (
Figure 3D). As such, chloro- and retinalophototrophy together exhibit strong priority effects[
5,
7,
50,
51] – otherwise known as niche incumbency[
6] – which limits the establishment and success of novel photosystems which use the same resources as the established systems[
7,
52] even if such systems were simple to evolve. This suggests a fundamental continuity between local ecological interactions involved niche colonization and succession and patterns of evolutionary innovation at the largest of scales, as has been suggested by Baum et al., 2023[
53], and implies that local ecological interactions can drastically shape the long-term history of major evolutionary innovations.
What was the sequence of evolution of these two systems, and why did one that appeared first not suppress the origin of the other in deep time due to this phenomenon of niche incumbency? It is difficult to determine from the paleontological record whether chlorophototrophs or retinalophototrophs evolved first. However, it is possible to deduce a likely sequence of evolution from physiological analyses. While chlorophototrophs are able to use light energy for either pumping protons via an electron transport chain for energy metabolism or the reduction of CO
2 into biomass, retinalophototrophs are only capable of using light for energy. They are mechanistically incapable of driving redox reactions directly and fail to push the membrane to a high enough voltage to counteract natural electron transport chains[
31], a precondition for driving carbon fixation via reverse electron flow. This means that there is, effectively, a third dimension to the ecological niche of phototrophy representing the ability to fix carbon using light energy. Assuming that the earliest chlorophototrophs engaged in redox chemistry, even a very slow and poorly adapted proto-chlorophototroph would be superior to a retinalophototroph in its ability to fix carbon without geologically provisioned electron sources. This would allow a way around any energy-based priority effect engendered by an established retinalophototroph via a novel carbon fixation method, and an opportunity to evolve until it too became well-optimized and filled its own area of the energy transduction trade-off curve. Our model therefore implies that retinalophototrophy likely evolved before chlorophototrophy, given that carbon fixation allows a proto-chlorophototroph to avoid evolutionary priority effects from retinalophototrophs but not the reverse.
There has been great debate about why some major evolutionary innovations exist only as singularities, while others independently evolve multiple times[
3,
54,
55,
56]. Our results suggest that priority effects may play a pivotal role in suppressing the repeated evolution of phototrophy; a phenomenon which may extend to the evolutionary patterns of some other major innovations. The evolution of complex cellular architecture, for example, is considered to have occurred only once on Earth in the form of eukaryogenesis. However, it is unclear if that is because the evolutionary pathway is complex and contingent or because eukaryotes have competitively excluded any secondary origins of such cellular complexity. A similar argument applies to the origin of life itself – is this a difficult process, or did modern life simply rapidly advance to a level of sophistication that scavenged all resources that could otherwise go towards simple novel replicators or protocells? As evidence for priority effects appears replete across biological, spatial, and temporal scales[
4], it is possible that such niche incumbency could play a pivotal role in the macroevolutionary landscape of multiple major innovations.
Phototrophy is among the most important innovations in the history of life, fundamentally changing the biosphere. It is unique among major biological innovations in that it has evolved not once, and not many times, but exactly twice. Here we show that the two origins of phototrophy are mechanistically and ecologically complementary, having partitioned phototrophic niche space along a set of trade-offs that prevent either mechanism from becoming dominant. Deep architectural limitations and functional trade-offs inherent to the evolution of metabolic pathways appear to have prevented either chlorophototrophs or retinalophototrophs from occupying all phototrophic niche space individually, creating the opportunity for their stable coexistence. The remarkable fact that phototrophy evolved just twice, producing two ecologically complementary forms, however, reveals how priority effects can shape major innovations. While each established system likely prevented new phototrophic pathways from emerging through competitive exclusion, their fundamental differences in design and function meant neither could eliminate the other, allowing both to persist throughout Earth's history.
It is tempting to take evolutionary rarity as a sign of intrinsic difficulty. Yet this interpretation assumes that evolutionary innovations represent independent rolls of the dice, each with the same low probability of success. Our results challenge this view. Phototrophy may not be intrinsically difficult to evolve, as evidenced by its dual origins through fundamentally disparate routes early in the history of life. Instead, we argue that the rarity of phototrophic innovations reflects the power of evolutionary priority effects: once chlorophototrophy and retinalophototrophy saturated the available metabolic niche space, they created insurmountable competitive barriers for any subsequent phototrophic systems, regardless of how readily such systems might have evolved in their absence. This perspective has profound implications for understanding other major transitions in life's history. The singularity of events like abiogenesis or eukaryogenesis may not reflect vanishingly small probabilities or extraordinary confluences of unlikely circumstances. Instead, these innovations may have evolved with relatively high probability given appropriate conditions, but their first appearance triggered feedback processes that fundamentally altered the selective landscape, preventing future parallel evolution. A trait that evolves easily can still arise only once if its emergence forecloses the ecological opportunities necessary for its repeated evolution. In this light, life's apparent evolutionary singularities may be less contingent than they appear, their rarity stemming not from intrinsic difficulty but from the consequences of priority effects operating across billions of years of Earth history.
Materials and Methods
We first calculated an effective specific energy flux per unit protein investment of different phototrophic systems based on a literature review of vital parameters for anoxygenic chlorophototrophic RCs, oxygenic RCs, and two different microbial rhodopsins (proteorhodopsin and bacteriorhodopsin) [
16,
20,
26,
31,
33,
34,
35,
36,
37,
38,
39,
40,
41,
42]. Vital parameters included total protein mass per functional unit
Mtotal in kDa, the maximum rate
Rmax in cycles per second, protons pumped per cycle
Np, and the light level at which absorption is at half-maximum
Km, in units of micromoles of photons per square meter per second. Using equation 1 we calculated the maximum flux per unit protein at saturating light levels
Vmax (protons per second per kDa) (See
Table S1, and supplemental equation S1).
We extended this analysis from the maximal energy flux per unit mass to the flux per unit mass at differing light levels based on the absorption cross section and maximum photocycle rate
Vmax of different phototrophic machineries (see supplementary file S1 and supplemental equation S2 for a detailed description). We treated light absorption and conversion as a Michaelis-Menten process resulting in equation 2, describing the energy flux per unit protein
FP at a given light intensity
L.
By dividing the function of the return per unit investment of each phototrophic system by the level of ambient light, we produced equation 3 describing the efficiency per unit ambient light
FL, in units of protons pumped per kDa per second per micromole of photons per square meter per second and equation 4 describing the maximum energy flux per kDa per unit ambient light
Ymax (Figure 4C, D and equations S3-S4).
In order to examine the theoretical capabilities of retinalophototrophs and chlorophototrophs, and the implications of the trade-offs embodied in either phototrophic system on organismal physiology, evolutionary history, and ecological interactions, we constructed an analytical model of an arbitrary phototrophic system and its photodegradation, recycling, and growth-based dilution[
57,
58,
59,
60] (
Figure 1B, Supplement S1). This model consists of the central catalytic core of the rhodopsin or reaction center with a constant yield, maximum reaction velocity, and absorption cross section. It is parameterized in terms of yield per cycle
Y (protons per cycle), maximum rate
Vmax (photocycles per second), mass of catalytic core
k (kDa), absorption cross section per catalytic core
b (Å
2), mass of antenna complexes
x (kDa), absorption cross section per unit mass of antenna
a (Å
2 kDa
-1) photodegradation constant
D (photon
-1), and recycling/dilution rate
R (s
-1).
The properties of a chlorophototrophic catalytic core were imputed by examining an anoxygenic bacterial type II reaction center, and those of microbial rhodopsins by examining bacteriorhodopsin. The catalytic core is paired with antenna complexes of variable size, whose absorption cross section per unit mass was imputed by examination of the LH2 antenna of a purple nonsulfur bacterium. Both chlorophototrophic reaction centers and rhodopsins were allowed to be paired or not paired with antenna complexes so as to not limit the model to configurations seen today, where only chlorophototrophs have antenna complexes larger than single carotenoid molecules, which could conceivably be due to historical contingency.
Photodegradation properties are taken from Faizi et al., 2018[
57]. All parameters were constrained by previous literature and this photodegradation model except for the rate of recycling and dilution of phototrophic machinery by cell growth and division (R) which is highly dynamic depending on cellular doubling time and metabolic state. We took this value to be 0.1 hr
-1 in this analysis, corresponding to a protein half-life or cellular doubling time of 6.93 hours, as this growth rate is comparable to those observed for rapidly growing chlorophototrophic algae and close to the upper growth rates modeled in Faizi et al., 2018[
57]. Varying this recycling and dilution timescale did not qualitatively impact our results - see supplement S1 and supplemental
Figure S4 for sensitivity analysis. See
Table S2 for all variables used in this analysis and supplement S1 for calculations of all numerical variables used from a literature review.
The behavior of this model follows Michaelis–Menten kinetics with regards to light absorption by its pigment cross-section and its conversion by the catalytic core - see supplement S1 and equations S5-S10 for a detailed description of this behavior. Both the antenna and the core are also subject to photodegradation. We modified the methods of Faizi et al., 2018[
57] to determine the rate of photodegradation of the phototrophic machinery at different light intensities. By combining this rate of photodegradation with a rate of dilution of degraded protein by growth and recycling, we derived the fraction of functional protein - see equations S11-S14. By applying this photodegradation and dilution correction to the flux of energy through the catalytic core, we arrive at equation 5 (see supplemental equation S15) and equation 6 (see supplemental equation S16) describing the flux per unit protein
FP (protons per kDa per second) and the yield per unit incident light
FL (protons per kDa per second per micromole of photons per square meter per second) of a phototrophic system.
We varied light intensity
L the model was exposed to from 0.01 to 4000 micromoles of photons per square meter per second in logarithmic intervals (the upper range is similar to the radiation intensity of full sunlight at the equator). At each light intensity, we numerically optimized the chlorophototrophic and retinalophototrophic model to determine the quantity of antenna
x required to maximize energy flux per unit mass, as this optimization likely maximizes growth rate and thus fitness[
46,
48,
49]. The mass of antenna complexes associated with each catalytic core were allowed to vary freely, reflecting the widely varying antenna complexes and antenna stoichiometry to reaction centers observed across the tree of life. We calculated this optimal flux per unit protein
FP and energy flux per unit light
FL for these optimized systems at all light intensities. We then plotted the efficiency per unit incident light and flux per unit protein infrastructure of optimal chlorophototrophs and retinalophototrophs at all light intensities. See supplemental
Figure S4 for a sensitivity analysis of this model to the free parameter of recycling rate, and S6 for an analysis of the relationship between optimal and suboptimal antenna stoichiometries at a given light intensity.