Preprint
Article

This version is not peer-reviewed.

Sequence characterization of eccDNA content in multiple blackgrass (Alopecurus myosuroides) populations

A peer-reviewed article of this preprint also exists.

Submitted:

19 September 2023

Posted:

20 September 2023

You are already at the latest version

Abstract
Alopecurus myosuroides (blackgrass) is a problematic weed of Western European winter wheat and its success is largely due to widespread multiple-herbicide resistance. Previous analysis of F2 seed families derived from two distinct blackgrass populations exhibiting equivalent non-target site resistance (NTSR) phenotypes shows resistance is polygenic and evolves from standing genetic variation. Using a CIDER-seq pipeline we show that herbicide resistant (HR) and herbicide sensitive (HS) F3 plants from these F2 seed families as well as the parent populations they were derived from carry extra-chromosomal circular DNA (eccDNA). We identify the similarities and differences in the coding structures within and between resistant and sensitive populations. Although the numbers and size of detected eccDNAs varied between the populations, comparisons between HR and HS blackgrass populations identified shared and unique coding content, predicted genes, and functional protein domains. These include genes related to herbicide detoxification such as Cytochrome P450s, ATP-binding cassette transporters and glutathione transferases including AmGSTF1. eccDNA content was mapped to the Alopecurus myosuroides reference genome, revealing genomic regions at distal end of chromosome 5 and the near center of chromosome 1 and 7 as regions with high number of mapped eccDNA gene density. Mapping to 15 known herbicide-resistant QTL regions showed the eccDNA coding sequences match 12 with four QLT matching HS coding sequences one region only contained HR coding sequences. These findings establish that like other pernicious weeds, blackgrass has eccDNAs that contain homologs of chromosomal genes, and these may contribute genetic heterogeneity and evolutionary innovation to rapidly adapt to abiotic stresses including herbicide treatment.
Keywords: 
;  ;  ;  

1. Introduction

Extra-chromosomal circular DNAs (eccDNA) are ring-shaped amplifications of linear genomic segments that exist separate from the autosomal genome. These structures have been observed in various species, including plant and higher eukaryote species [1,2,3,4,5,6]. Bassel and Hoota first discovered eccDNA in 1964 [7], but limited progress in understanding their function or genesis was made until the past decade. The formation of eccDNA can occur through multiple mechanisms, including non-allelic homologous recombination, replication slippage and DNA repair [8,9], and intramolecular recombination and retrotransposon activity [1,10,11,12]. Historically, the detection of eccDNA has been challenging due to their complex and highly repetitive structures. However, recent advancements in hybrid sequencing technologies (Illumina, PacBio, Oxford Nanopore) and computational algorithms (AmpliconArchitect [13], CiderSeq [14], and Circle-Seq [15]) have improved the ability to detect eccDNA and define tissue-specific "DNA circulome" [13,16].
EccDNAs have recently garnered significant attention in scientific communities as a unique hallmark of somatic mutations. These entities are capable of amplifying gene copies, related transcripts, and regulatory elements. eccDNAs can range in size from a few hundred base pairs [1] to much larger structures near a megabase in size [11,17], with a majority being smaller than 10 kb [1,18]. EccDNAs accumulate in cells through either known mechanisms of autonomous replication mediated by a replication origin [11,12,19,20], rolling circle amplification [21], DNA replication and mitosis [22,23] or some elusive mechanism. Notably, the accumulation of many replicate copies of a large eccDNA in Palmer amaranth has been shown to significantly increase the c-value of cells, seemingly without fitness penalty [24].
EccDNAs have been identified in various human physiological and pathological states, including normal cells (blood and muscle) [25], the central nervous system and neurodegeneration [26], aging and telomere maintenance [27,28], and cancer malignancy and progression [16]. eccDNAs have been observed in Drosophilia [4], and are believed to play a role in stress adaptation in yeast [29,30] and herbicide resistance in weeds [19]. The eccDNAs present in the noxious weed Palmer amaranth have been well characterized. This eccDNA, which is the largest gene-containing eccDNA reported thus far in a plant species, confers genetically transmissible resistance to the herbicide glyphosate [11,19]. This eccDNA replicon has multiple copies of the gene encoding for 5-enolpyruvylshikimate-3-phosphate (EPSP) synthase (EPSPS) and serves as the basis for gene copy amplification and therefore resistance to the EPSPS-inhibiting herbicide glyphosate. A study comparing eccDNA from glyphosate-sensitive and resistant populations revealed a significant content of eccDNA in glyphosate-sensitive populations [11]. The EPSPS gene was only present on eccDNA in glyphosate-resistant populations, while glyphosate-sensitive eccDNA displayed the genomic disposition for recombination events that may lead to the formation of the eccDNA replicon [1]. High-resolution cytological analysis of the eccDNA replicon revealed that its persistence in the germline is facilitated through a chromosomal tethering mechanism, rather than genomic re-integration [19]. This observation provides an explanation for the uneven segregation reported in previous studies [19]. Sequence analysis of the eccDNA replicon in Palmer amaranth revealed that it contains 58 genes in addition to the EPSPS gene, as well as a complex distribution of repeat sequences and gene functions that encompass detoxification, transport, replication, and stress response [11]. A synteny analysis of the other genes aligned to a neighboring species revealed that they originated from multiple chromosomes, suggesting a complex biogenesis event [11]. This study suggests that eccDNA serve as a rapid response mechanism and facilitate rapid adaptation [1].
Blackgrass (Alopecurues myosuroides) is a pernicious weed that has rapidly evolved to infest cereal crops across Northern and Western Europe and is now widely regarded as the top weed problem in several regions [31,32]. In response to the extensive use of herbicides, black-grass has evolved resistance to seven herbicide modes-of-action, exacerbating this species’ impact [33]. Blackgrass has been estimated to cause an annual £0.4 billion lost gross profit in the UK alone, due to wheat yield losses of 0.8 million tonnes [34]. Growers are now increasingly relying upon mixtures and sequences of (particularly pre-emergent) herbicides from a range of modes-of-action to manage this species, alongside physical or cultural practices. Nevertheless, further herbicidal control is complicated by the extensive presence of both gene mutations and multiple mechanisms that inactivate herbicides in this species [35], evolving independently within separate populations [36].
Advancements in comprehending the genetic basis of herbicide resistance have been limited. Nevertheless, recent studies have uncovered the genome of blackgrass, revealing independent and semi-overlapping genetic mechanisms underlying Non-Target Site Resistance (NTSR) in blackgrass at both DNA and RNA levels [36]. We used the same genetic material in this study to perform sequence characterization of extra-chromosomal circular DNA (eccDNA) content in multiple blackgrass populations with well-characterized levels of heritable NTSR-based herbicide resistance. We compare the genomic similarities and differences among individuals from these distinct populations. Additionally, we determine the gene content of the eccDNA and identify genomic hotspots for eccDNA biogenesis, evaluating their role in the rapid evolution of herbicide resistance.

2. Materials and Methods

2.1. Blackgrass populations, plant material, and genomic DNA extraction

Seven Blackgrass seed populations were grown for extraction of eccDNA, representing both herbicide sensitive (HS) and herbicide resistant (HR) lines. The Roth (HS) population originates from the Rothamsted ‘Broadbalk’ long-term field experiment and represents a wild-type population which has never experienced herbicide. The ‘Peldon’ and ‘Lola91’ are field collected HR populations confirmed to have non-target-site resistance to a range of herbicide chemistries. For these seed populations, we have selected against any known target-site mutations [37]. Further HR and HS samples were derived from segregating families created through biparental crossing of Roth and Peldon (CC2) and Roth and Lola 91 (CC5). The CC2R and CC2S are F3 generation lines, caused by identifying and bulk-crossing the most and least resistant CC2 individuals within the F2 generation. The CC5R and CC5S were produced in an identical way, but from the CC5 seed family. In total this provides three herbicide sensitive (Roth, CC2S, CC5S) populations, and four herbicide resistant (Peldon, Lola91, CC2R, CC5R) populations with which to examine eccDNA.
Seeds from these seven populations were broadcast sown onto a standard seed tray (37.3 x 24.5 x 5.5 cm) filled with Rothamsted Prescription Mix (75% Medium grade (L&P) peat, 12% Screened sterilized loam, 3% Medium grade vermiculite, 10% Grit (5mm screened, lime free, with 3.5kg Osmocote Exact 3/4 month per m3 - Supplier: Scotts UK Professional, Ipswich, Suffolk.). Plants were grown in standard glasshouse (GH43 101 t) at 10°C/5°C 16-hour day length without supplementary lighting for 40 days (between 15 December 2021 to 25 January 2022) until they were at the 2-3 leaf stage. From each population, two bulk harvests of aerial tissue from these plants totaling 0.5 g were cut, flash frozen in liquid nitrogen, and ground completely to a fine powder in a mortar and pestle. 100 mg of this ground material was then used for DNA extraction.
DNA was extracted from 100 mg of finely ground flash frozen bulked leaf material from 40-day old seedlings using a modified CTAB method. More specifically, 20μl of beta-mercaptoethanol was added to the samples along with 800μl of 2X CTAB buffer (2% Cetyltrimethylammonium Bromide (Merck 219374 – Calbiochem), 1% PVPP, 1.4M NaCl, 20mM EDTA pH 8.0, 100mM Tris pH 8.0) that had been heated to 80-90˚C. Samples were vortexed to mix. 800μl of chloroform was added, vortexed until completely mixed, and tubes were incubated at 65˚C for 10-15min allowing the phases to separate. Samples were centrifuged at max speed for 30min. The supernatant was transferred to new, labeled 2ml tube. A second aliquot of 800μl chloroform was added, tubes were inverted to mix, and centrifuged at maximum speed for 20min. 700μl of the supernatant was transferred to new labeled 1.5ml microfuge tubes. 450μl of cold (4˚C) isopropanol was added and tubes inverted gently. Samples were left to incubate at room temperature for exactly 5min, then centrifuged at maximum speed for 15min. Supernatant was removed, and the pellet was washed with 1ml of 70% ethanol at -20˚C. 70% Ethanol was removed and pellets dried using a speedvac for 7 minutes with medium heat. Pellet was resuspended in 100μl TE buffer with a 1:1000 dilution of RNase A Solution at 10mg/ml (Merck 70856-3) and incubated at 37˚C for 45min to resuspend the DNA into solution. Samples were quantified by nanodrop analysis then frozen and stored at -20˚C until being shipped on dry ice for eccDNA enrichment and sequencing.

2.2. EccDNA sequence enrichment, sequencing, and analysis

Circular DNA enrichment and sequencing was performed according to the CIDER-seq methods [14]. Circular DNA sequencing was performed on a Pacific Biosciences Sequel II instrument (Pacific Biosciences) with a 30-hour movie time and HiFi/CCS read generation. Sequence reads were deconcatenated into circular DNA using the CIDER-seq software [14]. The circular DNA sequences were then aligned to the reference genome of Alopecurus myosuroides [14] to predict the extra-chromosomal circular DNA (eccDNA) content. To reduce redundant sequences, the eccDNA sequences were clustered and collapsed into non-redundant representatives using the CD-hit program [38] with a sequence identity threshold of 0.9. The representative eccDNA sequences were masked for repetitive elements using the Repeat-Masker [39] with default settings and annotated using the MAKER annotation pipeline [40].
GO enrichment analysis was conducted on the predicted coding elements of the eccDNA using the GOseq 1.42.0 R package [41]. GO terms were considered significantly enriched at a False Discovery Rate (FDR) of less than 0.05. The enriched GO terms were visualized using the ggplot2 package [42]. tRNAs were exclusively predicted for the eccDNA sequences without coding sequences (CDS) using the tRNAscan-SE program [43] with default settings. To determine the genomic origin of the eccDNA, the predicted sequences were first aligned to the reference genome using the Minimap2 program [44]. The number of eccDNAs that overlapped with the reference genome in each 500 kb interval was counted using the BEDtools software [45]. The number of genes and transposable elements (TEs) were also counted within 200 kb downstream and upstream of each gene using BEDtools [45]. A permutation test was performed to compare the mean number of genes and TEs between the eccDNA-present and absent regions.

3. Results

3.1. EccDNA content and coding structure in multiple blackgrass populations

The CIDER-seq pipeline was used to identify a high abundance of extra-chromosomal circular DNA (eccDNA) in all blackgrass samples (Table 1). The number of non-redundant representative eccDNAs detected in the blackgrass samples ranged from 4233 in sample CC2S to 5663 in sample Roth, with an average of 4886 (Table 1). The size of the detected eccDNAs showed high variation within the samples, as evidenced by the length distributions that ranged from 31 bp to 29,081 bp and had mean lengths of around 6,900 bp (Table 1 and Figure 1). The length distributions of the eccDNAs were similar among all sequenced samples (Table 1 and Figure 1).
We further analyzed the coding content of the predicted eccDNA sequences. The number of eccDNAs with predicted genes ranged from 958 in sample CC5S to 1291 in sample Roth (average of 1079 per sample) and accounted for up to approximately 22% of the total predicted eccDNAs (Table 1). The number of predicted genes per eccDNA ranged from 1 to 15, with an average of 2 genes per eccDNA across all analyzed samples (Table S1 and S2). Transfer RNA (tRNA) prediction was performed for the eccDNAs and revealed a relatively low number of predicted tRNAs in all samples, ranging from 45 in sample CC5S to 77 in sample CC2S.

3.2. Coding content of eccDNAs in herbicide resistant and sensitive blackgrass populations

To identify common and unique eccDNA-encoded genes with predicted functional protein domains among different blackgrass populations, we compared the predicted gene content from both herbicide resistant (HR) (CC2R, CC5R, Lola91 and Peldon) and sensitive (HS) (CC2S, CC5S and Roth) blackgrass samples. A comparison of the predicted protein domains (PFAM id) identified a total of 80 protein coding domains that were shared by all HR samples (Figure 2 and Table S3). Among these functional protein domains, the number of eccDNAs annotated to them ranged from 6 to 416. There was a range of 7 to 25 functional annotations that were shared by at least two HR samples. The different HR samples had varying numbers of unique functional protein domains, ranging from 68 (Peldon) to 111 (CC5R) (Table S3). Overall, the three HS samples shared a total of 91 functional protein domains (Figure 3 and Table S4). Pairwise comparisons between the HS samples revealed 18 to 38 protein domains were shared by each of the two samples. In addition, 96 (CC2S) to 115 (Roth) unique functional domains were identified in different HS samples. By comparing the 91 functional protein domains shared by the three HS samples to the 80 protein coding domains shared by all four HR samples, a list of 69 functional protein domains that are common to all measured samples was obtained. Additionally, there are 11 functional protein domains specific to all the HR samples and 22 functional protein domains specific to all the HS samples (Table S5). Counts of these core set of functional protein domains found in all samples range from 416 to 6 in the HR samples and 5 in the HS samples and include putative gypsy type transposons, Proton-conducting membrane transporters, ribosomal protein domains, and domains associated with the respiratory chain, such as NADH-ubiquinone/plastoquinone oxidoreductase or NADH dehydrogenase, and the photosynthetic machinery, such as Photosystem I psaA/psaB protein and Photosystem II protein (Table S5). Counts for the functional protein domains that were unique to either HR (14 to 6) or HS (15 to 3) were comparatively low and are discussed below.
In the HR samples, the functional domains with the highest abundance comprised proteins associated with photosystems, gypsy type transposons, NADH dehydrogenase, PPR repeat, and Cytochrome C assembly protein (Table 2). Moreover, we observed the presence of eccDNA genes in the analyzed HR samples, which were previously reported to be associated with herbicide detoxification. Cytochrome P450 was detected in all HR samples, whereas ATP-binding cassette transporters (ABC transporters) and glutathione transferases were exclusively detected in the CC2R, CC5R, and Peldon populations (Table S3). Furthermore, within the HR samples, various stress response domains were identified. Specifically, ribosomal proteins and leucine-rich repeats were present in all HR samples. The WRKY DNA-binding domain was shared among the CC2R, CC5R, and Lola91 samples. Moreover, the Myb-like DNA-binding domain and WD domain were shared by the CC5R, Lola91, and Peldon samples. Finally, peroxidase was detected in the CC2R, Lola91, and Peldon samples.
The functional domains detected in herbicide sensitive (HS) samples were found to be abundant and largely overlapping with those detected in herbicide-resistant (HR) samples, as demonstrated in Tables 2, 3 and S3. Three functional domains, namely the Jacalin-like lectin domain, the GDSL/SGNH-like Acyl-Esterase family, and the PMR5 N-terminal domain, were present exclusively in the three HS samples. Additionally, annotations related to herbicide detoxification were identified in the HS samples, including the presence of cytochrome P450 and glutathione transferases in all HS samples and an ABC transporter annotated only to predicted eccDNA genes in CC5S and Roth (Table S4).
To better understand the inheritance of eccDNA in blackgrass, we further compared the functional domains separately in the parents (Figure 4), CC2 (Figure 5) and CC5 (Figure 6) populations. In total, 99 functional domains were shared by the three parents, and 26 functional domains were shared by the two HR parents (Peldon and Lola91), while 123 functional domains were only detected in the HS parent (Roth) (Figure 4 and Table S6). Some defense related functional domains such as DnaJ domain, glycosyl hydrolase and response regulator receiver domains were uniquely identified in the HR parents.
Upon comparing the two distinct biotypes within the CC2 population, we found that functional annotations associated with ABC transporters were exclusively identified in the CC2R (HR) samples (Table S7). In contrast, within the CC5 population, functional annotations linked to ABC transporters were detected in both the CC5R and CC5S samples.
To gain insights into the functions of the predicted extra-chromosomal circular DNA (eccDNA) genes, we conducted a homology-based analysis with previously identified blackgrass herbicide-resistant candidate genes [46]. Our results showed that eccDNAs homologous to the crucial herbicide-resistant gene, GSTF1 (ALOMY3G11302 [36]), were detected exclusively in the herbicide-resistant (HR) samples CC2R and CC5R (Table 4). Of particular interest, this sequence is 39 amino acids shorter than the other four gene sequences previously identified [47] but otherwise most closely aligns with the GST2c sequence. ALOMY3G11302 was differentially expressed in both CC2R and CC5R populations, as reported by [36]. EccDNAs homologous to GSTU2 were present in both HR and HS samples. The presence of eccDNAs homologous to OPR1 and GSTF2 was restricted to Peldon and CC5S, respectively. We also found sequences that associated with PF00662 (NADH-Ubiquinone oxidoreductase) common all the samples (Supplementary Table S5) as well as those associated with PF01370 the NAD-dependent epimerase/dehydratase in the CC5R, Peldon and Roth samples (Supplementary Table S1).

3.3. Gene ontology enrichment of blackgrass eccDNA

Gene ontology (GO) enrichment analysis of predicted coding contents in eccDNA samples showed a list of GO terms associated with biological processes, cellular components, and molecular function were enriched in HR and HS samples. In HR samples, the enriched biological processes include translation, transcription, protein neddylation, photosynthesis, cytochrome complex assembly, ATP-related processes, and galactose metabolic process (Figure 7a and Supplementary Table S9). For the cellular component category, GO terms that related to photosystem, thylakoid, and ribosome were enriched in HR samples (Figure 7b and Supplementary Table S9). Representative molecular functions for HR eccDNAs include zinc ion binding, UDP-glucose, structure constituent of ribosome and chlorophyll binding (Figure 7c and Supplementary Table S9).
Compared to the HR samples, similar but slightly different GO terms were enriched in HS samples. Biological processes such as chlorophyll catabolic process, glucose catabolic process, RNA metabolic process, heme oxidation, phosphate ion transport and proteolysis were only enriched in HS samples (Figures 7a, 8a and 9 and Supplementary Table S9). While biological process such as ATP synthesis, protein neddylation, transcription, tryptophan metabolic process and cellular manganese ion homeostasis were uniquely enriched in HR samples. All GO terms related to cellular components that were enriched in HR samples were also enriched in HS samples except for DNA replication factor, exosome and H4/H2A histone acetyltransferase complex (Figures 7b, 8b and 9 and Supplementary Table S9). Molecular functions such as chlorophyllase activity, enzyme inhibitor activity, phosphoglycerate mutase activity and double-stranded DNA binding were only identified in HS samples (Figures 7c, 8c and 9 and Supplementary Table S9). While GO terms related oxidoreductase activity, ADP binding, exonuclease activity and manganese ion transmembrane transporter activity were uniquely enriched in HR samples.
Figure 9. Circos plot showing the count of eccDNA identified in herbicide resistant (blue) and sensitive (orange) blackgrass population. Outer ring makes the number of genes (purple) and transposable elements (green) in the reference genome.
Figure 9. Circos plot showing the count of eccDNA identified in herbicide resistant (blue) and sensitive (orange) blackgrass population. Outer ring makes the number of genes (purple) and transposable elements (green) in the reference genome.
Preprints 85604 g009

3.4. Genomic origins of blackgrass eccDNA

To determine the genomic origins of eccDNA, the predicted eccDNAs were mapped to the Alopecurus myosuroides reference genome [48]. Numbers of mapped eccDNAs were counted within the non-overlapped windows of 500kb through the whole genome (Supplementary Table S11 and Figure 6). Over 90% of the detected were successfully mapped to the established chromosomes, and around 8.5% of the predicted eccDNAs were mapped to the unanchored sequences. Several genomic segments were identified with a high frequency of eccDNAs being mapped to these regions. These regions include the distal end of chromosome 5 and the near center of chromosome 1 and 7 (Figure 6 and Table 5). The 500kb window localized at the near center of chromosome 1 contained 83 eccDNAs from HR samples and 48 eccDNA eccDNAs from HS samples. The two 500kb window located at the distal end of chromosome 5 contained 237 (HR 133, HS 104) and 137 (HR 82, HS 55) predicted eccDNAs, respectively. And the 500kb window on chromosome 7 mapped 130 and 80 eccDNAs from HS and HR samples, respectively. Stress-related genes, such as ribosomal family, lysine decarboxylase and HPPK, were located in these regions. However, no eccDNA coding contents were mapped to these regions.
In order to gain further insight into the genomic features of the regions enriched with eccDNA-associated genes, we analyzed whether the eccDNA-associated genes originated from regions with a higher gene density than expected by chance (Figure 6). We observed that eccDNA present regions tend to have higher number of genes in comparison to eccDNA absent region in both HR and HS samples. In both HR and HS sample, we observed significant differences (permutation test for difference of means, n = 10,000 replicates) between the number of genes located at eccDNA present region and eccDNA absent region. The mean number of observed genes in eccDNA present region was increased by 0.24 (p = 0.0019) and 0.26 (p = 0.0044) in comparison to eccDNA absent region in HR and HS samples, respectively. Comparing to eccDNA absent region, eccDNA present region tend to have less TE (HR = 3.16, HS = 1.37), however, the difference of number of TEs in eccDNA present region and absent region was only significant in HR samples (p = 0.026). Notably, no significant differences were observed between the number of observed genes or TE in eccDNA present region and eccDNA absent region when all samples were analyzed together.
A prior study identified a list of quantitative trait loci (QTL) regions that are associated with herbicide resistance in various blackgrass populations [48]. To investigate the mapping of eccDNAs to these regions, we found that 496 HR eccDNAs and 352 HS eccDNAs were mapped to the blackgrass herbicide resistant QTL regions (Supplementary Table S11). Our analysis revealed that within the QTL regions, two non-overlapping windows of 500kb located on chromosome 3 (230.0–230.5 Mb and 312.5–313.0 Mb) were uniquely mapped to at least three of the HR samples (Supplementary Table S11). EccDNA coding sequences were mapped to 12 QTL regions (Supplementary Table S13), where the qtl-cc2-2-1, qtl-cc2-2-3, qtl-cc2-5-1, and qtl-cc2-5-2 regions only contained HS eccDNA coding sequences (CDS), while the qtl-cc5-3-3 region only contained HR eccDNA CDS. Additionally, we observed that among the eccDNA CDS mapped to the QTL regions, only HR eccDNA genes were successfully annotated with functional predictions. Most of these genes were predicted to be related to photosystem and gypsy-type transposon (Supplementary Table S13).

4. Discussion

Extra-chromosomal circular DNA (eccDNA) are DNA molecules that are separate from the main chromatin body, or chromosomes, within a cell and are gaining significant attention with their role in trait biology [1,5,9,11,16,20,27]. This form of DNA has been demonstrated to play a crucial role in various physiological processes. One of the most prominent functions of eccDNA is its capacity to harbor and amplify segments of transcriptionally active chromatin, resulting in gene focal amplifications. Furthermore, eccDNA acts as a reservoir of genetic diversity that the cell can draw upon during exposure to stress, whether it be biotic or abiotic, facilitating rapid responses and evolutionary adaptations [1,49]. The extensive use of herbicides over the years has led to the evolution of blackgrass populations that are resistant to multiple herbicides, including those from different chemical classes [35]. Aside from the rapid evolution of herbicide resistance, adaptive traits such as a lifecycle that occurs within a standard winter-cropping time, high fecundity, rapid growth, and ability to compete with the crop for nutrients, light and water, have enabled blackgrass to thrive in agricultural landscapes across Northern and Western Europe and become a major challenge for these farmers.
In this research, we have uncovered a wide array of eccDNAs within several black-grass populations, showcasing varying levels of herbicide resistance. Our findings in black-grass, concerning size, prevalence, and coding content, align closely with Amaranthus palmeri [1]. In general, we identified a suite of encoded genes and transposable elements associated with signaling, transport, and DNA mobility in both herbicide resistant and sensitive populations indicating a possible role in adaptability. For example, a number of gypsy retroelements were found on eccDNA which have been previously implicated with a role in creating genetic diversity [50]. We identified a significant number of proton conducting membrane transporter genes (Table S3) which function in the regulation of proton movement across cell membranes and have been associated with enabling plants to modulate their responses to various environmental stressors such as salinity, drought, and metal toxicity, which enhance their adaptability [51,52]. We also found an abundance of NADH dehydrogenase functional domains generally in the eccDNA coding dataset. This gene class is part of the mitochondrial electron transport chain and is involved in energy production, redox balance, and respiratory metabolism. Presence on an eccDNA may influence expression amplitudes of these genes and contribute with the plant’s ability to cope with stress through maintenance of cellular energy levels and redox homeostasis during adverse environmental conditions [53,54]. Ribosomal proteins were also highly abundant indicating a need for the ability for protein turnover, which has also been reported in [1]. Transcription factors such as WRKY, MYB, and other classes (Table S3) were also generally found in the blackgrass populations on eccDNA. These classes of transcription factors are pivotal regulators in plants that enable the activation of key processes such as stress responses, defense mechanisms, and various developmental process which are critical in allowing plants to integrate these signals and fine-tune the adaptative potential [55,56,57]. Presence of these transcription factors on eccDNA indicate their relevance as rapid response elements.
A principal goal of our study was to examine signatures of adaptation in the eccDNA content between blackgrass populations that have rapidly evolved metabolic herbicide resistance with those that have not. Interestingly, gene ontology enrichment analysis between the contrasting populations did not directly identify detoxification as being enriched exclusively in HR populations (Figure 7). Many of the same processes were enriched in both biotypes that include chlorophyll and glucose catabolism, RNA metabolism, heme oxidation, phosphate transport, and proteolysis. However, these processes are fundamental to plant growth and serve as an antioxidant in some capacities.
Investigation at the individual gene level did identify genes previously implicated with herbicide resistance [46]. A key gene found on an eccDNA in our study exclusive to the two resistant populations (CC2R and CC5R) was the GSTF1 (Glutathione S-Transferase Phi 1 - ALOMY3G11302 [36]). This gene belongs to the GST gene family which encodes enzymes involved in detoxifying herbicides by catalyzing the conjugation of glutathione to herbicide molecules making them less toxic and more water-soluble for excretion or transport out of plant cells [47]. This gene has been associated with resistance to multiple classes of herbicides, including triazine and atrazine, indicating a role in multiple herbicide resistance [47]. Alternate alleles for GSTF1 have been discovered where some alleles may confer higher herbicide resistance than other. Notably, the GSTF1 in this study is 39 amino acids shorter than previously reported isoforms [47]. The presence of this gene on an eccDNA with novel genetic variation in our study suggests a possible role in herbicide resistance and that eccDNA based gene amplification and/or ultra-expression may also be critical factors to consider. Our study also identified GSTU2, OPR1, and GSTF2 as present of eccDNA, but not exclusively to HR populations (Table 4). These genes have also been implicated in herbicide resistance; however these results may also suggest that perhaps detoxification may not fully support the explanation for the resistant phenotype observed in our HR populations.
The genomic mechanisms that contribute to the biogenesis of eccDNA are still not well understood. Some explanations include illegitimate replication (replication slippage), non-homologous end joining, homologous recombination, microhomology-mediated break induced replication [58,59,60], and likely some organized or directed mechanisms, such as the EPSPS gene amplification that confers glyphosate resistance [11]. It is speculated that eccDNA can exist in multimeric forms and undergo recombination as a mechanism of compounded building into larger complex structures [1,19]. Analysis of eccDNA origins revealed eccDNA biogenesis on every chromosome with hotspots (high frequency of eccDNA formation) on chromosomes 1 and 7 (Figure 6). The identification of eccDNA originating from all over the genome is consistent with Amaranthus palmeri [1], as well as recent studies in mammalian cells [61,62]. These data provide further evidence that eccDNA biogenesis contributes to evolutionary innovation by contributing global genetic heterogeneity, plasticity, and a critical element to a plant’s trajectory to rapid adaptation. To further expand on this thought, we considered the concept of a two-speed genome recently proposed by eccDNA analysis of the rice blast fungal pathogen [63]. This concept considers that segments or regions within a genome evolve at different rates, and that perhaps, this concept can correlate with genes found as genomic focal amplifications. The study considered genes found on eccDNA and their origin as within a gene rich/poor region of the genome, in addition to proximity to repetitive elements and found that genes on eccDNAs were in fact under faster evolutionary rates when compared to genes not found as eccDNA [63]. We examined this concept with eccDNA in this study and only found a statistically significant result when comparing the HR and HS populations together, suggesting alternate evolutionary pressures between HR and HS populations presented here. Perhaps the selective pressure and adaptative signatures that correspond to herbicide resistance have broader implications genome-wide and further support mechanisms that provide resistance beyond metabolic genes. It has been proposed that plants that have rapidly evolved to a xenobiotic pressure also have a new disposition to withstand other abiotic stresses [64]. The compartmentalized genome architectures of the pathogens could also be organized in a way that these evolutionary signatures are more pronounced when compared to large plant genomes like blackgrass (3.2Gb).

5. Conclusions

Our analysis provides evidence of evolutionary innovation and useful insights into the abundance, coding content, functional domains, and biogenesis patterns of eccDNA in various blackgrass populations, with a focus on differences between herbicide-resistant and herbicide-sensitive populations. These findings demonstrate that the pernicious weed blackgrass (Alopecurus myosuroides) carries eccDNA with functional domains associated with genes previously reported to be associated with herbicide detoxification. As the eccDNA content is not identical between herbicide resistant plants descending from different families exhibiting equivalent NTSR phenotypes, these differences may help to explain the polygenic enhanced metabolic resistance seen in these plants. It is interesting to note that unlike EPSP synthase in Palmer amaranth, as blackgrass plants exhibiting sensitivity to herbicides also carry eccDNA with protein domains associated with herbicide detoxification, differences in eccDNA content cannot be fully responsible for the different phenotypes observed. Our data add blackgrass to the expanding list of organisms including human cells, yeast, Drosophilia, and Palmer amaranth and add to the mechanistic knowledge of how they confer herbicide resistance, alongside acting in stress adaptation and other physiological and pathological states.

Supplementary Materials

The following supporting information can be downloaded at: Preprints.org, Table S1: Summary and charaterization of eccDNAs detected in multiple herbicide resistant blackgrass populations; Table S2: Summary and charaterization of eccDNAs detected in multiple herbicide sensitive blackgrass populations; Table S3: Venn diagram result summary for herbicide resistant eccDNAs with annotation; Table S4: Venn diagram result summary for herbicide sensitive eccDNAs with annotation; Table S5: Summary and charaterization of eccDNAs detected in all samples; Table S6: Venn diagram result summary for eccDNAs with annoation among parents; Table S7: Venn diagram result summary for eccDNAs with annoation in CC2 population; Table S8: Venn diagram result summary for eccDNAs with annoation in CC5 population; Table S9: Gene ontology enrichment of all herbicide resistant eccDNAs genes classified as biological process (BP), cellular component (CC), and molecular function (MF); Table S10: Gene ontology enrichment of all non-target site herbicide susceptible eccDNAs genes classified as biological process (BP), cellular component (CC), and molecular function (MF); Table S11: Counts of eccDNAs mapping to the blackgrass genome - 500kb; Table S12: Counts of eccDNAs mapping to the blackgrass genome - 1Mbp; Table S13: Summary of eccDNA genes located in previously detected QTL regions associated with blackgrass herbicide resistance.

Author Contributions

Conceptualization, Dana MacGregor, David Comont and Christopher Saski; Data curation, Wanfang Fu; Formal analysis, Wanfang Fu, Dana MacGregor and Christopher Saski; Funding acquisition, Dana MacGregor; Methodology, Wanfang Fu and David Comont; Project administration, Christopher Saski; Writing – original draft, Wanfang Fu, Dana MacGregor and Christopher Saski; Writing – review & editing, Dana MacGregor and David Comont. All authors have read and agreed to the final manuscript.

Funding

Bayer Crop Sciences Grant # 2019001099 to CS and DM. Rothamsted Research receives strategic funding from the Biotechnology and Biological Sciences Research Council of the United Kingdom (BBSRC). We acknowledge support for DM and DC from the Growing Health Institute Strategic Programme (BB/X010953/1).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

The corresponding author confirms that all authors listed on the manuscript have reviewed and approved the final version of the manuscript, including its title, contents, and authorship. Each author has contributed significantly to the work, and all authors are in agreement with the submission. Additionally, the corresponding author confirms that, where applicable, appropriate permissions for copyrighted materials, or proprietary data have been obtained. Any relevant ethical considerations, conflicts of interest, and financial disclosures have been appropriately disclosed within the manuscript. By submitting this manuscript, the corresponding author certifies that they have the authority to act on behalf of all co-authors in all matters pertaining to the submission and publication of this manuscript.

Data Availability Statement

All data presentd in this study is publicly available at the Sequence Read Archive (SRA) in Genbank under Bioproject # PRJNA1017797 and sample accessions SAMN37408031-SAMN37408037.

Acknowledgments

The authors would like to acknowledge Richard Hull and Laura Crook alongside the Rothamsted Horticultural and Controlled Environment Team for their help in growing and harvesting the plant material used herein.

Conflicts of Interest

The research outcomes presented in this work were supported in part by industrial funding from Bayer AG. While efforts have been made to ensure the integrity and objectivity of the research, the authors acknowledge the potential for a conflict of interest that could arise due to the involvement of industry funding. The research findings and conclusions presented here are based on rigorous scientific investigation, and any potential influence of industry funding on the research process has been managed in accordance with ethical and transparency standards.

References

  1. Spier Camposano H, Molin WT, Saski CA: Sequence characterization of eccDNA content in glyphosate sensitive and resistant Palmer amaranth from geographically distant populations. PLoS One 2022, 17, e0260906.
  2. Gaubatz JW: Extrachromosomal circular DNAs and genomic sequence plasticity in eukaryotic cells. Mutat Res 1990, 23, 271–292.
  3. Cohen S, Menut S, Mechali M: Regulated formation of extrachromosomal circular DNA molecules during development in Xenopus laevis. Mol Cell Biol 1999, 19, 6682–6689. [CrossRef] [PubMed]
  4. Cohen S, Yacobi K, Segal D: Extrachromosomal circular DNA of tandemly repeated genomic sequences in Drosophila. Genome Res 2003, 13, 1133–1145. [CrossRef] [PubMed]
  5. Moller HD, Parsons L, Jorgensen TS, Botstein D, Regenberg B: Extrachromosomal circular DNA is common in yeast. P Natl Acad Sci USA 2015, 112, E3114–E3122.
  6. Peng HR, Mirouze M, Bucher E: Extrachromosomal circular DNA: A neglected nucleic acid molecule in plants. Curr Opin Plant Biol 2022, 69.
  7. Hotta Y, Bassel A: Molecular Size and Circularity of DNA in Cells of Mammals and Higher Plants. Proc Natl Acad Sci U S A 1965, 53, 356–362. [CrossRef]
  8. Mansisidor A, Molinar T, Srivastava P, Dartis DD, Delgado AP, Blitzblau HG, Klein H, Hochwagen A: Genomic Copy-Number Loss Is Rescued by Self-Limiting Production of DNA Circles. Mol Cell 2018, 72, 583–+. [CrossRef]
  9. Paulsen T, Kumar P, Koseoglu MM, Dutta A: Discoveries of Extrachromosomal Circles of DNA in Normal and Tumor Cells. Trends Genet 2018, 34, 270–278. [CrossRef]
  10. Kilzer JM, Stracker T, Beitzel B, Meek K, Weitzman M, Bushman FD: Roles of host cell factors in circularization of retroviral DNA. Virology 2003, 314, 460–467. [CrossRef]
  11. Molin WT, Yaguchi A, Blenner M, Saski CA: The EccDNA Replicon: A Heritable, Extranuclear Vehicle That Enables Gene Amplification and Glyphosate Resistance in Amaranthus palmeri. Plant Cell 2020, 32, 2132–2140. [CrossRef] [PubMed]
  12. Garfinkel DJ, Stefanisko KM, Nyswaner KM, Moore SP, Oh J, Hughes SH: Retrotransposon suicide: Formation of Ty1 circles and autointegration via a central DNA flap. J Virol 2006, 80, 11920–11934. [CrossRef] [PubMed]
  13. Deshpande V, Luebeck J, Nguyen ND, Bakhtiari M, Turner KM, Schwab R, Carter H, Mischel PS, Bafna V: Exploring the landscape of focal amplifications in cancer using AmpliconArchitect. Nat Commun 2019, 10, 392. [CrossRef] [PubMed]
  14. Mehta D, Cornet L, Hirsch-Hoffmann M, Zaidi SS, Vanderschuren H: Full-length sequencing of circular DNA viruses and extrachromosomal circular DNA using CIDER-Seq. Nat Protoc 2020, 15, 1673–1689. [CrossRef]
  15. Moller HD: Circle-Seq: Isolation and Sequencing of Chromosome-Derived Circular DNA Elements in Cells. Methods Mol Biol 2020, 2119, 165–181.
  16. Turner KM, Deshpande V, Beyter D, Koga T, Rusert J, Lee C, Li B, Arden K, Ren B, Nathanson DA et al: Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature 2017, 543, 122–+. [CrossRef]
  17. Wu S, Turner KM, Nguyen N, Raviram R, Erb M, Santini J, Luebeck J, Rajkumar U, Diao Y, Li B et al: Circular ecDNA promotes accessible chromatin and high oncogene expression. Nature 2019, 575, 699–703. [CrossRef]
  18. Shibata Y, Kumar P, Layer R, Willcox S, Gagan JR, Griffith JD, Dutta A: Extrachromosomal microDNAs and chromosomal microdeletions in normal tissues. Science 2012, 336, 82–86. [CrossRef]
  19. Koo DH, Molin WT, Saski CA, Jiang J, Putta K, Jugulam M, Friebe B, Gill BS: Extrachromosomal circular DNA-based amplification and transmission of herbicide resistance in crop weed Amaranthus palmeri. P Natl Acad Sci USA 2018, 115, 3332–3337. [CrossRef]
  20. Molin WT, Yaguchi A, Blenner M, Saski CA: Autonomous replication sequences from the Amaranthus palmeri eccDNA replicon enable replication in yeast. BMC Res Notes 2020, 13, 330.
  21. Cohen S, Segal D: Extrachromosomal circular DNA in eukaryotes: possible involvement in the plasticity of tandem repeats. Cytogenet Genome Res 2009, 124, 327–338. [CrossRef] [PubMed]
  22. Baker SP, Grant PA: The SAGA continues: expanding the cellular role of a transcriptional co-activator complex. Oncogene 2007, 26, 5329–5340. [CrossRef] [PubMed]
  23. Herrup K, Yang Y: Cell cycle regulation in the postmitotic neuron: oxymoron or new biology? Nat Rev Neurosci 2007, 8, 368–378. [CrossRef]
  24. Molin WT, Wright AA, Lawton-Rauh A, Saski CA: The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance. BMC Genomics 2017, 18, 91.
  25. Moller HD, Mohiyuddin M, Prada-Luengo I, Sailani MR, Halling JF, Plomgaard P, Maretty L, Hansen AJ, Snyder MP, Pilegaard H et al: Circular DNA elements of chromosomal origin are common in healthy human somatic tissue. Nat Commun 2018, 9, 1069. [CrossRef] [PubMed]
  26. Ain Q, Schmeer C, Wengerodt D, Witte OW, Kretz A: Extrachromosomal Circular DNA: Current Knowledge and Implications for CNS Aging and Neurodegeneration. Int J Mol Sci 2020, 21.
  27. Tomaska L, Nosek J, Kramara J, Griffith JD: Telomeric circles: universal players in telomere maintenance? Nat Struct Mol Biol 2009, 16, 1010–1015. [CrossRef]
  28. Mazzucco G, Huda A, Galli M, Piccini D, Giannattasio M, Pessina F, Doksani Y: Telomere damage induces internal loops that generate telomeric circles. Nature Communications 2020, 11.
  29. Hull RM, King M, Pizza G, Krueger F, Vergara X, Houseley J: Transcription-induced formation of extrachromosomal DNA during yeast ageing. Plos Biol 2019, 17.
  30. Gresham D, Usaite R, Germann SM, Lisby M, Botstein D, Regenberg B: Adaptation to diverse nitrogen-limited environments by deletion or extrachromosomal element formation of the GAP1 locus. Proc Natl Acad Sci U S A 2010, 107, 18551–18556. [CrossRef]
  31. Moss SR, Perryman SAM, Tatnell LV: Managing herbicide-resistant blackgrass (Alopecurus myosuroides): Theory and practice. Weed Technol 2007, 21, 300–309. [CrossRef]
  32. Hicks HL, Comont D, Coutts SR, Crook L, Hull R, Norris K, Neve P, Childs DZ, Freckleton RP: The factors driving evolved herbicide resistance at a national scale. Nat Ecol Evol 2018, 2, 529–536. [CrossRef] [PubMed]
  33. Heap I: The International Herbicide-Resistant Weed Database. 2023.
  34. Varah A, Ahodo K, Coutts SR, Hicks HL, Comont D, Crook L, Hull R, Neve P, Childs DZ, Freckleton RP et al: The costs of human-induced evolution in an agricultural system. Nat Sustain 2020, 3, 63–71.
  35. Comont D, Lowe C, Hull R, Crook L, Hicks HL, Onkokesung N, Beffa R, Childs DZ, Edwards R, Freckleton RP et al: Evolution of generalist resistance to herbicide mixtures reveals a trade-off in resistance management. Nature Communications 2020, 11.
  36. Cai L, Comont D, MacGregor D, Lowe C, Beffa R, Neve P, Saski C: The blackgrass genome reveals patterns of non-parallel evolution of polygenic herbicide resistance. New Phytologist 2023, 237, 1891–1907. [CrossRef]
  37. Mellado-Sánchez M, McDiarmid F, Cardoso V, Kanyuka K, MacGregor DR: Virus-mediated transient expression techniques enable gene function studies in black-grass. Plant Physiology 2020, 183, 455–459. [CrossRef] [PubMed]
  38. Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [CrossRef] [PubMed]
  39. RepeatMasker Open-4.0. Available online: http://www.repeatmasker.org.
  40. Cantarel BL, Korf I, Robb SM, Parra G, Ross E, Moore B, Holt C, Sanchez Alvarado A, Yandell M: MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 2008, 18, 188–196. [CrossRef]
  41. Young MD, Wakefield MJ, Smyth GK, Oshlack A: Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol 2010, 11.
  42. Villanueva RAM, Chen ZJ: ggplot2: Elegant Graphics for Data Analysis, 2nd edition. Meas-Interdiscip Res 2019, 17, 160–167.
  43. Chan PP, Lowe TM: GtRNAdb: a database of transfer RNA genes detected in genomic sequence. Nucleic Acids Res 2009, 37, D93–D97. [CrossRef] [PubMed]
  44. Li H: Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [CrossRef] [PubMed]
  45. 45. Quinlan AR: BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 2014, 11, 11–34.
  46. Tetard-Jones C, Sabbadin F, Moss S, Hull R, Neve P, Edwards R: Changes in the proteome of the problem weed blackgrass correlating with multiple-herbicide resistance. Plant J 2018, 94, 709–720. [CrossRef] [PubMed]
  47. Cummins I, Cole DJ, Edwards R: A role for glutathione transferases functioning as glutathione peroxidases in resistance to multiple herbicides in black-grass. The Plant Journal 1999, 18, 285–292. [CrossRef]
  48. Cai L, Comont D, MacGregor D, Lowe C, Beffa R, Neve P, Saski C: The blackgrass genome reveals patterns of non-parallel evolution of polygenic herbicide resistance. New Phytol 2022.
  49. Li R, Wang Y, Li J, Zhou X: Extrachromosomal circular DNA (eccDNA): an emerging star in cancer. Biomark Res 2022, 10, 53.
  50. Fan W, Wang L, Chu J, Li H, Kim EY, Cho J: Tracing Mobile DNAs: From Molecular to Population Scales. Front Plant Sci 2022, 13, 837378. [CrossRef]
  51. Conde A, Chaves MM, Geros H: Membrane transport, sensing and signaling in plant adaptation to environmental stress. Plant Cell Physiol 2011, 52, 1583–1602. [CrossRef]
  52. Zhou Y, Wang B, Yuan F: The Role of Transmembrane Proteins in Plant Growth, Development, and Stress Responses. Int J Mol Sci 2022, 23.
  53. Pu XJ, Lv X, Tan TH, Fu FQ, Qin GW, Lin HH: Roles of mitochondrial energy dissipation systems in plant development and acclimation to stress. Ann Bot-London 2015, 116, 583–600. [CrossRef] [PubMed]
  54. Barreto P, Koltun A, Nonato J, Yassitepe J, Maia IG, Arruda P: Metabolism and Signaling of Plant Mitochondria in Adaptation to Environmental Stresses. Int J Mol Sci 2022, 23.
  55. Phukan UJ, Jeena GS, Shukla RK: WRKY Transcription Factors: Molecular Regulation and Stress Responses in Plants. Front Plant Sci 2016, 7, 760.
  56. Wang H, Chen W, Xu Z, Chen M, Yu D: Functions of WRKYs in plant growth and development. Trends Plant Sci 2023, 28, 630–645. [CrossRef] [PubMed]
  57. Li J, Han G, Sun C, Sui N: Research advances of MYB transcription factors in plant stress resistance and breeding. Plant Signal Behav 2019, 14, 1613131. [CrossRef] [PubMed]
  58. Yang LD, Jia RB, Ge TX, Ge SF, Zhuang A, Chai PW, Fan XQ: Extrachromosomal circular DNA: biogenesis, structure, functions and diseases. Signal Transduct Tar 2022, 7.
  59. Cao X, Wang S, Ge L, Zhang W, Huang J, Sun W: Corrigendum: Extrachromosomal Circular DNA: Category, Biogenesis, Recognition, and Functions. Front Vet Sci 2021, 8, 784611. [CrossRef]
  60. Wang M, Chen X, Yu F, Ding H, Zhang Y, Wang K: Extrachromosomal Circular DNAs: Origin, formation and emerging function in Cancer. Int J Biol Sci 2021, 17, 1010–1025. [CrossRef]
  61. Chitwood DG, Wang Q, Klaubert SR, Green K, Wu CH, Harcum SW, Saski CA: Microevolutionary dynamics of eccDNA in Chinese hamster ovary cells grown in fed-batch cultures under control and lactate-stressed conditions. Sci Rep 2023, 13, 1200. [CrossRef]
  62. Chitwood DG, Uy L, Fu W, Klaubert SR, Harcum SW, Saski CA: Dynamics of Amino Acid Metabolism, Gene Expression, and Circulomics in a Recombinant Chinese Hamster Ovary Cell Line Adapted to Moderate and High Levels of Extracellular Lactate. Genes (Basel) 2023, 14.
  63. Joubert PM, Krasileva KV: The extrachromosomal circular DNAs of the rice blast pathogen Magnaporthe oryzae contain a wide variety of LTR retrotransposons, genes, and effectors. BMC Biol 2022, 20, 260.
  64. Sen MK, Hamouzova K, Kosnarova P, Roy A, Soukup J: Herbicide resistance in grass weeds: Epigenetic regulation matters too. Frontiers in Plant Science 2022, 13.
Figure 1. EccDNA length distribution in herbicide resistant (a) and sensitive (b) blackgrass populations.
Figure 1. EccDNA length distribution in herbicide resistant (a) and sensitive (b) blackgrass populations.
Preprints 85604 g001
Figure 2. Venn diagram of PFAM elements shared by herbicide resistant eccDNA samples.
Figure 2. Venn diagram of PFAM elements shared by herbicide resistant eccDNA samples.
Preprints 85604 g002
Figure 3. Venn diagram of PFAM elements shared by herbicide sensitive eccDNA samples.
Figure 3. Venn diagram of PFAM elements shared by herbicide sensitive eccDNA samples.
Preprints 85604 g003
Figure 4. Venn diagram of PFAM elements shared by parents.
Figure 4. Venn diagram of PFAM elements shared by parents.
Preprints 85604 g004
Figure 5. Venn diagram of PFAM elements shared by CC2 population.
Figure 5. Venn diagram of PFAM elements shared by CC2 population.
Preprints 85604 g005
Figure 6. Venn diagram of PFAM elements shared by CC5 population.
Figure 6. Venn diagram of PFAM elements shared by CC5 population.
Preprints 85604 g006
Figure 7. Gene ontology enrichment terms and their prevalence among herbicide resistant blackgrass eccDNA samples. a) Biological process, b) Cellular components, c) Molecular functions.
Figure 7. Gene ontology enrichment terms and their prevalence among herbicide resistant blackgrass eccDNA samples. a) Biological process, b) Cellular components, c) Molecular functions.
Preprints 85604 g007
Figure 8. Gene ontology enrichment terms and their prevalence among herbicide resistant blackgrass eccDNA samples. a) Biological process, b) Cellular components, c) Molecular functions.
Figure 8. Gene ontology enrichment terms and their prevalence among herbicide resistant blackgrass eccDNA samples. a) Biological process, b) Cellular components, c) Molecular functions.
Preprints 85604 g008
Table 1. EccDNA characterization of herbicide resistant and sensitive blackgrass populations.
Table 1. EccDNA characterization of herbicide resistant and sensitive blackgrass populations.
Sample ID # eccDNA Mean length Length range # eccDNA with gene # eccDNA with tRNA % eccDNA with CDS
CC2R 4812 7058 31-28980 1031 53 21.43
CC2S 4233 6994 60-25087 1010 77 23.86
CC5R 5288 6946 51-29081 1145 74 21.65
CC5S 4332 7002 54-28260 958 45 22.11
Peldon 4443 6868 36-26368 962 59 21.65
Lola91 5426 6918 49-27814 1153 70 21.25
Roth 5663 7040 51-27090 1291 70 22.8
Table 2. Functional protein domains shared by all herbicide resistant and herbicide sensitive eccDNA samples with at least 30 counts in one set or that were only found in all HS or HR samples. N.D. designates those functional protein domains that were not detected in all the samples of that grouping.
Table 2. Functional protein domains shared by all herbicide resistant and herbicide sensitive eccDNA samples with at least 30 counts in one set or that were only found in all HS or HR samples. N.D. designates those functional protein domains that were not detected in all the samples of that grouping.
Pfam Accession Annotation # eccDNA
PF04195 Putative gypsy type transposon 416
PF00361 Proton-conducting membrane transporter 348
PF00146 NADH dehydrogenase 148
PF00223 Photosystem I psaA/psaB protein 139
PF00499 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 133
PF00420 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 112
PF13237 4Fe-4S dicluster domain 103
PF00124 Photosynthetic reaction centre protein 93
PF01578 Cytochrome C assembly protein 91
PF00346 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 87
PF00662 NADH-Ubiquinone oxidoreductase (complex I), chain 5 N-terminus 81
PF01010 NADH-dehyrogenase subunit F, TMs, (complex I) C-terminus 78
PF00421 Photosystem II protein 48
PF00623 RNA polymerase Rpb1, domain 2 42
PF01535 PPR repeat 41
PF00006 ATP synthase alpha/beta family, nucleotide-binding domain 38
PF13456 Reverse transcriptase-like 36
PF00276 Ribosomal protein L23 32
PF13041 PPR repeat family 31
PF01824 MatK/TrnK amino terminal region 30
Table 3. Top twenty functional protein domains shared by herbicide sensitive eccDNA samples.
Table 3. Top twenty functional protein domains shared by herbicide sensitive eccDNA samples.
Pfam Accession Annotation # eccDNA
PF04195 Putative gypsy type transposon 361
PF00361 Proton-conducting membrane transporter 208
PF00223 Photosystem I psaA/psaB protein 107
PF00146 NADH dehydrogenase 91
PF00499 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 81
PF13237 4Fe-4S dicluster domain 65
PF00124 Photosynthetic reaction centre protein 63
PF00346 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 62
PF00662 NADH-Ubiquinone oxidoreductase (complex I), chain 5 N-terminus 56
PF01578 Cytochrome C assembly protein 55
PF01010 NADH-dehyrogenase subunit F, TMs, (complex I) C-terminus 51
PF00420 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 43
PF13456 Reverse transcriptase-like 36
PF00421 Photosystem II protein 35
PF01535 PPR repeat 35
PF00006 ATP synthase alpha/beta family, nucleotide-binding domain 34
PF00069 Protein kinase domain 32
PF00646 F-box domain 24
PF02874 ATP synthase alpha/beta family, beta-barrel domain 24
PF03040 CemA family 24
Table 4. Homologous Alopecurus myosuroides candidate genes of predicted eccDNA
Table 4. Homologous Alopecurus myosuroides candidate genes of predicted eccDNA
Category eccDNA gene ID Homologous candidate gene Identity (%) E-value Bit score
Resistant CC2R_00001032 GSTF1 95.434 4.95E-162 436
CC5R_00001033 GSTF1 31.25 1.07E-14 64.7
CC5R_00001372 GSTF1 55 2.22E-91 257
Lola91_00001333 GSTU2 61.364 1.70E-42 128
Lola91_00001340 GSTU2 38.009 1.69E-42 133
Peldon_00000910 OPR1 59.6 7.35E-108 306
Peldon_00000911 OPR1 64.646 6.10E-44 137
Sensitive CC2S_00001307 GSTU2 65.741 8.99E-53 157
CC5S_00001012 GSTF2 61.176 1.37E-33 105
Roth_00001016 GSTU2 31.429 2.42E-24 85.1
Table 5. Alopecurus myosuroides genes located in the genomic region with high number of mapped eccDNA
Table 5. Alopecurus myosuroides genes located in the genomic region with high number of mapped eccDNA
Chr Start (Mb) End(Mb) # Herbicide resistant eccDNA # Herbicide sensitive eccDNA Genes located Functional annotation
1 251.0 251.5 83 48 ALOMY1G03512
ALOMY1G03513 PF00646 F-box domain
ALOMY1G03514 PF02485 Core-2/I-Branching enzyme
ALOMY1G03515
ALOMY1G03516 PF01288 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase (HPPK)
ALOMY1G03517 PF00809 Pterin binding enzyme
5 8.5 9.0 133 104 ALOMY5G31542 PF01015 Ribosomal S3Ae family
ALOMY5G31543 PF03641 Possible lysine decarboxylase
ALOMY5G31544
ALOMY5G31545 PF00294 pfkB family carbohydrate kinase
ALOMY5G31546
ALOMY5G31547
ALOMY5G31548
ALOMY5G31549
ALOMY5G31550
ALOMY5G31551 PF00069 Protein kinase domain
ALOMY5G31552 PF00069 Protein kinase domain
ALOMY5G31553 PF00069 Protein kinase domain
ALOMY5G31554
ALOMY5G31555
5 45.0 45.5 82 55 ALOMY5G32133 PF06747 CHCH domain
ALOMY5G32134
ALOMY5G32135
ALOMY5G32136
ALOMY5G32137
ALOMY5G32138 PF00179 Ubiquitin-conjugating enzyme
ALOMY5G32139 PF00829 Ribosomal prokaryotic L21 protein
ALOMY5G32140
ALOMY5G32141
7 243.5 244.0 130 80 ALOMY7G40250 PF02493 MORN repeat
ALOMY7G40251
ALOMY7G40252
ALOMY7G40253 PF00023 Ankyrin repeat
ALOMY7G40254 PF01694 Rhomboid family
ALOMY7G40255
ALOMY7G40256 PF12937 F-box-like
ALOMY7G40257 PF00194 Eukaryotic-type carbonic anhydrase
ALOMY7G40258
ALOMY7G40259 PF00153 Mitochondrial carrier protein
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated