Submitted:
28 May 2025
Posted:
28 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
3. Results
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| aa | amino acid |
| LAGLIDADG | one letter abbreviations of amino acids in a particular homing endonuclease motif |
| MCM | minichromosome maintenance |
| PDB | Protein Databank |
| pLDDT | predicted local distance difference test |
| taxid | Taxonomy ID |
References
- Hirata, R.; Ohsumk, Y.; Nakano, A.; Kawasaki, H.; Suzuki, K.; Anraku, Y. Molecular structure of a gene, VMA1, encoding the catalytic subunit of H(+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces cerevisiae. J. Biol. Chem. 1990, 265, 6726–6733. [Google Scholar] [CrossRef]
- Kane, P.M.; Yamashiro, C.T.; Wolczyk, D.F.; Neff, N.; Goebl, M.; Stevens, T.H. Protein Splicing Converts the Yeast TFP1 Gene Product to the 69-kdDSubunit of the Vacuolar H + -Adenosine Triphosphatase. Science 1990, 250, 651–657. [Google Scholar] [CrossRef]
- Wang, H.; Wang, L.; Zhong, B.; Dai, Z. Protein Splicing of Inteins: A Powerful Tool in Synthetic Biology. Front. Bioeng. Biotechnol. 2022, 10, 810180. [Google Scholar] [CrossRef] [PubMed]
- Gosselin, S.P.; Arsenault, D.; Gogarten, J.P. Actinobacteriophage Inteins: Host Diversity, Local Dissemination, and Non-Canonical Architecture. bioRxiv 2025. [CrossRef]
- Goddard, M.R.; Burt, A. Recurrent invasion and extinction of a selfish gene. Proc. Natl. Acad. Sci. 1999, 96, 13880–13885. [Google Scholar] [CrossRef] [PubMed]
- Naor, A.; Altman-Price, N.; Soucy, S.M.; Green, A.G.; Mitiagin, Y.; Turgeman-Grott, I.; Davidovich, N.; Gogarten, J.P.; Gophna, U. Impact of a homing intein on recombination frequency and organismal fitness. Proc. Natl. Acad. Sci. 2016, 113, E4654–E4661. [Google Scholar] [CrossRef]
- Barzel, A.; Obolski, U.; Gogarten, J.P.; Kupiec, M.; Hadany, L. Home and away- the evolutionary dynamics of homing endonucleases. BMC Evol. Biol. 2011, 11, 324–324. [Google Scholar] [CrossRef]
- Yahara, K.; Fukuyo, M.; Sasaki, A.; Kobayashi, I. Evolutionary maintenance of selfish homing endonuclease genes in the absence of horizontal transfer. Proc. Natl. Acad. Sci. 2009, 106, 18861–18866. [Google Scholar] [CrossRef] [PubMed]
- Gogarten, J.P.; Hilario, E. Inteins, introns, and homing endonucleases: recent revelations about the life cycle of parasitic genetic elements. BMC Evol. Biol. 2006, 6, 94–94. [Google Scholar] [CrossRef]
- Novikova, O.; Jayachandran, P.; Kelley, D.S.; Morton, Z.; Merwin, S.; Topilina, N.I.; Belfort, M. Intein Clustering Suggests Functional Importance in Different Domains of Life. Mol. Biol. Evol. 2015, 33, 783–799. [Google Scholar] [CrossRef]
- Naor, A.; Lazary, R.; Barzel, A.; Papke, R.T.; Gophna, U. In Vivo Characterization of the Homing Endonuclease within the polB Gene in the Halophilic Archaeon Haloferax volcanii. PLOS ONE 2011, 6, e15833. [Google Scholar] [CrossRef]
- Turgeman-Grott, I.; Arsenault, D.; Yahav, D.; Feng, Y.; Miezner, G.; Naki, D.; Peri, O.; Papke, R.T.; Gogarten, J.P.; Gophna, U. Neighboring inteins interfere with one another's homing capacity. PNAS Nexus 2023, 2, pgad354. [Google Scholar] [CrossRef]
- Brewster, A.S.; Chen, X.S. Insights into the MCM functional mechanism: lessons learned from the archaeal MCM complex. Crit. Rev. Biochem. Mol. Biol. 2010, 45, 243–256. [Google Scholar] [CrossRef] [PubMed]
- Maine, G.T.; Sinha, P.; Tye, B.-K. Mutants of S. cerevisiae defective in the maintenance of minichromosomes. Genetics 1984, 106, 365–385. [Google Scholar] [CrossRef]
- Yalala, V.R.; Lynch, A.K.; Mills, K.V. Conditional Alternative Protein Splicing Promoted by Inteins from Haloquadratum walsbyi. Biochemistry 2022, 61, 294–302. [Google Scholar] [CrossRef] [PubMed]
- InBase2.0. Available online: https://inbase.ligsciss.com/index.php?r=site/index (accessed on 23 May 2025).
- Perler, F.B. InBase: the Intein Database. Nucleic Acids Res. 2002, 30, 383–384. [Google Scholar] [CrossRef] [PubMed]
- Finstad, K.M.; Probst, A.J.; Thomas, B.C.; Andersen, G.L.; Demergasso, C.; Echeverría, A.; Amundson, R.G.; Banfield, J.F. Microbial Community Structure and the Persistence of Cyanobacterial Populations in Salt Crusts of the Hyperarid Atacama Desert from Genome-Resolved Metagenomics. Front. Microbiol. 2017, 8, 1435. [Google Scholar] [CrossRef]
- Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef]
- Edgar, R.C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 2004, 5, 113. [Google Scholar] [CrossRef]
- Gouy, M.; Tannier, E.; Comte, N.; Parsons, D.P. Seaview Version 5: A Multiplatform Software for Multiple Sequence Alignment, Molecular Phylogenetic Analyses, and Tree Reconciliation. Methods Mol. Biol. 2021, 2231, 241–260. [Google Scholar] [CrossRef]
- Sievers, F.; Higgins, D.G. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 2017, 27, 135–145. [Google Scholar] [CrossRef]
- Kazutaka, K.; Misakwa, K.; Kei-ichi, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
- Pellegrini-Calace, M. Detecting DNA-binding helix-turn-helix structural motifs using sequence and structure information. Nucleic Acids Res. 2005, 33, 2129–2140. [Google Scholar] [CrossRef] [PubMed]
- Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef] [PubMed]
- Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; Von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef]
- Hoang, D.T.; Chernomor, O.; Von Haeseler, A.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef]
- Swithers, K.S.; Senejani, A.G.; Fournier, G.P.; Gogarten, J.P. Conservation of intron and intein insertion sites: implications for life histories of parasitic genetic elements. BMC Evol. Biol. 2009, 9, 303–303. [Google Scholar] [CrossRef]
- Brewster, A.S.; Wang, G.; Yu, X.; Greenleaf, W.B.; Carazo, J.M.; Tjajadi, M.; Klein, M.G.; Chen, X.S. Crystal structure of a near-full-length archaeal MCM: Functional insights for an AAA+ hexameric helicase. Proc. Natl. Acad. Sci. 2008, 105, 20191–20196. [Google Scholar] [CrossRef]
- Meagher, M.; Epling, L.B.; Enemark, E.J. DNA translocation mechanism of the MCM complex and implications for replication initiation. Nat. Commun. 2019, 10, 3117. [Google Scholar] [CrossRef]
- Mills, K.V.; Johnson, M.A.; Perler, F.B. Protein Splicing: How Inteins Escape from Precursor Proteins. J. Biol. Chem. 2014, 289, 14498–14505. [Google Scholar] [CrossRef] [PubMed]
- Tori, K.; Dassa, B.; Johnson, M.A.; Southworth, M.W.; Brace, L.E.; Ishino, Y.; Pietrokovski, S.; Perler, F.B. Splicing of the Mycobacteriophage Bethlehem DnaB Intein. Journal of Biological Chemistry 2010, 285, 2515–2526. [Google Scholar] [CrossRef] [PubMed]
- Liu, X.-Q.; Yang, J.; Meng, Q. Four Inteins and Three Group II Introns Encoded in a Bacterial Ribonucleotide Reductase Gene. J. Biol. Chem. 2003, 278, 46826–46831. [Google Scholar] [CrossRef] [PubMed]
- Aravind, L.; Koonin, E.V. DNA-binding proteins and evolution of transcription regulation in the archaea. Nucleic Acids Res. 1999, 27, 4658–4670. [Google Scholar] [CrossRef]
- Moure, C.M.; Gimble, F.S.; Quiocho, F.A. Crystal structure of the intein homing endonuclease PI-SceI bound to its recognition sequence. Nat. Struct. Mol. Biol. 2002, 9, 764–770. [Google Scholar] [CrossRef]
- Christ, F.; Steuer, S.; Thole, H.; Wende, W.; Pingoud, A.; Pingoud, V. A Model for the PI-SceI×DNA Complex Based on Multiple Base and Phosphate Backbone-specific Photocross-links. J. Mol. Biol. 2000, 300, 841–849. [Google Scholar] [CrossRef]
- Hu, D.; Crist, M.; Duan, X.; Quiocho, F.A.; Gimble, F.S. Probing the Structure of the PI-SceI-DNA Complex by Affinity Cleavage and Affinity Photocross-linking. J. Biol. Chem. 2000, 275, 2705–2712. [Google Scholar] [CrossRef]






| MCM Intein Invasion Status | Total Homologs with Invasion Status |
| Empty | 3125 |
| Single | 709 |
| Double | 305 |
| Triple | 79 |
| Quadruple | 25 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).