Submitted:
11 July 2025
Posted:
11 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Results
2.1. Chromosome-Scale Genome Assembly and Annotation
2.2. Transposable Element Accumulation and Whole Genome Duplication
2.3. Phylogenetic Reconstruction
2.4. Expansion of Disease Resistance-Related Gene Families
2.5. Biosynthesis of Isofraxidin
2.5.1. Characteristics of Coumarin Biosynthetic Pathways in Early Angiosperms
2.5.2. Integrated Transcriptomic-Metabolomic Elucidation of the Isofraxidin Biosynthetic Pathway
3. Discussion
4. Materials and Methods
4.1. Materials and Sequencing
4.2. Genome Assembly
4.3. Repeat Annotation
4.4. Protein-Coding Gene Prediction and Functional Annotation
4.5. Construction of Gene Families
4.6. Phylogenetic Analyses
4.7. Identification of Whole-Genome Duplication
4.8. UPLC/QTRAP-MS Metabolomic Analysis
4.9. Identification of Gene Families Involved in Isofraxidin Biosynthesis
4.10. Integrated Transcriptome-Metabolome Analysis
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kong, H.Z. Karyotypes of Sarcandra Gardn. and Chloranthus Swartz (Chloranthaceae) from China. Botanical Journal of the Linnean Society 2000, 133, 327–342. [Google Scholar] [CrossRef]
- Hughes, N.F.; Ge, D.; Laing, J.F. Barremian earliest angiosperm pollen. Palaeontology 1979, 22, 513–535. [Google Scholar]
- Hughes, N.F. The enigma of angiosperm origins; Cambridge University Press: 1994; Volume 1.
- Taylor, D.W.; Hickey, L.J. Phylogenetic evidence for the herbaceous origin of angiosperms. Plant Systematics and Evolution 1992, 180, 137–156. [Google Scholar] [CrossRef]
- Doyle, J.A.; Endress, P.K. Integrating Early Cretaceous fossils into the phylogeny of living angiosperms: ANITA lines and relatives of Chloranthaceae. International Journal of Plant Sciences 2014, 175, 555–600. [Google Scholar] [CrossRef]
- Guo, X.; Fang, D.; Sahu, S.K.; Yang, S.; Guang, X.; Folk, R.; Smith, S.A.; Chanderbali, A.S.; Chen, S.; Liu, M.; et al. Chloranthus genome provides insights into the early diversification of angiosperms. Nature communications 2021, 12, 6930. [Google Scholar] [CrossRef] [PubMed]
- Zhang, M.; Liu, D.; Fan, G.; Wang, R.; Lu, X.; Gu, Y.; Shi, Q. Constituents from Chloranthaceae plants and their biological activities. Heterocyclic Communications 2016, 22, 175–220. [Google Scholar] [CrossRef]
- Chen, Y.C.; Li, Z.; Zhao, Y.X.; Gao, M.; Wang, J.Y.; Liu, K.W.; Wang, X.; Wu, L.W.; Jiao, Y.L.; Xu, Z.L. The Litsea genome and the evolution of the laurel family. Nature communications 2020, 11, 1675. [Google Scholar] [CrossRef] [PubMed]
- Robe, K.; Izquierdo, E.; Vignols, F.; Rouached, H.; Dubos, C. The coumarins: secondary metabolites playing a primary role in plant nutrition and health. Trends in Plant Science 2021, 26, 248–259. [Google Scholar] [CrossRef]
- Sharifi-Rad, J.; Cruz-Martins, N.; López-Jornet, P.; Lopez, E.P.-F.; Harun, N.; Yeskaliyeva, B.; Beyatli, A.; Sytar, O.; Shaheen, S.; Sharopov, F. Natural coumarins: exploring the pharmacological complexity and underlying molecular mechanisms. Oxidative Medicine and Cellular Longevity 2021, 2021, 6492346. [Google Scholar] [CrossRef]
- Durmaz, L.; Gulçin, İ.; Taslimi, P.; Tüzün, B. Isofraxidin: Antioxidant, Anti-carbonic Anhydrase, Anti-cholinesterase, Anti-diabetic, and in Silico Properties. ChemistrySelect 2023, 8, e202300170. [Google Scholar] [CrossRef]
- He, S.; Zhang, T.; Wang, Y.; Yuan, W.; Li, L.; Li, J.; Yang, Y.; Wu, D.; Xu, Y. Isofraxidin attenuates dextran sulfate sodium-induced ulcerative colitis through inhibiting pyroptosis by upregulating Nrf2 and reducing reactive oxidative species. International Immunopharmacology 2024, 128, 111570. [Google Scholar] [CrossRef]
- Manni, M.; Berkeley, M.R.; Seppey, M.; Simão, F.A.; Zdobnov, E.M. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Molecular Biology and Evolution 2021, 38, 4647–4654. [Google Scholar] [CrossRef]
- Li, X.; Yu, S.; Cheng, Z.; Chang, X.; Yun, Y.; Jiang, M.; Chen, X.; Wen, X.; Li, H.; Zhu, W. Origin and evolution of the triploid cultivated banana genome. Nature genetics 2024, 56, 136–142. [Google Scholar] [CrossRef]
- Wendel, J.F.; Jackson, S.A.; Meyers, B.C.; Wing, R.A. Evolution of plant genome architecture. Genome Biology 2016, 17, 37. [Google Scholar] [CrossRef]
- Liu, H.; Wang, X.; Wang, G.; Cui, P.; Wu, S.; Ai, C.; Hu, N.; Li, A.; He, B.; Shao, X.; et al. The nearly complete genome of Ginkgo biloba illuminates gymnosperm evolution. Nature Plants 2021, 7, 748–756. [Google Scholar] [CrossRef] [PubMed]
- Niu, S.; Li, J.; Bo, W.; Yang, W.; Zuccolo, A.; Giacomello, S.; Chen, X.; Han, F.; Yang, J.; Song, Y.; et al. The Chinese pine genome and methylome unveil key features of conifer evolution. Cell 2022, 185, 1–14. [Google Scholar] [CrossRef] [PubMed]
- Albert, V.A.; Barbazuk, W.B.; Depamphilis, C.W.; Der, J.P.; Leebens-Mack, J.; Ma, H.; Palmer, J.D.; Rounsley, S.; Sankoff, D.; Schuster, S.C. The Amborella genome and the evolution of flowering plants. Science 2013, 342, 1241089. [Google Scholar] [CrossRef] [PubMed]
- Hu, L.; Xu, Z.; Wang, M.; Fan, R.; Yuan, D.; Wu, B.; Wu, H.; Qin, X.; Yan, L.; Tan, L.; et al. The chromosome-scale reference genome of black pepper provides insight into piperine biosynthesis. Nature communications 2019, 10, 4702. [Google Scholar] [CrossRef]
- Chaw, S.M.; Liu, Y.C.; Wu, Y.W.; Wang, H.Y.; Lin, C.Y.I.; Wu, C.S.; Ke, H.M.; Chang, L.Y.; Hsu, C.Y.; Yang, H.T. Stout camphor tree genome fills gaps in understanding of flowering plant genome evolution. Nature plants 2019, 5, 63–73. [Google Scholar] [CrossRef]
- Wang, P.; Fan, Z.; Wei, W.; Yang, C.; Wang, Y.; Shen, X.; Yan, X.; Zhou, Z. Biosynthesis of the plant coumarin osthole by engineered Saccharomyces cerevisiae. ACS Synthetic Biology 2023, 12, 2455–2462. [Google Scholar] [CrossRef]
- Huang, X.; Tang, H.; Wei, X.; He, Y.; Hu, S.; Wu, J.; Xu, D.; Qiao, F.; Xue, J.; Zhao, Y. The gradual establishment of complex coumarin biosynthetic pathway in Apiaceae. Nature Communications 2024, 15, 6864. [Google Scholar] [CrossRef]
- Liu, Y.-y.; Li, Y.-z.; Huang, S.-q.; Zhang, H.-w.; Deng, C.; Song, X.-m.; Zhang, D.-d.; Wang, W. Genus Chloranthus: A comprehensive review of its phytochemistry, pharmacology, and uses. Arabian Journal of Chemistry 2022, 15, 104260. [Google Scholar] [CrossRef]
- Leng, L.; Xu, Z.; Hong, B.; Zhao, B.; Tian, Y.; Wang, C.; Yang, L.; Zou, Z.; Li, L.; Liu, K. Cepharanthine analogs mining and genomes of Stephania accelerate anti-coronavirus drug discovery. Nature Communications 2024, 15, 1537. [Google Scholar] [CrossRef] [PubMed]
- Ouadi, S.; Sierro, N.; Goepfert, S.; Bovet, L.; Glauser, G.; Vallat, A.; Peitsch, M.C.; Kessler, F.; Ivanov, N.V. The clove (Syzygium aromaticum) genome provides insights into the eugenol biosynthesis pathway. Communications biology 2022, 5, 684. [Google Scholar] [CrossRef] [PubMed]
- Peng, Z.; Song, L.; Chen, M.; Liu, Z.; Yuan, Z.; Wen, H.; Zhang, H.; Huang, Y.; Peng, Z.; Yang, H. Neofunctionalization of an OMT cluster dominates polymethoxyflavone biosynthesis associated with the domestication of citrus. Proceedings of the National Academy of Sciences 2024, 121, e2321615121. [Google Scholar] [CrossRef]
- Carocha, V.; Soler, M.; Hefer, C.; Cassan-Wang, H.; Fevereiro, P.; Myburg, A.A.; Paiva, J.A.; Grima-Pettenati, J. Genome-wide analysis of the lignin toolbox of E ucalyptus grandis. New Phytologist 2015, 206, 1297–1313. [Google Scholar] [CrossRef]
- Marçais, G.; Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 2011, 27, 764–770. [Google Scholar] [CrossRef]
- Cheng, H.; Concepcion, G.T.; Feng, X.W.; Zhang, H.W.; Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nature Methods 2021, 18, 170–175. [Google Scholar] [CrossRef]
- Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
- Parra, G.; Bradnam, K.; Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 2007, 23, 1061–1067. [Google Scholar] [CrossRef]
- Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef]
- Zhang, X.; Zhang, S.; Zhao, Q.; Ming, R.; Tang, H. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nature plants 2019, 5, 833–845. [Google Scholar] [CrossRef]
- Durand, N.C.; Shamim, M.S.; Machol, I.; Rao, S.S.; Huntley, M.H.; Lander, E.S.; Aiden, E.L. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell systems 2016, 3, 95–98. [Google Scholar] [CrossRef] [PubMed]
- Chen, N.S. Using Repeat Masker to identify repetitive elements in genomic sequences. Current Protocols in Bioinformatics 2004, 5, 4–10. [Google Scholar] [CrossRef] [PubMed]
- Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2 for automated genomic discovery of transposable element families. Proceedings of the National Academy of Sciences 2020, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
- Price, A.L.; Jones, N.C.; Pevzner, P.A. De novo identification of repeat families in large genomes. Bioinformatics 2005, 21, i351–i358. [Google Scholar] [CrossRef]
- Edgar, R.C.; Myers, E.W. PILER: identification and classification of genomic repeats. Bioinformatics 2005, 21, i152–i158. [Google Scholar] [CrossRef]
- Xu, Z.; Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic acids research 2007, 35, W265–W268. [Google Scholar] [CrossRef]
- Stanke, M.; Schöffmann, O.; Morgenstern, B.; Waack, S. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 2006, 7, 62. [Google Scholar] [CrossRef]
- Li, R.; Zhu, H.; Ruan, J.; Qian, W.; Fang, X.; Shi, Z.; Li, Y.; Li, S.; Shan, G.; Kristiansen, K. De novo assembly of human genomes with massively parallel short read sequencing. Genome Research 2010, 20, 265–272. [Google Scholar] [CrossRef]
- Parra, G.; Blanco, E.; Guigó, R. Geneid in drosophila. Genome Research 2000, 10, 511–515. [Google Scholar] [CrossRef] [PubMed]
- Majoros, W.H.; Pertea, M.; Salzberg, S.L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 2004, 20, 2878–2879. [Google Scholar] [CrossRef] [PubMed]
- Korf, I. Gene finding in novel genomes. BMC Bioinformatics 2004, 5, 59. [Google Scholar] [CrossRef] [PubMed]
- Birney, E.; Durbin, R. Using GeneWise in the Drosophila annotation experiment. Genome Research 2000, 10, 547–548. [Google Scholar] [CrossRef]
- Trapnell, C.; Williams, B.A.; Pertea, G.; Mortazavi, A.; Kwan, G.; Van Baren, M.J.; Salzberg, S.L.; Wold, B.J.; Pachter, L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotechnology 2010, 28, 511–515. [Google Scholar] [CrossRef]
- Haas, B.J.; Delcher, A.L.; Mount, S.M.; Wortman, J.R.; Smith Jr, R.K.; Hannick, L.I.; Maiti, R.; Ronning, C.M.; Rusch, D.B.; Town, C.D. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic acids research 2003, 31, 5654–5666. [Google Scholar] [CrossRef]
- Haas, B.J.; Salzberg, S.L.; Zhu, W.; Pertea, M.; Allen, J.E.; Orvis, J.; White, O.; Buell, C.R.; Wortman, J.R. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome biology 2008, 9, R7. [Google Scholar] [CrossRef]
- Li, L.; Stoeckert, C.J.; Roos, D.S. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome research 2003, 13, 2178–2189. [Google Scholar] [CrossRef]
- Han, M.V.; Thomas, G.W.; Lugo-Martinez, J.; Hahn, M.W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Molecular biology and evolution 2013, 30, 1987–1997. [Google Scholar] [CrossRef]
- Cosentino, S.; Iwasaki, W. SonicParanoid: fast, accurate and easy orthology inference. Bioinformatics 2019, 35, 149–151. [Google Scholar] [CrossRef]
- Edgar, R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic acids research 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed]
- Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; Von Haeseler, A.; Jermiin, L.S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed]
- Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
- Nguyen, L.-T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular biology and evolution 2015, 32, 268–274. [Google Scholar] [CrossRef]
- Mirarab, S.; Reaz, R.; Bayzid, M.S.; Zimmermann, T.; Swenson, M.S.; Warnow, T. ASTRAL: genome-scale coalescent-based species tree estimation. Bioinformatics 2014, 30, i541–i548. [Google Scholar] [CrossRef]
- Reis, M.d.; Yang, Z. Approximate likelihood calculation on a phylogeny for Bayesian estimation of divergence times. Molecular biology and evolution 2011, 28, 2161–2172. [Google Scholar] [CrossRef]
- Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Molecular biology and evolution 2007, 24, 1586–1591. [Google Scholar] [CrossRef]
- Chen, J.; Hao, Z.; Guang, X.; Zhao, C.; Wang, P.; Xue, L.; Zhu, Q.; Yang, L.; Sheng, Y.; Zhou, Y. Liriodendron genome sheds light on angiosperm phylogeny and species–pair differentiation. Nature plants 2019, 5, 18–25. [Google Scholar] [CrossRef]
- Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-h.; Jin, H.; Marler, B.; Guo, H. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic acids research 2012, 40, e49. [Google Scholar] [CrossRef]
- Potter, S.C.; Luciani, A.; Eddy, S.R.; Park, Y.; Lopez, R.; Finn, R.D. HMMER web server: 2018 update. Nucleic acids research 2018, 46, W200–W204. [Google Scholar] [CrossRef]





| Genome assembly | No. of sequences | Total length (bp) | N50 (bp) | N90 (bp) | Longest (bp) |
| Contigs | 8, 503 | 8, 660, 104, 190 | 8, 762, 697 | 1, 628, 957 | 67, 315, 735 |
| Hi-C assembly | 1, 784 | 8, 569, 334, 221 | 178, 915, 312 | 136, 432, 957 | 336, 677, 673 |
| Unplaced | 1, 739 | 85, 436, 445 | |||
| Chromosomes | 45 | 8, 483, 897, 776 | 178, 915, 312 | 136, 432, 957 | 336, 677, 673 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).