Submitted:
24 September 2024
Posted:
24 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Materials and DNA Extraction
2.2. Data Quality Control and Genotyping
2.3. Core SNP Selection
2.4. Simplification of SNP Markers Based on Random Forest Model
2.5. Phylogenetic Analysis and Genetic Diversity Analysis of Validation Population
2.6. Validation of Candidate SNPs Based on Random Forest Model and Construction of DNA Fingerprinting
3. Results
3.1. Screening and Simplification of Core SNPs
3.2. Population Structure of Validation Population
3.3. Effectiveness of Identifying Improved Varieties with Candidate SNPs
3.4. Population Structure of Validation Population
4. Discussion
4.1. An Application of the 51K Liquid-Phased Probes for Slash Pine
4.2. The Construction Methods of DNA Fingerprinting for the Improved Varieties of Slash Pines
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| SNP: | single nucleotide polymorphism; |
| SSR: | simple sequence repeats; |
| AFLP: | amplified fragment length polymorphism; |
| RAPD: | random amplified polymorphic DNA; |
| RFLP: | restriction fragment length polymorphism; |
| MAS: | molecular marker-assisted selection; |
| MAF: | minor allele frequency; |
| PIC: | polymorphism information content; |
| LD: | linked disequilibrium; |
| hdw: | Hardy-Weinberg; |
| Het: | The expected heterozygosity |
References
- Xianyin, D.; Xueyu, T. a. O.; Shu, D.; Qifu, L.; Jingmin, J. Estimation of Wood Basic Density in a Pinus Elliottii Stand Using Pilodyn and Resistograph Measurements. JOURNAL OF NANJING FORESTRY UNIVERSITY 2020, 44, 142. [Google Scholar] [CrossRef]
- Zhao, D.; Bullock, B.P.; Montes, C.R.; Wang, M.; Greene, D.; Sutter, L. Loblolly Pine Outperforms Slash Pine in the Southeastern United States – A Long-Term Experimental Comparison Study. Forest Ecology and Management 2019, 450, 117532. [Google Scholar] [CrossRef]
- Ding, X.; Li, Y.; Zhang, Y.; Diao, S.; Luan, Q.; Jiang, J. Genetic Analysis and Elite Tree Selection of the Main Resin Components of Slash Pine. Front. Plant Sci. 2023, 14. [Google Scholar] [CrossRef]
- Nybom, H.; Weising, K.; Rotter, B. DNA Fingerprinting in Botany: Past, Present, Future. Investig Genet 2014, 5, 1. [Google Scholar] [CrossRef]
- Wang, X.-R.; Szmidt, A.E. Molecular Markers in Population Genetics of Forest Trees. Scandinavian Journal of Forest Research 2001, 16, 199–220. [Google Scholar] [CrossRef]
- Nadeem, M.A.; Nawaz, M.A.; Shahid, M.Q.; Doğan, Y.; Comertpay, G.; Yıldız, M.; Hatipoğlu, R.; Ahmad, F.; Alsaleh, A.; Labhane, N.; et al. DNA Molecular Markers in Plant Breeding: Current Status and Recent Advancements in Genomic Selection and Genome Editing. Biotechnology & Biotechnological Equipment 2018, 32, 261–285. [Google Scholar] [CrossRef]
- Kumar, P.; Gupta, V.K.; Misra, A.K.; Modi, D.R.; Pandey, B.K. Potential of Molecular Markers in Plant Biotechnology. Plant Omics 2020, 2, 141–162. [Google Scholar] [CrossRef]
- Tsumura, Y.; Yoshimura, K.; Tomaru, N.; Ohba, K. Molecular Phytogeny of Conifers Using RFLP Analysis of PCR-Amplified Specific Chloroplast Genes. Theoret. Appl. Genetics 1995, 91, 1222–1236. [Google Scholar] [CrossRef] [PubMed]
- De La Torre, A.R.; Birol, I.; Bousquet, J.; Ingvarsson, P.K.; Jansson, S.; Jones, S.J.M.; Keeling, C.I.; MacKay, J.; Nilsson, O.; Ritland, K.; et al. Insights into Conifer Giga-Genomes. Plant Physiology 2014, 166, 1724–1732. [Google Scholar] [CrossRef]
- Nesbitt, K.A.; Potts, B.M.; Vaillancourt, R.E.; West, A.K.; Reid, J.B. Partitioning and Distribution of RAPD Variation in a Forest Tree Species, Eucalyptus Globulus (Myrtaceae). Heredity 1995, 74, 628–637. [Google Scholar] [CrossRef]
- DeVerno, L.L.; Mosseler, A. Genetic Variation in Red Pine (Pinus Resinosa) Revealed by RAPD and RAPD-RFLP Analysis. Can. J. For. Res. 1997, 27, 1316–1320. [Google Scholar] [CrossRef]
- Rawat, A.; Barthwal, S.; Ginwal, H.S. Comparative Assessment of SSR, ISSR and AFLP Markers for Characterization of Selected Genotypes of Himalayan Chir Pine (Pinus Roxburghii Sarg.) Based on Resin Yield. Silvae Genetica 2014, 63, 94–108. [Google Scholar] [CrossRef]
- Ganea, S.; Ranade, S.S.; Hall, D.; Abrahamsson, S.; García-Gil, M.R. Development and Transferability of Two Multiplexes nSSR in Scots Pine (Pinus Sylvestris L.). J. For. Res. 2015, 26, 361–368. [Google Scholar] [CrossRef]
- Guo, Z.; Yang, Q.; Huang, F.; Zheng, H.; Sang, Z.; Xu, Y.; Zhang, C.; Wu, K.; Tao, J.; Prasanna, B.M.; et al. Development of High-Resolution Multiple-SNP Arrays for Genetic Analyses and Molecular Breeding through Genotyping by Target Sequencing and Liquid Chip. Plant Communications 2021, 2, 100230. [Google Scholar] [CrossRef] [PubMed]
- Batley, J.; Edwards, D. SNP Applications in Plants. In Association Mapping in Plants; Oraguzie, N.C., Rikkerink, E.H.A., Gardiner, S.E., De Silva, H.N., Eds.; Springer New York: New York, NY, 2007; pp. 95–102. ISBN 978-0-387-35844-4. [Google Scholar]
- Mammadov, J.; Aggarwal, R.; Buyyarapu, R.; Kumpatla, S. SNP Markers and Their Impact on Plant Breeding. International Journal of Plant Genomics 2012, 2012, 728398. [Google Scholar] [CrossRef]
- Jehan, T.; Lakhanpaul, S. Single Nucleotide Polymorphism (SNP)–Methods and Applications in Plant Genetics: A Review. IJBT Vol.5(4) [October 2006] 2006.
- Pavy, N.; Gagnon, F.; Rigault, P.; Blais, S.; Deschênes, A.; Boyle, B.; Pelgas, B.; Deslauriers, M.; Clément, S.; Lavigne, P.; et al. Development of High-Density SNP Genotyping Arrays for White Spruce (Picea Glauca) and Transferability to Subtropical and Nordic Congeners. Molecular Ecology Resources 2013, 13, 324–336. [Google Scholar] [CrossRef]
- Ganal, M.W.; Wieseke, R.; Luerssen, H.; Durstewitz, G.; Graner, E.-M.; Plieske, J.; Polley, A. High-Throughput SNP Profiling of Genetic Resources in Crop Plants Using Genotyping Arrays. In Genomics of Plant Genetic Resources: Volume 1. Managing, sequencing and mining genetic resources; Tuberosa, R., Graner, A., Frison, E., Eds.; Springer Netherlands: Dordrecht, 2014; pp. 113–130. ISBN 978-94-007-7572-5. [Google Scholar]
- Montanari, S.; Deng, C.; Koot, E.; Bassil, N.V.; Zurn, J.D.; Morrison-Whittle, P.; Worthington, M.L.; Aryal, R.; Ashrafi, H.; Pradelles, J.; et al. A Multiplexed Plant–Animal SNP Array for Selective Breeding and Species Conservation Applications. G3 Genes|Genomes|Genetics 2023, 13, jkad170. [Google Scholar] [CrossRef]
- Jackson, C.; Christie, N.; Reynolds, S.M.; Marais, G.C.; Tii-kuzu, Y.; Caballero, M.; Kampman, T.; Visser, E.A.; Naidoo, S.; Kain, D.; et al. A Genome-Wide SNP Genotyping Resource for Tropical Pine Tree Species. Molecular Ecology Resources 2022, 22, 695–710. [Google Scholar] [CrossRef] [PubMed]
- Graham, N.; Telfer, E.; Frickey, T.; Slavov, G.; Ismael, A.; Klápště, J.; Dungey, H. Development and Validation of a 36K SNP Array for Radiata Pine (Pinus Radiata D.Don). Forests 2022, 13, 176. [Google Scholar] [CrossRef]
- Diao, S.; Ding, X.; Luan, Q.; Chen, Z.-Q.; Wu, H.X.; Li, X.; Zhang, Y.; Sun, J.; Wu, Y.; Zou, L.-H.; et al. Development of 51 K Liquid-Phased Probe Array for Loblolly and Slash Pines and Its Application to GWAS of Slash Pine Breeding Population. Industrial Crops and Products 2024, 216, 118777. [Google Scholar] [CrossRef]
- Wingett, S.W.; Andrews, S. FastQ Screen: A Tool for Multi-Genome Mapping and Quality Control. F1000Res 2018, 7, 1338. [Google Scholar] [CrossRef]
- Carvalho, A.; Matos, M.; Lima-Brito, J.; Guedes-Pinto, H.; Benito, C. DNA Fingerprint of F1 Interspecific Hybrids from the Triticeae Tribe Using ISSRs. Euphytica 2005, 143, 93–99. [Google Scholar] [CrossRef]
- Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. The American Journal of Human Genetics 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
- Boban, S.; Maurya, S.; Jha, Z. DNA Fingerprinting: An Overview on Genetic Diversity Studies in the Botanical Taxa of Indian Bamboo. Genet Resour Crop Evol 2022, 69, 469–498. [Google Scholar] [CrossRef]
- Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The Variant Call Format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
- Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve Years of SAMtools and BCFtools. GigaScience 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Luan, Q.; Jiang, J.; Li, Y. Prediction and Utilization of Malondialdehyde in Exotic Pine Under Drought Stress Using Near-Infrared Spectroscopy. Front. Plant Sci. 2021, 12. [Google Scholar] [CrossRef]
- Lai, M.; Dong, L.; Su, R.; Zhang, L.; Jia, T.; Chen, T.; Yi, M. Needle Functional Features in Contrasting Yield Phenotypes of Slash Pine at Three Locations in Southern China. Industrial Crops and Products 2023, 206, 117613. [Google Scholar] [CrossRef]
- Jiang, G.-L. Molecular Marker-Assisted Breeding: A Plant Breeder’s Review. In Advances in Plant Breeding Strategies: Breeding, Biotechnology and Molecular Tools; Al-Khayri, J.M., Jain, S.M., Johnson, D.V., Eds.; Springer International Publishing: Cham, 2015; pp. 431–472. ISBN 978-3-319-22521-0. [Google Scholar]
- Prunier, J.; Verta, J.-P.; MacKay, J.J. Conifer Genomics and Adaptation: At the Crossroads of Genetic Diversity and Genome Function. New Phytologist 2016, 209, 44–62. [Google Scholar] [CrossRef]
- Rodríguez, S.M.; Ordás, R.J.; Alvarez, J.M. Conifer Biotechnology: An Overview. Forests 2022, 13, 1061. [Google Scholar] [CrossRef]
- Singh, N.; Choudhury, D.R.; Singh, A.K.; Kumar, S.; Srinivasan, K.; Tyagi, R.K.; Singh, N.K.; Singh, R. Comparison of SSR and SNP Markers in Estimation of Genetic Diversity and Population Structure of Indian Rice Varieties. PLOS ONE 2013, 8, e84136. [Google Scholar] [CrossRef]
- Liu, C. SSR Fingerprinting of Poplar Clones and Construction of Core Germplasm Bank. Master’s Thesis, Central South University of forestry and technology, 2022.
- Shen, Z. Evaluation and Selection of Xanthoceras Sorbifolia Germplasm Resources Based on Morphological Characteristics and SSR Molecular Markers. PhD Thesis, Beijing Forestry University, 2017.
- Lu, Y. Genetic Diversity Evaluation and Molecular Identification of Clones of Sophora Japonica. PhD Thesis, Chinese Academy of Forestry, 2019.
- Zhang, L. Genetic Variation Analysis of the plus Tree Population of Pinus Lumenensis and Construction of Core Germplasm. PhD Thesis, Beijing Forestry University, 2020.
- Wang, Y.; Lv, H.; Xiang, X.; Yang, A.; Feng, Q.; Dai, P.; Li, Y.; Jiang, X.; Liu, G.; Zhang, X. Construction of a SNP Fingerprinting Database and Population Genetic Analysis of Cigar Tobacco Germplasm Resources in China. Front. Plant Sci. 2021, 12. [Google Scholar] [CrossRef]
- Yang, Y.; Lyu, M.; Liu, J.; Wu, J.; Wang, Q.; Xie, T.; Li, H.; Chen, R.; Sun, D.; Yang, Y.; et al. Construction of an SNP Fingerprinting Database and Population Genetic Analysis of 329 Cauliflower Cultivars. BMC Plant Biol 2022, 22, 522. [Google Scholar] [CrossRef] [PubMed]
- Lin, P.; Wang, K.; Yao, X.; Ren, H. Construction of Molecular ID Cards for Main Camellia Oleifera Germplasm Resources Based on Transcriptome SNPs. China Agricultural Science 2023, 56, 217–235. [Google Scholar]
- Fan, X.; Yu, W.; Cai, C.; Lin, Y.; Wang, Z.; Fang, W.; Zhang, J.; Ye, N. Construction of Molecular ID Card for Tea Variety Resources Using SNP Markers. China Agricultural Science 2021, 54, 1751–1772. [Google Scholar]
- Zhang, L.; Liu, Z.; Ren, T.; Liu, D.; Ma, Z.; Tong, L.; Zhang, C.; Zhou, T.; Zhang, X.; Li, S. Identification of Seed Maize Fields With High Spatial Resolution and Multiple Spectral Remote Sensing Using Random Forest Classifier. Remote Sensing 2020, 12, 362. [Google Scholar] [CrossRef]
- Pankaja, K.; Suma, V. Plant Leaf Recognition and Classification Based on the Whale Optimization Algorithm (WOA) and Random Forest (RF). J. Inst. Eng. India Ser. B 2020, 101, 597–607. [Google Scholar] [CrossRef]
- Qiu; Shen, B.; Li, T.; Guo, J.; Wang, J.; Sun L.; Chen X.; Hu S. Fragrant Osmanthus Varieties Identification Method Based on Random Forest Algorithm and SRAP Molecular Markers. Forestry Science 2018, 54, 32–45.
- Zhang, M.; He, Z.; Ma, J.; Sang, Z.; Zhu, Z.; Zhang, D.; Ma, L.; Chen, F. Genetic Relationship Analysis and Molecular Identification of Magnolia japonica Varieties Based on SSR and SRAP Markers. Journal of Beijing Forestry University 2019, 41, 69–80. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang Establishment of Alfalfa Forage Variety Database Based on Terahertz Technology and Discussion on Color Mechanism. Master Thesis, 10.27643/d.cnki.gsybu.2021.001585, 2021.
- Guo, M.; Gu, W.; Xie, X.; Mao, Y. ; Chen; Xiong Variety Identification Method Based on Single Nucleotide Polymorphism Screening. Computer Application 2024, 44, 369–373. [Google Scholar]
- Everingham, Y.; Sexton, J.; Skocaj, D.; Inman-Bamber, G. Accurate Prediction of Sugarcane Yield Using a Random Forest Algorithm. Agron. Sustain. Dev. 2016, 36, 27. [Google Scholar] [CrossRef]
- Prasad, N.R.; Patel, N.R.; Danodia, A. Crop Yield Prediction in Cotton for Regional Level Using Random Forest Approach. Spat. Inf. Res. 2021, 29, 195–206. [Google Scholar] [CrossRef]






| Screening Step | The number of remaining SNPs |
|---|---|
| Raw data | 1,83,849 |
| MAF>0.1, miss rate=0 | 64,193 |
| hdw>0.01 | 51,413 |
| ld>0.2 | 30,394 |
| PIC>0.35, | 12,841 |
| Het<0.4 | 3,502 |
| PopⅠ | PopⅡ | PopⅢ | |
|---|---|---|---|
| Pop1Ⅰ | 0.022 | 0.033 | |
| PopⅡ | 0.099 | 0.032 | |
| PopⅢ | 0.154 | 0.15 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).