Submitted:
14 May 2025
Posted:
15 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Leaf Sample Collection
2.2. Structuring of Samples for Varietal, Technical and Biological Library Preparation
2.3. DNA Extraction
2.4. DNA Sequencing and Sequence Data Output
2.5. Data Analysis
2.6. Raw Sequence Data Processing and Data Output
2.7. Stage-M1: Optimisation of Read Mapping Parameters
2.8. Stage-M2: Use of Optimized Read Mapping Parameters and Correction of Mapped Reads
2.9. Stage-V1: Optimisation of Variant Detection Parameters
2.10. Stage-V2: Use of Optimized Variant Detection Parameters on All Samples
2.11. Stage-V3: Paired Variant Comparison Analysis with the Unknown and a Reference Sample
2.12. Stage-V4: Subclassification of Not-Common (Unique?) Hom-SNPs Post PVC-Analysis
2.13. Stage-V5: Combining PVC Hom-SNPs Files of All Unknown & Reference Pairs and Filtering to Remove Data-Noise
2.14. Stage-V6: Variety identification Using the Combined-PVC Hom-SNPs with/Without Data-Noise
3. Results
3.1. DNA Quality
3.2. Consideration of Genome Coverage for Detecting High Confidence Homozygous SNPs
3.3. Raw Sequence Data Quality and Size as Times Genome Size of 600 Mb
3.4. Optimal Read Mapping and Variant Calling Parameters
3.5. Mapping Coverage at Optimal Read Mapping Settings
3.6. Identification of Variants
3.7. Variability in Hom-SNPs Identified Within Technical and Biological Replicate Samples
3.8. Variability in PVC-Analysis Generated Hom-SNPs Between Biological Replicates
3.9. Common and Not-Common Hom-SNPs Between Masena and Between Masena and Eureka Samples


3.10. Sub-Classification of Not-Common Hom-SNPs Between the Same and Dissimilar Varieties
3.11. Variety Identification After Removal of Data-Noise Hom-SNPs
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| Tec-rep | Technical replicate |
| Bio-rep | Biological replicate |
| PVC | Paired variant comparison |
| SNP | Single nucleotide polymorphism |
| LF | Length fraction |
| SF | Similarity fraction |
| MF | Minimum allele frequence |
| MC | Minimum read count |
| Hom-SNP | Homozygous SNP |
| Het-SNP | Heterozygous SNP |
| MFAA | Most frequent alternative allele |
References
- Lawson, C. Propagating materials and harvested materials: clarifying the scope of plant variety or breeder’s rights. J. Intellect. Prop. Law Pract. 2023;18:655-672. [CrossRef]
- Henry R.J., ed. Plant genotyping II: SNP technology. CAB International: Oxfordshire, UK. 2008, pp. 1-285.
- Henry R.J., ed. Plant genotyping: The DNA fingerprinting of plants. CABI Publishing: New York, USA. 2001, pp. 321.
- Poets, A., Silverstein K., Pardey P., Hearne S., Stevenson J. DNA Fingerprinting for Crop Varietal Identification: Fit-for-Purpose Protocols, their Costs and Analytical Implications. CGIAR Advisory Services (CAS) Secretariat 2020.
- Ko, H.L., Henry R.J. Specific 5S ribosomal RNA primers for plant species identification in admixtures 3. Plant Mol Biol Rep 1996;14:33-43. [CrossRef]
- Henry, R.J., Chopra V.L., Malik V.S., Bhat S.R., Molecular techniques for the identification of plants, in Applied plant biotechnology, Science Publishers: Enfield, NH, United Kingdom. 1999, pp. 269-283.
- Vignal, A., Milan D., SanCristobal M., Eggen A. A review on SNP and other types of molecular markers and their use in animal genetics. Genet. Sel. Evol. 2002;34:275. [CrossRef]
- Brookes, A.J. The essence of SNPs. Gene 1999;234:177-186. [CrossRef]
- Edward, K.J., Poole R.L., Barker G.L., SNP discovery in plants, in Plant genotyping II: SNP technology, Henry, R J, Editor, CAB International: Oxfordshire, UK. 2008, pp. 1-29.
- Mammadov, J., Aggarwal R., Buyyarapu R., Kumpatla S. SNP markers and their impact on plant breeding. Int J Plant Genomics 2012;2012:728398. [CrossRef]
- Sandal N.S.H.a.S.S. DNA fingerprinting and its applications in crop improvement: A review Pharma Innov. J. 2022;SP-11:792-797.
- Thomas, W.J.W., Zhang Y., Amas J.C., Cantila A.Y., Zandberg J.D., Harvie S.L., Batley J., Innovative Advances in Plant Genotyping, in Plant Genotyping: Methods and Protocols, Shavrukov, Y, Editor, Springer US: New York, NY. 2023, pp. 451-465.
- Shen, Y., Wang J., Shaw R., Yu H., Sheng X., Zhenqing Z., Li S., Gu H. Development of GBTS and KASP Panels for Genetic Diversity, Population Structure, and Fingerprinting of a Large Collection of Broccoli (Brassica oleracea L. var. italica) in China. Front Plant Sci 2021;12. [CrossRef]
- Zhang, J., Yang J., Zhang L., Luo J., Zhao H., Zhang J., Wen C. A new SNP genotyping technology Target SNP-seq and its application in genetic analysis of cucumber varieties. Sci. Rep. 2020;10:5623. [CrossRef]
- Lawrie, R.D., Massey S.E. Agrigenomic Diversity Unleashed: Current Single Nucleotide Polymorphism Genotyping Methods for the Agricultural Sciences. Applied Biosciences 2023;2:565-585.
- Pootakham, W. Genotyping by Sequencing (GBS) for Genome-Wide SNP Identification in Plants. Methods in Molecular Biology 2023;2638:1-8. [CrossRef]
- Kockum, I., Huang J., Stridh P. Overview of Genotyping Technologies and Methods. Current Protocols 2023;3:e727. [CrossRef]
- Henry, R.J. Progress in Plant Genome Sequencing. Applied Biosciences 2022;1:113-128. [CrossRef]
- Furtado, A., DNA Extraction from Vegetative Tissue for Next-Generation Sequencing, in Cereal Genomics, Henry, R J, Furtado, A, Editors, Humana Press. 2014, pp. 1-5.
- Kumar, R., Das S.P., Choudhury B.U., Kumar A., Prakash N.R., Verma R., Chakraborti M., Devi A.G., Bhattacharjee B., Das R., Das B., Devi H.L., Das B., Rawat S., Mishra V.K. Advances in genomic tools for plant breeding: harnessing DNA molecular markers, genomic selection, and genome editing. Biol. Res. 2024;57:80. [CrossRef]
- Varala, K., Swaminathan K., Li Y., Hudson M.E. Rapid Genotyping of Soybean Cultivars Using High Throughput Sequencing. PLOS ONE 2011;6:e24811. [CrossRef]
- de Boer, H., Rydmark M.O., Verstraete B., Gravendeel B. Molecular identification of plants: from sequence to species. Advanced Books 2022;1:396. [CrossRef]
- Wijesundara, U.K., Masouleh A.K., Furtado A., Dillon N.L., Henry R.J. A chromosome-level genome of mango exclusively from long-read sequence data. The Plant Genome 2024;n/a:e20441. [CrossRef]
- Al-Dossary, O., Alsubaie B., Kharabian-Masouleh A., Al-Mssallem I., Furtado A., Henry R.J. The jojoba genome reveals wide divergence of the sex chromosomes in a dioecious plant. Plant J 2021;n/a. [CrossRef]
- Healey, A., Furtado A., Cooper T., Henry R.J. Protocol: a simple method for extracting next-generation sequencing quality genomic DNA from recalcitrant plant species. Plant Methods 2014;10:21. [CrossRef]
- Tran, H.T.M., Furtado A., Vargas C.A.C., Smyth H., Slade Lee L., Henry R. SNP in the Coffea arabica genome associated with coffee quality. Tree Genet. Genom. 2018;14:72. [CrossRef]
- Kilby, N.J., Furner I.J., University of Cambridge, Cambridge, England. Another CTAB Protocol: Isolation of High Molecular Weight DNA from small quantities of arabidopsis tissue 2022 (Unpublished work).
- Rai, S. (The University of Queensland, Brisbane, Australia). High Molecular Weight (HMW) Total DNA Extraction from Non-model Species, 2023. (gih.uq.edu.au). Standard Operating Procedure: SOP204-01 2023.
- Lyrene, P.M., Vorsa N., Ballington J.R. Polyploidy and sexual polyploidization in the genus Vaccinium. Euphytica 2003;133:27-36. [CrossRef]
- Ortiz, R., Bruederle L.P., Laverty T., Vorsa N. The origin of polyploids via 2n gametes in Vaccinium section Cyanococcus. Euphytica 1991;61:241-246. [CrossRef]
- Cui, F., Ye X., Li X., Yang Y., Hu Z., Overmyer K., Brosché M., Yu H., Salojärvi J. Chromosome-level genome assembly of the diploid blueberry Vaccinium darrowii provides insights into its subtropical adaptation and cuticle synthesis. Plant Comm. 2022;3:100307. [CrossRef]
- Griffin, P.C., Robin C., Hoffmann A.A. A next-generation sequencing method for overcoming the multiple gene copy problem in polyploid phylogenetics, applied to Poa grasses. BMC Biol. 2011;9:19. [CrossRef]
- Campa, A., Ferreira J.J. Genetic diversity assessed by genotyping by sequencing (GBS) and for phenological traits in blueberry cultivars. PLoS One 2018;13:e0206361. [CrossRef]
- Swarts, K., Li H., Romero Navarro J.A., An D., Romay M.C., Hearne S., Acharya C., Glaubitz J.C., Mitchell S., Elshire R.J., Buckler E.S., Bradbury P.J. Novel Methods to Optimize Genotypic Imputation for Low-Coverage, Next-Generation Sequence Data in Crop Plants. The Plant Genome 2014;7:23. [CrossRef]
- Bradbury, P.J., Casstevens T., Jensen S.E., Johnson L.C., Miller Z.R., Monier B., Romay M.C., Song B., Buckler E.S. The Practical Haplotype Graph, a platform for storing and using pangenomes for imputation. Bioinformatics 2022;38:3698-3702. [CrossRef]
- Wang, N., Yuan Y., Wang H., Yu D., Liu Y., Zhang A., Gowda M., Nair S.K., Hao Z., Lu Y., San Vicente F., Prasanna B.M., Li X., Zhang X. Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding. Sci. Rep. 2020;10:16308. [CrossRef]
- McCallum, S., Graham J., Jorgensen L., Rowland L.J., Bassil N.V., Hancock J.F., Wheeler E.J., Vining K., Poland J.A., Olmstead J.W., Buck E., Wiedow C., Jackson E., Brown A., Hackett C.A. Construction of a SNP and SSR linkage map in autotetraploid blueberry using genotyping by sequencing. Mol. Breed. 2016;36:41. [CrossRef]
- Rowland, L.J., Ogden E.L., Ehlenfeldt M.K. EST-PCR markers developed for highbush blueberry are also useful for genetic fingerprinting and relationship studies in rabbiteye blueberry. Sci. Hortic. 2010;125:779-784. [CrossRef]
- Boches, P., Bassil NV., Rowland L. Genetic diversity in the highbush blueberry evaluated with microsatellite markers. J. Am. Soc. Hort. Sci. 2006;131:674-686. [CrossRef]
- Bassil, N.V., Bunch T., Nyberg A., Hummer K., Zee F.T. Microsatellite markers distinguish Hawaiian Ohelo from other Vaccinium L. section Myrtillus species. International Society for Horticultural Science (ISHS), Leuven, Belgium, 2010.
- Carvalho, M., Matos M., Crespí A., Lopes V.R., Carnide V. Genetic Diversity and Identification of Vaccinium Species Through Microsatellite Analysis. Plants 2024;13:3488. [CrossRef]
- Miteca, H., Castro M.H., Meneses M., Prat L., Muñoz C., Hinrichsen P. Validation of a minimal panel of microsatellite markers for blueberry cultivar identification and frequency of spontaneous mutations. Chil. J. Agric. Res. 2024;84:28-42. [CrossRef]
- Ganal, M.W., Altmann T., Röder M.S. SNP identification in crop plants. Curr Opin Plant Biol 2009;12:211-217. [CrossRef]
- Elshire, R.J., Glaubitz J.C., Sun Q., Poland J.A., Kawamoto K., Buckler E.S., Mitchell S.E. A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLOS ONE 2011;6:e19379. [CrossRef]
- Miller, M.R., Dunham J.P., Amores A., Cresko W.A., Johnson E.A. Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 2007;17:240-248. [CrossRef]


| Genome size | Heterozygosity | Sequence coverage | ||
| 10X | 20X | 25X | ||
| Small genome (500 Mbp) | 1% | 4,883 | 5 | 0 |
| 3% | 14,648 | 15 | 0 | |
| Large genome (2 Gbp) | 1% | 19,531 | 19 | 1 |
| 3% | 58,593 | 57 | 2 | |
![]() |
![]() |
![]() |
![]() |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).



