Submitted:
30 April 2025
Posted:
30 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Genotypic Data and Preprocessing
2.2. Phenotype Simulation
2.3. Genomic Prediction Models
2.4. Evaluation Metrics
2.5. Statistical Analysis and Visualization
3. Results
3.1. Prediction Accuracy

3.2. Model Stability and Consistency
3.3. Trait Association and Functional Relevance
4. Discussion
4.1. Prediction Accuracy in Genomic Selection
4.2. Stability and Consistency of Genomic Predictions
4.3. Biological Relevance of SNP Selection
4.4. Advantages of GSI-Based Selection Across MAF Ranges
4.5. Limitations and Future Directions
5. Conclusion
Acknowledgement
References
- Bernardo, R. (2014). Genomewide selection when major genes are known. Crop Science, 54(1), 68–75. [CrossRef]
- Chen, Z.-Q., Klingberg, A., Hallingbäck, H. R., & Wu, H. X. (2023). Preselection of QTL markers enhances accuracy of genomic selection in Norway spruce. BMC Genomics, 24(1), 147. [CrossRef]
- Crossa, J., Pérez-Rodríguez, P., Cuevas, J., Montesinos-López, O., Jarquín, D., de los Campos, G., Burgueño, J., González-Camacho, J. M., Pérez-Elizalde, S., Beyene, Y., Dreisigacker, S., Singh, R., Zhang, X., Gowda, M., Roorkiwal, M., Rutkoski, J., & Varshney, R. K. (2017). Genomic selection in plant breeding: Methods, models, and perspectives. Trends in Plant Science, 22(11), 961–975. [CrossRef]
- Dowle, M., & Srinivasan, A. (2019). data. table: Extension of ‘data. frame’. R package version 1.12.8. Computer Software. Retrieved from https://CRAN. R-project. org/package= data.table.
- Endelman, J. B. (2011). Ridge regression and other kernels for genomic selection with R package rrBLUP. The Plant Genome, 4(3), 250–255. [CrossRef]
- Falconer, D. S., & Mackay, T. F. C. (1996). Introduction to Quantitative Genetics (4th ed.). Longman.
- Gao, Y., Stein, M., Oshana, L., Zhao, W., Matsubara, S., & Stich, B. (2024). Exploring natural genetic variation in photosynthesis-related traits of barley in the field. Journal of Experimental Botany, 75(16), 4904–4925. [CrossRef]
- Heffner, E. L., Sorrells, M. E., & Jannink, J.-L. (2009). Genomic selection for crop improvement. Crop Science, 49(1), 1–12. [CrossRef]
- Hemstrom, W., Grummer, J. A., Luikart, G., & Christie, M. R. (2024). Next-generation data filtering in the genomics era. Nature Reviews Genetics, 25(11), 750–767. [CrossRef]
- Jannink, J.-L., Lorenz, A. J., & Iwata, H. (2010). Genomic selection in plant breeding: From theory to practice. Briefings in Functional Genomics, 9(2), 166–177. [CrossRef]
- Kolde, R. (2019). pheatmap: Pretty Heatmaps. R package version 1.0.12. https://CRAN.R-project.org/package=pheatmap.
- Kuhn, M. (2022). caret: Classification and Regression Training. R package version 6.0-94. https://CRAN.R-project.org/package=caret.
- Lipka, A. E., Tian, F., Wang, Q., Peiffer, J., Li, M., Bradbury, P. J., Gore, M. A., Buckler, E. S., & Zhang, Z. (2012). GAPIT: Genome association and prediction integrated tool. Bioinformatics, 28(18), 2397–2399. [CrossRef]
- Li, B., & Leal, S. M. (2008). Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data. American Journal of Human Genetics, 83(3), 311–321. [CrossRef]
- Mascher, M., Wicker, T., Jenkins, J., Plott, C., Lux, T., Koh, C. S., Ens, J., Gundlach, H., Boston, L. B., Tulpová, Z., Holden, S., Hernández-Pinzón, I., Scholz, U., Mayer, K. F. X., Spannagl, M., Pozniak, C. J., Sharpe, A. G., Šimková, H., Moscou, M. J., … Stein, N. (2021). Long-read sequence assembly: A technical evaluation in barley. The Plant Cell, 33(6), 1888–1906. [CrossRef]
- Meuwissen, T. H. E., Hayes, B. J., & Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics, 157(4), 1819–1829. [CrossRef]
- Norman, A., Taylor, J., Edwards, J., & Kuchel, H. (2018). Optimising genomic selection in wheat: effect of marker density, population size and population structure on prediction accuracy. G3: Genes, Genomes, Genetics, 8(9), 2889-2899.
- Panagiotou, O. A., Evangelou, E., & Ioannidis, J. P. A. (2010). Genome-wide significant associations for variants with minor allele frequency of 5% or less—An Overview: A HuGE Review. American Journal of Epidemiology, 172(8), 869–889.
- Pongpanich, M., Sullivan, P. F., & Tzeng, J.-Y. (2010). A quality control algorithm for filtering SNPs in genome-wide association studies. Bioinformatics, 26(14), 1731–1737. [CrossRef]
- R Core Team. (2023). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. https://www.R-project.org/.
- Spindel, J., Begum, H., Akdemir, D., Virk, P., Collard, B., Redoña, E., et al. (2015). Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines. PLoS Genetics, 11(2), e1004982. [CrossRef]
- Tang, Y., Liu, X., Wang, J., Li, M., Chen, Q., Xie, F., Zhang, Z., & Zhang, G. (2016). GAPIT Version 3: An interactive analytical tool for genomic association and prediction. The Plant Genome, 9(3), 1–12. [CrossRef]
- Venables, W. N., & Ripley, B. D. (2002). Modern Applied Statistics with S (4th ed.). Springer. [CrossRef]
- Wang, Q., Tian, F., Pan, Y., Buckler, E. S., & Zhang, Z. (2014). A SUPER Powerful Method for Genome Wide Association Study. PLOS ONE, 9(9), e107684. [CrossRef]
- Wang, J., & Zhang, Z. (2021). GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction. Genomics, Proteomics & Bioinformatics, 19(4), 629–640. [CrossRef]
- Wickham, H. (2016). ggplot2: Elegant Graphics for Data Analysis (2nd ed.). Springer. [CrossRef]
- Wilke, C. O. (2024). cowplot: Streamlined Plot Theme and Plot Annotations for “ggplot2” (Version 1.1.3) [Computer software].

| Model | R² ± SE | RMSE ± SE | CV_R2 |
|---|---|---|---|
| GS (rrBLUP) | 0.483 ± 0.023 | 16.988 ± 0.208 | 0.185 |
| GWAS-Assisted GS | 0.526 ± 0.028 | 17.003 ± 0.187 | 0.203 |
| GSI-GS | 0.514 ± 0.022 | 16.847 ± 0.145 | 0.165 |
| MAF-GS | 0.341 ± 0.025 | 17.408 ± 0.242 | 0.279 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).