Preprint
Review

This version is not peer-reviewed.

Integrating Envirotyping and Phenomics for AI-Enabled Multi-Environment Genomic Prediction in Crop Breeding

Submitted:

13 April 2026

Posted:

14 April 2026

You are already at the latest version

Abstract
Genomic prediction is now routine in crop improvement, but its main bottleneck has shifted from marker density to environmental complexity. Breeders rarely need predictions for one fixed environment; they need to rank genotypes across target populations of environments that differ in weather, soils, management, and stress timing. This makes genotype-by-environment interaction a primary breeding problem rather than a secondary statistical nuisance. This review examines how genomic, environmental, and phenomic information can be integrated to improve multi-environment prediction in crop breeding pipelines. The review is narrative rather than PRISMA-style, but the literature search and selection logic were structured and explicitly defined. Peer-reviewed English-language studies were identified through structured searches of Web of Science Core Collection and Scopus, supplemented by backward citation screening, with emphasis on literature published from January 2023 to March 2026. Four conclusions emerge. First, environmental information is most useful when it is developmentally aligned, biologically interpretable, and matched to the target population of environments. Second, strong structured statistical baselines remain highly competitive, especially in moderate-sized or highly unbalanced datasets, whereas gains from more flexible machine-learning and deep-learning approaches are most evident in large, sparse, heterogeneous, and multimodal settings. Third, phenomic markers often improve prediction for complex traits, especially yield, because they capture realized crop responses not fully represented by markers alone. Fourth, practical value depends less on isolated gains in predictive accuracy than on evaluation under realistic deployment scenarios, including untested genotype and untested environment settings. Progress therefore requires transparent reporting, benchmark design, stage-aware envirotyping, multimodal integration, uncertainty reporting, and cost-aware deployment.
Keywords: 
;  ;  ;  ;  ;  ;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated