Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Linking Phenotypes to Protein Characteristics in 3D Structures Predicted by Alphafold

Version 1 : Received: 13 May 2023 / Approved: 16 May 2023 / Online: 16 May 2023 (05:32:22 CEST)

How to cite: Parajuli, A.; Brueggeman, R.; Wagner, S.; Warburton, M.; Peel, M.; Yu, L.; See, D.; Zhang, Z. Linking Phenotypes to Protein Characteristics in 3D Structures Predicted by Alphafold. Preprints 2023, 2023051102. https://doi.org/10.20944/preprints202305.1102.v1 Parajuli, A.; Brueggeman, R.; Wagner, S.; Warburton, M.; Peel, M.; Yu, L.; See, D.; Zhang, Z. Linking Phenotypes to Protein Characteristics in 3D Structures Predicted by Alphafold. Preprints 2023, 2023051102. https://doi.org/10.20944/preprints202305.1102.v1

Abstract

Plant breeding aims to develop elite crop varieties appropriate for various environments with higher quality and quantity of production. Researchers use quantitative trait loci (QTL) mapping and association studies to identify regions in the genome responsible for the variation of the quantitative traits of interest. However, mapped regions do not always translate to functional proteins, which makes it challenging to identify genes associated with traits of interest. The biological functions of proteins are strongly dependent on their 3D structure. Alternatively, if proteins can be directly linked with the phenotypes, the effect of mutations on phenotypic changes can be assessed. Innovation of deep learning models in biology opens new avenues of exploration. AlphaFold is an AI system that predicts the 3D structure of a protein from its amino acid sequence with near experimental accuracy and was used in this study. Point mutations with a significant influence on the 3D structure of a protein can capture the effect on phenotypes through association study, and this provides insights into the regions that are of functional importance. In the current study, 534 plants were selected based on plant vigor, and 154 missense variants that change amino acid sequences, including 5 significant hits from previous study, were included. The changes in protein 3D structure were assessed by association with the phenotype. The analysis identified five significant associations, four of which were also identified in previous study of SNPs GWAS, however, a new fifth association was also identified which was annotated as disease resistance gene in Medicago truncatula. This study helps to associate SNPs that could be missed by GWAS due to stringent Bonferroni corrected p-values by providing a more robust filter for SNPs using features from predicted protein 3D structures.

Keywords

Alfalfa; Plant Growth Vigor; Alphafold; Protein 3D Structure; Association Study

Subject

Biology and Life Sciences, Agricultural Science and Agronomy

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.