Submitted:
07 September 2024
Posted:
09 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1 Preliminary Dataset and Selection of Targets for Analysis
2.2. Molecular Dynamics Simulation
2.3. Local Physicochemical Variation Analysis
2.4. Physicochemical Property Correlation Analysis
3. Results
3.1. Local Physicochemical Variation Analysis
3.1.1. Magnitude and Spread of Variation in Tensor Space
3.1.2. Relationship between Mean Inter-Tensor Euclidean Distance, Residual Mechanical Stiffness, ΔΔG and Decrease in Residual Hydrophobicity
3.1.3. Relationships between Inter-Tensor Euclidean Distance Standard Deviations, Residual Mechanical Stiffness, ΔΔG and Decrease in Residual Hydrophobicity
3.2. Physicochemical Property Correlation Analysis
4. Discussion
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Anfinsen C. B. (1973). Principles that govern the folding of protein chains. Science (New York, N.Y.), 181(4096), 223–230. [CrossRef]
- Choudhury, A., Mohammad, T., Anjum, F., Shafie, A., Singh, I. K., Abdullaev, B., Pasupuleti, V. R., Adnan, M., Yadav, D. K., & Hassan, M. I. (2022). Comparative analysis of web-based programs for single amino acid substitutions in proteins. PloS one, 17(5), e0267084. [CrossRef]
- Goldenzweig, A., & Fleishman, S. J. (2018). Principles of Protein Stability and Their Application in Computational Design. Annual review of biochemistry, 87, 105–129. [CrossRef]
- Jankovic, B., & Polovic, N. (2017). The protein folding problem. 39, 105–111. [CrossRef]
- Thusberg, J., & Vihinen, M. (2009). Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods. Human mutation, 30(5), 703–714. [CrossRef]
- Torrisi, M., Pollastri, G., & Le, Q. (2020). Deep learning methods in protein structure prediction. Computational and structural biotechnology journal, 18, 1301–1310. [CrossRef]
- Bakkers, M. J. G., Ritschel, T., Tiemessen, M., Dijkman, J., Zuffianò, A. A., Yu, X., van Overveld, D., Le, L., Voorzaat, R., van Haaren, M. M., de Man, M., Tamara, S., van der Fits, L., Zahn, R., Juraszek, J., & Langedijk, J. P. M. (2024). Efficacious human metapneumovirus vaccine based on AI-guided engineering of a closed prefusion trimer. Nature communications, 15(1), 6270. [CrossRef]
- Diaz, D. J., Kulikova, A. V., Ellington, A. D., & Wilke, C. O. (2023). Using machine learning to predict the effects and consequences of mutations in proteins. Current opinion in structural biology, 78, 102518. [CrossRef]
- Li, B., Yang, Y. T., Capra, J. A., & Gerstein, M. B. (2020). Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks. PLoS computational biology, 16(11), e1008291. [CrossRef]
- Wen, T., & Altman, R. B. (2017). 3D deep convolutional neural networks for amino acid environment similarity analysis. BMC bioinformatics, 18(1), 302. [CrossRef]
- Betts, M. J., Lu, Q., Jiang, Y., Drusko, A., Wichmann, O., Utz, M., Valtierra-Gutiérrez, I. A., Schlesner, M., Jaeger, N., Jones, D. T., Pfister, S., Lichter, P., Eils, R., Siebert, R., Bork, P., Apic, G., Gavin, A. C., & Russell, R. B. (2015). Mechismo: predicting the mechanistic impact of mutations and modifications on molecular interactions. Nucleic acids research, 43(2), e10. [CrossRef]
- Nam, K., & Wolf-Watz, M. (2023). Protein dynamics: The future is bright and complicated!. Structural dynamics (Melville, N.Y.), 10(1), 014301. [CrossRef]
- Rueda, M., Ferrer-Costa, C., Meyer, T., Pérez, A., Camps, J., Hospital, A., Gelpí, J. L., & Orozco, M. (2007). A consensus view of protein dynamics. Proceedings of the National Academy of Sciences of the United States of America, 104(3), 796–801. [CrossRef]
- Stourac, J., Dubrava, J., Musil, M., Horackova, J., Damborsky, J., Mazurenko, S., & Bednar, D. (2021). FireProtDB: database of manually curated protein stability data. Nucleic acids research, 49(D1), D319–D324. [CrossRef]
- Nisthal, A., Wang, C. Y., Ary, M. L., & Mayo, S. L. (2019). Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis. Proceedings of the National Academy of Sciences of the United States of America, 116(33), 16367–16377. [CrossRef]
- Eswar, N., Webb, B., Marti-Renom, M. A., Madhusudhan, M. S., Eramian, D., Shen, M. Y., Pieper, U., & Sali, A. (2006). Comparative protein structure modeling using Modeller. Current protocols in bioinformatics, Chapter 5, Unit–5.6. [CrossRef]
- Webb, B., & Sali, A. (2014). Protein structure modeling with MODELLER. Methods in molecular biology (Clifton, N.J.), 1137, 1–15. [CrossRef]
- Webb, B. (2022). Mutate model. Modeller Wiki. https://salilab.org/modeller/wiki/Mutate_model.
- Eastman, P., Swails, J., Chodera, J. D., McGibbon, R. T., Zhao, Y., Beauchamp, K. A., Wang, L. P., Simmonett, A. C., Harrigan, M. P., Stern, C. D., Wiewiora, R. P., Brooks, B. R., & Pande, V. S. (2017). OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS computational biology, 13(7), e1005659.
- Eastman, P., Galvelis, R., Peláez, R. P., Abreu, C. R. A., Farr, S. E., Gallicchio, E., Gorenko, A., Henry, M. M., Hu, F., Huang, J., Krämer, A., Michel, J., Mitchell, J. A., Pande, V. S., Rodrigues, J. P., Rodriguez-Guerra, J., Simmonett, A. C., Singh, S., Swails, J., Turner, P., … Markland, T. E. (2024). OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials. The journal of physical chemistry. B, 128(1), 109–116. [CrossRef]
- Orozco M. (2014). A theoretical view of protein dynamics. Chemical Society reviews, 43(14), 5051–5066. [CrossRef]
- Eyal, E., & Bahar, I. (2008). Toward a molecular understanding of the anisotropic response of proteins to external forces: insights from elastic network models. Biophysical journal, 94(9), 3424–3435. [CrossRef]
- Bakan, A., Meireles, L. M., & Bahar, I. (2011). ProDy: protein dynamics inferred from theory and experiments. Bioinformatics (Oxford, England), 27(11), 1575–1577. [CrossRef]
- Eisenberg, D., Schwarz, E., Komaromy, M., & Wall, R. (1984). Analysis of membrane and surface protein sequences with the hydrophobic moment plot. Journal of molecular biology, 179(1), 125–142. [CrossRef]
- Eisenberg, D.S., Weiss, R.M., Terwilliger, T.C., & Wilcox, W.R. (1982). Hydrophobic moments and protein structure. Faraday Symposia of The Chemical Society, 17, 109-120.
- Dill K. A. (1990). Dominant forces in protein folding. Biochemistry, 29(31), 7133–7155. [CrossRef]
- Pace, C. N., Scholtz, J. M., & Grimsley, G. R. (2014). Forces stabilizing proteins. FEBS letters, 588(14), 2177–2184. [CrossRef]
- Fang J. (2020). A critical review of five machine learning-based algorithms for predicting protein stability changes upon mutation. Briefings in bioinformatics, 21(4), 1285–1292. [CrossRef]
- Mumuni, A., & Mumuni, F. (2022). Data augmentation: A comprehensive survey of modern approaches. Array, 16, 100258. [CrossRef]



| Mean Intra-channel Euclidean distance | Residual mechanical stiffness | ΔΔG (kcal/mol) | Change in residual hydrophobicity |
|---|---|---|---|
| Whole-tensor (all channels) | 0.823 (p-value 6.729×10-245) | 0.324 (p-value 3.359×10-23) | 0.300 (p-value 4.009×10-22) |
| Hydrophobicity | 0.807 (p-value 5.300×10-228) | 0.318 (p-value 2.73 × 10-22) | 0.269 (p-value 7.058×10-18) |
| Aromaticity | 0.530 (p-value 1.026×10-72) | 0.167 (p-value 5.39 × 10-7) | 0.114 (p-value 3.312×10-4) |
| H-bond acceptor | 0.786 (p-value 8.047×10-209) | 0.353 (p-value 2.00 × 10-27) | 0.288 (p-value 2.232×10-20) |
| H-bond donor | 0.809 (p-value 1.387×10-230) | 0.301 (p-value 5.38 × 10-20) | 0.347 (p-value 2.319×10-29) |
| Positive ionisability | 0.125 (p-value 8.139×10-5) | 0.0131 (p-value 0.696) | 0.431 (p-value 4.290×10-46) |
| Negative ionisability | 0.251 (p-value 9.299×10-16) | 0.201 (p-value 1.53 × 10-9) | 0.224 (p-value 1.044×10-12) |
| Occupancy | 0.806 (p-value 2.365×10-227) | 0.301 (p-value 4.63 × 10-20) | 0.266 (p-value 1.738×10-17) |
| Intra-channel Euclidean distance standard deviation | Residual mechanical stiffness | ΔΔG (kcal/mol) | Change in residual hydrophobicity |
|---|---|---|---|
| Whole-tensor (all channels) | 0.624 (p-value 5.438×10-108) | 0.216 (p-value 7.265×10-11) | 0.196 (p-value 4.756×10-10) |
| Hydrophobicity | 0.642 (p-value 3.95 × 10-116) | 0.228 (p-value 5.97 × 10-12) | 0.198 (p-value 3.473×10-10) |
| Aromaticity | 0.479 (p-value 5.92 × 10-58) | 0.151 (p-value 5.99× 10-6) | 0.102 (p-value 0.00126) |
| H-bond acceptor | 0.670 (p-value 6.14 × 10-130) | 0.262 (p-value 1.88 × 10-15) | 0.240 (p-value 1.798×10-14) |
| H-bond donor | 0.701 (p-value 2.41 × 10-147) | 0.245 (p-value 1.21 × 10-13) | 0.244 (p-value 6.780×10-15) |
| Positive ionisability | 0.129 (p-value 4.43 × 10-5) | 0.0588 (p-value 0.0797) | 0.282 (p-value 1.428×10-19) |
| Negative ionisability | 0.081 (p-value 0.0109) | 0.163 (p-value 1.10× 10-6) | 0.193 (p-value 8.300×10-10) |
| Occupancy | 0.436 (p-value 2.37 × 10-47) | 0.115 (p-value 0.000588) | 0.114 (p-value 0.000321) |
| Residual mechanical stiffness | ΔΔG (kcal/mol) | Change in residual hydrophobicity | |
|---|---|---|---|
| Residual mechanical stiffness | |||
| ΔΔG (kcal/mol) | 0.461 (p-value 3.57 × 10-53) | ||
| Change in residual hydrophobicity | 0.204 (p-value 8.52 × 10-11) | 0.277 (p-value 5.62 × 10-19) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).