Submitted:
25 August 2025
Posted:
27 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Genomics and Protein Structure Prediction: A Unified Frontier Enabled by Deep Learning
1.2. Brief History and Evolution of Deep Learning
2. Advantages and Challenges of Using Deep Learning in Computational Biology
2.1. Advantages of Using Deep Learning
2.2. Challenges of Using Deep Learning
3. Interconnecting Genomics and Protein Structure Prediction through Deep Learning
3.1. Role of Deep Learning in Genomic Variant Detection and Precision Medicine
3.2. Advancements in Deep Learning for Epigenetic Data Analysis
3.3. Applications of Deep Learning in Protein Structure Prediction
4. Deep Learning Models for Prediction of Protein Structure from Sequence Data
4.1. Applications of Deep Learning in Protein-Protein Interaction Prediction and Drug Discovery
4.2. Recent Developments in Deep Learning-Based Techniques for Analyzing Protein Function and Evolution
4.3. Challenges and Future Directions
5. Key Challenges and Future Directions
5.1. Emerging Areas of Research and Potential Applications
5.2. Ethical and Social Implications
6. Future Prospects and Potential Impact of Deep Learning on Biological Research and Clinical Practice
Author Contributions
Funding
Institutional Review Board Statement
Declaration of Competing Interest
Acknowledgements
Abbreviations
| LLMs | Large Language Models |
| scGNN | single-cell analysis using graph neural networks |
| CNNs | Convolutional neural networks |
| RNNs | Recurrent neural networks |
| DNA | Deoxyribonucleic Acid |
| RNA | Ribonucleic Acid. |
| ANNs | Artificial neural networks |
| gLMs | genomic language models |
| LSTMs | Long short-term memory networks |
| GAN | Generative adversarial network |
| GNNs | Graph neural networks |
| SNPs | Single-nucleotide polymorphisms |
| DGMs | Deep generative models |
| DBN | Deep Belief Networks |
| RL | Reinforcement Learning |
| VAE | Variational Autoencoders |
| DBM | Deep Boltzmann Machines |
| DQN | Deep Q- Networks |
| GCN | Graph Convolutional Networks |
| VGAE | Variational Graph Autoencoders |
| DNN | Deep Neural Networks |
| CRNN | Convolutional Recurrent Neural Networks |
| BLSTM | Bidirectional long short-term memory |
| CASP | Critical Assessment of Structure Prediction |
| PPI | Protein-protein interaction |
| PSSM | Protein-positioning specific scoring matrices |
| QA | Quality assessment |
| AI | Artificial Intelligence |
| PLMs | Protein language models |
References
- Way, G.P.; Greene, C.S.; Carninci, P.; Carvalho, B.S.; de Hoon, M.; Finley, S.D.; Gosline, S.J.C.; Lȇ Cao, K.-A.; Lee, J.S.H.; et al. A field guide to cultivating computational biology. PLoS Biology 2021, 19, e3001419. [Google Scholar] [CrossRef] [PubMed]
- Faustine, A.; Pereira, L.; Bousbiat, H.; Kulkarni, S. UNet-NILM: A Deep Neural Network for Multi-tasks Appliances State Detection and Power Estimation in NILM. Proceedings of the 5th International Workshop on Non-Intrusive Load Monitoring 2020. [Google Scholar] [CrossRef]
- Kindel, W.F.; Christensen, E.D.; Zylberberg, J. Using deep learning to reveal the neural code for images in primary visual cortex.
- Machine Learning Industry Trends Report Data Book, 2022-2030 [Internet]. Available online: https://www.grandviewresearch.com/sector-report/machine-learning-industry-data-book (accessed on 9 May 2023).
- Wang, H.; Raj, B. On the Origin of Deep Learning. arXiv 2017. [Google Scholar] [CrossRef]
- Wang, H.; Raj, B. On the origin of deep learning. arXiv 2017, arXiv:1702.07800. [Google Scholar] [CrossRef]
- Sarker, I.H. Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions. SN Computer Science 2021, 2. [Google Scholar] [CrossRef]
- Stegle, O.; Pärnamaa, T.; Parts, L. Deep learning for computational biology. Molecular Systems Biology 2016, 12, 878. [Google Scholar] [CrossRef]
- Ferrucci, D.A.; Brown, E.D.; Chu-Carroll, J.; Fan, J.Z.; Gondek, D.C.; Kalyanpur, A.; Lally, A.; Murdock, J.W.; Nyberg, E.; Prager, J.M.; et al. Building Watson: An Overview of the DeepQA Project. Ai Magazine 2010, 31, 59. [Google Scholar] [CrossRef]
- Larrañaga, P.; Calvo, B.; Santana, R.; Bielza, C.; Galdiano, J.; Inza, I.; Lozano, J.A.; Armañanzas, R.; Santafé, G.; Pérez, A.; Robles, V. Machine learning in bioinformatics. Briefings in Bioinformatics 2006, 7, 86–112. [Google Scholar] [CrossRef] [PubMed]
- Ching, T.; Himmelstein, D.; Beaulieu-Jones, B.K.; Kalinin, A.A.; Do, B.T.; Way, G.P.; Ferrero, E.; Agapow, P.; Zietz, M.; Hoffman, M.M.; Xie, W.; et al. Opportunities and obstacles for deep learning in biology and medicine. Journal of the Royal Society Interface 2018, 15, 20170387. [Google Scholar] [CrossRef]
- Litjens, G.; Sánchez, C.I.; Timofeeva, N.; Hermsen, M.; Nagtegaal, I.D.; Kovacs, I.E.; De Kaa, C.a.H.; Bult, P.; Van Ginneken, B.; Van Der Laak, J. Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Scientific Reports 2016, 6. [Google Scholar] [CrossRef] [PubMed]
- Mamoshina, P.; Vieira, A.; Putin, E.; Zhavoronkov, A. Applications of Deep Learning in Biomedicine. Molecular Pharmaceutics 2016, 13, 1445–1454. [Google Scholar] [CrossRef] [PubMed]
- Carracedo-Reboredo, P.; Liñares-Blanco, J.; Rodriguez-Fernandez, N.; Cedrón, F.; Novoa, F.J.; Carballal, A.; Maojo, V.; Pazos, A.; Fernandez-Lozano, C. A review on machine learning approaches and trends in drug discovery. Computational and Structural Biotechnology Journal 2021, 19, 4538–4558. [Google Scholar] [CrossRef]
- Zhao, L.; Ciallella, H.L.; Aleksunes, L.M.; Zhu, H. Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling. Drug Discovery Today 2020, 25, 1624–1638. [Google Scholar] [CrossRef]
- Dinov, I.D.; Heavner, B.; Tang, M.; Glusman, G.; Chard, K.; D’Arcy, M.; Madduri, R.; Pa, J.; Spino, C.; Kesselman, C.; et al. Predictive Big Data Analytics: A Study of Parkinson’s Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations. PLOS ONE 2016, 11, e0157077. [Google Scholar] [CrossRef]
- Min, S.; Lee, B.; Yoon, S. Deep learning in bioinformatics. Briefings in Bioinformatics, bbw068.
- Ma, Q.; Xu, D. Deep learning shapes single-cell data analysis. Nat Rev Mol Cell Biol 2016, 23, 303–304. [Google Scholar] [CrossRef] [PubMed]
- Romão, M.C.; Castro, N.F.; Pedro, R.; Vale, T. Transferability of deep learning models in searches for new physics at colliders. Physical Review D 2020, 101. [Google Scholar] [CrossRef]
- Prabhu, S.P. Ethical challenges of machine learning and deep learning algorithms. Lancet Oncology 2019. [Google Scholar] [CrossRef] [PubMed]
- Miller, R.F.; Singh, A.; Otaki, Y.; Tamarappoo, B.; Kavanagh, P.; Parekh, T.; Hu, L.; Gransar, H.; Sharir, T.; Einstein, A.J.; et al. Mitigating bias in deep learning for diagnosis of coronary artery disease from myocardial perfusion SPECT images. European Journal of Nuclear Medicine and Molecular Imaging 2022, 50, 387–397. [Google Scholar] [CrossRef]
- Data sharing in the age of deep learning. Nature Biotechnology 2023, 41, 433. [CrossRef]
- Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.Q.; Duan, Y.; Al-Shamma, O.; Santamaría, J.V.G.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data 2021, 8. [Google Scholar] [CrossRef] [PubMed]
- Zou, J.; Huss, M.; Abid, A.; Mohammadi, P.; Torkamani, A.; Telenti, A. A primer on deep learning in genomics. Nature Genetics 2018, 51, 12–18. [Google Scholar] [CrossRef]
- Liu, J.; Li, J.; Wang, H.; Yan, J. Application of deep learning in genomics. Science China. Life Sciences 2020, 63, 1860–1878. [Google Scholar] [CrossRef]
- Jo, T.; Nho, K.; Bice, P.; Saykin, A.J. Deep learning-based identification of genetic variants: application to Alzheimer’s disease classification. Briefings in Bioinformatics 2022, 23. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Qu, H.-Q.; Mentch, F.D.; Qu, J.; Chang, X.; Nguyen, K.; Tian, L.; Glessner, J.; Sleiman, P.M.A.; Hakonarson, H. Application of deep learning algorithm on whole genome sequencing data uncovers structural variants associated with multiple mental disorders in African American patients. Molecular Psychiatry 2022. [Google Scholar] [CrossRef] [PubMed]
- Montesinos-López, O.A.; Montesinos-López, A.; Pérez-Rodríguez, P.; Barrón-López, J.A.; Martini, J.W.R.; Fajardo-Flores, S.B.; Gaytan-Lugo, L.S.; Santana-Mancilla, P.C.; Crossa, J. A review of deep learning applications for genomic selection. BMC Genomics 2021, 22. [Google Scholar] [CrossRef] [PubMed]
- Avsec, Ž.; Agarwal, V.; Visentin, D.; Ledsam, J.R.; Grabska-Barwinska, A.; Taylor, K.R.; Assael, Y.; Jumper, J.; Kohli, P.; Kelley, D.R. Effective gene expression prediction from sequence by integrating long-range interactions. Nature Methods 2021, 18, 1196–1203. [Google Scholar] [CrossRef]
- Treppner, M.; Binder, H.; Hess, M. Interpretable generative deep learning: an illustration with single cell gene expression data. Human Genetics 2022. [Google Scholar] [CrossRef]
- Van Dijk, S.J.; Tellam, R.L.; Morrison, J.L.; Muhlhausler, B.S.; Molloy, P.L. Recent developments on the role of epigenetics in obesity and metabolic disease. Clinical Epigenetics 2015, 7. [Google Scholar] [CrossRef] [PubMed]
- Angermueller, C.; Lee, H.J.; Reik, W.; Stegle, O. DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning. Genome Biology 2017, 18. [Google Scholar]
- Yin, Q.; Wu, M.; Liu, Q.; Lv, H.; Jiang, R. DeepHistone: a deep learning approach to predicting histone modifications. BMC Genomics 2019. [Google Scholar] [CrossRef]
- Galkin, F.; Mamoshina, P.; Kochetov, K.; Sidorenko, D.; Zhavoronkov, A. DeepMAge: A Methylation Aging Clock Developed with Deep Learning. Aging and Disease 2021, 12, 1252. [Google Scholar] [CrossRef]
- Zhang, B.; Li, J.; Lü, Q. Prediction of 8-state protein secondary structures by a novel deep learning architecture. BMC Bioinformatics 2018, 19. [Google Scholar] [CrossRef]
- AlQuraishi, M. AlphaFold at CASP13. Bioinformatics 2019, 35, 4862–4865. [Google Scholar] [CrossRef]
- Du, X.; Sun, S.; Hu, C.; Yao, Y.; Yan, Y.; Zhang, Y. DeepPPI: Boosting Prediction of Protein–Protein Interactions with Deep Neural Networks. Journal of Chemical Information Modeling 2017, 57, 1499–1510. [Google Scholar] [CrossRef] [PubMed]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
- Wang, S.; Mo, B.; Zhao, J.; Zhao, J.; Zhao, J. Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks. Transportation Research Part B 2021. [Google Scholar] [CrossRef]
- Cramer, P. AlphaFold2 and the future of structural biology. Nature Structural & Molecular Biology 2021, 28, 704–705. [Google Scholar] [CrossRef]
- Baek, M.; Baker, D. Deep learning and protein structure modeling. Nature methods 2022, 19, 13–14. [Google Scholar] [CrossRef]
- Zeng, H.; Edwards, M.D.; Liu, G.; Gifford, D.K. Convolutional neural network architectures for predicting DNA–protein binding. Bioinformatics 2016, 32, i121–i127. [Google Scholar] [CrossRef]
- Zhang, S.; Zhou, J.; Hu, H.; Gong, H.; Chen, L.; Cheng, C.; Zeng, J. A deep learning framework for modeling structural features of RNA-binding protein targets. Nucleic acids research 2016, 44, e32–e32. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Torrisi, M.; Pollastri, G.; Le, Q. Deep learning methods in protein structure prediction. Computational and Structural Biotechnology Journal 2020, 18. [Google Scholar] [CrossRef]
- Jha, K.; Saha, S.; Singh, H. Prediction of protein-protein interaction using graph neural networks. Scientific reports 2022, 12, 8360. [Google Scholar] [CrossRef]
- Du, X.; Sun, S.; Hu, C.; Yao, Y.; Yan, Y.; Zhang, Y. DeepPPI: Boosting Prediction of Protein- Protein Interactions with Deep Neural Networks. J Chem Inf Model. 2017, 57, 1499–1510. [Google Scholar] [CrossRef]
- Hashemifar, S.; Neyshabur, B.; Khan, A.A.; Xu, J. Predicting protein-protein interactions through sequence-based deep learning. Bioinformatics 2018, 34, i802–i810. [Google Scholar] [CrossRef]
- Yao, Y.; Du, X.; Diao, Y.; Zhu, H. An integration of deep learning with feature embedding for protein–protein interaction prediction. PeerJ 2019, 2019, e7126. [Google Scholar] [CrossRef] [PubMed]
- Yang, F.; Fan, K.; Song, D.; Lin, H. Graph-based prediction of Protein-protein interactions with attributed signed graph embedding. BMC Bioinformatics 2020, 21, 1–16. [Google Scholar] [CrossRef] [PubMed]
- Dara, S.; Dhamercherla, S.; Jadav, S.S.; Babu, C.M.; Ahsan, M.J. Machine Learning in Drug Discovery: A Review. Artificial intelligence review 2022, 55, 1947–1999. [Google Scholar] [CrossRef]
- Wen, M.; Zhang, Z.; Niu, S.; Sha, H.; Yang, R.; Yun, Y.; Lu, H. Deep-Learning- Based Drug-Target Interaction Prediction. Journal of proteome research 2017, 16, 1401–1409. [Google Scholar] [CrossRef]
- Suh, D.; Lee, J.W.; Choi, S.; Lee, Y. Recent applications of deep learning methods on evolution-and contact-based protein structure prediction. International Journal of Molecular Sciences 2021, 22, 6032. [Google Scholar] [CrossRef]
- Kuang, M.; Liu, Y.; Gao, L. DLPAlign: a deep learning based progressive alignment method for multiple protein sequences. In CSBio'20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics; 2020; pp. 83–92. [Google Scholar]
- Gao, M.; Zhou, H.; Skolnick, J. DESTINI: A deep-learning approach to contact- driven protein structure prediction. Scientific reports 2019, 9, 1–13. [Google Scholar] [CrossRef] [PubMed]
- Pan, X.; Yang, Y.; Xia, C.Q.; Mirza, A.H.; Shen, H.B. Recent methodology progress of deep learning for RNA–protein interaction prediction. Wiley Interdisciplinary Reviews: RNA 2019, 10, e1544. [Google Scholar] [CrossRef] [PubMed]
- Sapoval, N.; Aghazadeh, A.; Nute, M.G.; et al. Current progress and open challenges for applying deep learning across the biosciences. Nat Commun 2022, 13, 1728. [Google Scholar] [CrossRef]
- Akay, A.; Hess, H. Deep Learning: Current and Emerging Applications in Medicine and Technology. IEEE Journal of Biomedical and Health Informatics 2019, 23, 906–920. [Google Scholar] [CrossRef]
- Choudhary, K.; DeCost, B.L.; Low, W.C.; Jain, A.; Tavazza, F.; Cohn, R.; Wolverton, C. Recent advances and applications of deep learning methods in materials science. Npj Computational Materials 2022, 8. [Google Scholar] [CrossRef]
- O’Connor, O.M.; Alnahhas, R.N.; Lugagne, J.; Dunlop, M.J. DeLTA 2.0: A deep learning pipeline for quantifying single-cell spatial and temporal dynamics. PLOS Computational Biology 2022, 18, e1009797. [Google Scholar] [CrossRef]
- Fernandez-Quilez, A. Deep learning in radiology: ethics of data and on the value of algorithm transparency, interpretability and explainability. AI Ethics 2023, 3, 257–265. [Google Scholar] [CrossRef]
- Zhang, Y.; Ye, T.; Xi, H.; Juhas, M.; Li, J.J. Deep Learning Driven Drug Discovery: Tackling Severe Acute Respiratory Syndrome Coronavirus 2. Frontiers in Microbiology 2021, 12. [Google Scholar] [CrossRef]
- Chen, H.; Engkvist, O.; Wang, Y.; Olivecrona, M.; Blaschke, T. The rise of deep learning in drug discovery. Drug Discovery Today 2018, 23, 1241–1250. [Google Scholar] [CrossRef] [PubMed]
- Beck, B.R.; Shin, B.; Choi, Y.; Park, S.; Kang, K. Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug- target interaction deep learning model. Computational and Structural Biotechnology Journal 2020, 18, 784–790. [Google Scholar] [CrossRef] [PubMed]
- Bai, Q.; Liu, S.; Tian, Y.; Xu, T.; Banegas-Luna, A.J.; Pérez-Sánchez, H.; Yao, X. Application advances of deep learning methods for de novo drug design and molecular dynamics simulation. Wiley Interdisciplinary Reviews: Computational Molecular Science 2021, 12. [Google Scholar] [CrossRef]
- Pan, X.; Lin, X.; Cao, D.; Zeng, X.; Yu, P.S.; He, L.; Cheng, F. Deep learning for drug repurposing: methods, databases, and applications. arXiv 2022, arXiv:10.48550/arxiv.2202.05145. [Google Scholar] [CrossRef]
- Jobin, A.; Vayena, E. The global landscape of AI ethics guidelines. Nature Machine Intelligence 2019, 1, 389–399. [Google Scholar] [CrossRef]
- Naik, N.; Tokas, T.; Shetty, D.K.; Swain, D.; Shah, M.; Paul, R.; Somani, B.K. Legal and Ethical Consideration in Artificial Intelligence in Healthcare: Who Takes Responsibility? Frontiers in Surgery 2022, 9. [Google Scholar] [CrossRef] [PubMed]
- Lo Piano, S. Ethical principles in machine learning and artificial intelligence: cases from the field and possible ways forward. Humanities & Social Sciences Communications 2020. [Google Scholar] [CrossRef]
- Ching, T.; Himmelstein, D.; Beaulieu-Jones, B.K.; Kalinin, A.A.; Do, B.T.; Way, G.P.; Greene, C.S. Opportunities and obstacles for deep learning in biology and medicine. Journal of the Royal Society Interface 2018, 15, 20170387. [Google Scholar] [CrossRef]
- Sharma, P.B.; Ramteke, P. Recommendation for Selecting Smart Village in India through Opinion Mining Using Big Data Analytics. Indian Scientific Journal of Research in Engineering and Management 2023, 07. [Google Scholar] [CrossRef]
- Naik, N.; Tokas, T.; Shetty, D.K.; Swain, D.; Shah, M.; Paul, R.; Somani, B.K. Legal and Ethical Consideration in Artificial Intelligence in Healthcare: Who Takes Responsibility? Frontiers in Surgery 2022, 9. [Google Scholar] [CrossRef]
- Braun, M.; Hummel, P.; Beck, S.C.; Dabrock, P. Primer on an ethics of AI-based decision support systems in the clinic. Journal of Medical Ethics 2021, 47, e3. [Google Scholar] [CrossRef] [PubMed]
- Jarrahi, M.H.; Askay, D.A.; Eshraghi, A.; Smith, P. Artificial intelligence and knowledge management: A partnership between human and AI. Business Horizons 2022, 66, 87–99. [Google Scholar] [CrossRef]
- Terranova, N.; Venkatakrishnan, K.; Benincosa, L.J. Application of Machine Learning in Translational Medicine: Current Status and Future Opportunities. Aaps Journal 2021, 23. [Google Scholar] [CrossRef]
- Hartl, D.; De Luca, V.; Kostikova, A.; Laramie, J.M.; Kennedy, S.D.; Ferrero, E.; Siegel, R.M.; Fink, M.; Ahmed, S.; Millholland, J.; et al. Translational precision medicine: an industry perspective. Journal of Translational Medicine 2021, 19. [Google Scholar] [CrossRef] [PubMed]
- Zitnik, M.; Nguyen, F.; Wang, B.; Leskovec, J.; Goldenberg, A.; Hoffman, M.M. Machine learning for integrating data in biology and medicine: Principles, practice, and opportunities. Information Fusion 2018, 50, 71–91. [Google Scholar] [CrossRef] [PubMed]
- Sarker, I.H. Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions. SN Computer Science 2021, 2. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–44. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural computation 1997, 9, 1735–80. [Google Scholar] [CrossRef] [PubMed]
- Lin, E.; Lin, C.H.; Lane, H.Y. De novo peptide and protein design using generative adversarial networks: an update. Journal of Chemical Information and Modeling 2022, 62, 761–74. [Google Scholar] [CrossRef]
- Xu, J.; Wang, S. Analysis of distance-based protein structure prediction by deep learning in CASP13. Proteins: Structure, Function, and Bioinformatics 2019, 87, 1069–81. [Google Scholar] [CrossRef]
- Tan, R.K.; Liu, Y.; Xie, L. Reinforcement learning for systems pharmacology-oriented and personalized drug design. Expert Opinion on Drug Discovery 2022, 17, 849–63. [Google Scholar] [CrossRef]
- Feng, C.; Wang, W.; Han, R.; Wang, Z.; Ye, L.; Du, Z.; Wei, H.; Zhang, F.; Peng, Z.; Yang, J. Accurate de novo prediction of RNA 3D structure with transformer network. bioRxiv 2022. [Google Scholar]
- Wang, J.; Xie, X.; Shi, J.; He, W.; Chen, Q.; Chen, L.; Gu, W.; Zhou, T. Denoising autoencoder, a deep learning algorithm, aids the identification of a novel molecular signature of lung adenocarcinoma. Genomics, proteomics & bioinformatics 2020, 18, 468–80. [Google Scholar]
- Jha, K.; Saha, S.; Singh, H. Prediction of protein–protein interaction using graph neural networks. Scientific Reports 2022, 12, 8360. [Google Scholar] [CrossRef] [PubMed]
- Svensson, V.; Gayoso, A.; Yosef, N.; Pachter, L. Interpretable factor models of single-cell RNA- seq via variational autoencoders. Bioinformatics 2020, 36, 3418–21. [Google Scholar] [CrossRef] [PubMed]
- Sarkar, C.; Das, B.; Rawat, V.S.; Wahlang, J.B.; Nongpiur, A.; Tiewsoh, I.; Lyngdoh, N.M.; Das, D.; Bidarolli, M.; Sony, H.T. Artificial Intelligence and Machine Learning Technology Driven Modern Drug Discovery and Development. International Journal of Molecular Sciences 2023, 24, 2026. [Google Scholar] [CrossRef]
- Hinton, G.E.; Sabour, S.; Frosst, N. Matrix capsules with EM routing. InInternational conference on learning representations 2018. [Google Scholar]
- Das, D.; Chakrabarty, B.; Srinivasan, R.; Roy, A. Gex2SGen: Designing Drug-like Molecules from Desired Gene Expression Signatures. Journal of Chemical Information and Modeling 2023, 63, 1882–93. [Google Scholar] [CrossRef]
- Li, Z.; Gao, E.; Zhou, J.; Han, W.; Xu, X.; Gao, X. Applications of deep learning in understanding gene regulation. Cell Reports Methods 2023, 100384. [Google Scholar] [CrossRef]
- Bansal, H.; Luthra, H.; Chaurasia, A. Impact of Machine Learning Practices on Biomedical Informatics, Its Challenges and Future Benefits. In Artificial Intelligence Technologies for Computational Biology; CRC Press, 2023; pp. 273–294. [Google Scholar]
- Feng, X.; Zhang, H.; Lin, H.; Long, H. Single-cell RNA-seq data analysis based on directed graph neural network. Methods 2023, 211, 48–60. [Google Scholar] [CrossRef]
- Dou, H.; Tan, J.; Wei, H.; Wang, F.; Yang, J.; Ma, X.G.; Wang, J.; Zhou, T. Transfer inhibitory potency prediction to binary classification: A model only needs a small training set. Computer Methods and Programs in Biomedicine 2022, 215, 106633. [Google Scholar] [CrossRef]
- Hou, Z.; Yang, Y.; Ma, Z.; Wong, K.C.; Li, X. Learning the protein language of proteome-wide protein-protein binding sites via explainable ensemble deep learning. Communications Biology 2023, 6, 73. [Google Scholar] [CrossRef] [PubMed]
- Yelmen, B.; Jay, F. An Overview of Deep Generative Models in Functional and Evolutionary Genomics. Annual Review of Biomedical Data Science 2023, 6. [Google Scholar] [CrossRef]
- Chen, P.; Zheng, H. Drug-target interaction prediction based on spatial consistency constraint and graph convolutional autoencoder. BMC bioinformatics 2023, 24, 1–21. [Google Scholar] [CrossRef]
- Zheng, S.; Guo, J.; Langendijk, J.A.; Both, S.; Veldhuis, R.N.; Oudkerk, M.; van Ooijen, P.M.; Wijsman, R.; Sijtsema, N.M. Survival prediction for stage I-IIIA non-small cell lung cancer using deep learning. Radiotherapy and Oncology 2023, 109483. [Google Scholar] [CrossRef]
- Li, Z.; Gao, E.; Zhou, J.; Han, W.; Xu, X.; Gao, X. Applications of deep learning in understanding gene regulation. Cell Rep Methods 2023, 3, 100384. [Google Scholar] [CrossRef] [PubMed]
- Li, M.; Cai, X.; Xu, S.; Ji, H. Metapath-aggregated heterogeneous graph neural network for drug– target interaction prediction. Briefings in Bioinformatics 2023, 24, bbac578. [Google Scholar] [CrossRef] [PubMed]
- Veturi, Y.A.; Woof, W.; Lazebnik, T.; Moghul, I.; Woodward-Court, P.; Wagner, S.K.; de Guimarães, T.A.; Varela, M.D.; Liefers, B.; Patel, P.J.; et al. SynthEye: Investigating the Impact of Synthetic Data on Artificial Intelligence-assisted Gene Diagnosis of Inherited Retinal Disease. Ophthalmology Science 2023, 3, 100258. [Google Scholar] [CrossRef]
- Suhartono, D.; Majiid, M.R.; Handoyo, A.T.; Wicaksono, P.; Lucky, H. Towards a more general drug target interaction prediction model using transfer learning. Procedia Computer Science 2023, 216, 370–6. [Google Scholar] [CrossRef] [PubMed]
- Kumar, V.; Sinha, D. Synthetic attack data generation model applying generative adversarial network for intrusion detection. Computers & Security 2023, 125, 103054. [Google Scholar]
- Yang, K.; Huang, H.; Vandans, O.; Murali, A.; Tian, F.; Yap, R.H.; Dai, L. Applying deep reinforcement learning to the HP model for protein structure prediction. Physica A: Statistical Mechanics and its Applications 2023, 609, 128395. [Google Scholar] [CrossRef]
- Bhatti, U.A.; Tang, H.; Wu, G.; Marjan, S.; Hussain, A. Deep learning with graph convolutional networks: An overview and latest applications in computational intelligence. International Journal of Intelligent Systems 2023, 2023, 1–28. [Google Scholar] [CrossRef]
- Mreyoud, Y.; Song, M.; Lim, J.; Ahn, T.H. MegaD: deep learning for rapid and accurate disease status prediction of metagenomic samples. Life 2022, 12, 669. [Google Scholar] [CrossRef]
- Hamdy, R.; Maghraby, F.A.; Omar, Y.M. Convchrome: Predicting gene expression based on histone modifications using deep learning techniques. Current Bioinformatics 2022, 17, 273–83. [Google Scholar] [CrossRef]
- Cai, J.; Wang, S.; Xu, C.; Guo, W. Unsupervised deep clustering via contractive feature representation and focal loss. Pattern Recognition 2022, 123, 108386. [Google Scholar] [CrossRef]
- Stepniewska-Dziubinska, M.M.; Zielenkiewicz, P.; Siedlecki, P. Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics 2018, 34, 3666–74. [Google Scholar] [CrossRef]
- Jukič, M.; Bren, U. Machine learning in antibacterial drug design. Frontiers in Pharmacology 2022, 13, 1284. [Google Scholar] [CrossRef] [PubMed]
- Mohamed, T.; Sayed, S.; Salah, A.; Houssein, E.H. Long short-term memory neural networks for RNA viruses mutations prediction. Mathematical Problems in Engineering 2021, 2021, 1–9. [Google Scholar] [CrossRef]
- Sharma, P.; Dahiya, S.; Kaur, P.; Kapil, A. Computational biology: Role and scope in taming antimicrobial resistance. Indian Journal of Medical Microbiology 2023, 41, 33–8. [Google Scholar] [CrossRef] [PubMed]
- Giri, S.J.; Dutta, P.; Halani, P.; Saha, S. MultiPredGO: deep multi-modal protein function prediction by amalgamating protein structure, sequence, and interaction information. IEEE Journal of Biomedical and Health Informatics 2020, 25, 1832–8. [Google Scholar] [CrossRef]
- Zhao, J.P.; Hou, T.S.; Su, Y.; Zheng, C.H. scSSA: A clustering method for single cell RNA-seq data based on semi-supervised autoencoder. Methods 2022, 208, 66–74. [Google Scholar] [CrossRef]
- Mamoshina, P.; Vieira, A.; Putin, E.; Zhavoronkov, A. Applications of deep learning in biomedicine. Molecular pharmaceutics 2016, 13, 1445–54. [Google Scholar] [CrossRef]
- Ieremie, I.; Ewing, R.M.; Niranjan, M. TransformerGO: predicting protein–protein interactions by modelling the attention between sets of gene ontology terms. Bioinformatics 2022, 38, 2269–77. [Google Scholar] [CrossRef]
- Sapoval, N.; Aghazadeh, A.; Nute, M.G.; et al. Current progress and open challenges for applying deep learning across the biosciences. Nat Commun 2022, 13, 1728. [Google Scholar] [CrossRef]
- Feng, X.; Zhang, H.; Lin, H.; Long, H. Single-cell RNA-seq data analysis based on directed graph neural network. Methods 2023, 211, 48–60. [Google Scholar] [CrossRef]
- Yelmen, B.; Jay, F. An overview of deep generative models in functional and evolutionary genomics. Annu Rev Biomed Data Sci 2023, 6, 173–189. [Google Scholar] [CrossRef]
- Hou, Z.; Yang, Y.; Ma, Z.; Wong, K.C.; Li, X. Learning the protein language of proteome-wide protein-protein binding sites via explainable ensemble deep learning. Commun Biol 2023, 6, 73. [Google Scholar] [CrossRef]
- de Lima, L.M.; de Assis, M.C.; Soares, J.P.; Grão-Velloso, T.R.; de Barros, L.A.; Camisasca, D.R.; Krohling, R.A. On the importance of complementary data to histopathological image analysis of oral leukoplakia and carcinoma using deep neural networks. Intelligent Medicine 2023. [Google Scholar] [CrossRef]
- Gupta, A.; Zou, J. Feedback GAN for DNA optimizes protein functions. Nature Machine Intelligence 2019, 1, 105–11. [Google Scholar] [CrossRef]
- Lin, P.; Yan, Y.; Huang, S.Y. DeepHomo2. 0: improved protein–protein contact prediction of homodimers by transformer-enhanced deep learning. Briefings in Bioinformatics 2023, 24, bbac499. [Google Scholar] [CrossRef]
- Xavier, R.; de Souza, K.P.; Chateau, A.; Alves, R. Genome assembly using reinforcement learning. In Advances in Bioinformatics and Computational Biology: 12th Brazilian Symposium on Bioinformatics, BSB 2019, Fortaleza, Brazil, October 7–10, 2019, Revised Selected Papers 1; Springer International Publishing, 7 October 2020; pp. 16–28. [Google Scholar]
- Wang, J.; Ma, A.; Chang, Y.; Gong, J.; Jiang, Y.; Qi, R.; Wang, C.; Fu, H.; Ma, Q.; Xu, D. scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses. Nature communications 2021, 12, 1882. [Google Scholar] [CrossRef] [PubMed]
- Lee, N.K.; Azizan, F.L.; Wong, Y.S.; Omar, N. DeepFinder: An integration of feature-based and deep learning approach for DNA motif discovery. Biotechnology & Biotechnological Equipment 2018, 32, 759–68. [Google Scholar]

| Date | Developed by | Evolution |
|---|---|---|
| 1873 | A. Bain | The earliest models of neural networks, called Neural Groupings, were introduced and were inspired by the Hebbian Learning Rule. |
| 1943 | McCulloch & Pitts | The MCP Model was introduced, which is considered the precursor to Artificial Neural Models. |
| 1949 | D. Hebb | Considered as the father of neural networks, he introduced the Hebbian Learning Rule, which formed the basis for modern neural networks. |
| 1958 | F. Rosenblatt | The first perceptron, which closely resembles modern perceptron, was introduced. |
| 1969 | Minsky and Papert | Publish Perceptron, which criticizes the perceptron and limits the potential of neural networks |
| 1974 | P. Werbos | Introduced Backpropagation |
| 1980 | T. Kohonen K. Fukushima |
Introduced Self Organizing Map Neocogitron was introduced, which served as inspiration for Convolutional Neural Networks. |
| Date | Developed by | Evolution |
| 1982 | J. Hopheld | The Hopfield Network was introduced |
| 1985 | Hilton & Sejnowski | The Hopfield Network was introduced |
| 1986 | P.Smolensky M. I. Jordan |
Introduced Harmonium, which is later known as Restricted Boltzmann Machine Defined and introduced Recurrent Neural Network |
| 1990 | Y. LeCun | LeNet was introduced, demonstrating the practical potential of deep neural networks. |
| 1997 | Schuster & Paliwal Hochreiter& Schmidhuber | Introduced Bidirectional Recurrent Neural Network Long Short-Term Memory (LSTM) networks solved the vanishing gradient problem in recurrent neural networks |
| 2006 | G. Hinton | Deep Belief Networks were introduced, along with the layer-wise pretraining technique, which marked the beginning of the current deep learning era. |
| 2009 | Salakhutdinov & Hinton | Deep Boltzmann Machines were introduced. |
| 2012 | G. Hinton | Dropout, an efficient method for training neural networks, was introduced. |
| 2012 | Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton | Convolutional neural network (CNN) for Image classification |
| 2014 | Ian Goodfellow, Yoshua Bengio, and Aaron Courville | (GAN) for image generation |
| 2020 | Deep learning continues to evolve and is used for a wider range of tasks, including self-driving cars, medical diagnosis, and financial trading etc. |
| No | Deep Learning Algorithm | Application in Computational Biology | References |
|---|---|---|---|
| 01. | Convolutional Neural Networks (CNN) |
Gene expression analysis | 78 |
| 02. | Recurrent Neural Networks (RNN) | DNA sequence analysis | 79 |
| 03. | Generative Adversarial Networks (GAN) | Synthetic biology and protein design | 80 |
| 04. | Deep Belief Networks (DBN) | Protein structure prediction | 81 |
| 05. | Reinforcement Learning (RL) | Drug discovery and optimization | 82 |
| 06. | Transformer Networks | RNA structure prediction | 83 |
| 07. | Autoencoders | Disease diagnosis and prognosis | 84 |
| 08. | Graph Neural Networks (GNN) | Protein-protein interaction prediction | 85 |
| 09. | Variational Autoencoders (VAE) | Single-cell genomics analysis | 86 |
| 10. | Deep Reinforcement Learning | Drug target identification | 87 |
| 11. | Capsule Networks | Protein structure classification | 88 |
| 12. | Adversarial Autoencoders | Gene expression imputation | 89 |
| 13. | Deep Boltzmann Machines (DBM) | Epigenetic data analysis | 90 |
| 14. | Deep Learning Algorithm | Application in Computational Biology | 91 |
| 15. | Attention Mechanism | Single-cell RNA sequencing analysis | 92 |
| 16. | Deep Q- Networks (DQN) | Drug toxicity prediction | 93 |
| 17. | Capsule Networks | Protein-protein interaction prediction | 94 |
| 18. | Deep Generative Models | DNA sequence generation | 95 |
| 19. | Graph Convolutional Networks (GCN) | Drug-target interaction prediction | 96 |
| 20. | Deep Survival Analysis | Cancer survival prediction | 97 |
| 21. | Transformer Networks | Transcriptomics analysis | 98 |
| 22. | Graph Neural Networks (GNN) | Drug repurposing | 99 |
| 23. | Adversarial Networks | Image-based phenotypic screening | 100 |
| 24. | Deep Transfer Learning | Drug response prediction | 101 |
| 25. | Generative Adversarial Networks (GAN) | Synthetic data generation | 101 |
| 26. | Deep Reinforcement Learning | Protein folding | 102 |
| 27. | Variational Graph Autoencoders (VGAE) | Disease-gene prioritization | 103 |
| 28. | Deep Neural Networks (DNN) | Metagenomic analysis | 104 |
| 29. | Convolutional Recurrent Neural Networks (CRNN) | Chromatin state prediction | 105 |
| 30. | Deep Clustering | Cell type identification | 106 |
| 31. | Deep Reinforcement Learning | Protein-ligand binding affinity prediction | 107 |
| 32. | Graph Convolutional Networks (GCN) | Drug response prediction | 108 |
| 33. | Long Short- Term Memory (LSTM) | RNA splicing prediction | 109 |
| 34. | Deep Reinforcement Learning | Antibiotic resistance prediction | 110 |
| 35. | Capsule Networks | Protein function prediction | 111 |
| 36. | Autoencoders | Single-cell epigenomics analysis | 112 |
| 37. | Deep Belief Networks (DBN) | Genetic variant classification | 113 |
| 38. | Transformer Networks | Protein-protein interaction network analysis | 114 |
| 39. | Graph Convolutional Networks (GCN) |
Drug-target interaction network analysis | 115 |
| 40. | Recurrent Neural Networks (RNN) & Genome Language Models (gLMs) | Protein secondary structure prediction, Gene co-regulation prediction | 116 |
| 41. | Deep Reinforcement Learning | Gene regulatory network inference | 117 |
| 42. | Variational Autoencoders (VAE) | Metabolomics data analysis | 118 |
| 43. | Deep Belief Networks (DBN) | Drug side effect prediction | 119 |
| 44. | Capsule Networks | Cancer subtype classification | 120 |
| 45. | Convolutional Neural Networks (CNN) | Histopathology image analysis | 121 |
| 46. | Generative Adversarial Networks (GAN) | Synthetic biology and gene synthesis | 122 |
| 47. | Transformer Networks | Protein contact prediction | 123 |
| 48. | Deep Reinforcement Learning | Genome sequence assembly | 124 |
| 49. | Graph Neural Networks (GNN) | Cell type classification in single-cell transcriptomics | 125 |
| 50. | Autoencoders | DNA motif discovery | 126 |
| Model Type | Primary Application | Key Advancement (2020–2025) | Performance Metrics | Computational Requirements | Limitations | References |
|---|---|---|---|---|---|---|
| CNNs | DNA methylation analysis | DeepCpG predicts methylation states with high accuracy (2017–2024) | AUC: 0.92 | Moderate (GPU required) | Requires large datasets, risk of overfitting | [32,79] |
| RNNs | Gene expression prediction | Improved sequential modeling, but limited by long-range dependencies | Accuracy: 85% | Moderate | Struggles with long sequences | [80,118] |
| Transformers | Protein structure prediction | AlphaFold 3 predicts protein complexes and ligand interactions (2024) | RMSD: <1Å | High (TPU/GPU clusters) | Computationally intensive | [38,117] |
| GNNs | Single-cell omics | scGNN advances cell-type interaction modeling (2024) | F1 Score: 0.89 | High | Interpretability issues | [84,119] |
| gLMs | Gene co-regulation prediction | Transformer-based gLMs predict single-cell co-regulation (2024) | AUC: 0.90 | High | Limited to specific datasets | [118] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).