Submitted:
27 June 2025
Posted:
03 July 2025
You are already at the latest version
Abstract
Keywords:
Introduction
Deep Learning in Genomics
Key Applications and Advancements:
Variant Calling and Annotation
Gene Expression Analysis

Epigenomics

Single-Cell Genomics
Functional Genomics
AI in Drug Discovery
Key Applications and Advancements:
Target Identification and Validation
Lead Discovery and Optimization
- Virtual Screening: AI models can rapidly screen vast chemical libraries (millions to billions of compounds) to identify potential hits that are likely to bind to a specific target. Deep learning models, such as convolutional neural networks (CNNs) and graph neural networks (GNNs), can learn complex relationships between molecular structures and their biological activities [25]. This allows for the prediction of binding affinities and the identification of promising candidates without the need for extensive experimental testing.
- De Novo Drug Design: Generative AI models, including generative adversarial networks (GANs) and variational autoencoders (VAEs), can design novel molecular structures from scratch with desired physicochemical and biological properties. Instead of searching existing chemical space, these models can explore and generate entirely new compounds tailored to a specific target or therapeutic goal [26]. This capability accelerates the discovery of innovative drugs with optimized properties, such as improved potency, selectivity, and reduced toxicity.
- Lead Optimization: AI is instrumental in optimizing the properties of initial hit compounds to transform them into viable lead candidates. This involves predicting and improving various ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity) properties, solubility, and synthetic accessibility. Deep learning models can learn from experimental data to guide iterative design cycles, suggesting modifications to molecular structures that enhance desired properties while minimizing undesirable ones [27]. This iterative optimization process, guided by AI, significantly reduces the time and resources required to develop drug candidates with optimal profiles.
Drug Repurposing
ADMET/Toxicity Prediction
Clinical Trial Optimization
Integration of Deep Learning in Genomics and Drug Discovery
Genomics-Guided Drug Discovery
Multi-Omics Integration for Comprehensive Insights
Accelerating Preclinical and Clinical Development
Challenges and Future Directions
Data Challenges
Model Interpretability and Explainability
Generalizability and Robustness
Integration into Existing Workflows
Ethical and Societal Implications
Future Directions
- Multi-modal and Multi-scale Integration: Further advancements in integrating diverse data types (genomics, proteomics, imaging, clinical records) at multiple biological scales (molecular, cellular, tissue, organismal) will lead to more comprehensive and predictive models of disease and drug action.
- Reinforcement Learning for Drug Design: Applying reinforcement learning, where an AI agent learns to design molecules through trial and error in a simulated environment, could revolutionize de novo drug design by optimizing for complex property profiles.
- Digital Twins and Personalized Medicine: The creation of ‘digital twins’—virtual representations of individual patients based on their comprehensive genomic and health data—could enable highly personalized drug discovery and treatment strategies, allowing for in silico testing of therapies.
- Automated Experimentation and Robotics: Integrating AI with automated laboratory systems and robotics (AI-driven labs) will accelerate the pace of experimental validation and data generation, creating a virtuous cycle of data-driven discovery.
- Quantum Computing and AI: The nascent field of quantum computing holds potential for accelerating complex simulations and optimizations in drug discovery, particularly in molecular dynamics and quantum chemistry, which could be synergistically combined with AI.
Conclusion
References
- Poplin, R. , Chang, P. C., Alexander, D., Schwartz, S., Colthurst, N., Ku, A., … & DePristo, M. A. (2018). DeepVariant: highly accurate genomic variant calling with deep neural networks. Nature biotechnology, /: https, 4235. [Google Scholar]
- Avsec Ž, Agarwal V, Visentin D, Ledsam JR, Grabska-Barwinska A, Taylor KR, Assael Y, Jumper J, Kohli P, Kelley DR. Effective gene expression prediction from sequence by integrating long-range interactions. Nat Methods. 2021 Oct;18(10):1196-1203. [CrossRef] [PubMed]
- Ching, T. , Himmelstein, D. S., Beaulieu-Jones, B. K., Kalinin, A. A., Do, B. T., Way, G. P., … & Greene, C. S. (2018). Opportunities and obstacles for deep learning in biology and medicine. Journal of The Royal Society Interface, /: https, 2017. [Google Scholar]
- Vamathevan, J. , Clark, D., Czodrowski, P., Dunham, I., Ferran, E., Lee, G., … & Zhao, Z. (2019). Applications of machine learning in drug discovery and development. Nature reviews Drug discovery, /: https, 4157. [Google Scholar]
- LeCun, Y. , Bengio, Y., & Hinton, G. (2015). Deep learning. nature, /: 436-444. https, 1453. [Google Scholar]
- Angermueller, C. , Pärnamaa, T., Parts, L., & Stegle, O. (2016). Deep learning for computational biology. Molecular systems biology, /: https, 1525. [Google Scholar]
- Esteva, A. , Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., … & Dean, J. (2019). A guide to deep learning in healthcare. Nature medicine, /: https, 4159. [Google Scholar]
- Zou, J. , Huss, M., Abid, A., Mohammadi, P., Torkamani, A., & Telenti, A. (2019). A primer on deep learning in genomics. Nature genetics, /: https, 4158. [Google Scholar]
- Eraslan, G. , Avsec, Ž., Gagneur, J., & Theis, F. J. (2019). Deep learning: new computational modelling techniques for genomics. Nature Reviews Genetics, /: https, 4157. [Google Scholar]
- Cook DE, Venkat A, Yelizarov D, Pouliot Y, Chang PC, Carroll A, De La Vega FM. A deep-learning-based RNA-seq germline variant caller. Bioinform Adv. 2023 Jun 13;3(1):vbad062. [CrossRef] [PubMed]
- Nawy, T. Short reads join hands. Nat Methods, 2014. [Google Scholar] [CrossRef]
- Quang, D. , & Xie, X. (2016). DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences. Nucleic acids research, /: https, 2468. [Google Scholar]
- Kelley, D. R. , Snoek, J., & Rinn, J. L. (2016). Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks. Genome research, /: https.
- Way GP, Greene CS. Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders. Pac Symp Biocomput. 2018;23:80-91. PMCID: PMC5728678. [PubMed]
- Schnepf M, von Reutern M, Ludwig C, Jung C, Gaul U. Transcription Factor Binding Affinities and DNA Shape Readout. iScience. 2020 Oct 15;23(11):101694. [CrossRef] [PubMed]
- Singh, R. , Lanchantin, J., & Qi, Y. (2016). DeepChrome: deep-learning for predicting gene expression from histone modifications. Bioinformatics, /: https, 2455. [Google Scholar]
- Lopez, R. , Regier, J., Cole, M. B., Jordan, M. I., & Yosef, N. (2018). Deep generative modeling for single-cell transcriptomics. Nature methods, /: https, 1058. [Google Scholar]
- Du, J. , Chen, T., Gao, M., & Wang, J. (2024). Joint trajectory inference for single-cell genomics using deep learning with a mixture prior. Proceedings of the National Academy of Sciences. [CrossRef]
- Hie, B. , Bryson, B., & Berger, B. (2019). Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nature biotechnology, /: https, 4158. [Google Scholar]
- Mulligan, V. K. (2021). Current directions in combining simulation-based macromolecular modeling approaches with deep learning. Expert Opinion on Drug Discovery, 1025. [Google Scholar] [CrossRef]
- Fleming, N. (2018). How artificial intelligence is changing drug discovery. Nature, /: S55-S57. https, 4158. [Google Scholar]
- Kant, S. , Deepika & Roy, S. Artificial intelligence in drug discovery and development: transforming challenges into opportunities. Discov. Pharm. Sci. [CrossRef]
- 2024; 85. [CrossRef]
- Zhavoronkov, A. , Ivanenkov, Y. A., Ali, A., Johnson, M., & Ulloa, L. (2019). Deep learning for drug discovery and biomarker development. Nature biotechnology, /: https, 1018. [Google Scholar]
- Gawehn, E. , Hiss, J. A., & Schneider, G. (2016). Deep learning in drug discovery. Molecular informatics, /: https, 1002. [Google Scholar]
- Kadurin, A. , Aliper, A., Kazennov, A., Mamoshina, P., Vanhaelen, Q., Khrabrov, K., & Zhavoronkov, A. (2017). The cornucopia of meaningful leads: Applying deep adversarial autoencoders for new molecule generation in oncology. Oncotarget, /: https, 1407. [Google Scholar]
- Segler, M. H. , Kogej, T., Tyrchan, C., & Waller, M. P. (2018). Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS central science, /: https, 1021. [Google Scholar]
- Pushpakom, S. , Iorio, F., Eyers, P. et al. Drug repurposing: progress, challenges and recommendations. Nat Rev Drug Discov. [CrossRef]
- Mayr, A. , Klambauer, G., Unterthiner, T., & Hochreiter, S. (2016). DeepTox: toxicity prediction using deep learning. Frontiers in environmental science, 3389. [Google Scholar]
- Saeed, H. , El Naqa, I. (2022). Artificial Intelligence in Clinical Trials. In: El Naqa, I., Murphy, M.J. (eds) Machine and Deep Learning in Oncology, Medical Physics and Radiology. Springer, Cham. [CrossRef]








Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
