Submitted:
30 May 2026
Posted:
02 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction

2. Target Identification
2.1. Understanding Biological Functions of Possible Targets
2.2. Protein Structure Prediction
3. Target Development
3.1. Drug-Target Interaction Prediction
3.2. Binding Site Prediction
3.3. Drug-Response Prediction
3.4. Data Biases and Method Selection
4. Lead Identification
4.1. Virtual Screening with AI
4.2. Molecule Generation
5. Lead Optimization
5.1. Molecule Optimization
5.2. Toxicity Prediction
5.3. Pharmacokinetic Modeling
6. Clinical Trials
6.1. Trial Site Selection
6.2. Trial Outcome Prediction
6.3. Patient-Trial Matching
6.4. Challenges of LLMs in Clinical Tasks
7. Discussion
7.1. Challenges and Future Directions
Funding
Author Contributions
Conflicts of Interest
Use of Artificial Intelligence
References
- Mohs, R.C.; Greig, N.H. Drug discovery and development: Role of basic biological research. Alzheimer’s Dement. Transl. Res. Clin. Interv. 2017, 3, 651–657. [Google Scholar] [CrossRef] [PubMed]
- Huang, K.; Fu, T.; Gao, W.; Zhao, Y.; Roohani, Y.; Leskovec, J.; Coley, C.W.; Xiao, C.; Sun, J.; Zitnik, M. Artificial intelligence foundation for therapeutic science. Nat. Chem. Biol. 2022, 18, 1033–1036. [Google Scholar] [CrossRef] [PubMed]
- Serrano, D.R.; Luciano, F.C.; Anaya, B.J.; Ongoren, B.; Kara, A.; Molina, G.; Ramirez, B.I.; Sánchez-Guirales, S.A.; Simon, J.A.; Tomietto, G.; et al. Artificial intelligence (AI) applications in drug discovery and drug delivery: Revolutionizing personalized medicine. Pharmaceutics 2024, 16, 1328. [Google Scholar] [CrossRef] [PubMed]
- Kanakia, A.; Sale, M.; Zhao, L.; Zhou, Z. AI In Action: Redefining Drug Discovery and Development. Clin. Transl. Sci. 2025, 18, e70149. [Google Scholar] [CrossRef]
- Wagner, J.; Dahlem, A.M.; Hudson, L.D.; Terry, S.F.; Altman, R.B.; Gilliland, C.T.; DeFeo, C.; Austin, C.P. A dynamic map for learning, communicating, navigating and improving therapeutic development. Nat. Rev. Drug Discov. 2018, 17, 150–150. [Google Scholar] [CrossRef]
- Sudlow, C.; Gallacher, J.; Allen, N.; Beral, V.; Burton, P.; Danesh, J.; Downey, P.; Elliott, P.; Green, J.; Landray, M.; et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015, 12, e1001779. [Google Scholar] [CrossRef]
- Dolezalova, N.; Cairo, M.; Despotovic, A.; Booth, A.T.; Reed, A.B.; Morelli, D.; Plans, D. Development of a dynamic type 2 diabetes risk prediction tool: a uk biobank study. arXiv 2021, arXiv:2104.10108. [Google Scholar] [CrossRef]
- Dabbah, M.A.; Reed, A.B.; Booth, A.T.; Yassaee, A.; Despotovic, A.; Klasmer, B.; Binning, E.; Aral, M.; Plans, D.; Morelli, D.; et al. Machine learning approach to dynamic risk modeling of mortality in COVID-19: A UK Biobank study. Sci. Rep. 2021, 11, 16936. [Google Scholar] [CrossRef]
- Fry, A.; Littlejohns, T.J.; Sudlow, C.; Doherty, N.; Adamska, L.; Sprosen, T.; Collins, R.; Allen, N.E. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. Am. J. Epidemiol. 2017, 186, 1026–1034. [Google Scholar] [CrossRef]
- Richardson, L.; Allen, B.; Baldi, G.; Beracochea, M.; Bileschi, M.L.; Burdett, T.; Burgin, J.; Caballero-Pérez, J.; Cochrane, G.; Colwell, L.J.; et al. MGnify: the microbiome sequence data analysis resource in 2023. Nucleic Acids Res. 2023, 51, D753–D759. [Google Scholar] [CrossRef]
- McLaren, M.R.; Willis, A.D.; Callahan, B.J. Consistent and correctable bias in metagenomic sequencing experiments. elife 2019, 8, e46923. [Google Scholar] [CrossRef]
- Shoemaker, R.H. The NCI60 human tumour cell line anticancer drug screen. Nat. Rev. Cancer 2006, 6, 813–823. [Google Scholar] [CrossRef]
- Inoue, Y.; Lee, H.; Fu, T.; Kuang, R.; Luna, A. drGT: Attention-Guided Gene Assessment of Drug Response Utilizing a Drug-Cell-Gene Heterogeneous Network. arXiv 2026, arXiv:cs. [Google Scholar]
- Safikhani, Z.; El-Hachem, N.; Quevedo, R.; Smirnov, P.; Goldenberg, A.; Birkbak, N.J.; Mason, C.; Hatzis, C.; Shi, L.; Aerts, H.J.; et al. Assessment of pharmacogenomic agreement. F1000Research 2016, 5, 825. [Google Scholar] [CrossRef] [PubMed]
- Yang, W.; Soares, J.; Greninger, P.; Edelman, E.J.; Lightfoot, H.; Forbes, S.; Bindal, N.; Beare, D.; Smith, J.A.; Thompson, I.R.; et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2012, 41, D955–D961. [Google Scholar] [CrossRef] [PubMed]
- Ghandi, M.; Huang, F.W.; Jané-Valbuena, J.; Kryukov, G.V.; Lo, C.C.; McDonald, E.R., III; Barretina, J.; Gelfand, E.T.; Bielski, C.M.; Li, H.; et al. Next-generation characterization of the cancer cell line encyclopedia. Nature 2019, 569, 503–508. [Google Scholar] [CrossRef]
- Reinhold, W.C.; Wilson, K.; Elloumi, F.; Bradwell, K.R.; Ceribelli, M.; Varma, S.; Wang, Y.; Duveau, D.; Menon, N.; Trepel, J.; et al. CellMinerCDB: NCATS Is a Web-Based Portal Integrating Public Cancer Cell Line Databases for Pharmacogenomic Explorations. Cancer Res. 2023, 83, 1941–1952. [Google Scholar] [CrossRef] [PubMed]
- Inoue, Y.; Song, T.; Fu, T. DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning. arXiv 2024, arXiv:2408.13378. [Google Scholar]
- Huang, K.; Fu, T.; Gao, W.; Zhao, Y.; Roohani, Y.; Leskovec, J.; Coley, C.W.; Xiao, C.; Sun, J.; Zitnik, M. Therapeutics data commons: Machine learning datasets and tasks for drug discovery and development. arXiv 2021, arXiv:2102.09548. [Google Scholar] [CrossRef]
- Davis, M.I.; Hunt, J.P.; Herrgard, S.; Ciceri, P.; Wodicka, L.M.; Pallares, G.; Hocker, M.; Treiber, D.K.; Zarrinkar, P.P. Comprehensive analysis of kinase inhibitor selectivity. Nat. Biotechnol. 2011, 29, 1046–1051. [Google Scholar] [CrossRef]
- Huang, K.; Fu, T.; Glass, L.M.; Zitnik, M.; Xiao, C.; Sun, J. DeepPurpose: a deep learning library for drug–target interaction prediction. Bioinformatics 2020, 36, 5545–5547. [Google Scholar] [CrossRef]
- Ahmad, B.; Ouahada, K.; Hamam, H. Machine learning for drug-target interaction prediction: A comprehensive review of models, challenges, and computational strategies. Comput. Struct. Biotechnol. J. 2026. [Google Scholar] [CrossRef]
- Tang, J.; Szwajda, A.; Shakyawar, S.; Xu, T.; Hintsanen, P.; Wennerberg, K.; Aittokallio, T. Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis. J. Chem. Inf. Model. 2014, 54, 735–743. [Google Scholar] [CrossRef] [PubMed]
- Liu, T.; Lin, Y.; Wen, X.; Jorissen, R.N.; Gilson, M.K. BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities. Nucleic Acids Res. 2007, 35, D198–D201. [Google Scholar] [CrossRef]
- Mendez, D.; Gaulton, A.; Bento, A.P.; Chambers, J.; De Veij, M.; Félix, E.; Magariños, M.P.; Mosquera, J.F.; Mutowo, P.; Nowotka, M.; et al. ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res. 2019, 47, D930–D940. [Google Scholar] [CrossRef]
- Mayr, A.; Klambauer, G.; Unterthiner, T.; Steijaert, M.; Wegner, J.K.; Ceulemans, H.; Clevert, D.A.; Hochreiter, S. Large-scale comparison of machine learning methods for drug target prediction on ChEMBL. Chem. Sci. 2018, 9, 5441–5451. [Google Scholar] [CrossRef]
- Dao, Q.T.; Do, T.M.D.; Thai, Q.M.; Tran, P.T.; Ngo, S.T.; Nguyen, T.H. Identifying Potential BACE1 Inhibitors from the ChEMBL Database Using Machine Learning and Atomistic Simulation Approaches. ACS Omega 2026. [Google Scholar] [CrossRef]
- Mukherjee, S.; Ghosh, M.; Basuchowdhuri, P. Deep Graph Convolutional Network and LSTM based approach for predicting drug-target binding affinity. arXiv 2022, arXiv:2201.06872. [Google Scholar] [CrossRef]
- Kretschmer, F.; Seipp, J.; Ludwig, M.; Klau, G.W.; Böcker, S. Coverage bias in small molecule machine learning. Nat. Commun. 2025, 16, 554. [Google Scholar] [CrossRef]
- Son, H.; Lee, S.; Kim, J.; Park, H.; Hwang, M.H.; Yi, G.S. BASE: A web service for providing compound-protein binding affinity prediction datasets with reduced similarity bias. BMC Bioinform. 2024, 25, 340. [Google Scholar] [CrossRef] [PubMed]
- Tanoli, Z.; Alam, Z.; Vähä-Koskela, M.; Ravikumar, B.; Malyutina, A.; Jaiswal, A.; Tang, J.; Wennerberg, K.; Aittokallio, T. Drug Target Commons 2.0: a community platform for systematic analysis of drug–target interaction profiles. Database 2018, bay083. [Google Scholar] [CrossRef]
- Buniello, A.; Suveges, D.; Cruz-Castillo, C.; Llinares, M.B.; Cornu, H.; Lopez, I.; Tsukanov, K.; Roldán-Romero, J.M.; Mehta, C.; Fumis, L.; et al. Open Targets Platform: facilitating therapeutic hypotheses building in drug discovery. Nucleic Acids Res. 2025, 53, D1467–D1475. [Google Scholar] [CrossRef]
- Han, Y.; Klinger, K.; Rajpal, D.K.; Zhu, C.; Teeple, E. Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform. BMC Bioinform. 2022, 23, 232. [Google Scholar] [CrossRef]
- Sterling, T.; Irwin, J.J. ZINC 15–ligand discovery for everyone. J. Chem. Inf. Model. 2015, 55, 2324–2337. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Kashima, H. Chemical property prediction under experimental biases. Sci. Rep. 2022, 12, 8206. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; Ramsundar, B.; Feinberg, E.N.; Gomes, J.; Geniesse, C.; Pappu, A.S.; Leswing, K.; Pande, V. MoleculeNet: a benchmark for molecular machine learning. Chem. Sci. 2018, 9, 513–530. [Google Scholar] [CrossRef] [PubMed]
- Grzegorzewski, J.; Brandhorst, J.; Green, K.; Eleftheriadou, D.; Duport, Y.; Barthorscht, F.; Köller, A.; Ke, D.Y.J.; De Angelis, S.; König, M. PK-DB: pharmacokinetics database for individualized and stratified computational modeling. Nucleic Acids Res. 2021, 49, D1358–D1364. [Google Scholar] [CrossRef]
- Chou, W.C.; Lin, Z. Machine learning and artificial intelligence in physiologically based pharmacokinetic modeling. Toxicol. Sci. 2023, 191, 1–14. [Google Scholar] [CrossRef]
- Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The protein data bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef]
- Ballester, P.J.; Mitchell, J.B. A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinformatics 2010, 26, 1169–1175. [Google Scholar] [CrossRef]
- Derry, A.; Carpenter, K.A.; Altman, R.B. Training data composition affects performance of protein structure analysis algorithms. In Proceedings of the PACIFIC SYMPOSIUM ON BIOCOMPUTING 2022, 2021; World Scientific; pp. 10–21. [Google Scholar]
- Wang, R.; Fang, X.; Lu, Y.; Yang, C.Y.; Wang, S. The PDBbind database: methodologies and updates. J. Med. Chem. 2005, 48, 4111–4119. [Google Scholar] [CrossRef]
- Mirdita, M.; Von Den Driesch, L.; Galiez, C.; Martin, M.J.; Söding, J.; Steinegger, M. Uniclust databases of clustered and deeply annotated protein sequences and alignments. Nucleic Acids Res. 2017, 45, D170–D176. [Google Scholar] [CrossRef]
- Orlando, G.; Raimondi, D.; Vranken, W. Observation selection bias in contact prediction and its implications for structural bioinformatics. Sci. Rep. 2016, 6, 36679. [Google Scholar] [CrossRef]
- Suzek, B.E.; Huang, H.; McGarvey, P.; Mazumder, R.; Wu, C.H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 2007, 23, 1282–1288. [Google Scholar] [CrossRef] [PubMed]
- Zarin, D.A.; Tse, T.; Williams, R.J.; Califf, R.M.; Ide, N.C. The ClinicalTrials. gov results database—update and key issues. N. Engl. J. Med. 2011, 364, 852–860. [Google Scholar] [CrossRef] [PubMed]
- Lambert, J. Statistics in Brief: How to Assess Bias in Clinical Studies? Clin. Orthop. Relat. Res. 2011, 469, 1794–1796. [Google Scholar] [CrossRef]
- Criscuolo, C.; Dolci, T.; Salnitri, M. Towards assessing data bias in clinical trials. In Proceedings of the VLDB Workshop on Data Management and Analytics for Medicine and Healthcare, 2022; Springer; pp. 57–74. [Google Scholar]
- Morris, J.A.; Caragine, C.; Daniloski, Z.; Domingo, J.; Barry, T.; Lu, L.; Davis, K.; Ziosi, M.; Glinos, D.A.; Hao, S.; et al. Discovery of target genes and pathways at GWAS loci by pooled single-cell CRISPR screens. Science 2023, 380, eadh7699. [Google Scholar] [CrossRef]
- You, Y.; Lai, X.; Pan, Y.; Zheng, H.; Vera, J.; Liu, S.; Deng, S.; Zhang, L. Artificial intelligence in cancer target identification and drug discovery. Signal Transduct. Target. Ther. 2022, 7, 156. [Google Scholar] [CrossRef] [PubMed]
- Mitra, S.; Malik, R.; Wong, W.; Rahman, A.; Hartemink, A.J.; Pritykin, Y.; Dey, K.K.; Leslie, C.S. Single-cell multi-ome regression models identify functional and disease-associated enhancers and enable chromatin potential analysis. Nat. Genet. 2024, 56, 627–636. [Google Scholar] [CrossRef]
- Zhou, J.; Theesfeld, C.L.; Yao, K.; Chen, K.M.; Wong, A.K.; Troyanskaya, O.G. Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk. Nat. Genet. 2018, 50, 1171–1179. [Google Scholar] [CrossRef]
- Hu, F.; Hu, Y.; Zhang, W.; Huang, H.; Pan, Y.; Yin, P. A multimodal protein representation framework for quantifying transferability across biochemical downstream tasks. Adv. Sci. 2023, 10, 2301223. [Google Scholar] [CrossRef]
- Ma, W.; Bi, X.; Jiang, H.; Wei, Z.; Zhang, S. Annotating protein functions via fusing multiple biological modalities. Commun. Biol. 2024, 7, 1705. [Google Scholar] [CrossRef]
- Mitrofanov, A.; Alkhnbashi, O.S.; Shmakov, S.A.; Makarova, K.S.; Koonin, E.V.; Backofen, R. CRISPRidentify: identification of CRISPR arrays using machine learning approach. Nucleic Acids Res. 2021, 49, e20–e20. [Google Scholar] [CrossRef]
- Cheng, X.; Li, Z.; Shan, R.; Li, Z.; Wang, S.; Zhao, W.; Zhang, H.; Chao, L.; Peng, J.; Fei, T.; et al. Modeling CRISPR-Cas13d on-target and off-target effects using machine learning approaches. Nat. Commun. 2023, 14, 752. [Google Scholar] [CrossRef]
- Van de Sande, B.; Lee, J.S.; Mutasa-Gottgens, E.; Naughton, B.; Bacon, W.; Manning, J.; Wang, Y.; Pollard, J.; Mendez, M.; Hill, J.; et al. Applications of single-cell RNA sequencing in drug discovery and development. Nat. Rev. Drug Discov. 2023, 22, 496–520. [Google Scholar] [CrossRef]
- Kharchenko, P.V.; Silberstein, L.; Scadden, D.T. Bayesian approach to single-cell differential expression analysis. Nat. Methods 2014, 11, 740–742. [Google Scholar] [CrossRef]
- Hou, W.; Ji, Z.; Ji, H.; Hicks, S.C. A systematic evaluation of single-cell RNA-sequencing imputation methods. Genome Biol. 2020, 21, 218. [Google Scholar] [CrossRef] [PubMed]
- Inoue, Y. scVGAE: A Novel Approach using ZINB-Based Variational Graph Autoencoder for Single-Cell RNA-Seq Imputation. arXiv 2024, arXiv:2403.08959. [Google Scholar]
- Inoue, Y.; Kulman, E.; Kuang, R. BiGCN: Leveraging Cell and Gene Similarities for Single-cell Transcriptome Imputation with Bi-Graph Convolutional Networks. bioRxiv 2024, 2024–04. [Google Scholar]
- Cheng, Y.; Ma, X.; Yuan, L.; Sun, Z.; Wang, P. Evaluating imputation methods for single-cell RNA-seq data. BMC Bioinform. 2023, 24, 302. [Google Scholar] [CrossRef]
- Marks, D.S.; Colwell, L.J.; Sheridan, R.; Hopf, T.A.; Pagnani, A.; Zecchina, R.; Sander, C. Protein 3D structure computed from evolutionary sequence variation. PLoS ONE 2011, 6, e28766. [Google Scholar] [CrossRef]
- Senior, A.W.; Evans, R.; Jumper, J.; Kirkpatrick, J.; Sifre, L.; Green, T.; Qin, C.; Žídek, A.; Nelson, A.W.; Bridgland, A.; et al. Improved protein structure prediction using potentials from deep learning. Nature 2020, 577, 706–710. [Google Scholar] [CrossRef]
- Kryshtafovych, A.; Schwede, T.; Topf, M.; Fidelis, K.; Moult, J. Critical assessment of methods of protein structure prediction (CASP)—Round XIII. Proteins Struct. Funct. Bioinform. 2019, 87, 1011–1020. [Google Scholar] [CrossRef]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
- Abramson, J.; Adler, J.; Dunger, J.; Evans, R.; Green, T.; Pritzel, A.; Ronneberger, O.; Willmore, L.; Ballard, A.J.; Bambrick, J.; et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 2024, 630, 493–500. [Google Scholar] [CrossRef] [PubMed]
- Steinegger, M.; Mirdita, M.; Söding, J. Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold. Nat. Methods 2019, 16, 603–606. [Google Scholar] [CrossRef]
- Xu, L.; Ru, X.; Song, R. Application of machine learning for drug–target interaction prediction. Front. Genet. 2021, 12, 680117. [Google Scholar] [CrossRef] [PubMed]
- Su, M.; Yang, Q.; Du, Y.; Feng, G.; Liu, Z.; Li, Y.; Wang, R. Comparative assessment of scoring functions: the CASF-2016 update. J. Chem. Inf. Model. 2018, 59, 895–913. [Google Scholar] [CrossRef]
- Li, J.; Jiang, H.; Ma, W.; Bi, X.; Chen, R.; Lu, W.; Cai, Q.; Yang, F.; Wei, Z.; Zhang, S. Geometric deep learning for protein-ligand affinity prediction with hybrid message passing strategies. IEEE J. Biomed. Health Inform. 2025. [Google Scholar] [CrossRef] [PubMed]
- Bi, X.; Zhang, S.; Ma, W.; Jiang, H.; Wei, Z. HiSIF-DTA: a hierarchical semantic information fusion framework for drug-target affinity prediction. IEEE J. Biomed. Health Inform. 2023, 29, 1579–1590. [Google Scholar] [CrossRef]
- Huang, X.; Bi, X.; Xing, N.; Ma, W.; Jiang, H.; Cai, Q.; Lu, W.; Yang, F.; Wei, Z.; Zhang, S. LightDTA: lightweight drug-target affinity prediction via random-walk network embedding and knowledge distillation. Mol. Divers. 2026, 1–24. [Google Scholar] [CrossRef]
- Li, J.; Bi, X.; Ma, W.; Jiang, H.; Liu, S.; Lu, Y.; Wei, Z.; Zhang, S. MHAN-DTA: a multiscale hybrid attention network for drug-target affinity prediction. IEEE J. Biomed. Health Inform. 2024, 29, 7910–7921. [Google Scholar] [CrossRef]
- Gider, V.; Budak, C. A physics-informed graph neural network to approximate docking-based binding affinity for DYRK2 in Alzheimer’s drug repurposing. In Scientific Reports; 2026. [Google Scholar]
- Gainza, P.; Sverrisson, F.; Monti, F.; Rodola, E.; Boscaini, D.; et al. Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning. Nat. Methods 2020, 17, 184–192. [Google Scholar] [CrossRef]
- Xu, W.; Xie, Y.; Jiang, H.; Fu, Y.; Liu, H.; Wei, Z.; Zhang, S. Geometry-Aware Protein–Protein Binding Site Prediction Using Geometric Learning and Pretraining Strategies. J. Chem. Inf. Model. 2025, 65, 12088–12098. [Google Scholar] [CrossRef] [PubMed]
- Tubiana, J.; Schneidman-Duhovny, D.; Wolfson, H.J. ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat. Methods 2022, 19, 730–739. [Google Scholar] [CrossRef]
- Stärk, H.; Ganea, O.; Pattanaik, L.; Barzilay, R.; Jaakkola, T. Equibind: Geometric deep learning for drug binding structure prediction. In Proceedings of the International conference on machine learning. PMLR, 2022; pp. 20503–20521. [Google Scholar]
- Luna, A.; Elloumi, F.; Varma, S.; Wang, Y.; Rajapakse, V.N.; Aladjem, M.I.; Robert, J.; Sander, C.; Pommier, Y.; Reinhold, W.C. CellMiner Cross-Database (CellMinerCDB) version 1.2: Exploration of patient-derived cancer cell line pharmacogenomics. Nucleic Acids Res. 2021, 49, D1083–D1093. [Google Scholar] [CrossRef] [PubMed]
- Adam, G.; Rampášek, L.; Safikhani, Z.; Smirnov, P.; Haibe-Kains, B.; Goldenberg, A. Machine learning approaches to drug response prediction: challenges and recent progress. npj Precis. Oncol. 2020, 4, 19. [Google Scholar] [CrossRef]
- Peng, W.; Chen, T.; Dai, W. Predicting drug response based on multi-omics fusion and graph convolution. IEEE J. Biomed. Health Inform. 2021, 26, 1384–1393. [Google Scholar] [CrossRef] [PubMed]
- Liu, X.; Song, C.; Huang, F.; Fu, H.; Xiao, W.; Zhang, W. GraphCDR: a graph neural network method with contrastive learning for cancer drug response prediction. Brief. Bioinform. 2022, 23, bbab457. [Google Scholar] [CrossRef]
- Li, X.; Das, T.; Bhattarai, K.; Rajaganapathy, S.; Buchner, V.C.; Wang, Y.; Su, C.; Sun, L.; Wang, L.; Cerhan, J.R.; et al. Leveraging multi-source data to resolve inconsistency across pharmacogenomic datasets in drug sensitivity prediction. In Proceedings of the AMIA Annual Symposium Proceedings, 2025, Vol. 2024, p. 744.
- Rahman, R.; Dhruba, S.R.; Matlock, K.; De-Niz, C.; Ghosh, S.; Pal, R. Evaluating the consistency of large-scale pharmacogenomic studies. Brief. Bioinform. 2019, 20, 1734–1753. [Google Scholar] [CrossRef] [PubMed]
- Elloumi, F.; Reinhold, W.C.; Varma, S.; Wang, Y.; Kinali, M.; Arakawa, Y.; Inoue, Y.; Aladjem, M.I.; Pommier, Y.; Luna, A. CellMiner cross-database (CellMinerCDB) version 2.2 for explorations of patient-derived cancer cell line pharmacogenomics. Nucleic Acids Res. 2026, 54, D1345–D1354. [Google Scholar] [CrossRef]
- Dost, K.; Pullar-Strecker, Z.; Brydon, L.; Zhang, K.; Hafner, J.; Riddle, P.J.; Wicker, J.S. Combatting over-specialization bias in growing chemical databases. J. Cheminformatics 2023, 15, 53. [Google Scholar] [CrossRef] [PubMed]
- Nguyen, T.M.; Nguyen, T.; Tran, T. Mitigating cold-start problems in drug-target affinity prediction with interaction knowledge transferring. Brief. Bioinform. 2022, 23, bbac269. [Google Scholar] [CrossRef]
- Rajapakse, V.N.; Luna, A.; Yamade, M.; Loman, L.; Varma, S.; Sunshine, M.; Iorio, F.; Sousa, F.G.; Elloumi, F.; Aladjem, M.I.; et al. CellMinerCDB for integrative cross-database genomics and pharmacogenomics analyses of cancer cell lines. Iscience 2018, 10, 247–264. [Google Scholar] [CrossRef]
- He, D.; Liu, Q.; Wu, Y.; Xie, L. A context-aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening. Nat. Mach. Intell. 2022, 4, 879–892. [Google Scholar] [CrossRef]
- Jayagopal, A.; Zhang, Y.; Walsh, R.J.; Tan, T.Z.; Jeyasekharan, A.D.; Rajan, V. GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment. In Proceedings of the The Thirteenth International Conference on Learning Representations, 2025. [Google Scholar]
- Peng, W.; Liu, H.; Dai, W.; Yu, N.; Wang, J. Predicting cancer drug response using parallel heterogeneous graph convolutional networks with neighborhood interactions. Bioinformatics 2022, 38, 4546–4553. [Google Scholar] [CrossRef] [PubMed]
- Bik, E.M. Publishing negative results is good for science. 2024. [Google Scholar] [CrossRef]
- Briere, G.; Stosskopf, T.; Loire, B.; Baudot, A. Benchmarking the Impact of Data Leakage on the Performance of Knowledge Graph Embedding Models for Biomedical Link Prediction. bioRxiv 2025, 2025–01. [Google Scholar] [CrossRef]
- Asiaee, A.; Strauch, J.; Azinfar, L.; Pal, S.; Pua, H.H.; Long, J.P.; Coombes, K.R. Widespread data leakage inflates performance estimates in cancer drug response prediction. bioRxiv 2026, 2026–02. [Google Scholar]
- Papadatos, G.; Gaulton, A.; Hersey, A.; Overington, J.P. Activity, assay and target data curation and quality in the ChEMBL database. J. Comput.-Aided Mol. Des. 2015, 29, 885–896. [Google Scholar] [CrossRef]
- Kramer, C.; Kalliokoski, T.; Gedeck, P.; Vulpetti, A. The experimental uncertainty of heterogeneous public K i data. J. Med. Chem. 2012, 55, 5165–5173. [Google Scholar] [CrossRef]
- Zdrazil, B.; Felix, E.; Hunter, F.; Manners, E.J.; Blackshaw, J.; Corbett, S.; De Veij, M.; Ioannidis, H.; Lopez, D.M.; Mosquera, J.F.; et al. The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods. Nucleic Acids Res. 2024, 52, D1180–D1192. [Google Scholar] [CrossRef] [PubMed]
- Collins, G.S.; Moons, K.G.; Dhiman, P.; Riley, R.D.; Beam, A.L.; Van Calster, B.; Ghassemi, M.; Liu, X.; Reitsma, J.B.; Van Smeden, M.; et al. TRIPOD+ AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024, 385. [Google Scholar] [CrossRef]
- Gibson, A.D.; White, N.M.; Collins, G.S.; Barnett, A.G. Evidence of unreliable data and poor data provenance in clinical prediction model research and clinical practice. medRxiv 2026, 2026–02. [Google Scholar] [CrossRef]
- Wallach, I.; Dzamba, M.; Heifets, A. AtomNet: a deep convolutional neural network for bioactivity prediction in structure-based drug discovery. arXiv 2015, arXiv:1510.02855. [Google Scholar]
- Zhou, G.; Rusnac, D.V.; Park, H.; Canzani, D.; Nguyen, H.M.; Stewart, L.; Bush, M.F.; Nguyen, P.T.; Wulff, H.; Yarov-Yarovoy, V.; et al. An artificial intelligence accelerated virtual screening platform for drug discovery. Nat. Commun. 2024, 15, 7761. [Google Scholar] [CrossRef]
- Bickerton, G.R.; Paolini, G.V.; Besnard, J.; Muresan, S.; Hopkins, A.L. Quantifying the chemical beauty of drugs. Nat. Chem. 2012, 4, 90–98. [Google Scholar] [CrossRef]
- Zeng, X.; Wang, F.; Luo, Y.; Kang, S.g.; Tang, J.; Lightstone, F.C.; Fang, E.F.; Cornell, W.; Nussinov, R.; Cheng, F. Deep generative molecular design reshapes drug discovery. Cell Rep. Med. 2022, 3. [Google Scholar] [CrossRef]
- Li, X.; Wang, P.; Fu, T.; Gao, W.; Li, C.; Shi, L.; Liu, J. AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design. arXiv 2024, arXiv:2404.02003. [Google Scholar] [CrossRef]
- Fu, T.; Gao, W.; Coley, C.; Sun, J. Reinforced genetic algorithm for structure-based drug design. Adv. Neural Inf. Process. Syst. 2022, 35, 12325–12338. [Google Scholar]
- Fu, T.; Gao, W.; Xiao, C.; Yasonik, J.; Coley, C.W.; Sun, J. Differentiable scaffolding tree for molecular optimization. arXiv 2021, arXiv:2109.10469. [Google Scholar]
- Lin, H.; Huang, Y.; Zhang, O.; Ma, S.; Liu, M.; Li, X.; Wu, L.; Wang, J.; Hou, T.; Li, S.Z. Diffbp: Generative diffusion of 3d molecules for target protein binding. Chem. Sci. 2025, 16, 1417–1431. [Google Scholar] [CrossRef] [PubMed]
- Bohde, M.; Manjrekar, M.; Wang, R.; Ji, S.; Coley, C.W. Diffms: Diffusion generation of molecules conditioned on mass spectra. arXiv 2025, arXiv:2502.09571. [Google Scholar] [CrossRef]
- Lu, W.; Zhang, J.; Huang, W.; Zhang, Z.; Jia, X.; Wang, Z.; Shi, L.; Li, C.; Wolynes, P.G.; Zheng, S. DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model. Nat. Commun. 2024, 15, 1071. [Google Scholar] [CrossRef]
- Zhao, Q.; Zhao, H.; Guo, L.; Zheng, K.; Li, Y.; Ling, Q.; Tang, J.; Li, Y.; Wang, J. ColdstartCPI: Induced-fit theory-guided DTI predictive model with improved generalization performance. Nat. Commun. 2025, 16, 6436. [Google Scholar] [CrossRef]
- Jaeger, S.; Fulle, S.; Turk, S. Mol2vec: unsupervised machine learning approach with chemical intuition. J. Chem. Inf. Model. 2018, 58, 27–35. [Google Scholar] [CrossRef]
- Elnaggar, A.; Heinzinger, M.; Dallago, C.; Rehawi, G.; Wang, Y.; Jones, L.; Gibbs, T.; Feher, T.; Angerer, C.; Steinegger, M.; et al. ProtTrans: toward understanding the language of life through self-supervised learning. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 7112–7127. [Google Scholar] [CrossRef]
- Sun, Y.; Li, Y.Y.; Leung, C.K.; Hu, P. iNGNN-DTI: prediction of drug–target interaction with interpretable nested graph neural network and pretrained molecule models. Bioinformatics 2024, 40, btae135. [Google Scholar] [CrossRef] [PubMed]
- Irwin, R.; Dimitriadis, S.; He, J.; Bjerrum, E.J. Chemformer: a pre-trained transformer for computational chemistry. Mach. Learn. Sci. Technol. 2022, 3, 015022. [Google Scholar] [CrossRef]
- Lin, Z.; Akin, H.; Rao, R.; Hie, B.; Zhu, Z.; Lu, W.; Smetanin, N.; Verkuil, R.; Kabeli, O.; Shmueli, Y.; et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 2023, 379, 1123–1130. [Google Scholar] [CrossRef]
- Swanson, K.; Liu, G.; Catacutan, D.B.; Arnold, A.; Zou, J.; Stokes, J.M. Generative AI for designing and validating easily synthesizable and structurally novel antibiotics. Nat. Mach. Intell. 2024, 6, 338–353. [Google Scholar] [CrossRef]
- Fu, T.; Xiao, C.; Li, X.; Glass, L.M.; Sun, J. Mimosa: Multi-constraint molecule sampling for molecule optimization. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2021, Vol. 35, pp. 125–133. [CrossRef]
- Fu, T.; Xiao, C.; Sun, J. Core: Automatic molecule optimization using copy & refine strategy. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2020, Vol. 34, pp. 638–645.
- Fu, T.; Xiao, C.; Glass, L.M.; Sun, J. MOLER: Incorporate molecule-level reward to enhance deep generative model for molecule optimization. IEEE Trans. Knowl. Data Eng. 2021, 34, 5459–5471. [Google Scholar] [CrossRef]
- Stumpfe, D.; Bajorath, J. Exploring activity cliffs in medicinal chemistry: miniperspective. J. Med. Chem. 2012, 55, 2932–2942. [Google Scholar] [CrossRef] [PubMed]
- Maggiora, G.M. On outliers and activity cliffs why QSAR often disappoints. 2006. [Google Scholar]
- Van Tilborg, D.; Alenicheva, A.; Grisoni, F. Exposing the limitations of molecular machine learning with activity cliffs. J. Chem. Inf. Model. 2022, 62, 5938–5951. [Google Scholar] [CrossRef]
- Patel, C.N.; Kumar, S.P.; Rawal, R.M.; Patel, D.P.; Gonzalez, F.J.; Pandya, H.A. A multiparametric organ toxicity predictor for drug discovery. Toxicol. Mech. Methods 2020, 30, 159–166. [Google Scholar] [CrossRef]
- Lin, Z.; Will, Y. Evaluation of drugs with specific organ toxicities in organ-specific cell lines. Toxicol. Sci. 2012, 126, 114–127. [Google Scholar] [CrossRef]
- Cremer, J.; Medrano Sandonas, L.; Tkatchenko, A.; Clevert, D.A.; De Fabritiis, G. Equivariant graph neural networks for toxicity prediction. Chem. Res. Toxicol. 2023, 36, 1561–1573. [Google Scholar] [CrossRef] [PubMed]
- Teng, S.; Yin, C.; Wang, Y.; Chen, X.; Yan, Z.; Cui, L.; Wei, L. MolFPG: multi-level fingerprint-based graph transformer for accurate and robust drug toxicity prediction. Comput. Biol. Med. 2023, 164, 106904. [Google Scholar] [CrossRef]
- Sun, D.; Gao, W.; Hu, H.; Zhou, S. Why 90% of clinical drug development fails and how to improve it? Acta Pharm. Sin. B 2022, 12, 3049–3062. [Google Scholar] [CrossRef]
- Zhu, H.W.; Yi, H.C.; You, Z.H.; He, S.H.; Shang, X. A Multimodal Information Fusion Framework for Accurate and Robust Prediction of Drug-Drug Interactions. Inf. Fusion 2025, 103981. [Google Scholar] [CrossRef]
- Bi, X.; Ma, W.; Jiang, H.; Cai, Q.; Nie, J.; Wei, Z.; Zhang, S. Subgraph-Focused Biomedical Knowledge Embedding with Bi-Semantic Encoder for Multi-Type Drug-Drug Interaction Prediction. Inf. Fusion 2025, 104109. [Google Scholar] [CrossRef]
- Rationalizing combination therapies. Nat. Med. 2017, 23, 1113. [CrossRef]
- Preuer, K.; Lewis, R.P.; Hochreiter, S.; Bender, A.; Bulusu, K.C.; Klambauer, G. DeepSynergy: predicting anti-cancer drug synergy with Deep Learning. Bioinformatics 2018, 34, 1538–1546. [Google Scholar] [CrossRef]
- Wang, H.; Luna, A.; Yan, G.; Li, X.; Babur, O.; Mills, G.B.; Sander, C.; Korkut, A. Targeting adaptation to cancer treatment by drug combinations. bioRxiv 2021, 2021–04. [Google Scholar] [CrossRef]
- Kong, W.; Midena, G.; Chen, Y.; Athanasiadis, P.; Wang, T.; Rousu, J.; He, L.; Aittokallio, T. Systematic review of computational methods for drug combination prediction. Comput. Struct. Biotechnol. J. 2022, 20, 2807–2814. [Google Scholar] [CrossRef]
- Li, Y.; Meng, Q.; Yang, M.; Liu, D.; Hou, X.; Tang, L.; Wang, X.; Lyu, Y.; Chen, X.; Liu, K.; et al. Current trends in drug metabolism and pharmacokinetics. Acta Pharm. Sin. B 2019, 9, 1113–1144. [Google Scholar] [CrossRef] [PubMed]
- Lu, J.; Deng, K.; Zhang, X.; Liu, G.; Guan, Y. Neural-ODE for pharmacokinetics modeling and its advantage to alternative machine learning models in predicting new dosing regimens. iScience 2021, 24. [Google Scholar] [CrossRef]
- Chen, R.T.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D.K. Neural ordinary differential equations. In Proceedings of the Advances in Neural Information Processing Systems, 2018; Vol. 31. [Google Scholar]
- Lu, J.; Bender, B.; Jin, J.Y.; Guan, Y. Deep learning prediction of patient response time course from early data via neural-pharmacokinetic/pharmacodynamic modelling. Nat. Mach. Intell. 2021, 3, 696–704. [Google Scholar] [CrossRef]
- Laubenbacher, R.; Sluka, J.P.; Glazier, J.A. Using digital twins in viral infection. Science 2021, 371, 1105–1106. [Google Scholar] [CrossRef] [PubMed]
- Elias, M.; König, M. A digital twin of glimepiride for personalized and stratified diabetes treatment. Front. Pharmacol. 2025, 16, 1686415. [Google Scholar] [CrossRef] [PubMed]
- Shen, M.d.; Chen, S.b.; Ding, X.d. The effectiveness of digital twins in promoting precision health across the entire population: a systematic review. npj Digit. Med. 2024, 7, 145. [Google Scholar] [CrossRef]
- Srinivasa, R.S.; Qian, C.; Theodorou, B.; Spaeder, J.; Xiao, C.; Glass, L.; Sun, J. Clinical trial site matching with improved diversity using fair policy learning. arXiv 2022, arXiv:2204.06501. [Google Scholar] [CrossRef]
- Theodorou, B.; Glass, L.; Xiao, C.; Sun, J. FRAMM: Fair ranking with missing modalities for clinical trial site selection. Patterns 2024, 5. [Google Scholar] [CrossRef]
- Atkinson, J.A.; Wells, R.; Page, A.; Dominello, A.; Haines, M.; Wilson, A. Applications of system dynamics modelling to support health policy. Public Health Res. Pract. 2015, 25, e2531531. [Google Scholar] [CrossRef]
- Chen, X.; Orom, H.; Hay, J.L.; Waters, E.A.; Schofield, E.; Li, Y.; Kiviniemi, M.T. Differences in rural and urban health information access and use. J. Rural Health 2019, 35, 405–417. [Google Scholar] [CrossRef]
- Weiss, D.; Nelson, A.; Vargas-Ruiz, C.; Gligorić, K.; Bavadekar, S.; Gabrilovich, E.; Bertozzi-Villa, A.; Rozier, J.; Gibson, H.; Shekel, T.; et al. Global maps of travel time to healthcare facilities. Nat. Med. 2020, 26, 1835–1838. [Google Scholar] [CrossRef]
- Fu, T.; Huang, K.; Xiao, C.; Glass, L.M.; Sun, J. HINT: Hierarchical interaction network for clinical-trial-outcome predictions. Patterns 2022, 3, 100445. [Google Scholar] [CrossRef]
- Chen, T.; Hao, N.; Lu, Y.; Rechem, C.V. Uncertainty Quantification on Clinical Trial Outcome Prediction. arXiv 2024, arXiv:cs. [Google Scholar] [CrossRef]
- Zheng, W.; Peng, D.; Xu, H.; Zhu, H.; Fu, T.; Yao, H. Multimodal clinical trial outcome prediction with large language models. arXiv 2024, arXiv:2402.06512. [Google Scholar]
- Klein, H.; Mazor, T.; Siegel, E.; Trukhanov, P.; Ovalle, A.; Vecchio Fitz, C.D.; Zwiesler, Z.; Kumari, P.; Van Der Veen, B.; Marriott, E.; et al. MatchMiner: an open-source platform for cancer precision medicine. npj Precis. Oncol. 2022, 6, 69. [Google Scholar] [CrossRef]
- Hamer, D.M.d.; Schoor, P.; Polak, T.B.; Kapitan, D. Improving Patient Pre-screening for Clinical Trials: Assisting Physicians with Large Language Models. arXiv 2023, arXiv:2304.07396. [Google Scholar] [CrossRef]
- Jin, Q.; Wang, Z.; Floudas, C.S.; Chen, F.; Gong, C.; Bracken-Clarke, D.; Xue, E.; Yang, Y.; Sun, J.; Lu, Z. Matching patients to clinical trials with large language models. Nat. Commun. 2024, 15, 9074. [Google Scholar] [CrossRef]
- Shool, S.; Adimi, S.; Saboori Amleshi, R.; Bitaraf, E.; Golpira, R.; Tara, M. A systematic review of large language model (LLM) evaluations in clinical medicine. BMC Med. Inform. Decis. Mak. 2025, 25, 117. [Google Scholar] [CrossRef]
- Rao, A.S.; Esmail, K.P.; Lee, R.S.; Jiang, S.; Arraiza Carlo, B.; Gill, J.; Khanna, P.; Kalmowitz, E.; Montagnese, B.; Heydari, K.; et al. Large language model performance and clinical reasoning tasks. JAMA Netw. Open 2026, 9, e264003. [Google Scholar] [CrossRef] [PubMed]
- Zhou, J.; Li, H.; Chen, S.; Chen, Z.; Han, Z.; Gao, X. Large language models in biomedicine and healthcare. npj Artif. Intell. 2025, 1, 44. [Google Scholar] [CrossRef]
- Asgari, E.; Montaña-Brown, N.; Dubois, M.; Khalil, S.; Balloch, J.; Yeung, J.A.; Pimenta, D. A framework to assess clinical safety and hallucination rates of LLMs for medical text summarisation. npj Digit. Med. 2025, 8, 274. [Google Scholar] [CrossRef] [PubMed]
- Bean, A.M.; Payne, R.E.; Parsons, G.; Kirk, H.R.; Ciro, J.; Mosquera-Gómez, R.; Hincapié M, S.; Ekanayaka, A.S.; Tarassenko, L.; Rocher, L.; et al. Reliability of LLMs as medical assistants for the general public: a randomized preregistered study. Nat. Med. 2026, 1–7. [Google Scholar] [CrossRef]
- Zhong, X.; Li, S.; Chen, Z.; Ge, L.; Yu, D.; Wang, S.; You, L.; Shang, H. Considerations for patient privacy of Large Language Models in health care: scoping review. J. Med. Internet Res. 2025, 27, e76571. [Google Scholar] [CrossRef] [PubMed]
- Schoolcraft, D.; Meltzer, A.C.; Sangal, R.; Terry, A.T.; Robertson, K.; Buckland, D.; Motalib, S.; Genes, N.; Vukmir, R.; Waseem, T.; et al. Health Insurance Portability and Accountability Act Liability in the Age of Generative Artificial Intelligence. JACEP Open 2026, 7, 100317. [Google Scholar] [CrossRef]
- Shah, S.V. Accuracy, consistency, and hallucination of large language models when analyzing unstructured clinical notes in electronic medical records. JAMA Netw. Open 2024, 7, e2425953. [Google Scholar] [CrossRef]
- Croxford, E.; Gao, Y.; First, E.; Pellegrino, N.; Schnier, M.; Caskey, J.; Oguss, M.; Wills, G.; Chen, G.; Dligach, D.; et al. Evaluating clinical AI summaries with large language models as judges. npj Digit. Med. 2025, 8, 640. [Google Scholar] [CrossRef]
- Guo, Y.; Guo, M.; Su, J.; Yang, Z.; Zhu, M.; Li, H.; Qiu, M.; Liu, S.S. Bias in large language models: Origin, evaluation, and mitigation. arXiv 2024, arXiv:2411.10915. [Google Scholar] [CrossRef]
- Gu, J.; Jiang, X.; Shi, Z.; Tan, H.; Zhai, X.; Xu, C.; Li, W.; Shen, Y.; Ma, S.; Liu, H.; et al. A survey on llm-as-a-judge. The Innovation 2024. [Google Scholar] [CrossRef]
- Genovese, A.; Hegstrom, L.; Prabha, S.; Gomez-Cabello, C.A.; Haider, S.A.; Collaco, B.; Wood, N.G.; Forte, A.J. Artificial Authority: The Promise and Perils of LLM Judges in Healthcare. Bioengineering 2026, 13, 108. [Google Scholar] [CrossRef] [PubMed]
- Laskar, M.T.R.; Jahan, I.; Dolatabadi, E.; Peng, C.; Hoque, E.; Huang, J. Improving automatic evaluation of large language models (LLMs) in biomedical relation extraction via LLMs-as-the-judge. In Proceedings of the Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025, pp. 25483–25497.
- Wang, L.; Chen, X.; Deng, X.; Wen, H.; You, M.; Liu, W.; Li, Q.; Li, J. Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs. npj Digit. Med. 2024, 7, 41. [Google Scholar] [CrossRef]
- Chen, Y.; Zou, J. GenePT: A Simple But Effective Foundation Model for Genes and Cells Built From ChatGPT. bioRxiv 2023. [Google Scholar] [CrossRef] [PubMed]
- Steshin, S. Lo-hi: Practical ml drug discovery benchmark. Adv. Neural Inf. Process. Syst. 2023, 36, 64526–64554. [Google Scholar]
- Sheridan, R.P. Time-split cross-validation as a method for estimating the goodness of prospective prediction. J. Chem. Inf. Model. 2013, 53, 783–790. [Google Scholar] [CrossRef]
- Talevi, A.; Morales, J.F.; Hather, G.; Podichetty, J.T.; Kim, S.; Bloomingdale, P.C.; Kim, S.; Burton, J.; Brown, J.D.; Winterstein, A.G.; et al. Machine learning in drug discovery and development part 1: a primer. CPT Pharmacomet. Syst. Pharmacol. 2020, 9, 129–142. [Google Scholar] [CrossRef]
- Fooladi, H.; Vu, T.N.L.; Mathea, M.; Kirchmair, J. Evaluating machine learning models for molecular property prediction: performance and robustness on out-of-distribution data. J. Chem. Inf. Model. 2025, 65, 9871–9891. [Google Scholar] [CrossRef]
- Ji, Y.; Zhang, L.; Wu, J.; Wu, B.; Li, L.; Huang, L.K.; Xu, T.; Rong, Y.; Ren, J.; Xue, D.; et al. Drugood: Out-of-distribution dataset curator and benchmark for ai-aided drug discovery–a focus on affinity prediction problems with noise annotations. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2023, Vol. 37, pp. 8023–8031. [CrossRef]
- Abdar, M.; Pourpanah, F.; Hussain, S.; Rezazadegan, D.; Liu, L.; Ghavamzadeh, M.; Fieguth, P.; Cao, X.; Khosravi, A.; Acharya, U.R.; et al. A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Inf. Fusion 2021, 76, 243–297. [Google Scholar] [CrossRef]
- Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef]
- Tonekaboni, S.; Joshi, S.; McCradden, M.D.; Goldenberg, A. What clinicians want: contextualizing explainable machine learning for clinical end use. In Proceedings of the Machine learning for healthcare conference. PMLR, 2019; pp. 359–380. [Google Scholar]
- Chen, Y.; Zhang, L.; Zhong, C.; Li, S.; Li, R.; Jia, Z.; Huang, P.; Wang, S.; He, Z.; Lin, H.; et al. Rare Cancer Explorer 1.0 (RaCE 1.0): a dedicated database and analytical platform focused on rare cancers. Nucleic Acids Res. 2026, 54, D1590–D1607. [Google Scholar] [CrossRef]
- Pillai, R.K.; Jayasree, K. Rare cancers: Challenges & issues. Indian J. Med. Res. 2017, 145, 17–27. [Google Scholar] [PubMed]
- Thapa, K.; Kinali, M.; Pei, S.; Luna, A.; Babur, Ö. Strategies to include prior knowledge in omics analysis with deep neural networks. Patterns 2025, 6. [Google Scholar] [CrossRef] [PubMed]
- Babur, Ö.; Luna, A.; Korkut, A.; Durupinar, F.; Siper, M.C.; Dogrusoz, U.; Jacome, A.S.V.; Peckner, R.; Christianson, K.E.; Jaffe, J.D.; et al. Causal interactions from proteomic profiles: Molecular data meet pathway knowledge. Patterns 2021, 2. [Google Scholar] [CrossRef] [PubMed]
- Omar, M.; Sorin, V.; Collins, J.D.; Reich, D.; Freeman, R.; Gavin, N.; Charney, A.; Stump, L.; Bragazzi, N.L.; Nadkarni, G.N.; et al. Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinical decision support. Commun. Med. 2025, 5, 330. [Google Scholar] [CrossRef]
- Ayoub, M.; Zhao, H.; Li, L.; Yang, D.; Hussain, S.; Wahid, J.A. Structured clinical approach to enable large language models to be used for improved clinical diagnosis and explainable reasoning. Commun. Med. 2026. [Google Scholar] [CrossRef]
- Ding, C.; Bian, M.; Chen, P.; Zhang, H.; Li, T.; Liu, L.; Chen, J.; Li, Z.; Zhong, Y.; Liu, Y.; et al. Building a human-verified clinical reasoning dataset via a human LLM hybrid pipeline for trustworthy medical AI. arXiv 2025, arXiv:2505.06912. [Google Scholar] [CrossRef]



| Name | Usage / ML Application Task | Limitations (see Section 3.4) |
|---|---|---|
| • UK Biobank [6] | Large-scale genotype-phenotype cohort; Application: C/R [7,8] | Sampling bias due to non-representative cohort (e.g., excludes people of age > 69 and < 40) [9] |
| • MGnify [10] | Microbiome genomic annotations; Application: Other | Sampling bias of particular taxa [11] |
|
•• National Cancer Institute 60 (NCI60) [12] •• Genomics of Drug Sensitivity in Cancer (GDSC) [15] •• Cancer Cell Line Encyclopedia (CCLE) [16] |
Large-scale drug response measurements across cancer cell lines; Application: C/R [13] | Cross-dataset inconsistency (e.g., AUC vs IC50) [14] |
|
•• CellMiner Cross Database (CellMinerCDB) [17] •• Therapeutics Data Commons (TDC) [19] |
Cross-dataset multi-omics & drug response integration; Application: CellMinerCDB: C/R [13], TDC: C/R [18] | Cross-dataset inconsistency (e.g., AUC vs IC50) [14] |
|
• Davis kinase inhibitors DB [20] • Kinase Inhibitor Bioactivity Data (KIBA) [23] • BindingDB [24] |
Experimental binding affinity datasets for protein-ligand interactions; Application: C/R [21] | Sampling bias and class imbalance [22] |
|
•• ChEMBL [25] •• DrugTargetCommons (DTC) [31] |
Curated bioactivity/literature databases; Application: ChEMBL: C/R [26,27], DTC: R [28] | Coverage bias [29] and similarity bias [30] |
| •• OpenTargets [32] | Integrated target-disease association and prioritization; Application: C [33] | Evidence weighting subjectivity, data source imbalance [32] |
| •• ZINC ligand discovery database [34] | Virtual chemical library for ligand screening and generation; Application: C/R [35] | Coverage bias due to incomplete representation of chemical or biological space [29] |
| • MoleculeNet [36] | Benchmark datasets for evaluating molecular property prediction; Application: C/R [36] | Coverage bias due to incomplete representation of chemical or biological space [29] |
| • PK-DB [37] | ADME/PK experimental data; Application: C/R [38] | Data incompleteness, heterogeneous reporting, and aggregation bias [37] |
|
•• RCSB Protein Data Bank (PDB) [39] • PDBBind [42] |
High-quality protein 3D structures with protein-ligand binding benchmark; Application: R [40] | Bias toward crystallizable, stable, and well-folded proteins [41] |
|
• Uniclust [43] • Uniref [45] |
Clustered protein sequences with different similarity levels available; Application: Other | Selection bias due to uneven homolog availability, over-representing well-characterized proteins [44] |
| • ClinicalTrials.gov [46] | Clinical trial registry and metadata for study design and outcomes; Application: Other | Selection bias, information / classification bias, confounding bias [47,48] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).