Submitted:
24 April 2026
Posted:
27 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Research Questions
- What are the main classes of molecular docking algorithms currently in existence, and what is their architectural evolution?
- What are the comparative characteristics of accuracy, computational scalability, and physical validity of classical and AI-oriented tools according to independent benchmarks?
- What infrastructure solutions (pipelines, graphical interfaces) have been developed for high-throughput virtual screening, and what is their efficiency?
- What approaches are used to model conformational receptor flexibility and perform blind docking?
- What are the current trends and predicted directions for the development of hybrid pipelines that combine classical force fields and neural network models?
2.2. Information Sources and Search Strategy
- (“molecular docking” OR “protein-ligand docking”) AND (“virtual screening” OR “high-throughput screening”)
- (“docking software” OR “docking tools”) AND (“AutoDock Vina” OR “Glide” OR “GOLD”)
- (“deep learning” OR “machine learning” OR “generative AI”) AND (“molecular docking” OR “drug design”)
- (“DiffDock” OR “GNINA” OR “AlphaFold 3” OR “DynamicBind” OR “FABFlex”) AND (“docking” OR “binding pose”)
- (“benchmark” OR “PoseBusters” OR “comparative assessment”) AND (“docking accuracy” OR “scoring function”)
- (“receptor flexibility” OR “induced fit” OR “blind docking”) AND (“molecular docking”)
- (“pipeline” OR “workflow” OR “high-throughput”) AND (“EasyDock” OR “virtual screening”)
2.3. Selection Criteria
- Original research articles describing new algorithms, programs, or web servers for molecular docking.
- Review articles that systematically analyze docking methods.
- Preprints (arXiv, ChemRxiv, bioRxiv) if they contained a description of a tool or benchmark that, at the time of the search, did not have a peer-reviewed version.
- Conference proceedings (ICLR, NeurIPS, MLSB) describing new models.
- Software documentation and repositories to clarify technical specifications.
- Publications focused solely on the application of existing tools for a specific pharmacological problem (case studies) without methodological innovations.
- Conference abstracts of less than 2 pages.
- Works that did not contain information on evaluating the accuracy or speed of the tool.
2.4. Selection Process and Data Extraction
2.5. Synthesis of Results
3. Classical Docking Platforms
3.1. AutoDock Vina and Empirical Approaches
3.2. Commercial Solutions: Glide and GOLD
4. Scalable Infrastructure Solutions for Screening
4.1. Local Interfaces: EasyDockVina
4.2. Distributed Computing: EasyDock
5. Paradigm Shift: Machine Learning and Hybrid Architectures
5.1. GNINA and Convolutional Neural Networks
5.2. ArtiDock and Optimization for Industrial Scales
6. Generative Artificial Intelligence and Endogenous Modeling
6.1. Diffusion Generative Models: DiffDock
6.2. Flow Matching and Geometric Innovations
7. Overcoming the Problem of Receptor Flexibility
7.1. Backbone Transformation and DynamicBind
7.2. High-Speed Alternatives: FABFlex
8. Co-Folding: Ab Initio Prediction of Macromolecular Complexes
9. Specialized Therapeutic Modalities
9.1. Covalent Docking
9.2. Peptides, Metalloproteins, and Macrocycles
10. Discussion: Independent Benchmarking and the Neural Network Crisis
11. Future Research Perspectives
- Use of AI for full-atom modeling of receptor flexibility.
- Sieving of colossal libraries through distributed cluster pipelines with basic physical evaluation.
- Hybrid docking combining diffusion models with local optimization.
- Rescoring with an ensemble of 3D-CNNs followed by physical minimization.
- Application of co-folding models to the final group of ligands.
12. Conclusions
- A classification of docking methods is proposed, reflecting their architectural evolution. Three main generations of tools are identified: (i) classical biophysical platforms (AutoDock Vina, Glide, GOLD), based on stochastic search and empirical scoring functions; (ii) hybrid approaches integrating machine learning for rescoring or pose refinement (GNINA, ArtiDock); (iii) end-to-end generative models (DiffDock, AlphaFold 3, DynamicBind) that predict complex structure directly, without iterative conformational search.
- Analysis of independent benchmarks (PoseBusters, Bento, NextTopDocker) allowed for a balanced assessment of the capabilities of different approaches. It is shown that claims of the total superiority of AI models are premature. Classical algorithms maintain leadership in terms of physical validity (<3% incorrect structures), whereas generative models, while minimizing RMSD, often generate stereochemically impossible poses with atom collisions. At the same time, deep learning methods demonstrate unprecedented speed and the ability to account for macroscopic receptor flexibility, confirming their high potential when integrated correctly.
- The review revealed that specialized pipelines have been developed to address the challenge of high-throughput screening. It is shown that graphical interfaces (EasyDockVina) democratize access to docking for researchers without programming skills, while distributed frameworks (EasyDock) ensure unprecedented scalability (docking thousands of compounds in minutes on cluster architectures), automating the entire data processing cycle—from 3D structure generation to ensuring fault tolerance.
- Approaches to solving the fundamental problem of protein flexibility were analyzed in detail. It is demonstrated that the latest diffusion models (DynamicBind, FABFlex) overcome the limitations of classical “induced fit” modeling, allowing simulation of protein backbone transformations and detection of cryptic pockets. At the same time, a trade-off between accuracy (DynamicBind) and computational speed (FABFlex) was identified, which determines the choice of tool depending on the scale of the task.
- The main result of this work is the substantiation of the thesis regarding the inevitability of forming hybrid pipelines. The “classical vs. AI” dichotomy is recognized as false. The optimal future strategy lies in deep integration, where generative models provide rapid hypothesis generation and account for conformational mobility, classical force fields ensure physical realism and correct neural network artifacts, and distributed infrastructures (EasyDock) enable scaling this process to billion-sized libraries.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- DiMasi, J.A.; Grabowski, H.G.; Hansen, R.W. Innovation in the pharmaceutical industry: New estimates of R&D costs. J. Health Econ. 2016, 47, 20–33. [Google Scholar]
- Meng, X.Y.; Zhang, H.X.; Mezei, M.; Cui, M. Molecular Docking: A Powerful Approach for Structure-Based Drug Discovery. Curr. Comput. Aided Drug. Des. 2011, 7, 146–157. [Google Scholar] [CrossRef]
- Paggi, J.M.; Pandit, A.; Dror, R.O. The Art and Science of Molecular Docking. Annu. Rev. Biochem. 2024, 93, 389–410. [Google Scholar] [CrossRef] [PubMed]
- Oliveira, A.S. Bridging traditional and contemporary approaches in computational medicinal chemistry: opportunities for innovation in drug discovery. RSC Med. Chem. 2025, 16, 5953–5963. [Google Scholar] [CrossRef]
- Jumper, J.; Evans, R.; Pritzel, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
- Li, Y.; Yi, J.; Li, H.; et al. Decoding the limits of deep learning in molecular docking for drug discovery. Chem. Sci. 2025, 16, 17374–17390. [Google Scholar] [CrossRef]
- Tricco, A.C.; Lillie, E.; Zarin, W.; et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann. Intern. Med. 2018, 169, 467–473. [Google Scholar] [CrossRef]
- Trott, O.; Olson, A.J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 2010, 31, 455–461. [Google Scholar] [CrossRef] [PubMed]
- Friesner, R.A.; Banks, J.L.; Murphy, R.B.; et al. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 2004, 47, 1739–1749. [Google Scholar] [CrossRef] [PubMed]
- Jones, G.; Willett, P.; Glen, R.C.; Leach, A.R.; Taylor, R. Development and validation of a genetic algorithm for flexible docking. J. Mol. Biol. 1997, 267, 727–748. [Google Scholar] [CrossRef]
- Buttenschoen, M.; Morris, G.M.; Deane, C.M. PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chem. Sci. 2024, 15, 3130–3139. [Google Scholar] [CrossRef]
- ElTijani, A.; Alsafi, M.Y.; Ahmed, A.F. EasyDockVina: Graphical Interface for Ligand Optimization and High Throughput Virtual Screening with Vina. Zenodo 2019, v2.2. [Google Scholar]
- Minibaeva, G.; Ivanova, A.; Polishchuk, P. EasyDock: customizable and scalable docking tool. J. Cheminform. 2023, 15, 102. [Google Scholar] [CrossRef] [PubMed]
- McNutt, A.T.; Li, Y.; Meli, R.; Aggarwal, R.; Koes, D.R. GNINA 1.3: the next increment in molecular docking with deep learning. J. Cheminform. 2025, 17, 28. [Google Scholar] [CrossRef]
- Voitsitskyi, T.; Koleiev, I.; Stratiichuk, R.; et al. ArtiDock: Accurate Machine Learning Approach to Protein–Ligand Docking Optimized for High-Throughput Virtual Screening. J. Chem. Inf. Model. 2026, 66, 1–15. [Google Scholar] [CrossRef] [PubMed]
- Corso, G.; Stärk, H.; Jing, B.; Barzilay, R.; Jaakkola, T. DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking. In Proceedings of the International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
- Lu, W.; Zhang, J.; Huang, W.; et al. DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model. Nat. Commun. 2024, 15, 1071. [Google Scholar] [CrossRef]
- Zhang, Z.; Wu, L.; Gao, K.; Yao, J.; Qin, T.; Han, B. Fast and Accurate Blind Flexible Docking. In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR), Singapore, 24–28 April 2025. [Google Scholar]
- Abramson, J.; Adler, J.; Dunger, J.; et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 2024, 630, 493–500. [Google Scholar] [CrossRef]
- Shen, C.; Du, H.; Zhang, X.; et al. CarsiDock-Cov: A deep learning-guided approach for automated covalent docking and screening. Acta Pharm. Sin. B 2025, 15, 5758–5771. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, F.; Feng, L.; Zhang, C.; Lai, L. DiffPepDock: Efficient protein–peptide docking and binder screening via SE(3)-equivariant diffusion. Protein Sci. 2025, 34, e70338. [Google Scholar] [CrossRef]
- Zhang, H.; Su, Q.; Zheng, Y.; et al. MetalloDock: Decoding Metalloprotein–Ligand Interactions via Physics-Aware Deep Learning for Metalloprotein Drug Discovery. J. Am. Chem. Soc. 2026. [Google Scholar] [CrossRef]
- Liang, H.; Huang, S.; Xu, X.; et al. Designing Macrocyclic Kinase Inhibitors Using Macrocycle Scaffold Hopping with Reinforced Learning (Macro-Hop). J. Med. Chem. 2025, 68, 6698–6717. [Google Scholar] [CrossRef]
- Pak, M.A.; Frolova, D.; Nikolenko, S.A.; et al. Bento: Benchmarking Classical and AI Docking on Drug Design–Relevant Data. bioRxiv 2025, 2025.12.30.696741. [Google Scholar]
- Alcaide, E.; Gao, Z. NextTopDocker: the largest-to-date docking power benchmark reveals that deep learning performs generally much worse than logistic regression models. ChemRxiv 2024. [Google Scholar] [CrossRef]
| Characteristic | EasyDockVina (2019) | EasyDock (2023) |
|---|---|---|
| Interface | Graphical (GUI) | Command Line (CLI) / Python Module |
| Scalability | Local Workstation | Cluster Architecture (Dask, SSH) |
| Data Processing | File Conversion (.mol → .pdbqt) | 3D Generation from SMILES, RDKit, Desalting |
| Chemical Management | Absent | Tautomer Generation, pH 7.4 |
| Fault Tolerance | Basic (write to TXT/CSV) | Advanced (SQLite DB, Resumability) |
| Supported Engines | AutoDock Vina | Vina, Smina, GNINA, QVina |
| Specialized Functions | Standard Parameterization | Boron Compound Docking (Covalent Inhibitors) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).