Version 1
: Received: 8 October 2019 / Approved: 10 October 2019 / Online: 10 October 2019 (07:40:31 CEST)
Version 2
: Received: 24 October 2019 / Approved: 25 October 2019 / Online: 25 October 2019 (11:21:43 CEST)
How to cite:
Ancuceanu, R.; Tamba, B.; Stoicescu, C.S.; Dinu, M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints2019, 2019100113
Ancuceanu, R.; Tamba, B.; Stoicescu, C.S.; Dinu, M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints 2019, 2019100113
Ancuceanu, R.; Tamba, B.; Stoicescu, C.S.; Dinu, M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints2019, 2019100113
APA Style
Ancuceanu, R., Tamba, B., Stoicescu, C.S., & Dinu, M. (2019). Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints. https://doi.org/
Chicago/Turabian Style
Ancuceanu, R., Cristina Silvia Stoicescu and Mihaela Dinu. 2019 "Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase" Preprints. https://doi.org/
Abstract
Prototype of a family of at least nine members, c-src tyrosine kinase is a therapeutically interesting target, because its inhibition might be of interest not only in a number of malignancies, but also in a diverse array of conditions, from neurodegenerative pathologies to certain viral infections. Computational methods in drug discovery are considerably cheaper than conventional methods and offer opportunities of screening very large numbers of compounds in conditions that would be simply impossible within the wet lab experimental settings. We have explored the use of global QSAR models and molecular ligand docking in the discovery of new c-src tyrosine kinase inhibitors. Using a data set of 1038 compounds from ChEMBL and 19 blocks of molecular descriptors, we have developed over 200 QSAR classification models, based on six machine learning algorithms and 17 feature selection methods. We have selected 49 with reasonably good performance (positive predictive value and balanced accuracy higher than 70% in nested cross validation) and the models were assembled by stacking with a simple majority vote and used for the virtual screening of over the “named” ZINC data set (over 100,000 compounds). 744 compounds were predicted by at least 50% of the QSAR models as active, 147 compounds were within the applicability domain and predicted by at least 75% of the models to be active. The latter 147 compounds were submitted to molecular ligand docking using Vina and Ledock, and a number of 90 were predicted to be active based on the binding energy. External data from CHEMBL and PUBCHEM confirmed that at least 7.83% (in the case of QSAR) or 6.67% (in the case of integrated QSAR and molecular docking) of the compounds are active on the c-src target.
Computer Science and Mathematics, Mathematical and Computational Biology
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Received:
25 October 2019
Commenter:
Robert Ancuceanu
Commenter's Conflict of Interests:
Author
Comment:
We have added keywords (in the initial version they were missing) and removed a few lines about y-scrambling which were redundant (they were repeated in different words in the following paragraph). We also added a title to the first table.
Commenter: Robert Ancuceanu
Commenter's Conflict of Interests: Author