Working Paper Article Version 2 This version is not peer-reviewed

Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase

Version 1 : Received: 8 October 2019 / Approved: 10 October 2019 / Online: 10 October 2019 (07:40:31 CEST)
Version 2 : Received: 24 October 2019 / Approved: 25 October 2019 / Online: 25 October 2019 (11:21:43 CEST)

How to cite: Ancuceanu, R.; Tamba, B.; Stoicescu, C.S.; Dinu, M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints 2019, 2019100113 Ancuceanu, R.; Tamba, B.; Stoicescu, C.S.; Dinu, M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-Src Tyrosine Kinase. Preprints 2019, 2019100113

Abstract

Prototype of a family of at least nine members, c-src tyrosine kinase is a therapeutically interesting target, because its inhibition might be of interest not only in a number of malignancies, but also in a diverse array of conditions, from neurodegenerative pathologies to certain viral infections. Computational methods in drug discovery are considerably cheaper than conventional methods and offer opportunities of screening very large numbers of compounds in conditions that would be simply impossible within the wet lab experimental settings. We have explored the use of global QSAR models and molecular ligand docking in the discovery of new c-src tyrosine kinase inhibitors. Using a data set of 1038 compounds from ChEMBL and 19 blocks of molecular descriptors, we have developed over 200 QSAR classification models, based on six machine learning algorithms and 17 feature selection methods. We have selected 49 with reasonably good performance (positive predictive value and balanced accuracy higher than 70% in nested cross validation) and the models were assembled by stacking with a simple majority vote and used for the virtual screening of over the “named” ZINC data set (over 100,000 compounds). 744 compounds were predicted by at least 50% of the QSAR models as active, 147 compounds were within the applicability domain and predicted by at least 75% of the models to be active. The latter 147 compounds were submitted to molecular ligand docking using Vina and Ledock, and a number of 90 were predicted to be active based on the binding energy. External data from CHEMBL and PUBCHEM confirmed that at least 7.83% (in the case of QSAR) or 6.67% (in the case of integrated QSAR and molecular docking) of the compounds are active on the c-src target.

Keywords

c-src-tyrosine kinase; QSAR; molecular descriptors; virtual screening; drug discovery; cancer; molecular docking

Subject

Computer Science and Mathematics, Mathematical and Computational Biology

Comments (1)

Comment 1
Received: 25 October 2019
Commenter: Robert Ancuceanu
Commenter's Conflict of Interests: Author
Comment: We have added keywords (in the initial version they were missing) and removed a few lines about y-scrambling which were redundant (they were repeated in different words in the following paragraph). We also added a title to the first table.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.