Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC50 Inhibition Values of FLT3 Tyrosine Kinase

Version 1 : Received: 24 March 2024 / Approved: 24 March 2024 / Online: 25 March 2024 (08:47:40 CET)

How to cite: Alcázar, J.J.; Sánchez, I.; Merino, C.; Monasterio, B.; Sajuria, G.; Miranda, D.; Díaz, F.; Campodónico, P.R. A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC50 Inhibition Values of FLT3 Tyrosine Kinase. Preprints 2024, 2024031433. https://doi.org/10.20944/preprints202403.1433.v1 Alcázar, J.J.; Sánchez, I.; Merino, C.; Monasterio, B.; Sajuria, G.; Miranda, D.; Díaz, F.; Campodónico, P.R. A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC50 Inhibition Values of FLT3 Tyrosine Kinase. Preprints 2024, 2024031433. https://doi.org/10.20944/preprints202403.1433.v1

Abstract

In this study, a simple machine learning-based quantitative structure-activity relationship (QSAR) model was developed to predict the inhibitory potency (pIC50 values) of FLT3 tyrosine kinase inhibitors, pivotal in treating Acute Myeloid Leukemia (AML). Distinctively, our model leverages an extensive and diverse dataset, 14 times larger than those employed in prior studies within this field, enabling an unparalleled scope of compound analysis. This vast dataset, combined with further exploration of molecular descriptors, enabled predictions of extraordinary precision, covering a broader spectrum of FLT3 inhibitors than was previously possible. The Random Forest Regressor (RFR) algorithm, selected for its superior predictive performance, was trained with 1080 inputs and validated through comprehensive external and internal methods. It achieved an remarkable coefficient of determination (R^2) of 0.941 and a standard deviation of 0.235 on a test set of 270 compounds, highlighting the efficacy of model in predicting FLT3 inhibitory activity. Key molecular descriptors were identified, enhancing our understanding of structural requirements for inhibitor potency. Additionally, we developed a user-friendly computational tool that enables the rapid prediction of pIC50 values. Utilizing this tool, potential FLT3 inhibitors were identified through ligand-based virtual screening. This study represents a major advancement in FLT3 inhibitor discovery by utilizing a simple QSAR-machine learning model. It enables more efficient and precise identification of potential drug candidates at an early stage, promising a faster development of targeted therapies and streamlining the ligand-based drug design process.

Keywords

FLT3 Inhibitors; Ligand-based Drug Design; Computer-Aided Drug Design; QSAR Modeling; AML Treatment

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.