Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods

Version 1 : Received: 15 May 2024 / Approved: 16 May 2024 / Online: 16 May 2024 (17:18:14 CEST)

How to cite: Shyr, D.; Zhang, B.; Saini, G.; Brewer, S. Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods. Preprints 2024, 2024051116. https://doi.org/10.20944/preprints202405.1116.v1 Shyr, D.; Zhang, B.; Saini, G.; Brewer, S. Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods. Preprints 2024, 2024051116. https://doi.org/10.20944/preprints202405.1116.v1

Abstract

In this report, we present a study demonstrating how machine learning methods can reveal the interaction of different risk factors of post-transplant leukemic relapse and obtain robust predictions even with a modest clinical dataset. Using a cohort of 63 pediatric patients with acute lymphocytic leukemia (ALL) and 46 patients with acute myeloid leukemia (AML) who underwent stem cell transplant at a single institution, we built predictive models of leukemic relapse with both pretransplant and posttransplant patient variables (specifically lineage-specific chimerism) using the random forest classifier. The random forest classifier revealed different important predictive factors between ALL and AML in our relapse models consistent with previous knowledge. Furthermore, it also distinguished donor CD34 chimerism as most impactful in relapse prediction compare to donor chimerism of other cellular lineages in our dataset for ALL but CD3 for AML. Our models greatly improved sensitivity and specificity at predicting relapses in cross validation compared to a reference model based on our own institutional incidence of relapse and inferential statistical principles. Local Interpretable Model-Agnostic Explanations, an interpretable machine learning tool, confirmed our Random Forest Classification result and provided an intuitive explanation of how our machine learning models made the relapse prediction for each individual patient.

Keywords

Leukemia; relapse; predictive model; random forest; machine learning

Subject

Medicine and Pharmacology, Hematology

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.