PreprintArticleVersion 1Preserved in Portico This version is not peer-reviewed
Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods
Shyr, D.; Zhang, B.; Saini, G.; Brewer, S. Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods. Preprints2024, 2024051116. https://doi.org/10.20944/preprints202405.1116.v1
APA Style
Shyr, D., Zhang, B., Saini, G., & Brewer, S. (2024). Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods. Preprints. https://doi.org/10.20944/preprints202405.1116.v1
Chicago/Turabian Style
Shyr, D., Gopin Saini and Simon Brewer. 2024 "Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods" Preprints. https://doi.org/10.20944/preprints202405.1116.v1
Abstract
In this report, we present a study demonstrating how machine learning methods can reveal the interaction of different risk factors of post-transplant leukemic relapse and obtain robust predictions even with a modest clinical dataset. Using a cohort of 63 pediatric patients with acute lymphocytic leukemia (ALL) and 46 patients with acute myeloid leukemia (AML) who underwent stem cell transplant at a single institution, we built predictive models of leukemic relapse with both pretransplant and posttransplant patient variables (specifically lineage-specific chimerism) using the random forest classifier. The random forest classifier revealed different important predictive factors between ALL and AML in our relapse models consistent with previous knowledge. Furthermore, it also distinguished donor CD34 chimerism as most impactful in relapse prediction compare to donor chimerism of other cellular lineages in our dataset for ALL but CD3 for AML. Our models greatly improved sensitivity and specificity at predicting relapses in cross validation compared to a reference model based on our own institutional incidence of relapse and inferential statistical principles. Local Interpretable Model-Agnostic Explanations, an interpretable machine learning tool, confirmed our Random Forest Classification result and provided an intuitive explanation of how our machine learning models made the relapse prediction for each individual patient.
Keywords
Leukemia; relapse; predictive model; random forest; machine learning
Subject
Medicine and Pharmacology, Hematology
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.