Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Machine Learning based Survival Group Prediction in Glioblastoma

Version 1 : Received: 1 February 2022 / Approved: 3 February 2022 / Online: 3 February 2022 (12:00:23 CET)

How to cite: Kalya, M.; Kel, A.; Leha, A.; Altynbekova, K.; Wingender, E.; beissbarth, T. Machine Learning based Survival Group Prediction in Glioblastoma . Preprints 2022, 2022020051. https://doi.org/10.20944/preprints202202.0051.v1 Kalya, M.; Kel, A.; Leha, A.; Altynbekova, K.; Wingender, E.; beissbarth, T. Machine Learning based Survival Group Prediction in Glioblastoma . Preprints 2022, 2022020051. https://doi.org/10.20944/preprints202202.0051.v1

Abstract

Glioblastoma (GBM) is a very aggressive malignant brain tumor with the vast majority of patients surviving less than 12 months (Short-term survivors [STS]). Only around 2% of patients survive more than 36 months (Long-term survivors [LTS]). Studying these extreme survival groups might help in better understanding GBM biology. This work aims at exploring application of machine learning methods in predicting survival groups(STS, LTS). We used age and gene expression profiles belonging to 249 samples from publicly available datasets. 10 Machine learning methods have been implemented and compared for their performances. Hyperparameter tuned random forest model performed best with accuracy of 80% (AUC of 74% and F1_score of 85%). The performance of this model is validated on external test data of 16 samples. The model predicted the true survival group for 15 samples achieving an accuracy of 93.75%. This classification model is deployed as a web tool GlioSurvML. The top 1500 features which retained classification efficiency (Accuracy of 80%, AUC of 74%) were studied for enriched pathways and disease-causal biomarker associations using the HumanPSDTM database. We identified 199 genes as possible biomarkers of GBM and/or similar diseases (like Glioma, astrocytoma, and others). 57 of these genes are shown to be differentially expressed across survival groups and/or have impact on survival. This work demonstrates the application of machine learning methods in predicting survival groups of GBM.

Supplementary and Associated Material

Keywords

Glioblastoma; survival prediction; Machine Learning; biomarkers; HumanPSDTM; Long-term survivor

Subject

Computer Science and Mathematics, Mathematical and Computational Biology

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.