Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Forecasting the Spread of COVID-19 and ICU Requirements

Version 1 : Received: 16 March 2021 / Approved: 17 March 2021 / Online: 17 March 2021 (14:56:27 CET)

A peer-reviewed article of this Preprint also exists.

Podder, P.; Khamparia, A.; Mondal, M. R. H.; Rahman, M. A.; Bharati, S. Forecasting the Spread of COVID-19 and ICU Requirements. International Journal of Online and Biomedical Engineering (iJOE), 2021, 17, 81. https://doi.org/10.3991/ijoe.v17i05.20009. Podder, P.; Khamparia, A.; Mondal, M. R. H.; Rahman, M. A.; Bharati, S. Forecasting the Spread of COVID-19 and ICU Requirements. International Journal of Online and Biomedical Engineering (iJOE), 2021, 17, 81. https://doi.org/10.3991/ijoe.v17i05.20009.

Abstract

Since December 2019, the world is fighting against coronavirus disease (COVID-19). This disease is caused by a novel coronavirus termed as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This work focuses on the applications of machine learning algorithms in the context of COVID-19. Firstly, regression analysis is performed to model the number of confirmed cases and death cases. Our experiments show that autoregressive integrated moving average (ARIMA) can reliably model the increase in the number of confirmed cases and can predict future cases. Secondly, a number of classifiers are used to predict whether a COVID-19 patient needs to be admitted to an intensive care unit (ICU) or semi-ICU. For this, classification algorithms are applied to a dataset having 5644 samples. Using this dataset, the most significant attributes are selected using features selection by ExtraTrees classifier, and Proteina C reativa (mg/dL) is found to be the highest-ranked feature. In our experiments, random forest, logistic regression, support vector machine, XGBoost, stacking and voting classifiers are applied to the top 10 selected attributes of the dataset. Results show that random forest and hard voting classifiers achieve the highest classification accuracy values near 98%, and the highest recall value of 98% in predicting the need for admission into ICU/semi ICU units.

Keywords

COVID-19; ICU; feature selection; classification; ARIMA model

Subject

Computer Science and Mathematics, Algebra and Number Theory

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.