Preprint
Article

This version is not peer-reviewed.

Personalized Assessment of Mortality Risk and Hospital Stay Duration in Hospitalized Patients with COVID-19 Treated with Remdesivir: A Machine Learning Approach

A peer-reviewed article of this preprint also exists.

Submitted:

22 February 2024

Posted:

24 February 2024

You are already at the latest version

Abstract
Despite widespread vaccination, early treatments, and improved understanding of the disease, the effects of SARS-CoV-2 infection remain significant worldwide. Many patients still suffer from severe COVID-19, necessitating admission to intensive care units. Remdesivir is a primary treatment option among viral RNA polymerase inhibitors for hospitalized SARS-CoV-2 patients. However, there is a lack of studies examining factors influencing its effectiveness in this context. We conducted a retrospective study throughout 2022, analyzing clinical, laboratory, and sociodemographic data from 252 hospitalized COVID-19 patients treated with remdesivir. Six machine learning algorithms were compared and validated to predict factors associated with the loss of clinical benefit from remdesivir in terms of mortality and hospital stay. Data were extracted from electronic health records. The eXtreme Gradient Boost (XGB) method achieved the highest balanced accuracy for both mortality (95.45%) and hospital stay (94.24%). Factors associated with worse outcomes of remdesivir use in terms of mortality included limitation of life support treatment, need for ventilatory support (especially invasive mechanical ventilation) on day 14 after the first dose of remdesivir, lymphopenia, low levels of albumin and hemoglobin, presence of flu and/or coinfection, and cough. Factors associated with worse outcomes of remdesivir use in terms of hospital stay included the number of doses of the COVID-19 vaccine, patchy lung density, bilateral pulmonary radiological status, number of comorbidities, oxygen therapy, troponin and lactate dehydrogenase levels, and asthenia. These findings highlight the effectiveness of XGB as a strong candidate for accurately categorizing COVID-19 patients undergoing remdesivir treatment.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

The Coronavirus Disease 2019 (COVID-19) first emerged in late December 2019 in Wuhan, China. The genetic sequencing of the virus revealed it to be a novel beta-coronavirus, named Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) [1]. As of December 13, 2023, the World Health Organization (WHO) reports that the global impact of the COVID-19 pandemic has resulted in nearly 7 million deaths [2]. The Omicron variant displaced the Delta variant at the end of 2021. The actual variant is less severe clinically than earlier SARS-CoV-2 variants, likely due to widespread global vaccination [3]. Different treatments have also helped lower hospitalization and death rates [4,5].
COVID-19 is a highly contagious disease with significant risks of morbidity and death. It can cause bilateral pneumonia, severe respiratory failure necessitating invasive mechanical ventilation (IMV) and/or damage to multiple organs, sometimes leading to tragic outcomes [6].
SARS-CoV-2 affects various organs, as seen in autopsy findings (such as the kidneys, heart, intestines, liver, and brain). However, it has a specific tendency to infect the respiratory system [7]. Most individuals infected will experience symptoms such as cough, fever, fatigue, and muscle pain. However, a small percentage will progress to develop a hyperinflammatory state with pulmonary edema and cellular infiltration, leading to acute respiratory distress syndrome (ARDS) [8]. Additionally, it may lead to complications like kidney issues, coagulation disorders, or shock, which can result in mortality rates exceeding 30% [9].
The virus enters cells using structures like angiotensin-converting enzyme 2 and transmembrane serine protease 2, which are present in respiratory tract, cornea, and gastrointestinal cells [10].
In COVID-19, factors contributing to the severity and progression of the infection include advanced age, various health conditions, and changes in lab results. Elevated levels of certain markers like C-reactive protein (CRP), lactate dehydrogenase (LDH), ferritin, procalcitonin, and proinflammatory cytokines such as interleukin (IL)-6, IL-2, IL-1β, TNF-α, and G-CSF have been identified as significant indicators [11]. Furthermore, the neutrophil-to-lymphocyte ratio is often elevated in patients with severe COVID-19 [12].
Presently, the broad vaccination of the population stands out as the most effective public health measure in the ongoing battle against SARS-CoV-2.
Though only a few drugs prove effective in treating severe cases of COVID-19, early diagnosis and prompt initiation of therapy, complemented by nutritional and organ support, can markedly enhance outcomes.
Remdesivir is a bioactive molecule that has demonstrated in vitro activity against SARS-CoV-2, along with prophylactic and therapeutic efficacy in non-clinical models of other viruses, such as SARS-CoV, MERS-CoV, and Ebola [12]. In October 2020, it became the inaugural drug authorized by the Food and Drug Administration (FDA) for the treatment of COVID-19. Based on various clinical trials, its use was approved for adults and pediatric patients aged 12 and older, weighing at least 40 kilograms, for the treatment of COVID-19 requiring hospitalization [13,14,15].
The preliminary findings from the WHO-funded Solidarity trial revealed that lopinavir, hydroxychloroquine, interferon (IFN)-β1a, and remdesivir had little to no effect on hospitalized COVID-19 patients, as indicated by overall mortality, initiation of ventilation, and hospital stay duration [16]. The final results highlight the ineffectiveness of the drugs under investigation, except for remdesivir. While it doesn't significantly impact COVID-19 patients already on ventilation, it does show a modest effect on mortality or progression to ventilation (or both) in other hospitalized patients [17].
Currently, little is known about the predictive factors linked to poor outcomes in hospitalized COVID-19 patients treated with antivirals in general and specifically with remdesivir in clinical practice.
The goal of our study is to use machine learning (ML) models to categorize hospitalized COVID-19 patients undergoing remdesivir treatment based on the risk of mortality and/or hospital stay duration. ML, a subset of artificial intelligence (AI), employs statistical and mathematical algorithms to extract patterns from the data, aiding in making complex decisions [18]. Unlike classical statistical models created for inferences about variable relationships, these models are designed to make accurate predictions using data from a multitude of variables.
AI tools have been implemented in various areas to combat COVID-19, including drug and vaccine discovery or repurposing [19,20].
To our knowledge, this is the first study to develop, compare and validate six supervised ML models predicting factors associated with a high risk of mortality and/or hospital stay duration in patients with SARS-CoV-2 infection undergoing treatment with remdesivir.

2. Materials and Methods

2.1. Data Source

Patient information was systematically gathered from various internal hospital channels, utilizing two primary sources: (1) the electronic medical records (EMR) system, equipped with specialized modules for documenting clinical analysis results, radiological imaging findings, and electronic medical prescriptions, and (2) the intensive care unit (ICU) electronic prescription program. This comprehensive approach facilitated the synthesis of extensive data, enabling the methodical development of a personalized data collection questionnaire (DCQ) for each individual patient.

2.2. Study Design and Population

A retrospective observational study was conducted at a high-complexity tertiary hospital. Initially, 285 patients began remdesivir treatment, but 33 were excluded for not reaching the minimum required dose. Exclusions were based on various criteria, including a glomerular filtration rate < 30 ml/min, mortality or hospital discharge, clinical decision, symptom onset beyond 7 days of starting remdesivir, and a negative diagnostic test. Ultimately, the study focused on 252 patients (58.3% male) admitted with microbiologically confirmed SARS-CoV-2 between January 1 and December 31, 2022, using reverse transcription-polymerase chain reaction (RT-PCR) from nasopharyngeal swabs.
In this study, inclusion criteria involved patients ≥ 12 years old and weighing ≥ 40 kg, admitted to the hospital with a COVID-19 diagnosis. Selected patients met specific requirements for remdesivir administration according to the hospital's internal protocol at the study's outset. These criteria included symptom onset ≤ 7 days before the first remdesivir dose, no need for oxygen therapy or low-flow oxygen therapy, and meeting at least two of the following three conditions: 1) respiratory rate (RR) ≥ 24 breaths per minute; 2) baseline oxygen saturation (SpO2) < 94% in ambient air; and 3) PAFI index (PaO2/FIO2) < 300 mmHg. Patients were required to have received a minimum of three doses of the drug. The approved remdesivir regimen consisted of an initial 200 mg loading dose administered intravenously over 30-120 minutes in 100-250 ml of sterile, pyrogen-free 0.9% sodium chloride solution, followed by maintenance doses of 100 mg for a duration ranging from 5 to 10 days, depending on the patient's level of immunosuppression. Participants provided informed consent before participating in the study, and it was approved by the Ethics Committee of the General University Hospital of Valencia.

2.3. Study Data

The DCQ included data on demographic, clinical, and laboratory variables, organized into 8 sections:
1. Patient profile: This section covered demographic factors like age and sex, as well as clinical aspects including:
a) Weight and height or body mass index.
b) Presence of comorbidities: such as smoking, obesity, hypertension, diabetes mellitus (DM), chronic obstructive pulmonary disease (COPD), asthma, and other chronic respiratory conditions. Details on other serious underlying conditions were provided in an open-text section. The number of comorbidities was categorized (1, 2, 3, and > 3).
c) Pre-admission pharmacological treatment: This included the use of angiotensin-converting enzyme inhibitors/angiotensin receptor blockers, non-steroidal anti-inflammatory drugs, antihistamines, and/or montelukast, as well as information on the type of COVID-19 vaccine received and the number of doses administered.
d) COVID-19 Symptoms: A detailed list of symptoms was recorded, including fever, cough, dyspnea, fatigue, loss of taste or smell, headache, myalgia, sore throat, nasal congestion, rhinorrhea, conjunctivitis, rash, nausea, vomiting, and diarrhea.
2. Initial hospitalization details:
-
Date of admission to the emergency department.
-
Date of admission to the hospital.
-
Date of symptom onset.
-
Date of microbiological confirmation of SARS-CoV-2 infection.
-
Any life support limitations and date implemented.
-
Whether the patient needed ICU admission.
3. ICU admission details:
-
ICU admission date.
-
Mortality risk assessed by the CURB-65 scale.
-
Level of consciousness evaluated using the Glasgow Coma Scale.
-
Other clinical variables in the first 24 hours: fever (≥ 38ºC), RR > 24 breaths/minute, systolic blood pressure < 90 mmHg, SpO2, and number of lung quadrants affected in imaging tests (ranging from 1 to 4).
-
Severity of illness assessed using the APACHE II scale within the first 24 hours of admission
-
Patient's condition evaluated using the SOFA scale during their ICU stay.
4. Analytical and radiological data overview:
This section covers laboratory tests done just after hospital admission (in the emergency department or upon admission) before starting remdesivir treatment, as well as those performed after completing the remdesivir treatment.
The included laboratory parameters were: PAFI index; FIO2; SpO2; leukocytes; neutrophils; lymphocytes; monocytes: platelets; mean corpuscular volume; hemoglobin; erythrocyte sedimentation rate; CRP; aspartate aminotransferase; alanine aminotransferase; LDH; blood urea nitrogen; serum creatinine; albumin and/or total proteins; total cholesterol; procalcitonin; lactic acid, bicarbonate and pH; creatine phosphokinase and/or troponin; D-dimer and ferritin.
Additionally, the determination of cycle threshold (Ct) values from the virus RT-PCR at the start of treatment (or the closest available) is included. Ct is a semi-quantitative value inversely related to the amount of RNA in the sample. The SARS-CoV-2 antigenic variant is also identified.
A classification of pulmonary radiological status was included:
-
Affected side (bilateral or unilateral)
-
Type of lung injury (ground-glass opacity, consolidation, or mixed)
-
Density pattern (patchy, confluent, or mixed)
5. Pharmacological treatment during hospitalization: The medications considered included drugs that modulate inflammation and the immune system, such as IL-6 receptor antagonists (e.g., tocilizumab and sarilumab), IL-1 receptor antagonist (anakinra), Janus kinase inhibitors (e.g., baricitinib and tofacitinib), or Bruton's tyrosine kinase inhibitors (e.g., ibrutinib and acalabrutinib). Additionally, other medications like immunosuppressants (e.g., corticosteroids, cyclosporine, and tacrolimus) or immunoglobulins were considered. Antibiotics, vasopressors, and low molecular weight heparin at prophylactic doses were also part of the treatment plan. For each medication, the dosage and duration of treatment were recorded.
The patient's initial condition upon starting remdesivir therapy was categorized into two groups: a) patients not needing extra oxygen and b) patients requiring low-flow oxygen. The patient's clinical status on days 7 and 14 after the first dose of remdesivir was documented as follows: a) discharged from the hospital and resumed normal activities; b) discharged from the hospital but with difficulties resuming normal activities; c) hospitalized without needing extra oxygen; d) hospitalized needing extra oxygen but not IMV; e) hospitalized needing IMV; and f) deceased.
6. Microbiological testing:
We considered the isolated microorganism in every case. Tests included tracheal aspirates, blood cultures, detection of influenza and/or coinfection, as well as tests for pneumococcal and legionella antigens in urine.
7. Medical procedures during hospitalization:
The following procedures were included:
-
Oxygen therapy
-
Non-invasive ventilation (NIV)
-
IMV
-
Prone ventilation
-
Hemodialysis/hemofiltration
-
Extracorporeal membrane oxygenation
8. Patient final outcome:
Complications during hospitalization were noted, including ARDS, sepsis, septic shock, nosocomial pneumonia (non-COVID-19), other nosocomial infections (non-COVID-19, non-pneumonia), acute renal failure, and acute hepatic failure.
The clinical benefit consisted of symptom improvement (fever, cough, etc.), along with improved radiological findings and/or a PAFI index ≥ 300 mmHg or SpO2 > 93 without oxygen support within the first 5, 14, or 28 days, depending on the length of hospitalization.
Time taken to clear SARS-CoV-2 was considered, and outcomes were divided into hospital discharge or mortality. Collected data included length of hospital stay and date of ICU discharge. Any rehospitalizations within 7 days post-discharge were also documented.
To evaluate remdesivir effectiveness, information on overall in-hospital mortality and duration of hospitalization was recorded. Length of hospital stay was defined as the period from admission to death or discharge.

2.4. Method

2.4.1. Model Development

In this study, the eXtreme Gradient Boost (XGB) method was used as the reference algorithm, thanks to its notable features, such as fast execution, scalability, and high processing capability through parallel computing [21,22]. XGB consistently outperforms other algorithms in accurately solving various data science problems [22,23,24]. Additionally, a comparative analysis was conducted with other supervised ML systems.
Considering a dataset S=xj, yj, the XGB model was formulated using the following:
y j ^ = p = 1 P t p x j
where xj represents the input vector with m time variables, y j ^ denotes the predicted output, yj shows the output, tp represents a tree with leaf weight wp and structure up, j = 1; 2; ...; n, and P corresponds to the number of trees.
The formulated objective function for the proposed method is expressed in Equation (2). Employing a second-order Taylor expansion is integral to improving prediction accuracy in approximating the XGB objective function [21].
R = j r y j ^ , y j + p Ψ t p
In Equation (3), fp represents the number of leaves on the tree. The R () function penalizes the complexity of the method. The learning rate is denoted by λ and wp is the vector of leaf scores. To control the complexity weight of the system, a parameter γ is employed. The aim is to optimize Equation (2) [22].
Ψ t p = λ f p + 1 2 γ ω p 2
In this study, we tested several ML algorithms to evaluate the performance of our proposed method. We chose the top-performing five algorithms from those widely recognized in the scientific community. These include Decision Trees (DT) [25], Gaussian Naive Bayes (GNB) [26], Bayesian Linear Discriminant Analysis (BLDA) [27], K-Nearest Neighbors (KNN) [28], and Support Vector Machines (SVM) [29]. We built the models using the MatLab Statistical and Machine Learning Toolbox (MatLab 2022a; The MathWorks, Natick, Massachusetts, USA). The dataset was split into two parts, with 70% used for training and the remaining 30% for testing, ensuring that patient information was not shared between the sets. To validate the results and prevent overfitting, we conducted 5-fold cross-validation.
Optimizing the ML algorithms involves adjusting various hyperparameters during the training phase. Bayesian techniques were employed in this study to determine optimal hyperparameter values. This optimization method significantly improves the outcomes of the developed methods.
Throughout all simulations, 100 iterations were executed to derive mean and standard deviation values in a uniformly random manner. This systematic approach mitigates the impact of noise, facilitating the calculation of relevant values and ensuring the attainment of statistically valid results [30]. The procedural phases employed in this study are delineated in Figure 1. Initially, subjects for study were selected, followed by the implementation of the database, and subsequent training and validation of ML methods.

2.4.2. Performance Evaluation

In this study, various methods were compared using the following metrics: specificity, precision (positive predictive value), recall (sensitivity), balanced accuracy, degenerate Youden index (DYI), F1-score, Matthew’s correlation coefficient (MCC), Cohen’s Kappa index (CKI), receiver operating characteristic (ROC), and area under the curve (AUC) [30]. The F1 score is defined as:
Preprints 99643 i001
MCC was additionally employed to evaluate the performance of the ML methods, and it is defined as:
Preprints 99643 i002
where TP denotes the number of true positives, TN represents the number of true negatives, FP is the number of false positives, and FN corresponds to the number of false negatives. CKI was used to assess the overall performance of the system [31].

3. Results

In this section, we discuss the results obtained from patient records used for training and validation to identify predictors of increased in-hospital mortality and hospital stay in COVID-19 patients treated with remdesivir. We compare the performance of our proposed system with various supervised ML classification methods widely accepted in the scientific community.
Table 1 presents the performance outcomes regarding mortality of various classification methods, including DT, GNB, BLDA, KNN, SVM, and our novel XGB system. It is noteworthy that GNB and BLDA-based approaches demonstrate a relatively lower balanced accuracy, falling short of the 82% benchmark. Conversely, DT and SVM techniques exhibit superior classification prowess, nearing a balanced accuracy of 90%, surpassing the effectiveness of GNB and BLDA. In contrast, the KNN method is the one that achieves a result closest to the proposed XGB method resulting in improved predictive capabilities. Particularly, the XGB system achieves an outstanding score surpassing 95%, showcasing remarkable performance in classification tasks.
Table 2 presents the performance results for the hospital stay variable using the same classification methods. As can be observed, the outcomes are similar to those of the mortality variable, with XGB achieving a balanced accuracy exceeding 94%.
KNN and DT stand out as the algorithms that come closest to XGB in terms of precision and recall values, surpassing SVM and notably outperforming BLDA and GNB in results. Moreover, this pattern is reflected in Table 1 and Table 2 when considering the F1 score parameter, where XGB achieves high values, serving as a clear indicator of improved classification performance.
To assess the performance of the proposed XGB system in classifying COVID-19 patients treated with remdesivir with a higher risk of mortality and/or extended hospital stay, we calculated several widely used parameters from the literature, including AUC, MCC, DYI, and the kappa index. Among these, MCC stands out as one of the most reliable statistical indices, as it yields a high score only when predictions have been accurate across all four categories of the confusion matrix.
The results of these four categories (true positives, false negatives, true negatives, and false positives) are directly influenced by the size of positive and negative elements within the dataset. As shown in Table 1 and Table 2, the XGB method achieved MCC values of 84.70% and 83.62%, respectively, clearly surpassing the values obtained by other methods.
In the same vein, when analyzing the kappa index, XGB reached nearly 85% for the final variable of mortality, marking a significant advancement over KNN and DT by 5.4% and 8.2%, respectively. As for the final variable of hospital stay, XGB approached about 84%. This trend persists when considering the AUC and DYI parameters, where XGB surpasses the rest of the methods with higher values. These findings emphasize the superior ability of XGB to accurately categorize COVID-19 patients treated with remdesivir, in terms of mortality and/or hospital stay.
Figure 2 and Figure 3 provide a comprehensive analysis, comparing the performance of the XGB method with alternative classifiers across a variety of essential metrics, for both mortality and hospital stay, respectively. These metrics include balanced accuracy, recall, specificity, precision, F1 score, CKI, MCC, AUC, and DYI.
Furthermore, the ROC curve is a crucial tool for assessing and comparing how effectively the proposed system classifies compared to other machine learning methods. It is constructed by plotting sensitivity against specificity across various threshold values. Figure 4 depict the outcomes of different classification systems, aligning with the primary goal of categorizing patients in the study.
In particular, the XGB method exhibits a significantly larger area under the ROC curve, indicating its enhanced ability to accurately classify the two distinct classes, both in terms of mortality and hospital stay. This is further supported by the specific numerical values provided in Table 1 and Table 2.
For better clarity, we've arranged all metrics for each dataset—both training and validation—and represented them in a radar plot (Figure 5 and Figure 6). In an ideal situation, where the model performs exceptionally well in all metrics, the plot would create a circle covering the entire grid. In our study, the training sets consistently show higher scores across all metrics, whereas the validation sets generally display lower scores.
The radar plots give us a quick look at the model's performance. A bigger circle on the validation set suggests a better predictive method. Figure 5 shows that our suggested XGB system is a good example of a well-balanced model. Importantly, both the training and validation sets have similar radar plots, indicating there's neither overfitting nor underfitting. This boosts the model's ability to work well with new inputs, delivering accurate results effectively.
On the flip side, the GNB method consistently comes in as the least effective performer in all metrics. Based on these findings, we can confidently state that our proposed XGB system excels at categorizing patients in line with the study's goals. It provides high accuracy and automation, making it a valuable tool for clinical practice.
With the proposed XGB method, the predictive factors linked to a poorer outcome in COVID-19 patients treated with remdesivir, in terms of mortality, include: limitation of life support treatment, need for ventilatory support (especially IMV) on day 14 after the first dose of remdesivir, lymphopenia, low levels of albumin and hemoglobin, presence of flu and/or coinfection, and cough. Factors associated with a worse outcome of remdesivir use in terms of hospital stay include: number of doses of the COVID-19 vaccine, patchy lung density, bilateral pulmonary radiological status, number of comorbidities, oxygen therapy, troponin and LDH levels, and asthenia. Figure 7 depicts a bar graph illustrating the weights of predictive variables that significantly enhance the classification accuracy of various ML methods.
The main baseline clinical data of the 252 patients included in the study are presented in Table 3.

4. Discussion

Among the approved medications for COVID-19, remdesivir stands out as the preferred antiviral treatment for hospitalized patients infected with SARS-CoV-2. Additionally, other drugs like tocilizumab and baricitinib show promise, particularly for severe cases, including those requiring IMV [32]. While remdesivir can reduce viral levels and shorten symptom duration, identifying the patients who benefit most from it remains uncertain. This knowledge is crucial for minimizing unnecessary side effects and costs, while optimizing resource use [33].
The European Medicines Agency (EMA) initially granted conditional approval for marketing across the EU on July 3, 2020, which later transitioned to full marketing authorization on August 8, 2022. This medication is approved for use in adults and children as young as 4 weeks old, weighing at least 3 kg, who have pneumonia and need supplemental oxygen (either low- or high-flow oxygen or other non-invasive ventilation at the start of treatment). Additionally, it can be used in adults and children weighing at least 40 kg who do not need supplemental oxygen but are at higher risk of developing severe COVID-19 [34].
On the other hand, AI has been employed to identify genomic sequences of SARS-CoV-2, including antigenic variants, as well as to develop drugs and vaccines for COVID-19 [35]. It has also been used to discover drug combinations against COVID-19 [36].
Drug repurposing trials have aimed at finding potential treatments for COVID-19, including antiviral therapies, anti-inflammatory drugs, antithrombotic agents, and immunomodulators [37]. In a study by Basit SA et al., a deep learning model was employed to predict the effectiveness of different medications, identifying remdesivir as highly effective against COVID-19 with a 95% positive score [38].
Understanding the physiopathology of COVID-19 can help scientists develop effective antiviral drugs by uncovering unknown viral pathways and structures. With the advancements in AI and ML, it's reasonable to use these methods to explore new candidates. Various studies, like the one by Imtiaz F and Pasha MK, have focused on examining the structure of the RNA-dependent RNA polymerase (RdRp) using ML techniques. RdRp is crucial for virus replication and holds potential as a promising target for COVID-19 treatment [39]. Remdesivir, which transforms into an analog of adenosine triphosphate during intracellular metabolism, works mainly by integrating into the developing RNA chain by the RdRp. This disrupts viral replication, a key aspect of its antiviral activity [40]. Monitoring the impact of emerging mutations on viral replication and response to antiviral drugs is essential. Remdesivir shows stability as an RdRp inhibitor compared to other antivirals in the presence of mutations at this level of viral replication [41].
The WHO's Solidarity trial was the first major study to show the limited clinical effectiveness, in terms of mortality, of three repurposed antivirals in hospitalized COVID-19 patients: lopinavir, hydroxychloroquine, and interferon (IFN)-β1a. The remdesivir arm continued, with 4,146 patients receiving remdesivir and 4,129 assigned to the control group. It was observed that remdesivir does not have a significant effect on COVID-19 patients already on ventilation. Among other hospitalized patients, it has a minor impact on mortality or progression to ventilation [42]. Remdesivir may be beneficial in the clinical course for both hospitalized and non-hospitalized patients, but certainty remains limited [43]. There is evidence supporting the clinical benefit of a 5-day regimen of remdesivir in patients with moderate COVID-19 infection (lung infiltrates and SpO2 in ambient air > 94%)[44]. For patients with severe COVID-19 (SpO2 ≤ 94% while breathing ambient air and radiological evidence of pneumonia) who do not require mechanical ventilation, there doesn't seem to be a significant difference between a 5-day and a 10-day course of remdesivir [45]. Remdesivir significantly reduces hospitalization days and lowers inflammatory markers compared to standard treatment in patients with moderate to severe COVID-19 [46]. Compared to standard care, remdesivir quickly improves low oxygen levels (reducing the need for ventilatory support) and reduces inflammation (lowering IL-6 levels), leading to a better course of moderate to severe COVID-19 [47]. Patients treated with remdesivir spend less time in the ICU and have better survival rates [48].
Recent meta-analyses confirm that the use of remdesivir can help reduce mortality in COVID-19 patients and shorten the time to clinical improvement [32,49]. Observational studies have demonstrated benefits in hospital mortality with remdesivir therapy [50,51].
Remdesivir appears to lower mortality rates in hospitalized COVID-19 patients who don't require oxygen support or only need standard oxygen therapy. However, it doesn't seem to help patients on mechanical ventilation [52]. A recent meta-analysis by Huang C et al. found that hospitalized adult COVID-19 patients who didn't need extra oxygen or only required low-flow oxygen and were treated with remdesivir had a lower risk of death. However, those needing high-flow oxygen or IMV didn't see the same benefit [53]. Remdesivir also seems to speed up recovery, reduce complications, and might slightly decrease the need for ventilation [32,54,55]. Additionally, a 5-day treatment course appears to provide more benefits with fewer side effects and lower costs for non-ventilated patients compared to a 10-day course [54].
Remdesivir is safe to use, but when combined with corticosteroids, it doesn't seem to offer extra clinical benefits [49]. However, when paired with baricitinib, it's not only safe but also seems to be more effective than using remdesivir alone. This combination can reduce recovery time and speed up clinical improvement in COVID-19 patients, particularly those needing high-flow oxygen or NIV [56].
This study leads the way in developing, comparing, and evaluating six supervised ML methods to predict factors that reduce the effectiveness of remdesivir in hospitalized patients with SARS-CoV-2 pneumonia. We collected data on 133 demographic, clinical, and laboratory variables. Among the ML algorithms tested, XGB stood out as the best performer, achieving the highest balanced accuracy rates for predicting mortality (95.4%) and hospital stay duration (94.2%).
The invasion of respiratory tract cells by SARS-CoV-2 causes cellular damage and triggers a local immune response, leading to the release of proinflammatory substances like IFNγ, IL-1β, IL-6, and TNF-α [57]. IL-6 plays a crucial role in transitioning from mild inflammation to severe hyperinflammatory states, such as cytokine release syndrome (CRS) and ARDS, which can result in high mortality for severely ill COVID-19 patients. ARDS develops due to an unbalanced immune response, intense inflammation, and disruption of blood clotting, with CRS being central to this intricate relationship [58]. This syndrome is the leading cause of death among these patients, with mortality rates exceeding 70% [8]. Therefore, treatment strategies for hospitalized COVID-19 patients mainly focus on regulating the immune response and achieving antiviral, antithrombotic, or anticoagulant effects [8].
In our investigation, we observed a hospital mortality rate of 13.5%, with 3.9% of all patients requiring IMV. These findings align with those of other studies [42,52,59]. The median duration of hospitalization following the administration of remdesivir was 8 days (Interquartile Range, IQR, 5–12).
Our analysis involved testing various ML classifiers, among which the XGB method stood out as the most precise in identifying patients at higher risk of mortality and/or prolonged hospital stay. After thorough examination, we compared the XGB model with several other supervised ML methods commonly found in existing literature, such as BLDA, GNB, DT, KNN, and SVM. It's important to note that in biomedical scenarios, current ML classification techniques consistently outperform unsupervised methods, achieving higher average accuracy rates for both regression and classification tasks [60]. In our study, BLDA and GNB performed the poorest among the methods examined, while KNN's performance closely matched that of XGB. These findings align with previous research on the predictive capabilities of supervised ML algorithms for COVID-19 mortality and hospitalization durations [61,62].
In our study, we use a radar graph to evaluate how well ML models perform during both training and testing. The results clearly show that the XGB model excels, particularly in handling large datasets without overfitting. It outperforms other methods by achieving higher precision, recall, and overall accuracy. The reliable performance of the XGB model makes it incredibly valuable, especially in biomedical applications like predicting cancer stages for patients [63].
In our cohort, 58.3% were male, with a median age of 77 (IQR 66.7–85.2). We found several factors linked to negative outcomes after receiving remdesivir, notably regarding mortality and hospital stay. For mortality, adverse outcomes were associated with life support limitations, the need for ventilatory support (particularly IMV) 14 days after the initial remdesivir dose, lymphopenia, low albumin and hemoglobin levels, flu and/or coinfection, and cough. When it comes to hospital stay, factors associated with a worse outcome with remdesivir use included the COVID-19 vaccine doses, patchy lung density, bilateral pulmonary radiological status, comorbidity count, need for oxygen therapy, high levels of troponin and LDH, and presence of asthenia.
These identified factors provide valuable insights into the potential determinants of adverse outcomes associated with remdesivir use. In the realm of research on factors influencing the response to remdesivir in hospitalized patients with COVID-19, few studies have been conducted [64,65,66].
Previous studies have identified factors that predict the severity and mortality of COVID-19 patients, but these were conducted before standard treatment with remdesivir was introduced. ML techniques have confirmed that demographic factors (like age), clinical factors (such as comorbidities or symptoms), and analytical factors are associated with the severity, mortality, and length of hospital stay of COVID-19 patients, regardless of the specific treatment used [67,68]. Adamidi et al.'s systematic review also found predictors of disease progression and mortality using ML techniques, similar to our study. They emphasized that age, PCR and LDH levels, lymphopenia, and chest X-ray and CT scan findings are commonly linked to adverse outcomes in COVID-19 patients [62]. Additionally, other studies have highlighted the effectiveness of the XGB method in predicting adverse outcomes in hospitalized COVID-19 patients [69].
In Choi YJ et al.'s study, multivariate analysis confirmed that a high National Early Warning Score (NEWS) and Charlson Comorbidity Index (CCI) at admission, along with dyspnea, were independent risk factors for 30-day mortality in COVID-19 pneumonia patients treated with remdesivir and dexamethasone [64]. NEWS is based on a scoring system that combines six key physiological measurements (respiratory rate, SpO2, temperature, systolic blood pressure, heart rate, and level of consciousness) upon hospital admission or monitoring. This tool is highly sensitive and specific in predicting early mortality in prehospital and emergency department settings [70]. It emphasizes that COVID-19 admissions carry a significantly higher mortality risk compared to non-COVID-19 admissions, highlighting the elevated baseline mortality risk associated with COVID-19 [71].
In our study, factors indicating a higher risk of mortality, similar to NEWS and shortness of breath, included the limitation of life support treatment and the need for ventilatory support, especially IMV, 14 days after the initial dose of remdesivir. Regarding hospital stay, a similar factor was the requirement for oxygen therapy. These respiratory parameters and symptoms indicate respiratory failure and suggest a more severe decline in hospitalized COVID-19 patients.
Several studies confirm that the main clinical symptoms in patients admitted with COVID-19 are shortness of breath, cough, and fever [70]. In our study, cough was associated with a higher risk of mortality, while asthenia was linked to prolonged hospital stay.
In the study conducted by Choi YJ et al., similar to our findings, the majority of hospitalized cases of COVID-19 pneumonia occurred in adults aged ≥ 65 years; thus, there were no significant differences in the prognosis based on age. Additionally, within the laboratory parameters, low lymphocyte count, high levels of CRP, and elevated LDH indicated an unfavorable prognosis [64]. In our case, lymphopenia was one of the factors strongly associated with mortality in patients treated with remdesivir, along with low levels of albumin and hemoglobin. Lymphopenia signals an imbalanced T-cell response and weakened adaptive immunity. SARS-CoV-2 infection predominantly affects T lymphocytes, specifically CD4+ and CD8+ T cells, leading to reduced counts. Lymphopenia is more frequently observed in severe cases [72]. Georgakopoulou VE et al., confirmed that low levels of albumin and the C-reactive protein to albumin ratio were predictors of mortality, similar to our findings [66]. For hospital stay, elevated troponin and LDH levels were the most relevant predictors, along with observing bilateral lung involvement and patchy lung density in imaging tests. In the study by Terkes V et al., advanced age, elevated CRP, and the Computed Tomography (CT) score were identified as significant predictors of disease outcome [73]. However, the intense inflammatory response triggered by the infection can lead to alterations in hemostasis and coagulation parameters [74]. In our research, platelet count was not a significant predictor to influence the sought-after final outcome.
On the other hand, comorbidities themselves result from inflammation and can induce a proinflammatory state. The CCI is a simple, easy-to-apply, and valid method for classifying comorbidities and estimating mortality from COVID-19 [75].
Recent studies have taken into account the pre-existing health conditions (comorbidities) of patients infected with SARS-CoV-2 and their association with the progression of the disease in terms of mortality and hospital stay [65].
Aging and pre-existing health conditions can create a state of meta-inflammation, amplifying inflammation in COVID-19 and increasing the risk of mortality. Several studies demonstrate a correlation in patients affected by COVID-19 between age, number of comorbidities and certain laboratory markers [76]. There appears to be a positive link between inflammation biomarkers such as CRP, ferritin, and LDH, and the number of comorbidities in COVID-19 patients. The same pattern is observed in hematological parameters like the neutrophil-to-lymphocyte ratio. Similar to these studies, in our research, the number of comorbidities were associated with worse outcome for the patients and extended hospital stay.
Chronic kidney disease, the incidence of acute kidney injury, and atrial fibrillation have been shown to be comorbidities associated with reduced survival in patients hospitalized for COVID-19 [77]. Other studies confirm that lymphopenia, often observed in cancer patients, is associated with a higher risk of mortality [78]. Hematological disorders are also seen as additional health conditions that can negatively impact the effectiveness of antiviral treatments such as remdesivir [79]. In other studies, hypertension and type 2 diabetes, linked with obesity as metabolic syndrome, are considered significant risk factors for adverse outcomes [80]. In our study, the presence of influenza and/or co-infection constituted the comorbidity associated with higher mortality.
In the recent meta-analysis by Amstutz A et al., neither age, comorbidities, nor the use of corticosteroids had an impact on the effectiveness of remdesivir in terms of mortality [52]. Similarly, the need for increased respiratory support in patients has shown limited effectiveness of remdesivir in reducing mortality, aligning with our own research outcomes [17].
In our investigation, 22.2% of patients developed ARDS according to the latest definition [81]. ARDS is a clinical syndrome of acute hypoxemic respiratory failure due to lung inflammation, not caused by cardiogenic pulmonary edema. Various studies using these techniques have found that ARDS is associated with fatal outcomes in COVID-19 patients making ventilatory support essential, including IMV. [62,68]. In our study, both IMV and oxygen therapy were predictors of poor prognosis.
In contrast to Choi YJ et al.'s study, in ours, a lower number of COVID-19 vaccine doses was linked to clinical deterioration and a longer hospital stay [64]. The study by Georgakopoulou VE et al. confirmed that, regardless of vaccination status, pre-existing comorbidities, age, and gender, patients with a combination of biomarkers indicating acute inflammatory response, cell death, and hypercoagulability – specifically, CRP, LDH, and fibrinogen – reflected the severity of COVID-19 [82]. More recently, Mikulska M et al. demonstrated that receiving fewer doses of the COVID-19 vaccine was a predictive factor for treatment failure with antivirals such as remdesivir [79]. Several studies consistently show that individuals who are not vaccinated or receive fewer vaccine doses are more likely to experience negative outcomes, including the need for mechanical ventilation or death during hospitalization [83]. This aligns with our findings.
In the study by Shimizu H et al., the time intervals between symptom onset, diagnosis, and antiviral treatment were significant predictors of moderate illness [84]. In our study, following the applied protocol, remdesivir treatment was supposed to commence upon microbiological confirmation of SARS-CoV-2 infection and within 7 days of clinical symptom onset.
None of the drug treatments, including early antibiotic use, had a significant effect on the final outcome, similar to findings in other studies [64].
Most current studies using ML techniques confirm that respiratory parameters like SpO2 and the need for invasive ventilatory support are considered the most important predictors for mortality in hospitalized COVID-19 patients treated with remdesivir. While hypertension and worsening renal function are also considered mortality predictors in these studies, they didn't hold enough significance in our research [85,86]. Kuno T et al. developed a predictive model for in-hospital mortality using ML methods in COVID-19 patients treated with steroids and remdesivir. Variables associated with mortality included age, hypertension, SpO2, blood urea nitrogen, ICU admission, and endotracheal intubation [85].
On the other hand, evidence indicates the presence of different COVID-19 patient phenotypes showing diverse inflammatory and immune responses, mortality risks, and treatment outcomes [87,88]. In a study by Chen H et al., two COVID-19 phenotypes emerged: hypo-inflammatory and hyper-inflammatory, with the latter marked by heightened pro-inflammatory cytokine levels and increased complication rates. Corticosteroid therapy was linked to lower 28-day mortality (HR, 0.45; 95% CI, 0.25-0.80; p = 0.0062) in the hyper-inflammatory type [88]. The lack of efficacy with remdesivir treatment may lend support to this idea, underscoring the importance of identifying factors for personalized treatments.
Our study has both limitations and strengths. The main limitations come from its retrospective, single-center design and limited sample size. However, these drawbacks are balanced by the use of powerful methodological tools like ML. ML methods have the advantage of being effective even with small datasets, resulting in simple and fast classification for our proposed method. We've also used data augmentation techniques to enhance our analysis [30].
One strength of our study is its inclusion of a diverse patient population, covering individuals with common health conditions often overlooked in standard clinical trials. This wider representation makes our findings applicable to a broader range of patients. Our methodology effectively identifies patients who could benefit from remdesivir, potentially leading to better survival rates and shorter hospital stays. Additionally, similar research using ML techniques has identified factors linked to worse outcomes in severe COVID-19 patients treated with tocilizumab [89]. Comparative studies suggest that ML methods may offer greater accuracy and efficiency compared to traditional logistic regression analysis, particularly with limited sample sizes.
The XGB method is a simple binary classification system that is easy to use and train. As more data is gathered, this algorithm gets better at making predictions.

5. Conclusions

Remdesivir has proven to be beneficial in patients with SARS-CoV-2 pneumonia, especially in those without critical illness criteria. However, a significant number of patients still die or require longer hospital stays despite treatment with remdesivir. Therefore, we utilized ML techniques, which are increasingly important in predicting important events. Among the six supervised ML methods we validated, XGB achieved the highest accuracy in predicting factors associated with worse outcomes, such as mortality and hospital stay length, in hospitalized COVID-19 patients treated with remdesivir. This tool can assist healthcare professionals in making early and effective clinical decisions to optimize remdesivir treatment for COVID-19 patients who meet specific clinical criteria.

Author Contributions

Conceptualization, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Data curation, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Formal analysis, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Investigation, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Methodology, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Project administration, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Resources, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Software, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Supervision, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Validation, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Visualization, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Writing – original draft, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo; Writing – review & editing, Antonio Ramón, Andrés Bas, Santiago Herrero, Pilar Blasco, Miguel Suárez-Matías and Jorge Mateo. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of General University Hospital of Valencia (GUHV). The study was approved by the Ethics Committee of GUHV with protocol code XGB-COVID-19 on September 24, 2021.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets employed and analyzed in the current study are accessible upon reasonable request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Lu, R.; Zhao, X.; Li, J.; et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020, 395, 565–574. [Google Scholar] [CrossRef] [PubMed]
  2. World Health Organization. Coronavirus disease 2019 (COVID-19) outbreak [Internet]. Geneva: World Health Organization; [cited 2024 Feb 12]. Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019.
  3. Yuan, Z.; Shao, Z.; Ma, L.; Guo, R. Clinical Severity of SARS-CoV-2 Variants during COVID-19 Vaccination: A Systematic Review and Meta-Analysis. Viruses. 2023, 15, Published 2023 Sep 26. [Google Scholar] [CrossRef]
  4. Chen, Z.; Tian, F. Evaluation of oral small molecule drugs for the treatment of COVID-19 patients: a systematic review and network meta-analysis. Ann Med. 2023, 55, 2274511. [Google Scholar] [CrossRef] [PubMed]
  5. Qin, Z.; Li, Y.; Sun, W.; et al. Effect of anti-inflammatory drugs on the storm of inflammatory factors in respiratory tract infection caused by SARS-CoV-2: an updated meta-analysis. Front Public Health 2023, 11, 1198987, Published 2023 Oct 2. [Google Scholar] [CrossRef] [PubMed]
  6. Wang, D.; Hu, B.; Hu, C.; et al. Clinical Characteristics of 138 Hospitalized Patients With 2019 Novel Coronavirus-Infected Pneumonia in Wuhan, China [published correction appears in JAMA. 2021 Mar 16;325(11):1113]. JAMA. 2020, 323, 1061–1069. [Google Scholar] [CrossRef] [PubMed]
  7. Krishnan, A.; Hamilton, J.P.; Alqahtani, S.A.; Woreta, T.A. A narrative review of coronavirus disease 2019 (COVID-19): clinical, epidemiological characteristics, and systemic manifestations. Intern Emerg Med. 2021, 16, 815–830. [Google Scholar] [CrossRef]
  8. Thapa, K.; Verma, N.; Singh, T.G.; Kaur Grewal, A.; Kanojia, N.; Rani, L. COVID-19-Associated acute respiratory distress syndrome (CARDS): Mechanistic insights on therapeutic intervention and emerging trends. Int Immunopharmacol. 2021, 101 (Pt A), 108328. [Google Scholar] [CrossRef]
  9. Wang, Z.; Deng, H.; Ou, C.; et al. Clinical symptoms, comorbidities and complications in severe and non-severe patients with COVID-19: A systematic review and meta-analysis without cases duplication. Medicine (Baltimore). 2020, 99, e23327. [Google Scholar] [CrossRef]
  10. Sungnak, W.; Huang, N.; Bécavin, C.; et al. SARS-CoV-2 entry factors are highly expressed in nasal epithelial cells together with innate immune genes. Nat Med. 2020, 26, 681–687. [Google Scholar] [CrossRef]
  11. Zeng, F.; Huang, Y.; Guo, Y.; et al. Association of inflammatory markers with the severity of COVID-19: A meta-analysis. Int J Infect Dis. 2020, 96, 467–474. [Google Scholar] [CrossRef]
  12. Gordon, C.J.; Tchesnokov, E.P.; Woolner, E.; et al. Remdesivir is a direct-acting antiviral that inhibits RNA-dependent RNA polymerase from severe acute respiratory syndrome coronavirus 2 with high potency. J Biol Chem. 2020, 295, 6785–6797. [Google Scholar] [CrossRef] [PubMed]
  13. Beigel, J.H.; Tomashek, K.M.; Dodd, L.E.; et al. Remdesivir for the Treatment of Covid-19 - Final Report. N Engl J Med. 2020, 383, 1813–1826. [Google Scholar] [CrossRef]
  14. Spinner, C.D.; Gottlieb, R.L.; Criner, G.J.; et al. Effect of Remdesivir vs Standard Care on Clinical Status at 11 Days in Patients With Moderate COVID-19: A Randomized Clinical Trial. JAMA. 2020, 324, 1048–1057. [Google Scholar] [CrossRef] [PubMed]
  15. Goldman, J.D.; Lye, D.C.B.; Hui, D.S.; et al. Remdesivir for 5 or 10 Days in Patients with Severe Covid-. N Engl J Med. 2020, 383, 1827–1837. [Google Scholar] [CrossRef]
  16. WHO Solidarity Trial Consortium; Pan, H.; Peto, R.; et al. Repurposed Antiviral Drugs for Covid-19 - Interim WHO Solidarity Trial Results. N Engl J Med. 2021, 384, 497–511. [Google Scholar] [CrossRef]
  17. WHO Solidarity Trial Consortium. Remdesivir and three other drugs for hospitalised patients with COVID-19: final results of the WHO Solidarity randomised trial and updated meta-analyses [published correction appears in Lancet. 2022 Oct 29;400(10362):1512] [published correction appears in Lancet. 2024 Jan 13;403(10422):146]. Lancet. 2022, 399, 1941–1953. [Google Scholar] [CrossRef]
  18. Vollmer, S.; Mateen, B.A.; Bohner, G.; et al. Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness [published correction appears in BMJ. 2020 Apr 1;369:m1312]. BMJ. 2020, 368, l6927, Published 2020 Mar 20. [Google Scholar] [CrossRef] [PubMed]
  19. Syeda, H.B.; Syed, M.; Sexton, K.W.; et al. Role of Machine Learning Techniques to Tackle the COVID-19 Crisis: Systematic Review. JMIR Med Inform. 2021, 9, e23811, Published 2021 Jan 11. [Google Scholar] [CrossRef]
  20. Rasheed, J.; Jamil, A.; Hameed, A.A.; Al-Turjman, F.; Rasheed, A. COVID-19 in the Age of Artificial Intelligence: A Comprehensive Review. Interdiscip Sci. 2021, 13, 153–175. [Google Scholar] [CrossRef]
  21. Chen, C.; Dong, D.; Qi, B.; Petersen, I.R.; Rabitz, H. Quantum Ensemble Classification: A Sampling-Based Learning Control Approach. IEEE Trans Neural Netw Learn Syst. 2017, 28, 1345–1359. [Google Scholar] [CrossRef]
  22. Chang, W.; Liu, Y.; Wu, X.; Xiao, Y.; Zhou, S.; Cao, W. A New Hybrid XGBSVM Model: Application for Hypertensive Heart Disease. IEEE Access. 2019, 7, 175248–175258. [Google Scholar] [CrossRef]
  23. Chen, T.; Guestrin, C. 2016. Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, August 13, 2016. San Francisco, California, USA, 785-794.
  24. Mateo, J.; Rius-Peris, J.M.; Maraña-Pérez, A.I.; Valiente-Armero, A.; Torres, A.M. Extreme gradient boosting machine learning method for predicting medical treatment in patients with acute bronchiolitis. Biocybern Biomed Eng. 2021, 41, 792–801. [Google Scholar] [CrossRef]
  25. Rivera-Lopez, R.; Canul-Reich, J. Construction of Near-Optimal Axis-Parallel Decision Trees Using a Differential-Evolution-Based Approach. IEEE Access. 2018, 6, 5548–5563. [Google Scholar] [CrossRef]
  26. Das, B.K.; Dutta, H.S. GFNB: Gini index-based Fuzzy Naive Bayes and blast cell segmentation for leukemia detection using multi-cell blood smear images. Med Biol Eng Comput. 2020, 58, 2789–2803. [Google Scholar] [CrossRef] [PubMed]
  27. Ma, D.; Yuan, S.; Shang, J.; et al. The Automatic Detection of Seizure Based on Tensor Distance And Bayesian Linear Discriminant Analysis. Int J Neural Syst. 2021, 31, 2150006. [Google Scholar] [CrossRef] [PubMed]
  28. Xing, W.; Bei, Y. Medical Health Big Data Classification Based on KNN Classification Algorithm. IEEE Access. 2020, 8, 28808–28819. [Google Scholar] [CrossRef]
  29. Yu, S.; Li, X.; Zhang, X.; Wang, H. The OCS-SVM: An Objective-Cost-Sensitive SVM With Sample-Based Misclassification Cost Invariance. IEEE Access. 2019, 7, 118931–118942. [Google Scholar] [CrossRef]
  30. Pei, J.; Han, J.; Tong, H. Data Mining: Concepts and Techniques. Elsevier Science & Technology; 2022.
  31. Zhou, X.H.; McClish, D.K.; Obuchowski, N.A. Statistical Methods in Diagnostic Medicine. Wiley & Sons, Incorporated, John; 2014.
  32. Marko, M.; Pawliczak, R. Assessment of the available therapeutic approaches for severe COVID-19: a meta-analysis of randomized controlled trials. Sci Rep. 2023, 13, 17114, Published 2023 Oct 10. [Google Scholar] [CrossRef]
  33. Kuno, T.; Miyamoto, Y.; Iwagami, M.; Ishimaru, M.; Takahashi, M.; Egorova, N.N. The association of remdesivir and in-hospital outcomes for COVID-19 patients treated with steroids. J Antimicrob Chemother. 2021, 76, 2690–2696. [Google Scholar] [CrossRef]
  34. European Medicines Agency. Veklury: European Public Assessment Report (EPAR). Date of access: February 10, 2024. Available at: https://www.ema.europa.eu/en/medicines/human/EPAR/veklury.
  35. Abubaker Bagabir, S.; Ibrahim, N.K.; Abubaker Bagabir, H.; Hashem Ateeq, R. Covid-19 and Artificial Intelligence: Genome sequencing, drug development and vaccine discovery. J Infect Public Health. 2022, 15, 289–296. [Google Scholar] [CrossRef]
  36. Jin, W.; Stokes, J.M.; Eastman, R.T.; et al. Deep learning identifies synergistic drug combinations for treating COVID-19. Proc Natl Acad Sci U S A. 2021, 118, e2105070118. [Google Scholar] [CrossRef] [PubMed]
  37. Augustin, Y.; Staines, H.M.; Velavan, T.P.; Kamarulzaman, A.; Kremsner, P.G.; Krishna, S. Drug repurposing for COVID-19: current evidence from randomized controlled adaptive platform trials and living systematic reviews. Br Med Bull. 2023, 147, 31–49. [Google Scholar] [CrossRef] [PubMed]
  38. Basit, S.A.; Qureshi, R.; Musleh, S.; et al. COVID-19Base v3: Update of the knowledgebase for drugs and biomedical entities linked to COVID-19. Front Public Health. 2023, 11, 1125917, Published 2023 Mar 6. [Google Scholar] [CrossRef] [PubMed]
  39. Imtiaz, F.; Pasha, M.K. A systematic review of RdRp of SARS-CoV-2 through artificial intelligence and machine learning utilizing structure-based drug design strategy. Turk J Chem. 2021, 46, 583–594, Published 2021 Dec 27. [Google Scholar] [CrossRef]
  40. Gordon, C.J.; Tchesnokov, E.P.; Woolner, E.; et al. Remdesivir is a direct-acting antiviral that inhibits RNA-dependent RNA polymerase from severe acute respiratory syndrome coronavirus 2 with high potency. J Biol Chem. 2020, 295, 6785–6797. [Google Scholar] [CrossRef]
  41. Abd-Elshafy, D.N.; Nadeem, R.; Nasraa, M.H.; Bahgat, M.M. Analysis of the SARS-CoV-2 nsp12 P323L/A529V mutations: coeffect in the transiently peaking lineage C.36.3 on protein structure and response to treatment in Egyptian records. Z Naturforsch C J Biosci. Published online January 24, 2024. [CrossRef]
  42. WHO Solidarity Trial Consortium. Remdesivir and three other drugs for hospitalised patients with COVID-19: final results of the WHO Solidarity randomised trial and updated meta-analyses [published correction appears in Lancet. 2022 Oct 29;400(10362):1512]. Lancet. 2022, 399, 1941–1953. [Google Scholar] [CrossRef]
  43. Grundeis, F.; Ansems, K.; Dahms, K.; et al. Remdesivir for the treatment of COVID-19. Cochrane Database Syst Rev. 2023, 1, CD014962, Published 2023 Jan 25. [Google Scholar] [CrossRef]
  44. Spinner, C.D.; Gottlieb, R.L.; Criner, G.J.; et al. Effect of Remdesivir vs Standard Care on Clinical Status at 11 Days in Patients With Moderate COVID-19: A Randomized Clinical Trial. JAMA. 2020, 324, 1048–1057. [Google Scholar] [CrossRef]
  45. Goldman, J.D.; Lye, D.C.B.; Hui, D.S.; et al. Remdesivir for 5 or 10 Days in Patients with Severe Covid-19. N Engl J Med. 2020, 383, 1827–1837. [Google Scholar] [CrossRef]
  46. Chaudhary, B.R.; Dudhrejia, P.J.; Gambhir, R.M.; Rathod, M.M. Study of Efficacy of Injection Remdesivir in Patients of COVID-19. J Assoc Physicians India. 2023, 71, 11–12. [Google Scholar] [CrossRef]
  47. Libra, A.; Ciancio, N.; Sambataro, G.; et al. Use of Remdesivir in Patients Hospitalized for COVID-19 Pneumonia: Effect on the Hypoxic and Inflammatory State. Viruses. 2023, 15, 2101, Published 2023 Oct 17. [Google Scholar] [CrossRef]
  48. Lavrentieva, A.; Kaimakamis, E.; Voutsas, V.; Bitzani, M. An observational study on factors associated with ICU mortality in Covid-19 patients and critical review of the literature. Sci Rep. 2023, 13, 7804, Published 2023 May 13. [Google Scholar] [CrossRef]
  49. Chen, C.; Fang, J.; Chen, S.; et al. The efficacy and safety of remdesivir alone and in combination with other drugs for the treatment of COVID-19: a systematic review and meta-analysis. BMC Infect Dis. 2023, 23, 672, Published 2023 Oct 9. [Google Scholar] [CrossRef]
  50. Mozaffari, E.; Chandak, A.; Zhang, Z.; et al. Remdesivir Treatment in Hospitalized Patients With Coronavirus Disease 2019 (COVID-19): A Comparative Analysis of In-hospital All-cause Mortality in a Large Multicenter Observational Cohort. Clin Infect Dis. 2022, 75, e450–e458. [Google Scholar] [CrossRef]
  51. Paules, C.I.; Gallagher, S.K.; Rapaka, R.R.; et al. Remdesivir for the Prevention of Invasive Mechanical Ventilation or Death in Coronavirus Disease 2019 (COVID-19): A Post Hoc Analysis of the Adaptive COVID-19 Treatment Trial-1 Cohort Data. Clin Infect Dis. 2022, 74, 1260–1264. [Google Scholar] [CrossRef] [PubMed]
  52. Amstutz, A.; Speich, B.; Mentré, F.; et al. Effects of remdesivir in patients hospitalised with COVID-19: a systematic review and individual patient data meta-analysis of randomised controlled trials [published correction appears in Lancet Respir Med. 2023 Aug;11(8):e77]. Lancet Respir Med. 2023, 11, 453–464. [Google Scholar] [CrossRef]
  53. Huang, C.; Lu, T.L.; Lin, L. Remdesivir Treatment Lacks the Effect on Mortality Reduction in Hospitalized Adult COVID-19 Patients Who Required High-Flow Supplemental Oxygen or Invasive Mechanical Ventilation. Medicina (Kaunas). 2023, 59, 1027, Published 2023 May 26. [Google Scholar] [CrossRef] [PubMed]
  54. Kaka, A.S.; MacDonald, R.; Linskens, E.J.; et al. Major Update 2: Remdesivir for Adults With COVID-19: A Living Systematic Review and Meta-analysis for the American College of Physicians Practice Points. Ann Intern Med. 2022, 175, 701–709. [Google Scholar] [CrossRef] [PubMed]
  55. Wei, B.; Zhang, R.; Zeng, H.; et al. Impact of some antiviral drugs on health care utilization for patients with COVID-19: a systematic review and meta-analysis [published online ahead of print, 2023 Sep 5]. Expert Rev Anti Infect Ther. 2023, 1–17. [Google Scholar] [CrossRef]
  56. Kalil, A.C.; Patterson, T.F.; Mehta, A.K.; et al. Baricitinib plus Remdesivir for Hospitalized Adults with Covid-19. N Engl J Med. 2021, 384, 795–807. [Google Scholar] [CrossRef]
  57. Hariharan, A.; Hakeem, A.R.; Radhakrishnan, S.; Reddy, M.S.; Rela, M. The Role and Therapeutic Potential of NF-kappa-B Pathway in Severe COVID-19 Patients. Inflammopharmacology. 2021, 29, 91–100. [Google Scholar] [CrossRef]
  58. Osuchowski, M.F.; Winkler, M.S.; Skirecki, T.; et al. The COVID-19 puzzle: deciphering pathophysiology and phenotypes of a new disease entity. Lancet Respir Med. 2021, 9, 622–642. [Google Scholar] [CrossRef] [PubMed]
  59. Ali, K.; Azher, T.; Baqi, M.; et al. Remdesivir for the treatment of patients in hospital with COVID-19 in Canada: a randomized controlled trial. CMAJ. 2022, 194, E242–E251. [Google Scholar] [CrossRef] [PubMed]
  60. Khosrow-Pour, M. Advanced Methodologies and Technologies in Network Architecture, Mobile Computing, and Data Analytics. IGI Global; 2018.
  61. Bottino, F.; Tagliente, E.; Pasquini, L.; et al. COVID Mortality Prediction with Machine Learning Methods: A Systematic Review and Critical Appraisal. J Pers Med. 2021, 11, 893, Published 2021 Sep 7. [Google Scholar] [CrossRef] [PubMed]
  62. Adamidi, E.S.; Mitsis, K.; Nikita, K.S. Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review. Comput Struct Biotechnol J. 2021, 19, 2833–2850. [Google Scholar] [CrossRef] [PubMed]
  63. Ma, B.; Meng, F.; Yan, G.; Yan, H.; Chai, B.; Song, F. Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data. Comput Biol Med. 2020, 121, 103761. [Google Scholar] [CrossRef] [PubMed]
  64. Choi, Y.J.; Song, J.Y.; Hyun, H.; et al. Prognostic factors of 30-day mortality in patients with COVID-19 pneumonia under standard remdesivir and dexamethasone treatment. Medicine (Baltimore). 2022, 101, e30474. [Google Scholar] [CrossRef] [PubMed]
  65. Amiri, P.; Montazeri, M.; Ghasemian, F.; et al. Prediction of mortality risk and duration of hospitalization of COVID-19 patients with chronic comorbidities based on machine learning algorithms. Digit Health 2023, 9, 20552076231170493, Published 2023 Jun 6. [Google Scholar] [CrossRef]
  66. Georgakopoulou, V.E.; Gkoufa, A.; Makrodimitri, S.; et al. Early 3 day course of remdesivir for the prevention of the progression to severe COVID 19 in the elderly: A single centre, real life cohort study. Exp Ther Med. 2023, 26, 462, Published 2023 Aug 9. [Google Scholar] [CrossRef]
  67. Vaid, A.; Somani, S.; Russak, A.J.; et al. Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation. J Med Internet Res. 2020, 22, e24018, Published 2020 Nov 6. [Google Scholar] [CrossRef]
  68. Aktar, S.; Talukder, A.; Ahamad, M.M.; et al. Machine Learning Approaches to Identify Patient Comorbidities and Symptoms That Increased Risk of Mortality in COVID-19. Diagnostics (Basel). 2021, 11, 1383, Published 2021 Jul 31. [Google Scholar] [CrossRef] [PubMed]
  69. Guan, X.; Zhang, B.; Fu, M.; et al. Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study. Ann Med. 2021, 53, 257–266. [Google Scholar] [CrossRef] [PubMed]
  70. Wei, S.; Xiong, D.; Wang, J.; Liang, X.; Wang, J.; Chen, Y. The accuracy of the National Early Warning Score 2 in predicting early death in prehospital and emergency department settings: a systematic review and meta-analysis. Ann Transl Med. 2023, 11, 95. [Google Scholar] [CrossRef] [PubMed]
  71. Richardson, D.; Faisal, M.; Fiori, M.; Beatson, K.; Mohammed, M. Use of the first National Early Warning Score recorded within 24 hours of admission to estimate the risk of in-hospital mortality in unplanned COVID-19 patients: a retrospective cohort study. BMJ Open. 2021, 11, e043721, Published 2021 Feb 22. [Google Scholar] [CrossRef] [PubMed]
  72. Qin, C.; Zhou, L.; Hu, Z.; et al. Dysregulation of Immune Response in Patients With Coronavirus 2019 (COVID-19) in Wuhan, China. Clin Infect Dis. 2020, 71, 762–768. [Google Scholar] [CrossRef]
  73. Terkes, V.; Lisica, K.; Marusic, M.; Verunica, N.; Tolic, A.; Morovic, M. Remdesivir Treatment in Moderately Ill COVID-19 Patients: A Retrospective Single Center Study. J Clin Med. 2022, 11, 5066, Published 2022 Aug 29. [Google Scholar] [CrossRef]
  74. Liu, Y.; Gao, W.; Guo, W.; et al. Prominent coagulation disorder is closely related to inflammatory response and could be as a prognostic indicator for ICU patients with COVID-19. J Thromb Thrombolysis. 2020, 50, 825–832. [Google Scholar] [CrossRef]
  75. Comoglu, S.; Kant, A. Does the Charlson comorbidity index help predict the risk of death in COVID-19 patients? North Clin Istanb. 2022, 9, 117–121, Published 2022 Apr 12. [Google Scholar] [CrossRef]
  76. Kurien, S.S.; David, R.; Varma, R.P.; Dev, A.S.; Chellappan, A.; Yadev, I.P. Correlation Between Biomarkers and Age-Adjusted Charlson Comorbidity Index in Patients With COVID-19: A Cross-Sectional Study in a Tertiary Care Center in South India. Cureus. 2023, 15, e36000, Published 2023 Mar 10. [Google Scholar] [CrossRef]
  77. Carannante, M.; D'Amato, V.; Iaccarino, G. The Future Evolution of the Mortality Acceleration Due to the COVID-19: The Charlson Comorbidity Index in Stochastic Setting. Front Cardiovasc Med. 2022, 9, 938086, Published 2022 Jul 14. [Google Scholar] [CrossRef]
  78. Raad, I.I.; Hachem, R.; Masayuki, N.; et al. International multicenter study comparing COVID-19 in patients with cancer to patients without cancer: Impact of risk factors and treatment modalities on survivorship. Elife. 2023, 12, e81127, Published 2023 Jan 30. [Google Scholar] [CrossRef] [PubMed]
  79. Mikulska, M.; Testi, D.; Russo, C.; et al. Outcome of early treatment of SARS-CoV-2 infection in patients with haematological disorders. Br J Haematol. 2023, 201, 628–639. [Google Scholar] [CrossRef] [PubMed]
  80. Alharbi, A.M.; Rabbani, S.I.; Halim Mohamed, A.A.; et al. Analysis of potential risk factors associated with COVID-19 and hospitalization. Front Public Health. 2022, 10, 921953, Published 2022 Aug 5. [Google Scholar] [CrossRef] [PubMed]
  81. Matthay, M.A.; Arabi, Y.; Arroliga, A.C.; et al. A New Global Definition of Acute Respiratory Distress Syndrome. Am J Respir Crit Care Med. 2024, 209, 37–47. [Google Scholar] [CrossRef] [PubMed]
  82. Georgakopoulou, V.E.; Vlachogiannis, N.I.; Basoulis, D.; et al. A Simple Prognostic Score for Critical COVID-19 Derived from Patients without Comorbidities Performs Well in Unselected Patients. J Clin Med. 2022, 11, 1810, Published 2022 Mar 25. [Google Scholar] [CrossRef] [PubMed]
  83. Brosh-Nissimov, T.; Hussein, K.; Wiener-Well, Y.; et al. Hospitalized Patients With Severe Coronavirus Disease 2019 During the Omicron Wave in Israel: Benefits of a Fourth Vaccine Dose. Clin Infect Dis. 2023, 76, e234–e239. [Google Scholar] [CrossRef]
  84. Shimizu, H.; Kawase, J.; Hayashi, M.; Imaizumi, K.; Ito, Y.; Okazawa, M. COVID-19 symptom-onset to diagnosis and diagnosis to treatment intervals are significant predictors of disease progression and hospitalization in high-risk patients: A real world analysis. Respir Investig. 2023, 61, 220–229. [Google Scholar] [CrossRef]
  85. Kuno, T.; Sahashi, Y.; Kawahito, S.; Takahashi, M.; Iwagami, M.; Egorova, N.N. Prediction of in-hospital mortality with machine learning for COVID-19 patients treated with steroid and remdesivir. J Med Virol. 2022, 94, 958–964. [Google Scholar] [CrossRef]
  86. Banoei, M.M.; Rafiepoor, H.; Zendehdel, K.; et al. Unraveling complex relationships between COVID-19 risk factors using machine learning based models for predicting mortality of hospitalized patients and identification of high-risk group: a large retrospective study. Front Med (Lausanne). 2023, 10, 1170331, Published 2023 May 4. [Google Scholar] [CrossRef]
  87. Gutiérrez-Gutiérrez, B.; Del Toro, M.D.; Borobia, A.M.; et al. Identification and validation of clinical phenotypes with prognostic implications in patients admitted to hospital with COVID-19: a multicentre cohort study [published correction appears in Lancet Infect Dis. 2023 Jun;23(6):e198]. Lancet Infect Dis. 2021, 21, 783–792. [Google Scholar] [CrossRef]
  88. Chen, H.; Xie, J.; Su, N.; et al. Corticosteroid Therapy Is Associated With Improved Outcome in Critically Ill Patients With COVID-19 With Hyperinflammatory Phenotype. Chest. 2021, 159, 1793–1802. [Google Scholar] [CrossRef] [PubMed]
  89. Ramón, A.; Zaragozá, M.; Torres, A.M.; et al. Application of Machine Learning in Hospitalized Patients with Severe COVID-19 Treated with Tocilizumab. J Clin Med. 2022, 11, 4729, Published 2022 Aug 12. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The figure illustrates the framework employed in the training and testing processes of this study.
Figure 1. The figure illustrates the framework employed in the training and testing processes of this study.
Preprints 99643 g001
Figure 2. Graphic representation of different metrics for each of the machine learning models used, in percentage. Training phase (above) and test phase (below) for predicting mortality in COVID-19 patients undergoing remdesivir treatment. Abbreviations: AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Figure 2. Graphic representation of different metrics for each of the machine learning models used, in percentage. Training phase (above) and test phase (below) for predicting mortality in COVID-19 patients undergoing remdesivir treatment. Abbreviations: AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Preprints 99643 g002
Figure 3. Graphic representation of different metrics for each of the machine learning models used, in percentage. Training phase (above) and test phase (below) for predicting hospital stay in COVID-19 patients undergoing remdesivir treatment. Abbreviations: AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Figure 3. Graphic representation of different metrics for each of the machine learning models used, in percentage. Training phase (above) and test phase (below) for predicting hospital stay in COVID-19 patients undergoing remdesivir treatment. Abbreviations: AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Preprints 99643 g003aPreprints 99643 g003b
Figure 4. ROC curves for mortality (above) and hospital stay (below) for the six assessed machine learning predictors. Abbreviations: BLDA: Bayesian linear discriminant analysis; DT: decision tree; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; ROC: receiver operating characteristic; SVM: support vector machine; XGB: extreme gradient boost.
Figure 4. ROC curves for mortality (above) and hospital stay (below) for the six assessed machine learning predictors. Abbreviations: BLDA: Bayesian linear discriminant analysis; DT: decision tree; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; ROC: receiver operating characteristic; SVM: support vector machine; XGB: extreme gradient boost.
Preprints 99643 g004aPreprints 99643 g004b
Figure 5. Radar plot depicting the training phase (above) and test phase (below) for predicting mortality in COVID-19 patients undergoing remdesivir treatment. AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Figure 5. Radar plot depicting the training phase (above) and test phase (below) for predicting mortality in COVID-19 patients undergoing remdesivir treatment. AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Preprints 99643 g005aPreprints 99643 g005b
Figure 6. Radar plot depicting the training phase (above) and test phase (below) for predicting hospital stay in COVID-19 patients undergoing remdesivir treatment. AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Figure 6. Radar plot depicting the training phase (above) and test phase (below) for predicting hospital stay in COVID-19 patients undergoing remdesivir treatment. AUC: area under curve; BLDA: Bayesian linear discriminant analysis; DT: decision tree; DYI: degenerated Younden index; GNB: Gaussian naïve Bayes; KNN: K-nearest neighbour; MCC: Matthew’s correlation coefficient; SVM: support vector machine; XGB: extreme gradient boost.
Preprints 99643 g006
Figure 7. Graphical representation of the predictive variables with the most significant impact on classifying COVID-19 patients undergoing remdesivir treatment in terms of mortality (above) and hospital stay (below).
Figure 7. Graphical representation of the predictive variables with the most significant impact on classifying COVID-19 patients undergoing remdesivir treatment in terms of mortality (above) and hospital stay (below).
Preprints 99643 g007aPreprints 99643 g007b
Table 1. Summary of the mean values and standard deviation of balanced accuracy, recall, precision, F1 score, AUC, MCC, DYI and Kappa index of the machine learning models and the proposed method implemented in this study for the mortality.
Table 1. Summary of the mean values and standard deviation of balanced accuracy, recall, precision, F1 score, AUC, MCC, DYI and Kappa index of the machine learning models and the proposed method implemented in this study for the mortality.
Methods Balanced Accuracy Recall Precision F1 Score
SVM 84.80 ± 0.75 84.90 ± 0.73 84.20 ± 0.74 84.55 ± 0.73
BLDA 81.61 ± 0.84 81.70 ± 0.81 81.03 ± 0.82 81.36 ± 0.81
DT 86.25 ± 0.71 86.35 ± 0.70 85.63 ± 0.71 85.99 ± 0.70
GNB 76.80 ± 0.94 76.89 ± 0.93 76.25 ± 0.95 76.57 ± 0.94
KNN 89.43 ± 0.60 89.54 ± 0.58 88.79 ± 0.59 89.16 ± 0.58
XGB 95.45 ± 0.46 95.56 ± 0.45 94.77 ± 0.47 95.17 ± 0.44
Methods AUC MCC DYI Kappa
SVM 0.84 ± 0.02 75.25 ± 0.75 84.80 ± 0.73 75.49 ± 0.74
BLDA 0.81 ± 0.03 72.41 ± 0.82 81.61 ± 0.81 72.65 ± 0.81
DT 0.86 ± 0.02 76.53 ± 0.71 86.25 ± 0.71 76.78 ± 0.72
GNB 0.76 ± 0.03 68.15 ± 0.95 76.80 ± 0.94 68.37 ± 0.94
KNN 0.89 ± 0.02 79.35 ± 0.57 89.43 ± 0.58 79.62 ± 0.58
XGB 0.95 ± 0.02 84.70 ± 0.42 95.45 ± 0.46 84.98 ± 0.43
Abbreviations: AUC: Area Under Curve; BLDA: Bayesian Linear Discriminant Analysis; DT: Decision Tree; DYI: Degenerated Younden Index; GNB: Gaussian Naïve Bayes; KNN: K-Nearest Neighbour; MCC: Matthew’s Correlation Coefficient; SVM: Support Vector Machine; XGB: eXtreme Gradient Boost.
Table 2. Summary of the mean values and standard deviation of balanced accuracy, recall, precision, F1 score, AUC, MCC, DYI and Kappa index of the machine learning models and the proposed method implemented in this study for the hospital stay.
Table 2. Summary of the mean values and standard deviation of balanced accuracy, recall, precision, F1 score, AUC, MCC, DYI and Kappa index of the machine learning models and the proposed method implemented in this study for the hospital stay.
Methods Balanced Accuracy Recall Precision F1 Score
SVM 83.80 ± 0.77 83.89 ± 0.76 83.20 ± 0.78 83.54 ± 0.76
BLDA 80.54 ± 0.85 80.64 ± 0.84 79.94 ± 0.86 80.30 ± 0.85
DT 85.67 ± 0.73 85.61 ± 0.72 84.90 ± 0.75 85.25 ± 0.74
GNB 75.41 ± 0.98 75.50 ± 0.95 74.86 ± 0.96 75.18 ± 0.97
KNN 88.53 ± 0.66 88.61 ± 0.64 87.85 ± 0.67 88.24 ± 0.68
XGB 94.24 ± 0.48 94.35 ± 0.44 93.57 ± 0.46 93.96 ± 0.47
Methods AUC MCC DYI Kappa
SVM 0.83 ± 0.02 74.35 ± 0.74 83.80 ± 0.77 74.60 ± 0.73
BLDA 0.80 ± 0.03 71.47 ± 0.83 80.54 ± 0.85 71.71 ± 0.84
DT 0.85 ± 0.03 75.89 ± 0.71 85.51 ± 0.73 76.13 ± 0.71
GNB 0.75 ± 0.03 66.91 ± 0.92 75.43 ± 0.95 67.15 ± 0.93
KNN 0.88 ± 0.02 78.53 ± 0.63 88.51 ± 0.66 78.79 ± 0.64
XGB 0.94 ± 0.02 83.62 ± 0.41 94.24 ± 0.45 83.90 ± 0.42
Table 3. Main basal clinical data of patients. Data are n (%) or median (IQR), unless otherwise stated.
Table 3. Main basal clinical data of patients. Data are n (%) or median (IQR), unless otherwise stated.
Variable Cohort
Number of patients 252
Age (years) (IQR) 77 (66.7–85.2)
Male, n (yes %) 147 (58.3)
Hospital admission (days) after remdesivir administration (IQR) 8 (5–12)
Exitus, n (yes %) 34 (13.5)
Patients with duration of remdesivir treatment for 4-5 days, n (yes %) 194 (76.9)
Patients with duration of remdesivir treatment > 5 days, n (yes %) 14 (5.6)
Duration (days) from onset of symptoms to microbiological confirmation (IQR) 2 (1-4)
Oxygen therapy, n (yes %) 188 (74.6)
Patients who need admission to the ICU, n (yes %) 14 (5.6)
Limitation of life support treatment, n (yes %) 47 (18.6)
IMV, n (yes %) 10 (3.9)
Baseline situation at the start of remdesivir:
- Patient who does not require supplementary oxygen, n (yes %)
- Patient requiring low-flow oxygen therapy, n (yes %)
 
96 (38.1)
156 (61.9)
Hypertension, n (yes %) 158 (62.7)
Diabetes, n (yes %) 89 (35.3)
Dyslipemia, n (yes %) 112 (44.4)
Smoker, n (yes %) 55 (21.8)
Obesity, n (yes %) 45 (17.8)
COPD, n (yes %) 35 (13.9)
Heart failure, n (yes %) 51 (20.2)
Ischemic heart disease, n (yes %) 46 (18.2)
Chronic kidney disease, n (yes %) 26 (10.3)
Chronic neurological or neurodegenerative disease, n (yes %) 62 (24.6)
Mental health disorder, n (yes %) 60 (23.8)
Active hematological or oncological neoplasia, n (yes %) 63 (25.0)
Patients with ≥ 3 comorbidities, n (yes %) 193 (76.6)
Fever, n (yes %) 148 (58.7)
Cough, n (yes %) 156 (61.9)
Dyspnea, n (yes %) 158 (62.7)
Asthenia, n (yes %) 106 (42.1)
Presence of flu and/or coinfection, n (yes %) 29 (11.5)
Angiotensin-converting enzyme inhibitors/angiotensin receptor blockers, n (yes %) 109 (43.2)
Antibiotics, n (yes %) 234 (92.8)
Immunosuppressants and/or immunomodulators, n (yes %)
- Corticoides, n (yes %)
- Tocilizumab, n (yes %)
- Baricitinib, n (yes %)
197 (78.2)
124 (49.2)
33 (13.1)
32 (12.7)
Albumin (g/dL) (IQR) 3.4 (2.9-3.6)
Hemoglobin (g/dL) (IQR) 12.4 (10.9-13.7)
Troponin (ng/mL) (IQR) 12 (7-27)
CRP (mg/dL) (IQR) 6.9 (2.4–14.1)
LDH (U/L) (IQR) 463 (359.5–609.5)
Ferritin (µg/L) (IQR) 413 (193–782)
D-dimer (ng/mL) (IQR) 878 (499.7-1,534.2)
Creatinina (mg/dL) (IQR) 0.8 (0.6-1.1)
CK (U/L) (IQR) 98.5 (51.0-192.0)
PAFI (IQR) 323.8 (257.0–368.0)
Lymphocytes (109/L) (IQR) 0.8 (0.6–1.2)
Bilateral lung radiological status, n (yes %) 98 (38.9)
Ground-glass opacity lung injury, n (yes %) 69 (27.4)
Patchy lung density, n (yes %) 64 (25.4)
Patients with ≥ 3 doses of COVID-19 vaccine, n (yes %) 199 (78.9)
Acute Respiratory Distress Syndrome, n (yes %) 56 (22.2)
Clinical status on day 14 after the first administration of remdesivir
- Hospitalized and requires supplementary oxygen but not IMV, n (yes %)
- Hospitalized and requires IMV, n (yes %)

23 (9.1)
5 (1.9)
Abbreviations: CK: creatine kinase, COPD: chronic obstructive pulmonary disease, CRP: C-reactive protein, ICU: intensive care unit, IQR: interquartile range, IMV: invasive mechanical ventilation, LDH: lactate dehydrogenase, PAFI: ratio of arterial oxygen partial pressure (PaO2) to fractional inspired oxygen (FiO2).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated