- 1Unit of Non-Communicable Diseases, Department of Public Health, Institute of Tropical Medicine Antwerp, Antwerp, Belgium
- 2Department of Epidemiology and Public Health, Sciensano, Brussels, Belgium
Objectives: To adopt a multi-state risk prediction model for critical disease/mortality outcomes among hospitalised COVID-19 patients using nationwide COVID-19 hospital surveillance data in Belgium.
Materials and methods: Information on 44,659 COVID-19 patients hospitalised between March 2020 and June 2021 with complete data on disease outcomes and candidate predictors was used to adopt a multi-state, multivariate Cox model to predict patients’ probability of recovery, critical [transfer to intensive care units (ICU)] or fatal outcomes during hospital stay.
Results: Median length of hospital stay was 9 days (interquartile range: 5–14). After admission, approximately 82% of the COVID-19 patients were discharged alive, 15% of patients were admitted to ICU, and 15% died in the hospital. The main predictors of an increased probability for recovery were younger age, and to a lesser extent, a lower number of prevalent comorbidities. A patient’s transition to ICU or in-hospital death had in common the following predictors: high levels of c-reactive protein (CRP) and lactate dehydrogenase (LDH), reporting lower respiratory complaints and male sex. Additionally predictors for a transfer to ICU included middle-age, obesity and reporting loss of appetite and staying at a university hospital, while advanced age and a higher number of prevalent comorbidities for in-hospital death. After ICU, younger age and low levels of CRP and LDH were the main predictors for recovery, while in-hospital death was predicted by advanced age and concurrent comorbidities.
Conclusion: As one of the very few, a multi-state model was adopted to identify key factors predicting COVID-19 progression to critical disease, and recovery or death.
Introduction
As of July 4, 2022, a number of 4,294,880 confirmed cases of COVID-19 have been reported in Belgium, with 128,814 associated hospitalisations, and 31,977 deaths (1). This unprecedented burden on the health care system exposed the need for tools to facilitate informed clinical decision on patients’ management, and resource allocation for effective health systems, particularly during outbreaks of uncertain aetiology and heterogeneous prognosis. A robust risk prediction model of COVID-19 progression after hospitalisation, and the identification of predicting factors for patients’ outcomes would be key for not only delivering targeted treatments and strategies in high-risk patients, and hereby potentially increase survival while managing the pressure on the health care system, but also needed for informed decision on futile medical care.
Toward this need, more than 100 prognostic models for patients diagnosed with COVID-19 have been developed using a variety of data sources from the early phases of the pandemic. Most were discussed to carry a high risk of bias and lacking external validation, as summarised in the living systematic literature review of Wynants and co-workers (2). Standard logistic regression was a frequently used tool to predict COVID-19 severity, though likely to introduce important selection bias if patient data are not available until discharge; a plausible situation during a pandemic where data mainly cover active cases who are still in the hospital. More recent COVID-19 risk prediction models, however, accounted for some of the data complexity by considering time-to-event outcomes, hereby incorporating the time elapsing before an event is experienced or the observation is censored, either using Cox proportional hazard models (3–9) or in a competing risk framework (10–12). However, to the best of our knowledge, only a very few prediction scores have been built using time-to-event data in a multi-state setting that should fit better hospital progression, as documented by an increasing number of studies using a multi-state approach in the COVID-19 context (13–17).
A multi-state model framework allows for the simultaneous investigation of clinically competing outcomes (e.g., discharged alive versus in-hospital death), as well as an intermediate progression to a critical disease state [transfer to intensive care units (ICU)], while also accounting for censoring due to loss of follow up (18). The analysis of patients’ progression during hospitalisation is particularly challenged by the presence of competing events, where the occurrence of one event prevents the observation of another, and methods should account for this if interested in the probability of failing from an event (19). In addition, the probability of experiencing a certain event is likely to be influenced by patient’s risk factors, including time-dependent factors that are generally present with progression of disease. When it is of interest to model non-fatal intermediate events, the competing risk model can be easily extended to a multi-state model for the analysis of successive events (18). When analysing hospital data, especially in the context of COVID-19, it is important to acknowledge the presence of competing risks and intermediate events using adequate statistical models to provide robust findings (17).
The objective of the present study is to adopt a tool for multi-state risk prediction and identifying the key factors predicting COVID-19 progression to critical disease status, and subsequent recovery or death, using nationwide hospital surveillance data from COVID-19 in Belgium. This model aims to predict individual probabilities of experiencing an intermediate (ICU stay) and final events (recovery or death) after hospital admission, using survival methods in a multi-state setting.
Materials and methods
Data source
Nationwide hospital surveillance data on COVID-19 patients in Belgium are since the beginning of the pandemic routinely collected by Sciensano, the Belgian Institute of Public Health. The methodology of the Belgian surveillance system has been previously described in detail (20). Briefly, the clinical survey collects individual data of patients hospitalised in Belgium with confirmed COVID-19, as reported through a structured questionnaire at hospital admission and discharge (and a third questionnaire at ICU discharge if appropriate), and hereby achieving a national coverage at patient level of around 65%. Information reported at admission include patient demographics (such as age, sex, residence in a retirement home), type of the hospital at admission, date of hospital admission, symptoms at admission, the presence of chronic pre-existing comorbidities and risk factors, including current smoking. Information reported at discharge include laboratory values at admission, COVID-19 related treatment during hospital stay, measures on critical disease state (such as the need for transfer to ICU, invasive ventilation support and/or extracorporeal membrane oxygenation (ECMO), with only dates available for ICU transfer), development of complications [such as a bacterial and/or fungal superinfection, pneumonia, and/or acute respiratory distress syndrome (ARDS)], date of discharge, and health status at discharge. The ICU occupancy (percentage of recognized ICU beds occupied by confirmed- and suspected COVID-19 patients) is reported through a different surveillance system, the surge capacity survey (20).
This COVID-19 clinical surveillance was authorised by an independent administrative authority protecting privacy and personal data, and was approved by the ethical committee of Ghent University Hospital (BC-07507), which waived the informed consent because of the anonymous and retrospective data analyses, and because it was considered an extra burden on the hospitals during the pandemic. Ethical approval for present data analyses was obtained from the Institutional Review Board of the Institute of Tropical Medicine (ITM) after revision of the research protocol (num. 1488/21, 06/04/2021).
Study population
A total of 55,737 adult patients with a community- or non-hospital-acquired SARS-CoV-2 infection confirmed by polymerase chain reaction (PCR) and/or suggestive imaging alterations on chest CT combined with typical clinical presentation at admission in Belgium from March, 2020 to June, 2021 were considered for inclusion in the present study. From this, excluded were patients with incomplete or implausible dates of admission, ICU transfer and/or discharge (15%), admitted to a psychiatric/categorical hospital (0.2%), and those with missing data on complete sections of the surveys: i.e., missing all symptoms (0.1%), all comorbidities (0.2%), and all laboratory values (6%), leaving 44,550 patients for the risk prediction model after multiple imputation (MI). Missing data among important potential candidate predictors were handled by 10-fold MI using the “mice” package in R (21).
Similar analyses were performed in a dataset excluding patients with missing data on any of the potential candidate predictors (66%), this complete-case analysis included 18,994 hospitalised COVID-19 patients in Belgium with complete data on necessary outcome variables for the prediction model (i.e., length of hospital stay, the time from admission to ICU for those admitted to ICU, and status at discharge) and potential candidate predictors.
Patient predictor variables
Candidate predictor variables were identified on the basis of the presence of existing clinical vulnerability criteria, as identified by the WHO, previous literature and expert knowledge, and included the following variables measured at hospital admission: patients’ demographics [age (in quartiles), sex, residence in a retirement home], prevalent risk factors and comorbidities (current smoking, pregnancy, obesity, high blood pressure, diabetes, chronic renal disease, cardiovascular disease, chronic lung disease, chronic liver disease, malignant solid neoplasms, haematological cancers, immunosuppression, and including the number of comorbidities in four categories), symptoms at admission (fever, upper respiratory complaints, lower respiratory complaints, gastro-intestinal complaints, anosmia, loss appetite, and typical symptoms of viral infection), laboratory values at admission [c-reactive protein (CRP), lactate dehydrogenase (LDH), lymphocyte count; all in quartiles], as well as hospital characteristics [hospital type (in three categories) and ICU occupancy at admission and at ICU admission (in quartiles)].
Patient outcome variables
Patients were considered to have recovered when their status at discharged was reported as “recovered” or “other” (with the latter representing recovery at home, a revalidation centre or a nursing home), to have in-hospital death when reported as “died,” and to be lost to follow-up when status at discharge was “unknown” or “transferred”. Hospital length of stay was calculated as the time in days starting from hospital admission until date of hospital discharge (either recovery, in-hospital death, or lost to follow-up). Only transfer to ICU was captured in the database as a time-defined critical state of COVID-19, with time to critical disease state calculated as time between hospital admission and date of transfer to ICU. If ICU transfer was on the same day as hospital admission, then we assumed half a day in hospital before a transfer to ICU.
In the multi-state model, transfer to ICU was modelled as a transient state, using time from admission to ICU, and recovery and in-hospital death as absorbing states either after hospital admission or via a transfer to ICU using hospital length of stay.
The multi-state prediction model framework in the hospital setting
A multi-state model in the hospital setting describes the course of hospital stay from admission to discharge, including intermediate events of disease progression among the hospitalised patients. The present analysis is built upon the same set-up of the multi-state model as devised in our previous work (17), but we have extended the multi-state methodology for the deployment of a risk prediction model.
In our multi-state model, patients entered in one initial state at day 0: State 1: “Hospitalisation.” From this State 1, they can either transition to a transient state: State 2: “ICU” (as a proxy for severe disease progression) or to one of the two absorbing states: discharged alive (State 3: “Recovery”) or in-hospital death (State 4: “Death”). From State 2, a transition to State 3 or State 4 was once again possible. Hereby, this four-state model incorporated five possible transitions: Transition 1: “Hospitalisation to ICU”, Transition 2: “Hospitalisation to recovery”, Transition 3: “Hospitalisation to death”, Transition 4: “ICU to recovery”, and Transition 5: “ICU to death”, as presented in Figure 1. In our model, being in the State 2 of ICU stay reflected a past transfer to ICU, but not necessarily a present stay at ICU, and also no back-transition to State 1 was considered.
Figure 1. Schematic representation of the multi-state model with four states and five transition events, including the event matrix for training and Testing set. aNo event represents censored or absorbing state. bTraining set was taken in 10-fold for building the prediction model on stacked multiple-imputed data. cTesting set for calculating prediction error includes complete cases only (excluding 57% of the Testing patients with missing data on any of the predicting variables).
Training and testing set
The dataset is split 70/30 into a Training and Testing set. The prediction models were built in the Training set (including the variable selection from predefined candidate predictors and regression coefficient estimation). Predictions were made in the Testing set with the accuracy of the prediction evaluated using the Brier prediction error for multi-state models with pseudo-values to handle the presence of right censoring (22).
Variable selection using the training set
Variable selection in the Training set was performed on the stacked MI data (23) using an elastic net procedure with alpha set to 0.5 and lambda within one standard error from the optimal lambda, determined by a 10-fold cross-validation using the “cv.glmnet” package in R (24).
Prediction model adoption in the training set
This prediction model aims to predict individual state-occupation probabilities for the intermediate and the absorbing states at a future time t after hospital admission. To this end, we performed an elastic net Cox regression model for the variable selection of the five transition hazards, and subsequently, the four state-occupation probabilities were predicted by combining the (exponentiated) regression coefficients of the selected variables with the cumulative baseline hazards, as estimated by the Breslow estimator with the Aalen estimator of variance (25). Hereafter, we fitted also on the Training data a multi-state model without any covariates, i.e., a null model, with the aim to compare predicting performance of the prediction model with that of a null model.
Prediction of state-occupation probabilities in testing set
The predictions of the state-occupation probabilities were calculated in R using the “mstate” package with “msfit” function to obtain the cumulative baseline hazards, and the “probtrans” function to compute the transition probabilities (25) in the Testing set for the model with the selected predictor variables and the null model.
Risk prediction error measurement using testing set
To evaluate the prediction performance of the model, the prediction error based on the Brier Score was calculated as a measure of predictive accuracy that evaluates both discrimination and calibration simultaneously (26). Prediction error was calculated up to 30 days after hospital admission, since most of the patients had been discharged by then. Briefly, the Brier score is a function of the differences between the predictions and the observations and can be seen as the mean squared error of these differences. With the use of pseudo-values, the Brier score can also be estimated for survival models derived from incomplete data because of right censoring (27).
Pseudo-values were calculated in the Testing set, by directly modelling state-occupation probabilities in the Testing set by using a pseudo-value regression (28, 29). This approach replaces all observations, censored or not, with pseudo-values that are estimated by recalculating the Aalen-Johansen estimator derived from the entire Testing set and a Testing set with leave-one-out, repeatedly for each individual. This leave-one-out jack-knife method for the Aalen-Johansen estimator of the marginal state-occupation probabilities permits to assign pseudo-values for the state-occupation probabilities to each individual at each time point. In this way, a pseudo-value reflects the extent to which the overall marginal estimator is affected by the presence or absence of that individual in the set. This implies that pseudo-values are intuitively related to the covariates at the individual level, and thus contain information on how covariates of that individual affect the overall marginal estimator.
Subsequently, in the Testing set, the Brier prediction error estimated with pseudo-values for the state-occupation probabilities was calculated (22). The prediction errors were calculated for predictions made with the prediction models as well as for predictions made with the null model (i.e., the non-parametric multi-state model derived from the Training set) in order to quantify the relative reduction in the prediction error after covariate-inclusion.
Building the nomogram
The selected predictor variables of the transition hazards were incorporated in a nomogram to predict the 2-, 3-, 7- and 14-day probability for the transitions from hospital admission to ICU-admission, to recovery and to in-hospital death. These days were chosen based on the distribution of time from hospital admission to ICU admission and total hospital length of stay, in particular day 2 and 3 mainly reflecting the need for an ICU transfer as 75% of the ICU patients are transferred before or at day 3 after hospitalisation, and day 7 and 14 mainly reflecting potential clinical outcomes of recovery or in-hospital death, in case of no transfer to ICU, as approximately one-third of the patients will have been recovered or died after 1 week of hospitalisation and up to 75% after 2 weeks. Using the “rms” package in R, the nomogram function provided for each variable a point score, ranging from 0 to 10 (low to high risk) and attributed according to the predictive importance of the selected variables on the specific transition probability, as represented by the format of the axes (30, 31).
Results
Descriptive characteristics of the study population
Patient’s characteristics were similar to those reported earlier in analyses using the Belgian COVID-19 surveillance data (17, 32–35) (Table 1). Approximately 85% of the hospitalised COVID-19 patients were aged ≥ 50 years and more than half of them were men. In general, the most frequent reported symptoms were lower respiratory complaints (69.7%), followed by fever (49.7%) and other symptoms typically associated with a viral infection (43.1%). A number of patients had a least one comorbidity with arterial hypertension (40.6%), cardiovascular disease (32.7%) and diabetes mellitus (23.8%) as the most common. Median length of hospital stay was 9 days. Approximately 15% of the patients were admitted to ICU with a median time from hospital to ICU admission of 1 day, and a total of 80.3% recovered, 16.9% died in the hospital and only 2.8% were lost to follow-up.
Table 1. Description of the information of COVID-19 patients according to health states during hospitalisation available from the Belgian COVID-19 hospital surveillance database between March 2020 and June 2021.
Compared to non-ICU patients, the ICU-patients were slightly younger, and more frequently males, more likely to report symptoms of fever, lower respiratory complaints, and loss of appetite (but less gastro-intestinal complaints), to suffer from pre-existing conditions (particularly obesity, arterial hypertension, and diabetes mellitus), and to present lower levels of lymphocyte and higher levels of LDH and CRP. In addition, median length of hospital stay was more than double for those admitted to ICU (median 17 days in ICU-patients vs. 8 days in non-ICU-patients; data of non-ICU patients; not shown), and more patients experienced an in-hospital death (33.4 vs. 14.1%).
Compared with patients that recovered, the patients who died in the hospital were older, more frequently males, and more likely to report less symptoms (but more lower respiratory complaints), to suffer from pre-existing conditions, and to present lower levels of lymphocyte and higher levels of LDH and CRP. Median length of hospital say did not differ by final health states, but as compared with patients that recovered, the patients with an in-hospital death were more often transferred to ICU (28.9 vs. 10.6%), and also the patients that were lost-to-follow-up were more often transferred to ICU (42.4%).
Risk prediction models in a multi-state setting
The estimates for the transition hazards for the risk prediction model of COVID-19 disease progression, as applied to 31,205 patients included in the training set, are presented in Table 2.
Table 2. Hazard ratios and 95% confidence intervals for transitions of COVID-19 disease progression in a multi-state risk prediction setting using the Training dataset with 10-fold multiple imputation.
Recovery after hospital admission (transition 2) was strongly positively predicted by pregnancy (HR: 1.28, 95% CI: 1.23–1.33), reporting symptoms of anosmia (1.17, 1.15–1.19), presenting high levels of lymphocytes (1.16, 1.14–1.17), while inversely by advanced age (50–69 years: 0.79, 0.78–0.80; 70–79 years: 0.51, 0.50–0.52; 80+ years: 0.34, 0.34–0.35), presenting two or more comorbidities (2: 0.85, 0.84–0.87; 3+ years: 0.82, 0.80–0.84), and in particular neurological disorders (0.83, 0.81–0.84), haematological malignancies (0.85, 0.82–0.88), cognitive disorders (0.87, 0.86–0.89), chronic lung disease (0.89, 0.88–0.90) and chronic renal disease (0.90, 0.87–0.92), as well as presenting high levels of CRP (0.82, 0.81–0.83) and of LDH (0.89, 0.88–0.90).
Strong predictors for an increased probability of ICU transfer (transition 1) were presenting high values for CRP (2.10, 2.05–2.15) and LDH (2.02, 1.98–2.07), being admitted to a University Hospital (1.88, 1.83–1.94), reporting symptoms of loss of appetite (1.74, 1.65–1.85), suffering from obesity (1.46, 1.43–1.49), reporting symptoms of lower respiratory complaints (1.38, 1.35–1.42) and being male (1.30, 1.27–1.33), while a lower risk was observed for aged 80+ years (0.43, 0.41–0.45), residence in a retirement home (0.63, 0.61–0.66), suffering from cognitive disorders (0.68, 0.65–0.71), and overloaded ICU occupancy (0.79, 0.77–0.81).
The probability of in-hospital death after hospital admission (transition 3) was mainly positively predicted by advanced age (70–79 years: 2.27, 2.18–2.37; 80+ years: 4.50, 4.34–4.68), a residence in retirement home (1.86, 1.81–1.90), presenting high levels of CRP (1.93, 1.87–2.00) and LDH (1.81, 1.75–1.86), reporting symptom of lower respiratory complaints (1.33, 1.30–1.36), presenting two or more comorbidities (2: 1.17, 1.11–1.23; 3+: 1.12, 1.05–1.08), and in particular chronic liver disease (1.45, 1.36–1.54), haematological (1.32, 1.24–1.40) and solid (1.27, 1.23–1.31) malignancies, cardiovascular disease (1.25, 1.22–1.28) and chronic neurological disorders (1.24, 1.20–1.26), while inversely by reporting symptoms of anosmia (0.75, 0.70–0.80) and of gastro-intestinal complaints (0.81, 0.79–0.84), and presenting high values of lymphocytes (0.79, 0.77–0.82).
For the transition to recovery after a transfer to ICU (transition 4), advanced age was identified as a strong inverse predictor (50–69 years: 0.76, 0.72–0.79; 70–79 years: 0.45, 0.44–0.47, 80+ years: 0.50, 0.47–0.53) as well as high levels of CRP (0.76, 0.72–0.79) and LDH (0.81, 0.78–0.83), while reporting loss of appetite (1.55, 1.45–1.65) as a positive predictor. For the transition to in-hospital death after a transfer to ICU (transition 5), the predictors for a higher probability were advanced age (70–79 years: 1.78, 1.65–1.92; 80+ years: 3.08, 2.84–3.33; residence in retirement home: 1.67, 1.57, 1.78) and suffering from comorbidities (2: 1.23, 1.14–1.33; 3+ year: 1.40, 1.25–1.57), and in particular haematological malignancies (1.52, 1.40–1.66), chronic lung disease (1.27, 1.09–1.48), transplant (1.27, 1.22, 1.33) and chronic renal disease (1.23, 1.17–1.29), while staying at a university hospital (0.69, 0.65–0.73) was the main predictor for a lower probability.
Figure 2 shows the performance of the risk prediction model in the Testing set as compared to a null model. Overall, the prediction error for both the null and full model was increasing during the first days after hospitalisation, and started decreasing after 7 days to flatten after day 20. At 7 days after hospital admission, the prediction error was 0.64 for the null and 0.53 for the full model, pointing to a proportional reduction of 16% which was, on average, also observed for the following days. Stratified by health state, the prediction errors were taking a different course over time for each state, and followed the flow of patients during the hospital stay, i.e., a fast increase and a flattened decrease for the state of hospitalisation and ICU as well as for the state of recovery but at a slower speed, while only a flattened increase in prediction error for the state of in-hospital death.
Figure 2. Prediction error of the multi-state prediction model as compared with a null model, calculated using the Brier score, overall (A) and stratified by state (B).
State probabilities of a reference patient versus a high-risk patient
Risk prediction outcomes were visualised using plots for the cumulative hazards for the five possible transitions and the final state-occupation probabilities, and in this case for a high-risk patient versus a reference patient (Figure 3). Overall, the cumulative hazards for recovery were markedly greater than that of in-hospital death as well as that of the transfer to ICU, which was associated with an increased hazard for in-hospital death and decreased hazard for recovery (Figure 3A). Transition hazards were, however, different between a high-risk and a reference patient. For a high-risk patient, a lower hazard to recovery was observed, independently of transfer to ICU, as well as a higher transition hazard for a transfer to ICU and in-hospital death after hospitalisation and after ICU. Similarly, patients with a high-risk profile had higher state-occupation probabilities for ICU, starting immediately after hospital admission and slowly levelling off at 5 days after hospital admission, and also had a higher probability for in-hospital death along with a lower probability for recovery (Figure 3B). However, for both patient profiles, the state-occupation probabilities of recovery were higher than those of in-hospital death, by which the recovery probability were showing an increase over time that stabilised after 15–20 days of hospitalisation and those of in-hospital death a gentle increase over the 30-day hospital stay.
Figure 3. Plots for the cumulative transition hazards (A) and state-occupation probabilities (B) in a multi-state model considering a high-risk patient (tick solid line) versus a reference patient (thin solid line). The reference patient represents an average patient characterised by being male, aged between 55 and 69 years old, admitted to a general hospital with a medium-high ICU occupancy at hospital admission, experiencing lower respiratory complaints, having zero co-morbidities, being a non-smoker, and presenting high levels of lymphocyte and medium-high levels (Q3) of lactate dehydrogenase (LDH) and C-reactive protein (CRP). The high-risk patient represents an existing patient with the worst risk profile, while having the same age, sex, hospital type and ICU occupancy at admission as the reference patient.
Nomogram
Using the first three transitions in a competing risk framework, a nomogram was built assigning a score to each of the selected predictor variables associated with the recovery, ICU admission, and in-hospital death, respectively (Figure 4).
Figure 4. Nomograms to predict the transition probability for admission to recovery (A), ICU (B), and in-hospital death (C) after 2, 3, 7, and 14 days of hospitalisation in COVID-19 hospitalised COVID-19 patients. Patients’ values are located on the axis of each variable where 0 refers to “No” and 1 “Yes”; drawing an upward line at 90° angle to determine the number of points for that particular variable. The sum of these numbers of points is located on the total score axis; drawing a downward line at 90° angle to determine the probability of experiencing that particular transition at day 2, 3, 7, and 14.
The transition from hospital admission to recovery was mainly predicted by age, receiving a number of 10 points when younger than 80+ plus 6.3 points when younger than 70–70 years and plus 2.2 points when younger than 50–69 years, and to a lesser extent by pregnancy (2.3 points) and the number of comorbidities (1.8 plus 1.5 plus 0.8 points when having less than 3 comorbidities, less than 2, and less than 1, respectively) and in particular the comorbidity of chronic neurological disorders (1.8 point when absent; Figure 4A). Points were lower than 1.5 for all other predictor variables relevant for recovery after hospital admission.
For a transition from hospital admission to ICU admission, the higher numbers of points were assigned to being of non-advanced age (10 when younger than 80+ plus 5.5 when not having a residence in a retirement home), high levels of CRP (8.8) and LDH (8.4) followed by hospital type (7.5 when admitted to university hospital), reporting the symptom of loss of appetite (6.6), and the lower points were for having not having cognitive disorders (4.5), but suffering from obesity (4.5) and reporting the symptom of lower respiratory complaints (3.9), and being male (3.1; Figure 4B). Points were lower than 3 for all other predictor variables relevant for ICU admission.
For the transition from hospital admission to in-hospital death, the higher numbers of points were assigned to advanced age (10 when 80+, 5.5 when 70–79 years, and 4.1 for residence in retirement home), followed by high levels of CRP (4.4) and LDH (3.9), and the comorbidity of chronic liver disease (2.5; Figure 4C). Points were lower than 2 for all other predictor variables relevant for in-hospital death after hospital admission.
For example, using the nomogram for calculating the transition probabilities for the first three transitions at 7 days after hospital admission for a reference patient resulted in a transition probability for ICU of 10%, for recovery of 50% and for in-hospital death less than 5%, while for the high-risk patient the transition probabilities of 40, 30, and 10%, respectively, corresponding with Figure 3B. Estimated c-index for the individual transitions, as illustrated by the nomograms, was 0.677 for the risk prediction of ICU transfer, and 0.629 and 0.766 for predicting recovery and in-hospital death, respectively, after hospital admission.
Complete-case analysis
Patient’s characteristics of the complete-cases only did not differ from the 44,550 patients used for the MI (Supplementary Table 1). Less predictors were selected for each transition in the complete-case analysis (Supplementary Table 2), while showing a similar reduction in prediction error (data not shown). Predictors selected in both the imputed and complete-case dataset had regression coefficients pointing toward the same direction, and often of the same or a marginally weaker strength for the complete-case analysis.
Discussion
In this study, a multi-state prediction model has been adopted for one of the first times to identify predictive factors of COVID-19 outcomes after hospitalisation, including recovery, ICU transfer (as proxy for critical disease state) and death, in a multi-state framework. Our findings indicate that all possible transitions are predicted by a different set of variables with varying magnitudes, but having in common the following variables: sex, age, levels of CRP and LDH, complaints related to the lower respiratory tract, and pre-existing comorbidities. The risk prediction model had good performance with 15% reduction in prediction error as compared to a null model derived from the Training set.
This study identified age, sex, symptoms of lower respiratory complaints, higher levels of CRP and LDH, as well as living in a retirement home, as the set of variables predicting progression to critical/fatal COVID-19, however varying in terms of hazard ratios for critical disease state and in-hospital death. These findings align with those of a meta-analysis using data of 69,762 patients, which reported age, cerebrovascular disease, CRP, LDH and cardiac troponin I as relevant prognostic factors for severe COVID-19 outcomes (36). In addition, an umbrella review and meta-analysis summarising estimates for 42 pre-existing comorbidities reported an increased risk of severe COVID-19 outcomes in patients with diabetes, obesity, cardiovascular disease, chronic obstructive pulmonary disease and dementia as well as liver cirrhosis and active cancer from studies conducted in Europe and US (37). In accordance with these results, our analysis found obesity to be a relevant predictor for ICU admission, and cardiovascular disease, chronic liver disease, neurological disorders and malignancies for in-hospital mortality, while recovery was predicted to be significant, but modestly lower for patients with one or more pre-existing conditions. Pre-existing comorbidities were also associated with higher mortality among COVID-19 patients previously admitted to ICU along with increasing age, high levels on disease severity scores, and disrupted immune-response (38). Also, in this study, increasing age and suffering from pre-existing conditions were selected to be of prognostic relevance for patients admitted to ICU with COVID-19. A note of caution is due here since results are based on a limited sample size of 6,520 ICU patients (15% of the total cohort).
Debate continues regarding the role of smoking status with COVID-19 severity. A recent meta-analysis including 517,020 patients from 109 studies found smoking to be significantly associated with ICU admission (OR: 1.73; 95% CI: 1.36, 2.19) and increased mortality (OR: 1.58, 95% CI: 1.38, 1.81) (39), in agreement with previous meta-analyses (40–42), while not confirmed by other meta-analyses (43–45) and recent large retrospective cohort studies (46, 47). This discrepancy could be attributed to the mediating role of comorbidities and their risk factors (46, 47) as well as the lower than expected prevalence of smoking among COVID-19 patients (44, 46, 47). The latter also validates the lower risk of SARS-CoV-2 infection observed among smokers (RR: 0.74; 95% CI: 0.58, 0.93), as established in a meta-analyses based on 45 studies providing data on SARS-CoV-2 infection in adults (45). The smoking rate of patients is, however, considered as insufficient to judge any association between smoking and COVID-19 because of the higher probability of smokers for being tested owing to symptoms similar to COVID-19 (48). In our study, smoking was not identified as a relevant strong predictor for COVID-19 severity (i.e., weak HRs only seen in the dataset with 10-fold MI), which concurs with the recorded lower smoking prevalence among included hospitalised COVID-19 patients (8.1%) as compared to the national prevalence (16.1%), as inquired by the Belgian Health Interview Survey of 2018 (49). However, with this apparently low proportion of smokers requiring hospitalisation for their COVID-19, caution must be applied, as an “unknown” smoking status was recorded for around 40% of the patients considered for inclusion, without any reason provided but presumably more likely among smokers. In general, it appears that, if existing, any predictive value of current smoking status on critical/fatal COVID-19 outcomes is likely to be small, but this needs to clarified with more accurate data.
Considering the choice of the methodology for the predictive models, a systematic review, including 107 risk prediction models for severe COVID-19 outcomes, concluded that many of the proposed models carry a high risk of bias because of inappropriate statistical methods ignoring time-to-event and the presence of competing risks (2). Integrating competing risk models into a multi-state framework allows simultaneous modelling of time-to-event outcomes and disease progression, and hereby enables the calculation of transition and state occupation probabilities, adding an extra layer of information (18, 50). Like this, the multi-state model more accurately describes the evolution of hospitalised patients, by accounting for intermediate events of disease progression that likely influence disease outcomes over time, i.e., time and event-related dependencies of disease progression. Similar to a smaller sample size study covering only one region of Spain (51), the present study devised the multi-state framework for building a risk prediction model for COVID-19 critical state (ICU transfer) and death versus recovery, and consistently co-morbidities were more frequently observed to be the more prevalent among those experiencing worst COVID-19 outcomes, like hospitalisation, and subsequently in-hospital death. In addition, we thereby identified male sex, high levels of CRP and LDH as well as reporting lower respiratory complaints as common important predictive factors for transitioning to ICU or in-hospital death after hospitalisation due to SARS-CoV-2 infection, with additionally middle-age, obesity, reporting loss of appetite and staying in a university hospital for a transfer to ICU, while advanced age and a higher number of comorbidities for in-hospital death.
The use of the nationwide hospital surveillance data, along with appropriate statistical models fitting the hospital setting, are of crucial importance for informed evidence-based clinical decision-making. In the present study, the underlying time-to-event analyses allowed prediction of the state-occupation probabilities through the transition hazards and cumulative hazards, i.e., an indirect approach by which variable selection for the prediction model occurred at the level of transition hazards of the time-to-event models instead of state-occupation probabilities. Alternatively, a pseudo-value regression model would offer the possibility to directly model the state-occupation probabilities for intermediate and absorbing state, and modelling this only for a predefined future time t after hospital admission, as chosen in advance (28, 29). Further, information on when the event occurred, as needed to account for time dependencies, was in our data only reported for hospital admission and discharge (i.e., in-hospital death vs. recovery) and for admission to ICU, and therefore this latter was the only intermediate event considered as a proxy for critical disease progression. Also, the lack of information on withdrawal and/or the Limitation of Life-Sustaining Therapies might limit overall conclusions. However, with the large number of hospitalised COVID-19 patients in Belgium, it is anticipated to have a sufficient number of events per variable to, despite the presence of low-prevalence predictors, eliminate bias in the regression coefficients and improve predictive accuracy for all five transitions considered (52, 53). Albeit the large sample size of our study allowed us to use a random split approach for internal validation, other internal validation tests, such as bootstrapping or cross-validation, should be used to fully account for overfitting and optimism in model performance (54). The Brier score, used for the overall evaluation of the model performance as well as the individual transitions, limited comparison with other established published prediction models, though this evaluation supported by providing the c-index for the individual transitions as illustrated in the nomogram. This prediction model is based on patient’s electronic medical files relying on the clinicians’ report of clinical observations and anamnesis and medical registration personnel reporting accurately, which might vary across hospitals and during the peak periods of the pandemic. In this respect, an important limitation is the number of missing values without any reason for the incomplete data, in particular an issue for current smoking. A complete-case analysis was performed to confirm the results obtained after MI, but warrants caution because of the underlying assumption of missing at random. Finally, the model’s flexibility for adaptation to other settings and countries remains to be examined, by way of an external validation judging not only discrimination but also calibration and clinical utility in a dataset with a minimum of 100 events collected using appropriate study designs and representative of the target population (55).
In conclusion, integrating the standard Cox models into a multi-state framework allows the study of separate competing outcomes simultaneously as well as the disease progression through intermediate states. This paper shows the application of the multi-state framework for the deployment of a risk prediction model for COVID-19 disease progression after hospitalisation. Each transition, i.e., from hospitalisation to critical disease state, and subsequently to recovery or in-hospital death, has its own set of variables with varying magnitudes, and commonly including sex, age, levels of CRP and LDH, complaints related to the lower respiratory tract, and pre-existing comorbidities as predicting variables. The deployment in a risk score to predict the first three potential transitions shows the potential to be utilised by potential stakeholders, such as health care providers and policy makers, for informed clinical decision on patients’ management, and resource allocation for effective health systems.
Data availability statement
The data that support the findings of this study are available from Sciensano, but restrictions apply to the availability of these data, which were used under licence for the current study, and so are not publicly available. Data are however available from the corresponding author upon reasonable request and with permission of Sciensano.
Membership of the Belgian collaborative group on COVID-19 hospital surveillance
The Belgian Collaborative Group on COVID-19 Hospital Surveillance consists of all local investigators who are responsible for the design of the COVID surveillance, identifying variables for inclusion, and data collection in each of the hospitals. They have all approved the final manuscript. Anke Vanhoenacker, Ziekenhuisnetwerk Antwerpen, Belgium; Benjamin Kerzmann, Clinique Notre Dame de Grâce, Gosselies, Belgium; BS, Sciensano, Brussels, Belgium; Camelia Rossi, Centre Hospitalier Universitaire Ambroise Paré, Mons, Belgium; Carole Schirvel, CHIREC, Brussels, Belgium; Caroline Gheysen, Jan Yperman Ziekenhuis, Poperinge, Belgium; Catherine Nachtergal, Cliniques de l’Europe, Brussels, Belgium; Chloé Wyndham-Thomas, Sciensano, Brussels, Belgium.; Denis Piérard, Universitair Ziekenhuis Brussel, Brussels; Didier Delmarcelle, Clinique St. Jean, Brussels, Belgium; Dominique Van Beckhoven, Sciensano, Brussels, Belgium; Elise Willems, Algemeen Ziekenhuis Nikolaas, Sint-Niklaas, Belgium; Erica Sermijn, A.S.Z. Ziekenhuis, Aalst, Belgium; Eva Van Braeckel, Universiteit Gent, Gent, Belgium; Fabio Taccone, Hôpital Erasme, Brussels, Belgium; Filip Triest, Algemeen Ziekenhuis Sint Lucas, Gent, Belgium; Frank Staelens, Onze Lieve Vrouwziekenhuis, Aalst, Belgium; Geert Meyfroidt, Katholieke Universiteit Leuven, Leuven, Belgium; Jean-Marc Minon, Centre Hospitalier Régional de la Citadelle, Liège, Belgium; Jenny Chung, Sciensano, Brussels, Belgium; Jens Van Praet, Algemeen Ziekenhuis Sint Jan, Brugge-Oostende, Belgium; Koen Blot, Sciensano, Brussels, Belgium; Kristof Bafort, Mariaziekenhuis, Pelt, Belgium; Leïla Belkhir, Cliniques Universitaires Saint-Luc, Brussels, Belgium; Mélanie Delvallee, Centre Hospitalier de Wallonie Picarde, Tournai, Belgium; Nathalie Bossuyt, Sciensano, Brussels, Belgium; Nicolas Dauby, Centre Hospitalier Universitaire Saint-Pierre, Brussels, Belgium; Nina Van Goethem, Sciensano, Brussels, Belgium; Paul De Munter, Universitair Ziekenhuis Leuven, Leuven, Belgium; Philippe Minette, Centres Hospitaliers Jolimont, Belgium; Pierre Yves Machurot, Centre Hospitalier de l’Ardenne, Belgium; Rémy Demeester, Centre Hospitalier Universitaire de Charleroi, Charleroi, Belgium; Robby De Pauw, Sciensano, Brussels, Belgium; Roeland Verstraete, Algemeen Ziekenhuis, Monica, Antwerpen, Belgium; Samy Amir Aouachria, Centre Hospitalier Chrétien, Liège, Belgium; Saphia Mokrane, Hôpitaux Iris Sud, Brussels, Belgium; Sarah Cooreman, Algmeen Ziekenhuis Monica, Antwerp, Belgium; Sarah Loof, Algemeen Ziekenhuis Maria Middelares Gent, Gent, Belgium; Séverine Noirhomme, Centre Hospitalier Régional de Namur, Belgium; Steven Callens, Universiteit Gent, Gent, Belgium; Thierry Dugernier, Clinique Saint-Pierre, Ottignies, Belgium; Vincent Colombie, Centre Hospitalier Epicura, Baudour, Belgium; Xavier Holemans, Grand Hôpital de Charleroi, Charleroi, Belgium.
Author contributions
EM and JP: concept and design, statistical analysis, and interpretation of data. BS and MV: provision of data, curation of data, and acquisition of data. EM: drafting of the manuscript. EM, BS, MV, and JP: critical revision of the manuscript for important intellectual content. JP: supervision. All authors contributed to the article and approved the submitted version.
Acknowledgments
We acknowledge the earlier contribution of Els Genbrugge in the statistical analyses and data interpretation as well as in the critical revision of the manuscript for important intellectual content.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.1027674/full#supplementary-material
References
2. Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of COVID-19 infection: systematic review and critical appraisal. BMJ. (2020) 369:m1328.
3. Lu J, Hu S, Fan R, Liu Z, Yin X, Wang Q, et al. ACP risk grade: a simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China. medRxiv [Preprint]. (2020). doi: 10.1101/2020.02.20.20025510
4. Wang S, Zha Y, Li W, Wu Q, Li X, Niu M, et al. A Fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis. Eur Respir J. (2020) 56:2000775. doi: 10.1183/13993003.00775-2020
5. Bello-Chavolla OY, Bahena-Lopez JP, Antonio-Villa NE, Vargas-Vázquez A, González-Díaz A, Márquez-Salinas A, et al. Predicting mortality due to SARS-CoV-2: a mechanistic score relating obesity and diabetes to COVID-19 outcomes in Mexico. medRxiv [Preprint]. (2020). doi: 10.1101/2020.04.20.20072223
6. Guo Y, Liu Y, Lu J, Fan R, Zhang F, Yin X, et al. Development and validation of an early warning score (EWAS) for predicting clinical deterioration in patients with coronavirus disease 2019. medRxiv [Preprint]. (2020). doi: 10.1101/2020.04.17.20064691
7. Ji D, Zhang D, Xu J, Chen Z, Yang T, Zhao P, et al. Prediction for progression risk in patients with COVID-19 pneumonia: the CALL score. Clin Infect Dis. (2020) 71:1393–9.
8. Liang W, Yao J, Chen A, Lv Q, Zanin M, Liu J, et al. Early triage of critically ill COVID-19 patients using deep learning. Nat Commun. (2020) 11:3543.
9. Labenz C, Kremer WM, Schattenberg JM, Wörns MA, Toenges G, Weinmann A, et al. Clinical Frailty Scale for risk stratification in patients with SARS-CoV-2 infection. J Investig Med. (2020) 68:1199–202.
10. Galloway JB, Norton S, Barker RD, Brookes A, Carey I, Clarke BD, et al. A clinical risk score to identify patients with COVID-19 at high risk of critical care admission or death: an observational cohort study. J Infect. (2020) 81:282–8.
11. Zeng L, Li J, Liao M, Hua R, Huang P, Zhang M, et al. Risk assessment of progression to severe conditions for patients with COVID-19 pneumonia: a single-center retrospective study. medRxiv [Preprint]. (2020). doi: 10.1101/2020.03.25.20043166
12. Clift AK, Coupland CAC, Keogh RH, Diaz-Ordaz K, Williamson E, Harrison EM, et al. Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study. BMJ. (2020) 371:m3731. doi: 10.1136/bmj.m3731
13. Burn E, Tebe C, Fernandez-Bertolin S, Aragon M, Recalde M, Roel E, et al. The natural history of symptomatic COVID-19 in Catalonia, Spain: a multi-state model including 109,367 outpatient diagnoses, 18,019 hospitalisations, and 5,585 COVID-19 deaths among 5,627,520 people. medRxiv [Preprint]. (2020). doi: 10.1101/2020.07.13.20152454
14. Hazard D, Kaier K, von Cube M, Aragon M, Recalde M, Roel E, et al. Joint analysis of duration of ventilation, length of intensive care, and mortality of COVID-19 patients: a multistate approach. BMC Med Res Methodol. (2020) 20:206. doi: 10.1186/s12874-020-01082-z
15. von Cube M, Grodd M, Wolkewitz M, Hazard D, Lambert J. Harmonizing heterogeneous endpoints in COVID-19 trials without loss of information - an essential step to facilitate decision making. medRxiv [Preprint]. (2020). doi: 10.1101/2020.03.31.20049007
16. Deschepper M, Eeckloo K, Malfait S, Benoit D, Callens S, Vansteelandt S, et al. Prediction of hospital bed capacity during the COVID-19 pandemic. BMC Health Serv Res. (2020) 21:468. doi: 10.1186/s12913-021-06492-3
17. Peñalvo JL, Genbrugge E, Mertens E, Sagastume D, van der Sande MAB, Widdowson MA, et al. Insights into the association of ACEIs/ARBs use and COVID-19 prognosis: a multistate modelling study of nationwide hospital surveillance data from Belgium. BMJ Open. (2021) 11:e053393. doi: 10.1136/bmjopen-2021-053393
18. Putter H, Fiocco M, Geskus RB. Tutorial in biostatistics: competing risks and multi-state models. Stat Med. (2007) 26:2389–430. doi: 10.1002/sim.2712
19. Wolkewitz M, Lambert J, von Cube M, Bugiera L, Grodd M, Hazard D, et al. Statistical analysis of clinical COVID-19 data: a concise overview of lessons learned. common errors and how to avoid them. Clin Epidemiol. (2020) 12:925–8. doi: 10.2147/CLEP.S256735
20. Van Goethem N, Vilain A, Wyndham-Thomas C, Deblonde J, Bossuyt N, Lernout T, et al. Rapid establishment of a national surveillance of COVID-19 hospitalizations in Belgium. Arch Public Health. (2020) 78:121.
21. van Buuren S, Groothuis-Oudshoorn K. mice: multivariate imputation by chained equations in R. J Stat Softw. (2011) 45:1–67.
22. Spitoni C, Lammens V, Putter H. Prediction errors for state occupation and transition probabilities in multi-state models. Biom J. (2018) 60:34–48. doi: 10.1002/bimj.201600191
23. Thao LTP, Geskus R. A comparison of model selection methods for prediction in the presence of multiply imputed data. Biom J. (2019) 61:343–56. doi: 10.1002/bimj.201700232
24. Friedman J, Hastie T, Simon N, Tibshirani R, Hastie MT, Matrix D. Package ‘glmnet’. J Stat Softw. (2017) 33:1–22.
25. de Wreede LC, Fiocco M, Putter H. The mstate package for estimation and prediction in non-and semi-parametric multi-state and competing risks models. Comput Methods Programs Biomed. (2010) 99:261–74. doi: 10.1016/j.cmpb.2010.01.001
26. Schoop R, Beyersmann J, Schumacher M, Binder H. Quantifying the predictive accuracy of time-to-event models in the presence of competing risks. Biom J. (2011) 53:88–112.
27. Andersen PK, Klein JP, Rosthøj S. Generalised linear models for correlated pseudo-observations, with applications to multi-state models. Biometrika. (2003) 90:15–27.
28. Apkkjpr S. Generalised linear models for correlated pseudoobservations, with applications to multi-state models. Biometrika. (2003) 90:15–27.
29. Dutta S, Datta S, Datta S. Temporal prediction of future state occupation in a multistate model from high-dimensional baseline covariates via pseudo-value regression. J Stat Comput Simul. (2017) 87:1363–78. doi: 10.1080/00949655.2016.1263992
30. Zhang Z, Geskus RB, Kattan MW, Zhang H, Liu T. Nomogram for survival analysis in the presence of competing risks. Ann Transl Med. (2017) 5:403–403.
31. Harrell FE. Regression Modeling Strategies: With Applications To Linear Models, Logistic And Ordinal Regression, And Survival Analysis. Berlin: Springer (2015).
32. Faes C, Abrams S, Van Beckhoven D, Meyfroidt G, Vlieghe E, Hens N, et al. Time between symptom onset, hospitalisation and recovery or death: statistical analysis of Belgian COVID-19 patients. Int J Environ Res Public Health. (2020) 17:7560.
33. Catteau L, Dauby N, Montourcy M, Bottieau E, Hautekiet J, Goetghebeur E, et al. Low-dose hydroxychloroquine therapy and mortality in hospitalised patients with COVID-19: a nationwide observational study of 8075 participants. Int J Antimicrob Agents. (2020) 56:106144. doi: 10.1016/j.ijantimicag.2020.106144
34. de Azambuja E, Brandão M, Wildiers H, Laenen A, Aspeslagh S, Fontaine C, et al. Impact of solid cancer on in-hospital mortality overall and among different subgroups of patients with COVID-19: a nationwide, population-based analysis. ESMO Open. (2020) 5:e947–947. doi: 10.1136/esmoopen-2020-000947
35. Taccone FS, Van Goethem N, De Pauw R, Wittebole X, Blot K, Oyen HV, et al. The role of organizational characteristics on the outcome of COVID-19 patients admitted to the ICU in Belgium. Lancet Reg Health Eur. (2021) 2:100019. doi: 10.1016/j.lanepe.2020.100019
36. Katzenschlager S, Zimmer AJ, Gottschalk C, Grafeneder J, Schmitz S, Kraker S, et al. Can we predict the severe course of COVID-19 - a systematic review and meta-analysis of indicators of clinical outcome? PLoS One. (2021) 16:e0255154. doi: 10.1371/journal.pone.0255154
37. Treskova-Schwarzbach M, Haas L, Reda S, Pilic A, Borodova A, Karimi K, et al. Pre-existing health conditions and severe COVID-19 outcomes: an umbrella review approach and meta-analysis of global evidence. BMC Med. (2021) 19:212. doi: 10.1186/s12916-021-02058-6
38. Taylor EH, Marson EJ, Elhadi M, Macleod KDM, Yu YC, Davids R, et al. Factors associated with mortality in patients with COVID-19 admitted to intensive care: a systematic review and meta-analysis. Anaesthesia. (2021) 76:1224–32.
39. Zhang H, Ma S, Han T, Qu G, Cheng C, Uy JP, et al. Association of smoking history with severe and critical outcomes in COVID-19 patients: a systemic review and meta-analysis. Eur J Integr Med. (2021) 43:101313.
40. Reddy RK, Charles WN, Sklavounos A, Dutt A, Seed PT, Khajuria A, et al. The effect of smoking on COVID-19 severity: a systematic review and meta-analysis. J Med Virol. (2021) 93:1045–56.
41. Dorjee K, Kim H, Bonomo E, Dolma R. Prevalence and predictors of death and severe disease in patients hospitalized due to COVID-19: a comprehensive systematic review and meta-analysis of 77 studies and 38,000 patients. PLoS One. (2020) 15:e0243191. doi: 10.1371/journal.pone.0243191
42. Mesas AE, Cavero-Redondo I, Álvarez-Bueno C, Sarriá Cabrera MA, Maffei de Andrade S, Sequí-Dominguez I, et al. Predictors of in-hospital COVID-19 mortality: a comprehensive systematic review and meta-analysis exploring differences by age, sex and health conditions. PLoS One. (2020) 15:e0241742. doi: 10.1371/journal.pone.0241742
43. Silverio A, Di Maio M, Citro R, Esposito L, Iuliano G, Bellino M, et al. Cardiovascular risk factors and mortality in hospitalized patients with COVID-19: systematic review and meta-analysis of 45 studies and 18,300 patients. BMC Cardiovasc Disord. (2021) 21:23. doi: 10.1186/s12872-020-01816-3
44. Farsalinos K, Bagos PG, Giannouchos T, Niaura R, Barbouni A, Poulas K, et al. Smoking prevalence among hospitalized COVID-19 patients and its association with disease severity and mortality: an expanded re-analysis of a recent publication. Harm Reduct J. (2021) 18:9. doi: 10.1186/s12954-020-00437-5
45. Simons D, Shahab L, Brown J, Perski O. The association of smoking status with SARS-CoV-2 infection, hospitalization and mortality from COVID-19: a living rapid evidence review with Bayesian meta-analyses (version 7). Addiction. (2021) 116:1319–68. doi: 10.1111/add.15276
46. Puebla Neira D, Watts A, Seashore J, Polychronopoulou E, Kuo YF, Sharma G, et al. Smoking and risk of COVID-19 hospitalization. Respir Med. (2021) 182:106414.
47. Williamson EJ, Walker AJ, Bhaskaran K, Bacon S, Bates C, Morton CE, et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. (2020) 584:430–6.
48. de Lusignan S, Dorward J, Correa A, Jones N, Akinyemi O, Amirthalingam G, et al. Risk factors for SARS-CoV-2 among patients in the Oxford royal college of general practitioners research and surveillance centre primary care network: a cross-sectional study. Lancet Infect Dis. (2020) 20:1034–42. doi: 10.1016/S1473-3099(20)30371-6
49. Sciensano. Gezondheidsenquête België, Health Interview Survey Belgium. Brussels: Sciensano (2018).
50. Meira-Machado L, de Uña-Alvarez J, Cadarso-Suárez C, Andersen PK. Multi-state models for the analysis of time-to-event data. Stat Methods Med Res. (2009) 18:195–222.
51. Álvarez-Esteban PC, del Barrio E, Rueda OM, Rueda C. Predicting COVID-19 progression from diagnosis to recovery or death linking primary care and hospital records in Castilla y León (Spain). PLoS One. (2021) 16:e0257613. doi: 10.1371/journal.pone.0257613
52. Austin PC, Allignol A, Fine JP. The number of primary events per variable affects estimation of the subdistribution hazard competing risks model. J Clin Epidemiol. (2017) 83:75–84. doi: 10.1016/j.jclinepi.2016.11.017
53. Ogundimu EO, Altman DG, Collins GS. Adequate sample size for developing prediction models is not simply related to events per variable. J Clin Epidemiol. (2016) 76:175–82.
54. Steyerberg EW, Harrell FE Jr. Prediction models need appropriate internal, internal-external, and external validation. J Clin Epidemiol. (2016) 69:245–7. doi: 10.1016/j.jclinepi.2015.04.005
Keywords: multistate modelling, risk prediction model, COVID-19, hospital data, Belgium
Citation: Mertens E, Serrien B, Vandromme M and Peñalvo JL (2022) Predicting COVID-19 progression in hospitalized patients in Belgium from a multi-state model. Front. Med. 9:1027674. doi: 10.3389/fmed.2022.1027674
Received: 25 August 2022; Accepted: 03 November 2022;
Published: 23 November 2022.
Edited by:
Reza Lashgari, Shahid Beheshti University, IranReviewed by:
Yuntao Chen, University College London, United KingdomMin Lu, University of Miami, United States
Copyright © 2022 Mertens, Serrien, Vandromme and Peñalvo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Elly Mertens, ZWxseW1lcnRlbnNAaXRnLmJl