Introduction

The first cases of pneumonia of unknown origin were detected in Wuhan (Hubei, China) in early December 20191. On 7 January 2020, Chinese scientists isolated a novel coronavirus that was named as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and the relevant disease was called coronavirus-2019 disease (COVID-19)2. Since then, the dramatic increase in cases has posed numerous challenges to even the most sophisticated and advanced health systems, which led the World Health Organization (WHO) to declare the outbreak a pandemic in March 20203. To date, health systems worldwide have experienced an exponential increase in hospitalizations and admissions to intensive care units (ICUs) associated with COVID-194.

COVID-19 can cause a wide variety of symptoms ranging from asymptomatic infection to life-threatening complications such as acute respiratory distress, multi-organic failure, and death1,5,6,7. Some studies have evidenced that older patients with comorbidities (including arterial hypertension, cardio-respiratory disease or diabetes)6 and patients with more elevated levels of cytokines in blood1 are the ones at a higher risk for experiencing severe complications8,9.

At this time, there are no specific vaccines or treatments for COVID-19. Accurate diagnosis and prognosis of the disease are crucial to alleviating the burden on the health system while the best care possible is provided to patients. A predictive model that combines several variables or parameters to estimate the risk for a poor outcome would help the clinician to estimate the prognosis of patients when limited healthcare resources are available. Thus, early identification of patients at risk of serious complications is clinically relevant10. A recent systematic literature review found ten prognostic models for predicting mortality or progression to severe disease, but only a study involved patients from countries other than China, and all studies had been categorized as being at a high risk of bias11.

Therefore, the aim of this study was to develop and validate a prognostic model to identify inpatients with COVID-19 pneumonia at a greater risk for developing severe/critical complications, including death.

Methods

Source of data

Data were collected from the medical reports of patients diagnosed with COVID-19 and admitted to the Complejo Hospitalario Universitario of Santiago de Compostela in Spain, a hospital with over 1000 beds, from March 12, 2020 (date of first COVID-19 diagnosis) to April 11. The study was conducted in accordance with the guidelines of the Declaration of Helsinki and the principles of good clinical practice and was approved by the Institutional Review Board (IRB) of the Galician Health Service on April 3, 2020 (#2020/194). Informed consent forms were waived by the IRB.

A confirmed case of COVID-19 was defined as a positive result in the reverse transcription polymerase chain reaction (RT-PCR) test on samples obtained from nasal or throat swabs performed in accordance with WHO protocol12. Only laboratory-confirmed cases were considered for analysis. Patients with uncomplicated disease, with symptoms of upper airway infection, headache, myalgia, anosmia, dysgeusia or anorexia, but with an SaO2 > 95% and a respiratory rate < 25 breaths/min, were monitored as follows: (1) At home by the TELEA system13, a home monitoring platform that allows to monitor respiratory and heart rate, temperature, and SaO2; (2) patients without an Internet connection at home were monitored via 2–3 telephone calls daily. If the clinical status of the patient deteriorated, a physician contacted them to decide where hospitalization was required or not; (3) previously-institutionalized patients or those without enough assistance at home were transferred to a socio-health center adapted as a hospital. All patients diagnosed with COVID-19 pneumonia were hospitalized.

All patients with COVID-19 pneumonia were eligible for inclusion. Pneumonia was defined as an acute respiratory disorder characterized by cough, chest radiograph findings of lung shadowing that is likely to be new, and fever ≥ 4 days or dyspnea/tachypnea14. Exclusion criteria were simultaneous infection by another germ or in other organ. Fever was defined as an axillary temperature ≥ 37.5 °C.

The data extracted from the medical history of patients included symptoms, clinical signs, and laboratory test and radiological results at admission (+ 1 day). The comorbidities considered were arterial hypertension, diabetes mellitus, chronic obstructive pulmonary disease, other respiratory diseases, kidney and liver disease, heart failure, ischemic heart disease, heart valve surgery, active neoplasm, systemic disease, and psoriasis. Previous use of drugs such as angiotensin-converting enzyme inhibitors, angiotensin II receptor antagonists, statins, corticosteroids, immunosuppressants, antiplatelet agents, and anticoagulants were recorded. The totality of laboratory tests were performed as a function of the clinical care needs of the patients. Determinations in blood included a complete hemogram, coagulation tests (including D-dimer), an evaluation of liver and kidney function, and determination of electrolytes, C-reactive protein (CRP), procalcitonin, lactate dehydrogenase (LDH), creatine kinase, ferritin, and interleukin 6. COVID-19 was considered severe and the patient was candidate to ICU admission if required mechanical ventilation or had a fraction of inspired oxygen (FiO2) of at least 60% or more15. Radiological anomalies were collected from reports of the Unit of Radiology.

Outcomes of interest were death from any cause, use of mechanic invasive ventilation, or ICU stay.

Statistical analysis

Multiple imputation was used to impute missing data by creating 100 datasets. Missing values were predicted on the basis of all other predictors considered for assessing outcomes16. For each of them, 250 bootstrap datasets were generated, and backwards feature selection with the Akaike Information Criterion was performed on every set17.

The predictors returned by this procedure in 70% of datasets were selected for the final model, and their regression coefficients were calculated according to “Rubin's Rules”18. We also studied the possible nonlinear effect of each predictor on the outcome by means of Generalized Additive models (GAM) using splines17. Results are presented as Odds Ratio (OR) with 95% confidence intervals (CI). The Nagelkerke R2 was used to calculate the proportion of variance in clinical outcomes that could be explained by the selected predictors. The different aspects of model performance were studied, including calibration and discrimination19. Calibration was assessed using the Brier score, and by plotting the non-parametric estimate of the association between the observed frequencies and the predicted probabilities. The receiver operating characteristics (ROC) curves (and the correspondent area under the ROC curve-AUC) were calculated to test for discrimination. To correct optimism, internal validation was performed using the bootstrap procedure. The procedure was repeated in each imputed dataset, and the average estimates for the AUC, the Brier Score, and the Nagelkerke R2 were extracted to assess discrimination, calibration, and overall fit, respectively20. The final model was selected to derive a score for clinical use and a nomogram was created. Criteria for this selection included both discriminant ability (defined by the AUC) and model simplicity. Statistical significance was set at p < 0.05. All statistical analyses were carried out in R version 3.5.1 using the mice rms and psfmi packages). These packages are freely available at https://cran.r-project.org. The analysis conforms to TRIPOD reporting standards21.

Results

During the study period, 1152 patients were infected by COVID-19, of whom 229 (19.9%) were admitted for pneumonia. None of the hospitalized patients were lost to follow-up. During hospitalization, 51 (22.3%) cases progressed to severe disease (Fig. 1), of whom 26 (11.4%) needed intensive care, 17 (7.4%) underwent invasive mechanical ventilation, and 32 (14%) died of any cause. The majority of patients (90%) received antibiotic therapy and hydroxychloroquine, and 83.8% received lopinavir/ritonavir. Additionally, 30% were given systemic glucocorticoids, and 8.3% were administered tocilizumab.

Figure 1
figure 1

Flow chart for management of patients with Covid-2019 in health area of Santiago de Compostela.

Clinical characteristics

Baseline demographic, clinical, and laboratory data are presented in Table 1. The most common symptoms at onset of illness were cough (77.3%), fever (75.5%), and dyspnea (52.8%). Patients with severe disease were significantly older than those with nonsevere disease and were more likely to have higher systolic blood pressure levels (p = 0.003), and lower SaO2 concentrations (p < 0.001). The presence of comorbidities such as diabetes, heart failure, and chronic kidney disease was significantly higher in patients who progressed to severe disease. No statistically significant differences were observed between the two groups in main symptoms, although slightly more patients in the severe group manifested confusion (p = 0.025).

Table 1 Clinical characteristics of the study patients.

On admission, patients who progressed to severe disease had a lower baseline lymphocyte and platelet count, lower levels of hemoglobin, and higher levels of neutrophils, serum creatinine, urea, CRP, and interleukin-6 (all p < 0.01) (Table 2). Abnormalities on chest X-ray images were detected in all patients. More than half of patients (57.2%) had bilateral pneumonia (Table 3). The most common findings in 51 patients who progressed to severe disease were bilateral multiple areas of consolidation (28; 54.9%) (Table 3). Table 3 shows a comparison of gasometric parameters between the two groups by level of severity.

Table 2 Laboratory characteristics of the patients hospitalized with COVID-19 pneumonia at admission.
Table 3 Radiological and gasometric characteristics of patients hospitalized with COVID-19 pneumonia at admission.

Multivariate prediction models

A regression model was built based on the aforementioned clinical, laboratory, gasometric, and radiographic data to predict the risk of progression to severe/critical disease. Five predictors were identified: Diabetes, Age, Lymphocyte count, SaO2, and pH (DALSH score, Nagelkerke R2 0.45, Table 4). As shown in Fig. 2, whereas risk increases linearly with increasing age and decreasing SaO2, the relationship with leukocyte count and pH is not linear.

Table 4 Multivariate logistic regression analysis predicting severity in subjects hospitalized with COVID-19 pneumonia.
Figure 2
figure 2

Effects of age, oxygen saturation, lymphocytes and pH on the risk for progression to severe disease (death, use of mechanic invasive ventilation, or intensive care unit stay) in patients hospitalized with COVID-19 pneumonia.

The receiver operating characteristic curve for this combination of predictors (Fig. 3) confirmed its good clinical performance (AUC 0.87 CI 0.81, 0.92). The AUC was higher than that obtained with the CURB-65 (AUC 0.73 CI 0.68, 0.78). The DALSH score showed an acceptable calibration (Brier score = 0.11, see supplementary material, Figure 1S). After correcting optimism by bootstrapping, Nagelkerke R2, AUC, and Brier score were 0.38, 0.85, and 0.13, respectively.

Figure 3
figure 3

Receiver operating characteristic (ROC) curve for risk of severity (death, use of mechanic invasive ventilation, or intensive care unit stay) in COVID-19 patients with pneumonia. Figures show Area under the Curve [AUC (95% Confidence Interval)].

Figure 2S (see supplementary material) illustrates a method to estimate the risk of progression to severe disease based on an overall score calculated by the sum of the individual scores obtained in the five variables of the model.

Table 5 shows the individual score of each of the DALSH predictors. The total score indicates the estimated individual risk of severity of each patient. For instance, a patient with diabetes, 31 points; 70 years old, 50 points; and SaO2 90%, 19 points; has a total of 100 points which is equivalent to the risk about 23%. The same patient with a lymphocyte count of 1000 cells/mm3, 10 points; has a total of 110 points which is equivalent to a risk of 32–33%. (Table 5, Figure S2). As many as 30% (n = 69) of patients had a ≤ 5% risk of progression (arbitrarily considered as low risk); 39% (n = 89) had a risk of 6–25% (intermediate risk), and 31% (n = 71) had a > 25% risk (high risk). No patients with a DALSH score < 66 points (low risk) progressed to severe disease (ICU admission, need for invasive mechanical ventilation, or death); whereas 12% of patients with a DALSH score of 66–100 points (intermediate risk) progressed to severe disease; finally, 50% of patients with a DALSH score > 100 points progressed to severe disease. Half of patients identified as high-risk had a mean age of 74 years. Notably, low-risk patients had a mean age of 54 years and none had diabetes.

Table 5 The DALSH score calculation for predicting risk of severity in patients hospitalized with COVID-19 pneumonia.

Discussion

Given the high transmission rate, potential severity of symptoms, and scarcity of resources that may result from the COVID-19 pandemic, predicting the course of coronavirus is crucial to guaranteeing that patients receive the best care possible. In this study we derived and validated a clinical prediction rule for the prognosis of hospitalized patients with COVID-19 pneumonia. Our results also show that the presence of diabetes, advanced age, lymphopenia, hypoxemia and pH alterations on admission were all associated with disease severity.

The course of COVID-19 pneumonia is uncertain and may lead to death. Predicting the course of COVID-19 pneumonia at 24 h of admission based on clinical data is challenging but crucial. In a large number of cases, the disease can be controlled by closely monitoring the patient, but severely-ill patients will need aggressive treatment and intensive care. Predicting the course of the disease will enable the early adoption of a management approach in line with the estimated prognosis.

A multiplicity of studies have demonstrated that age is a relevant predictor of progression and mortality5,7,22,23, which is confirmed by the results of this study (OR 1.06 CI 1.03, 1.09; p = 0.000). This may be explained by age-related effects on T- and B-cell function and the excessive production of type 2 cytokines. Immunosenescence, or age-related defects in the human immune system, affects the adaptive immune response, as evidenced by major defects in cell-mediated immunity and impairment of humoral immune responses with age. These alterations impair the body’s ability to control viral replication and prolonged inflammatory response, thereby resulting in disease progression24.

Pneumonia may induce ischemia, endothelial dysfunction, and alterations in atherosclerotic plaques, which increases the short-term risk of cardiovascular events, especially in patients with previously known cardiovascular disease25. This may explain the fact that some studies—but not all1—have revealed a relationship between previous cardiovascular disease and a poorer prognosis of COVID-195,7,22. In our study, patients with a previous diagnosis of heart failure were very likely to have a bad prognosis (OR 10.6 CI 3.25, 35.5; p < 0.001). Nevertheless, a relationship was not observed with ischemic heart disease. A variety of studies have established a relationship between diabetes and disease progression5,7,22,23,26. In our case, COVID-19 pneumonia was five-fold more likely to progress to severe disease in patients with diabetes (OR 5.20 CI 2.61, 10.35; p < 0.001). However, no studies have been conducted to date to clarify the relationship between diabetes and COVID-19.

Although prognosis is favourable in most patients, those with COVID-19 pneumonia can develop dyspnea and hypoxemia. The underlying pathophysiology of disease progression seems to be that of severe acute respiratory distress syndrome (ARDS). In these cases, the activation of alveolar macrophages by COVID-19 may trigger the release of powerful proinflammatory mediators and chemokines, the accumulation of neutrophils and monocytes, and the production of toxic mediators, which would lead to a loss of alveolar endothelial and epithelial barrier function, and ultimately induce alveolar and interstitial edema27. Laboratory test values alterations identified in critical patients with COVID-19 pneumonia could be associated with adverse ARDS outcomes. This suggests that infection could induce cell-mediated immunity alterations, the activation of coagulation, and myocardial, hepatic or renal damage. In line with other publications, our study demonstrates that the risk of disease progression increases with alterations in SaO27 (OR 0.85 CI 0.79, 0.92; p < 0.001); LDH5,6,7,9,22 (OR 1.23 CI 1.05, 1.44; p = 0.012); CRP6,7,9 (OR 1.12 CI 1.06, 1.18; p < 0.001); IL-66 (OR 1.01 CI 1.01, 1.02; p < 0.001); and lymphocyte count5,6,7,9,22 (OR 0.84 CI 0.77, 0.91; p < 0.001), which are the most commonly used inflammation markers.

Our prognostic model is based on five predictors: diabetes, age, lymphocyte count, SaO2 and pH, which we call the DALSH score. This score has demonstrated a good discrimination power (AUC 0.87 CI 0.81, 0.92). The DALSH score may be highly useful in clinical practice as these predictors can be easily determined in most settings and are usually recorded on admission. In addition, this score makes it possible to establish risk levels (even arbitrarily: ≤ 5%, low; 6–25%, intermediate and > 25% high) that may be useful to guide decision-making. Thus, a patient identified as low-risk is unlikely to experience disease progression; a relatively low number of patients identified as intermediate-risk will develop disease progression, and a large proportion of high-risk patients will experience disease progression. The cases observed in our series were 0%, 12% and 50%, respectively. Determination of risk would help clinicians adopt the appropriate therapeutic measures. However the non-linear relationships between two predictors (lymphocyte count and pH) and the outcomes means that its use in clinical settings is not so simple.

Any pH alteration is associated with a higher risk for progression. pH decrease is probably related to hypercapnia secondary to alveolar hypoventilation, whereas its increase may be due to the presence of respiratory alkalosis with progressive hyperventilation caused by hypoxemia. The two settings can co-occur with disease progression (Fig. 2D). A decrease in lymphocytes count causes an increase in OR, and the same occurs when it increases, although the effect is more subtle (Fig. 2C). A recent systematic literature review on prognostic models for predicting mortality or progression to severe disease revealed that these models have good to excellent performance. However, only a study involved patients from countries other than China, and all studies had been categorized as being at a high risk of bias, mainly because the sample of control patients was not representative, patients who had not experienced the event of interest by the end of the study were excluded; and model overfitting. The performance estimated of these studies were rated to be optimistic and misleading11. This systematic review includes 107 studies describing 145 prediction models for identifying people at risk in the general population; diagnostic models for detecting COVID-19; and prognostic models for predicting mortality risk, progression to severe disease, intensive care unit admission, ventilation, intubation, or length of hospital stay. The most frequently reported predictors of diagnosis and prognosis of COVID-19 are age, body temperature, lymphocyte count, and lung imaging features, similar to those found in our study. Unlike some of those studies, in our study all patients were followed-up until discharge or death, and objective predictors were used. A major strength of our study is the high-quality data obtained for all predictors and the minimal rate of missing data.

This study has several limitations. First, since the model was developed in a single population, a major limitation is the lack of external validation. Second, sample size was not large enough to adequately develop a multivariate regression model in which 53 predictors were entered. For this reason, after multivariate analysis, we resampled the development sample using a bootstrapping technique. The purpose was to avoid overfitting and estimate the stability of the dataset. As a result, five variables were retained in the model.

In summary, our study identified five straightforward, objective predictors easily determined on admission, which are associated with progression to severe or critical state in hospitalized patients with COVID-19 pneumonia. A risk score based on these factors predicted disease progression and allowed us to adopt therapeutic measures in accordance with patient’s prognosis from the very moment of diagnosis. To verify the external validity of the DALSH score, studies from the other hospitals even in the same area, and also from the other areas, should be required as this study was conducted using the patients only from one hospital.