Background
Sepsis is defined as life-threatening organ dysfunction caused by dysregulated host responses to infection [
1]. Worldwide, sepsis has a high incidence, morbidity, and mortality and represents a major public health problem [
2,
3]. Given this background, the WHO has announced sepsis as a global health priority [
4].
The Sequential Organ Failure Assessment (SOFA) score was developed in 1996 [
5], and this score is now extensively used in critically ill patients. Moreover, the development of a new sepsis definition, which adopts SOFA score as a main diagnostic tool, has broadened the score’s application [
1]. However, the cardiovascular SOFA score has critical limitations. When first developed, the guideline recommended the use of dopamine as the first-line vasopressor in septic shock [
6,
7]; but, in 2008, this first-line vasopressor recommendation was changed to norepinephrine. This use of norepinephrine has become standard management [
8].
Sepsis-3 defines septic shock as a subset of sepsis with circulatory dysfunction and cellular metabolic abnormality which can be estimated by hyperlactatemia [
1]. Because an elevated lactate level is reflective of tissue hypoxia caused by insufficient tissue oxygen delivery and impaired aerobic respiration, lactate is an essential biomarker in sepsis [
9].
Considering the importance of the SOFA score, we propose that the SOFA score be modified to reflect the current clinical practice patterns for vasopressor use and the diagnostic importance of lactate level. Our proposed modified SOFA scoring system is based on data from multiple cohorts. We developed and internally and externally validated our modified SOFA scoring system, and we compared this system with the original SOFA scoring system in terms of predictive validity.
Methods
Study design, setting, and population
Three retrospective or prospective cohorts from seven emergency departments (EDs) were used in this study. One cohort was the suspicious infection cohort from one hospital (suspected infection cohort), the second cohort was for sepsis from three hospitals (sepsis cohort), and the third was for septic shock from the Korean Shock Society (KoSS) septic shock registry (septic shock cohort). Only adult patients (age ≥ 18 years) who presented to EDs were included in the cohorts.
The suspected infection cohort was used to develop and internally validate the modified CV SOFA score. This cohort was retrospectively assembled from data gathered from December 2019 to December 2020 at the ED of the Samsung Medical Center (a 1960-bed, university-affiliated, tertiary care referral hospital located in Seoul, Korea, with an annual census of over 70,000). Suspected infection was defined as cases in which blood culture and antibiotic therapy were performed in the ED [
10].
Two prospective, multi-center ED registries were evaluated for external validation. First, we analyzed sepsis cohort data from adult patients who were admitted to the EDs of three urban tertiary teaching hospitals between May 2014 and December 2017 (SNU CARE registry, external validation cohort 1). These three hospitals are affiliated with the College of Medicine of Seoul National University. Patients who met the criteria for severe sepsis and septic shock according to the Sepsis-2 definition [
11] were included. From March 2016 to December 2017, patients with sepsis were enrolled based on the Sepsis-3 definition [
1].
We also analyzed septic shock cohort data (external validation cohort 2) from the Korean Shock Society (KoSS) septic shock registry between October 2015 and December 2019 [
12]. Inclusion criteria of the registry were adult patients who had a suspected or confirmed infection and evidence of refractory hypotension or hypoperfusion. Refractory hypotension was defined as persistent hypotension despite the administration of fluid challenge (20–30 mL/kg or at least 1 L of crystalloid solution administered over 30 min). Hypotension was defined as systolic blood pressure (SBP) < 90 mmHg, mean arterial pressure < 70 mmHg, or SBP decrease > 40 mmHg from baseline. Hypoperfusion was defined as serum lactate levels ≥ 4 mmol/L.
In the suspected infection cohort and the septic shock cohort, we excluded patients who had previously signed a “Do Not Attempt Resuscitation (DNAR)” order and patients with terminal malignancy who had limitations on invasive care.
Data collection and outcome
The suspected infection cohort data were retrospectively collected by extraction from the hospital’s clinical data warehouse and review of the electronic medical record (EMR). Eligible cases were electronically identified based on the definition of suspected infection. The following data were extracted from the hospital database: general patient characteristics, including age, gender, and comorbidities; vital signs; infection focus on final diagnosis; laboratory tests; therapeutic interventions including vasopressor and mechanical ventilation use; ED disposition; and survival data. Three research coordinators reviewed the extracted data and the EMR to collect components of the SOFA score for each system (respiratory, coagulation, liver, cardiovascular, central nervous, and renal) (Additional file
1: Fig. S1). If the PaO
2 was not available, we estimated the respiratory SOFA score by using the peripheral arterial oxygen saturation (SaO
2) [
13]. The Glasgow coma scale (GCS) was obtained with electronic medical records, and in case of no documentation, the AVPU system was used to convert to the GCS [
14]. In the external validation cohort (the sepsis cohort and the septic shock cohort), data were prospectively collected by trained research coordinators or experts after informed consent was obtained. The SOFA score was calculated using maximum values for the time window within 24 h from ED arrival in all cohorts. Initial ED lactate values were used. If variables including lactate and SOFA components were missing, a single normal value was imputed for each variable. The primary outcome was 28-day mortality after admission to the ED. Survival data were extracted from the registry data or hospital database. We also used visit history after discharge, Statistics Korea mortality data, and telephone interviews to gather survival data.
Candidate models for a modified cardiovascular SOFA score
The suspected infection cohort was split randomly into derivation and internal validation samples (70/30). To develop a modified CV SOFA, we constructed candidate models combining hypotension (mean arterial pressure, MAP < 70 mmHg), dose of vasopressor with or without lactate level (Additional file
1: Table S1 and Table S2).
We derived multiple cut-off points of the total norepinephrine equivalent dose, and each dose of vasopressor (dopamine, epinephrine, and vasopressin) was converted to a norepinephrine equivalent dose (Additional file
1: Table S3) [
15]. We used peak doses administered for at least one hour during a 24-h period from ED arrival. The cut-off values were selected based on the tertile dose; optimal cut-offs using the Youden index and the closest-to-(0,1) on the area under the receiver operating characteristic curve (AUROC) for 28-day mortality; and reference values from previous studies [
16‐
18]. The optimal cut-offs were rounded to the nearest 0.05 equivalent dose interval value. We made combinations of low and high cut-offs that we included in candidate models.
In modified models with the combination of vasopressor use and lactate, we incorporated lactate level in modified CV SOFA models as a marker of circulatory shock [
19]. In cases of CV SOFA score of 0 to 3 points, we added one point if the initial lactate level was elevated without changing the five-point scale (0 to 4 points). We used two cut-off values for lactate ≥ 2 mmol/L and ≥ 4 mmol/L.
We made candidate models in two ways. First, in cases with MAP < 70 mmHg or use of low dose vasopressor, we allocated to the models the modified CV SOFA score of 1, corresponding to MAP < 70 mmHg in the original CV SOFA [
5]. Modified cut-offs of vasopressor dose were incorporated from score 1 to score 4. Second, we did not change the MAP criteria of the scores 0 and 1. Vasopressor dose cut-offs were included from score 2 to score 4, which were similar to the original scoring. Lactate criteria were included in all models. Other components of the SOFA score were not revised.
Deriving a modified cardiovascular SOFA score and validation
To select a final model, we first considered the incidence and the corresponding mortality rate according to the CV and total SOFA score in each model. Second, we evaluated discrimination power with AUROC, calibration of CV score, and total SOFA score for the original SOFA and candidate SOFA models in the derivation cohort. We compared the predictive accuracy of AUROCs using an individual unadjusted analysis by a non-parametric approach and adjusted the analysis in conjunction with a baseline risk model for 28-day mortality including variables for age, gender, and comorbidities [
20,
21]. Calibration was evaluated with calibration plots of predicted and observed probability. We evaluated the model’s net reclassification improvement and the integrated discrimination improvement compared with the original SOFA score, but we did not use these methods for the final model selection due to suggested limitations in the previous study [
1].
We validated a final modified CV score in terms of discrimination and calibration for the internal validation cohort. We also tested the model for external validation using the sepsis and septic shock cohorts.
Agreement with the original SOFA score
Because the SOFA score has been widely used to identify sepsis according to the clinical Sepsis-3 definition, we evaluated the agreement between the original SOFA score and the final modified SOFA score using the Cohen's kappa of the suspected infection cohort [
22]. The clinical sepsis criteria, defined as a change of total SOFA score of 2 or more [
1], were also compared between the two models in terms of agreement and diagnostic performance for predicting 28-day mortality. The baseline SOFA score was assumed to be zero.
Sensitivity analysis
We performed a sensitivity analysis using a complete data set without missing values in the suspected infection cohort, the sepsis cohort, and the septic shock cohort.
Other cardiovascular SOFA models
We additionally tested discrimination and calibration of these CV SOFA models: (1) a lactate-based CV score model without blood pressure criteria and vasopressor dose and (2) a model using norepinephrine equivalent dose in the original CV score.
Statistics
Continuous data are presented as mean (standard deviation, SD) or median (interquartile range, IQR) as appropriate. Categorical data are presented as numbers with percentages. For comparisons, continuous variables were analyzed using Student's t-test, while categorical variables were analyzed using chi-square tests. Predicted mortality in calibration and 95% confidence interval (CI) were estimated by the bootstrap method. A two-tailed p value < 0.05 was considered statistically significant. All analyses were performed using the R version 3.6.3 (R Foundation for Statistical Computing, Vienna, Austria) and STATA version 17.0 (STATA Corporation, College Station, TX).
Study approval
This study was approved by the institutional review boards of Samsung Medical Center, Seoul National University Hospital, Seoul Metropolitan Government–Seoul National University Boramae Medical Center, Seoul National University Bunding Hospital, Asan Medical Center, Gangnam Severance Hospital, and Hanyang University Medical Center. Informed consent was waived or obtained depending on cohort or hospital requirements.
Discussion
In this study, we demonstrated that the modified CV SOFA score reflecting the current sepsis guidelines could be more useful both in prognostication for sepsis and detection of sepsis at risk. These current guidelines include the use of norepinephrine as vasopressor of choice and the use of lactate level as an important tissue perfusion biomarker.
The SOFA score was created by the Working Group of the European Society of Intensive Care Medicine. The SOFA score aimed to describe as quantitatively and objectively as possible the degree of organ dysfunction/failure in sepsis patients [
5]. Recently, the SOFA score has been advocated and adopted as means of identifying sepsis among patients with suspected infection in 2016 [
10]. The new definition is important in research, performance monitoring, and accreditation [
23]. However, the CV SOFA score has a critical issue in terms of the use of vasopressors. The SOFA score was introduced in 1996 when dopamine was the drug of choice as vasopressor in sepsis [
6,
7]. Thereafter, dopamine was used in the CV SOFA score. However, in 2008, norepinephrine replaced dopamine as the first-line vasopressor in sepsis [
8]. This changed clinical practice, but the change was not accounted for in the CV SOFA score. Reflecting this, in our 3 cohorts, there were few cases with CV SOFA score of 2, which is defined as use of dopamine less than 5 µg/kg/min or any dose of dobutamine, and this is consistent with recent studies [
24,
25]. Even when equivalent dose of norepinephrine has been used to overcome this, original CV SOFA score performance is not good. Therefore, modification of the CV SOFA scoring system is urgently needed and provides the motivation behind this study.
Our modified SOFA score model showed significantly improved mortality-discriminant power than the original SOFA score in the suspected infection, sepsis, and septic shock cohorts. In previous studies, the mortality rate of each SOFA score did not show incremental tendency [
26‐
30]. In this study, the same findings were detected in all three independent cohorts. However, the newly-developed modified SOFA score showed a more incremental tendency. In addition, the modified SOFA model can detect more patients at risk of septic mortality than the original SOFA score. Moreover, in the suspected infection cohort, the modified score showed high agreement with the current SOFA score (Cohen’s kappa, 0.916), implying that this modified SOFA could have clinical applications.
We decided that lactate level should be included in the modified CV SOFA score with the presence of pre-existing hypotension and the use/dosage of vasopressor in the original CV SOFA score. Lactate has been extensively investigated as a good biomarker for tissue perfusion, and lactate level is widely used in sepsis [
31]. In the Sepsis-3 definition study, lactate level was identified for testing in cohort studies by the Delphi consensus, and lactate level was included in the definition of septic shock [
1]. Lactate level was also proposed as a screening tool for sepsis or septic shock, but this level was not included in the final quick SOFA. The group extensively investigated the usefulness of lactate level and found that 1 added point to qSOFA score for elevated serum lactate level 2.0 mmol/L or more significantly increased predictive validity of qSOFA [
10]. However, the group designated lactate level’s inclusion in the quick SOFA as an “area of further inquiry”. The group proposed that lactate levels could be used for patients with borderline qSOFA values or could substitute for individual qSOFA variables in healthy systems in which lactate levels are reliably measured at low cost and in a timely manner. Interestingly, the group did not investigate the value that lactate addition could have with SOFA score. We used the various cut-off levels of lactate used in previous investigations [
10] in our derivation and validation models. We ultimately decided on the cut-off level as 2, which has been used in the new septic shock definition, to be included in the modified CV SOFA score [
1,
19].
We determined multiple cut-off points of vasopressor dose referring to both “a priori” and “data-driven” optimal values [
32,
33]. We incorporated these into the model and decided on two cut-off points regarding incidence and mortality rate according to the CV/total SOFA score, discrimination, and calibration. We could not be confident that these cut-off values are consistently valid in other cohorts, leaving generalizability concerns. We did not include the use and dose of arginine vasopressin as independent scoring variables. Given that vasopressin and its analogs are commonly used in clinical practice for the management of sepsis [
8], the modified CV SOFA score could be more accurate if their use were included. However, the limited score of 0 to 4 on the CV SOFA becomes too complex when too many variables are added. Instead, a conversion table for vasopressor doses might be used [
34].
In the modified CV SOFA score, the use of NE was included from the score of 1, rather than the score of 2 in the original CV SOFA score. Recently, the beneficial effect of early use of NE in septic shock has been investigated [
35] and has led to the early use of this vasopressor in current clinical practice [
36]. Therefore, we decided that a modified CV score of 1 should include the use of small doses of NE.
We tried to modify cardiovascular SOFA with blood pressure and various cut-offs of vasopressors, but their performances were not better than those with the original cardiovascular SOFA model. The AUROC of the vasopressor only model were 0.64 at best, which as included in Table
3. Therefore, we included lactate in the modified SOFA model and found that the discriminative performances were significantly improved (range 0.648–0.716) than that of the original CV SOFA (range 0.557–0.638) and that of vasopressor only model (range 0.610–0.640). We, therefore, decided to include lactate value in the model since lactate has been investigated as significant mortality-associated factor, independently with blood pressure or vasopressor use [
12,
37,
38]. Lactate was also included in the Sepsis-3 definition [
1].
The performance of the modified total SOFA score could be considered modestly increased in clinical aspect. However, the modified CV SOFA performance seems to be significant in the clinical aspect. SOFA score has 6 sub-categories and a change of one category might have a modest increase in total SOFA scores.
The discriminatory power of the SOFA in our three cohorts was similar to that in previous studies [
10,
26,
39], implying the reliability of our cohorts.
We modified the SOFA score to detect sepsis. Even though the SOFA score is used to detect sepsis, it is not limited to septic patients, and this inherent limitation of the SOFA score should be considered in further study.
With this study, we could not propose the global use of our modified CV SOFA score, but this study offers a good starting point for SOFA score modification. Modification is necessary to reflect current guidelines regarding the clinical use of vasopressors and diagnostic use of lactate levels in sepsis patients.
Limitations
There are some limitations to this study. First, all cohorts were derived from emergency departments, and validation with ICU data is required. Second, the characteristics of the three cohorts are different. The sepsis cohorts were collected in accordance with either the Sepsis-2 or Sepsis-3 definition depending on the period. The septic shock cohort was collected in accordance with the sepsis-3 definition, and the suspected infection cohort included patients in whom antibiotics and blood culture were administered. However, this could be a study strength because the newly-modified SOFA score could be applied to differently defined cohorts, meaning more generalizability. Third, these cohorts are all from a single country and all from university-based hospitals. Multinational and multi-level center validation is necessary. Fourth, the purpose of the three cohorts used in this study was not to develop the new CV SOFA score. Fifth, we did not develop an entirely new scoring system that is usually performed according to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) recommendations [
40]. The SOFA score-based definition of sepsis has been widely adopted, so entirely changing the CV SOFA scoring system would not be useful at this time. Therefore, we intended to change the CV SOFA scoring system as minimally as possible. Supporting this, the agreement between the original SOFA score and the modified SOFA score was excellent. Sixth, we postulated a baseline SOFA score of 0, but in clinical practice, this is an inevitable limitation. Seventh, the cohort we used in derivation and internal validation is from a single center. However, we tested 16 candidate models in two other large cohorts (external validation cohorts); and model 3 performed better than the other models in terms of incidence, mortality rate, discrimination, and calibration (Additional file
1: Figs. S8-10). Eighth, we used the initial lactate level in the modified CV model. Even though lactate is widely used in sepsis, there are some controversies about the role of spot lactate (initial lactate level) in sepsis. We could not find any review or meta-analysis study about the role of the initial level of lactate in sepsis, which could be conclusive on the utility of lactate. Also, the changes in lactate levels over time are relatively slow, so the patients still have a lactate level above the normal range even after they were resuscitated [
41]. This concept could be a major obstacle to include lactate in the modified SOFA score. This needs further evaluation with larger and multi-national cohorts. Ninth, we developed and tested modify SOFA model with mortality as a primary outcome. Even though we did not perform this study to propose modified prognostic scoring systems but modify the SOFA score as a tool to detect sepsis, we tactically used mortality as a primary outcome to investigate modified models following the method of developing the Sepsis-3 definition. Lastly, there were some missing data in all three cohorts. However, the missing data rate was low in most cases, and the results of complete analysis among patients without missing data were nearly identical, implying minimal effects of missing data on the primary analysis (Additional file
1: Fig. S7).
Acknowledgements
Seung Sik Hwang and So Yeon Ahn for statistical advice.
Korean Shock Society; Sangchun Choi, MD9; Tae Nyoung Chung, MD8; Jae Hyuk Lee, MD5; Kyung Su Kim, MD5; Yoo Seok Park10 MD; Young-Hoon Yoon, M.D6; Han Sung Choi, MD11; Kap Su Han, MD6; GuHyun Kang, MD12; Youn-Jung Kim, MD3; Hanjin Cho, MD6
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.