Figures
Abstract
Background
Sepsis-3 definition uses SOFA score to discriminate sepsis from uncomplicated infection, replacing SIRS criteria that were criticized for being inaccurate. Eligibility of sepsis-3 criteria for sepsis diagnosis and the applied validation methodology using mortality as endpoint are topic of ongoing debate. We assessed the impact of different criteria on sepsis diagnosis in our ICU and devised a mathematical approach for mortality-based validation of sepsis criteria. As infectious status is often unclear at clinical deterioration, we integrated non-infected patients into analysis.
Methods
Suspected infection, SOFA and SIRS were captured for an ICU cohort of a university center over one year. For raw scores (SIRS/SOFA) and sepsis criteria (SIRS≥2/SOFA≥2/SOFA_change≥2) frequencies and associations with in-hospital mortality were assessed. Using a mathematical approach, we estimated the correlation between sepsis and in-hospital mortality serving as reference for evaluation of observed mortality correlations of sepsis criteria.
Results
Of 791 patients, 369 (47%) were infected and 422 (53%) non-infected, with an in-hospital mortality of 39% and 15%. SIRS≥2 indicated sepsis in 90% of infected patients, SOFA≥2 in 99% and SOFA_change≥2 in 77%. In non-infected patients, SIRS, SOFA and SOFA_change were ≥2 in 78%, 88% and 58%. In AUROC analyses neither SOFA nor SIRS displayed superior mortality discrimination in infected compared to non-infected patients. The mathematically estimated correlation of sepsis and in-hospital mortality was 0.10 in infected and 0 in non-infected patients. Among sepsis criteria, solely SIRS≥2 agreed with expected correlations in both subgroups (infected: r = 0.19; non-infected: r = 0.02).
Conclusions
SOFA≥2 yielded a more liberal sepsis diagnosis than SIRS≥2. None of the criteria showed an infection specific occurrence that would be essential for reliable sepsis detection. However, SIRS≥2 matched the mortality association pattern of a valid sepsis criterion, whereas SOFA-based criteria did not. With this study, we establish a mathematical approach to mortality-based evaluation of sepsis criteria.
Citation: Centner F-S, Schoettler JJ, Fairley A-M, Lindner HA, Schneider-Lindner V, Weiss C, et al. (2020) Impact of different consensus definition criteria on sepsis diagnosis in a cohort of critically ill patients—Insights from a new mathematical probabilistic approach to mortality-based validation of sepsis criteria. PLoS ONE 15(9): e0238548. https://doi.org/10.1371/journal.pone.0238548
Editor: Robert Ehrman, Wayne State University, UNITED STATES
Received: May 24, 2020; Accepted: August 18, 2020; Published: September 8, 2020
Copyright: © 2020 Centner et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files. The anonymized data set necessary to replicate our study findings can be found as supporting information file (S3 Appendix) accompanying our manuscript.
Funding: This work was supported by the Klaus Tschira Foundation (grant number: 00.277.2015; website: https://www.klaus-tschira-stiftung.de). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Sepsis is a dysregulated systemic inflammatory and immune response to microbial invasion [1] and the primary cause of death from infection [2]. As mortality increases with delay in treatment [3–5] the diagnosis of sepsis mandates immediate administration of broad spectrum antibiotics according to Surviving Sepsis Campaign guidelines [6]. On the other hand, the non-indicated use of broad spectrum antibiotics has serious adverse consequences for both the population and the individual patient, as it is connected to the emergence of multi-drug resistant bacteria and morbidity caused by side effects [7, 8]. Therefore an accurate sepsis diagnosis is of crucial importance [2].
As a gold standard diagnostic test for sepsis is still missing, clinical sepsis criteria serve to diagnose sepsis and differentiate it from ‘uncomplicated infection’ [2, 9]. Because the formerly used Systemic Inflammatory Response Syndrome (SIRS) criteria (sepsis-1 and -2) [10, 11] were criticized for occurring too frequently among infected and non-infected patients and their eligibility as sepsis criteria was questioned [2, 12, 13], new clinical sepsis criteria were introduced by a consensus definition of the Society of Critical Care Medicine (SCCM) and the European Society of Intensive Care Medicine (ESICM) in 2016, referred to as sepsis-3. Accordingly, a patient with a (suspected) infection and an acute change in Sequential Organ Failure Assessment (SOFA) score of ≥2 points is diagnosed as septic [2]. A large validation study which assessed the predictive power for in-hospital mortality as primary outcome showed stronger discrimination for in-hospital mortality of SOFA and an acute change in SOFA compared to SIRS in infected patients [9]. Sepsis-3 authors argued that this proved superior validity of SOFA-based criteria for sepsis diagnosis [2, 9]. Although these new criteria were introduced in order to provide better diagnostic guidance for sepsis, in the original sepsis-3 ICU cohort, application of SOFA≥2 resulted in a higher sepsis frequency than SIRS≥2 (91% versus 84% [9]), which is in conflict with the intention to generate a more specific criterion.
In clinical practice not only the differentiation between ‘uncomplicated infection’ and sepsis is challenging. Additionally, by the time a patient develops signs of organ dysfunction or systemic inflammation it is often unclear whether these conditions are related to an infection or not [2, 14–16]. Ideal sepsis criteria should therefore not only differentiate between patients with sepsis and with uncomplicated infection, but also help to separate patients who are critically ill due to sepsis from those suffering from non-infectious conditions. The ability of sepsis-3 criteria to make this differentiation was not analyzed in the sepsis-3 validation study, which limited analysis to the subgroup of infected patients. We integrated non-infected patients into analysis, thereby addressing this named limitation of sepsis-3 [2, 9].
In this context, we designed this study to contribute to the evaluation of sepsis criteria in the ICU setting by addressing the following issues:
- We assessed the impact of the application of sepsis-3 versus sepsis-1/2 clinical criteria in an ICU cohort, especially on the frequencies of sepsis and the differentiation between uncomplicated infection and sepsis.
- As the use of mortality as endpoint for validation of sepsis criteria in sepsis-3 [2, 9] has been criticized [17–21] we developed a mathematical approach to reasonably use mortality for the validity evaluation of sepsis criteria. Thereby, we estimated the correlation between sepsis and in-hospital mortality, which enabled the introduction of quantitative references for mortality-based evaluation of sepsis criteria.
- By including non-infected patients in our study, we were able to analyze whether the occurrence or the mortality association of the proposed clinical sepsis criteria differ dependent on the infectious status of a patient. Applying our probabilistic approach, we investigated if the strength of the mortality association of the proposed clinical sepsis criteria was within the mathematically estimated range for infected and non-infected patients.
Materials and methods
Study design and settings
This prospective observational study [22] was conducted at the 25-bed ICU of the Department of Anaesthesiology and Surgical Intensive Care Medicine at University Medical Center Mannheim. All encounters (age ≥18 years) with complete ICU stay between June 1st 2016 and July 9th 2017 were included. Starting date was set due to introduction of daily SOFA scoring in reaction to sepsis-3 [2, 9]. The Ethics Commission II of Medical Faculty Mannheim approved the study (2016-800R-MA) and waived the need for informed consent.
Measurements and definitions
SOFA scores [23] were determined daily by intensivists, and SIRS criteria [10] were extracted by computational query from the electronic health record. For identification of suspected infection and its onset the approach chosen in sepsis-3 validation study was applied [9] and sepsis-3 criteria were evaluated in the same time frames around infection onset [9] (For further details on all measures, see the S1 Appendix section A: Details on Definitions). To define equivalent evaluation time points for non-infected patients, these were selected by computational query to achieve a distribution similar to infection onset in infected patients (Fig A in S1 Appendix section A). We applied maximum SOFA and acute change in SOFA (henceforth referred to as ‘SOFA’ and ‘SOFA_change’) as closely as possible to sepsis-3 implementation [9] (For details see S1 Appendix section A). As it was considered closest to the sepsis-1/2 definition [10, 11] the 24 hour window before infection onset was chosen for SIRS analyses.
Statistical analysis
Statistical analysis was operated with R 3.3.2 [24] and the pROC package [25]. A p-value ≤0.05 was regarded statistically significant. No adjustment for multiplicity was applied.
Assessment of the impact of different sepsis criteria on sepsis diagnosis.
Raw scores (SIRS/SOFA) and the proposed sepsis criteria (SIRS≥2/SOFA≥2/SOFA_change≥2) were analyzed separately for infected and non-infected patients. For scores and in-hospital mortality, Area Under the Receiver Operating Characteristic Curve (AUROC) analyses was performed, applying similar baseline risk modeling as in sepsis-3 [9]. We calculated the discrimination capacity of score plus baseline risk model and score alone (For details see S1 Appendix section B). The latter was chosen for primary reporting because the baseline model variables are not part of sepsis-3 criteria. The score threshold of 2 was evaluated by sensitivity, specificity, positive and negative predictive value, risk ratio and odds ratio for in-hospital mortality.
Investigating the correlation between sepsis and (in-hospital) mortality.
In a ‘framework for the development and interpretation of different sepsis definitions and clinical criteria’, sepsis-3 authors defined criterion validity as the extent to which a proposed measure (clinical criteria) of a disease (sepsis) agrees with an existing accepted measure (in-hospital mortality) [26, 27]. They highlighted the importance to set expectations about this agreement when assessing validity. Nevertheless, it was omitted during the sepsis-3 validation process to formulate an expectation for the agreement between sepsis and in-hospital mortality [9]. Instead, it was assumed to be rather large [9]. We replaced this assumption with an empirical estimate using sepsis-3 validation study data [9] in conjunction with SOFA ≥2 as septic condition to determine the septic status of a patient. This approach yielded an expected correlation between sepsis and in-hospital mortality of 0.10 for infected patients (Details on the probabilistic mathematical model of the relation between sepsis and in-hospital death and the performed calculations can be found in the S1 Appendix section D and S2 Appendix). This value served as reference for statistical examination of the observed correlations in our cohort. As sepsis cannot cause death in non-infected patients, the correlation between in-hospital mortality and sepsis must be 0 in this subgroup. This facilitated the investigation of an infection-specific mortality association pattern that must be displayed by valid sepsis criteria. Pearson correlation coefficients were calculated to assess mortality associations and statistical examination of correlations was based on refined Fisher transformation [28].
Sensitivity analysis.
For sensitivity analyses, examination was limited to the day of ICU admission for two reasons: It entailed timely fixation for comparison between infected and non-infected patients, and major studies examining sepsis-3 in the ICU limited observation to the 24 hours after admission [29–33] (for details see S1 Appendix section C).
Results
Frequencies of proposed sepsis criteria in our ICU cohort and descriptive results in relation to in-hospital mortality
Of 791 examined patients, 369 (47%) were assigned infected and 422 (53%) non-infected, with an in-hospital mortality of 39% and 15% respectively (Table 1). SOFA distribution at infection onset and evaluation time points is displayed in Fig 1 (for definition of infection onset and evaluation time points see S1 Appendix section A). Frequencies of positive sepsis criteria by reference population are shown in Table 2. Frequencies of sepsis diagnoses in dependence on applied sepsis criteria in the overall ICU cohort were 42% (SIRS≥2), 46% (SOFA≥2), and 36% (SOFA_change≥2). In infected patients, SIRS≥2 indicated sepsis in 90% at infection onset, SOFA≥2 in 99% and SOFA_change≥2 in 77%. In non-infected patients at the respective evaluation time points, SIRS, SOFA and SOFA_change were ≥2 in 78%, 88% and 58% respectively. Further, SIRS was ≥2 in 5231 of 5978 (88%) observed patient days in the infected group, and in 1292 of 1724 days (75%) in the non-infected group. The respective percentages for SOFA≥2 were 100% in infected and 94% non-infected patients respectively (for SOFA_change 12% and 23%, respectively) (Table 3).
All scores on shaded background fulfill the threshold ≥2 proposed by sepsis-3 as clinical sepsis criterion. SOFA was ≥2 in 93% of all patients (infected: 99%; non-infected patients: 88%). The x-axis is the score range, with SOFA truncated at 23 points for illustration.
Regarding the relation of proposed sepsis criteria to in-hospital mortality (Table 4), at infection onset, SIRS displayed a sensitivity of 97% [95% Confidence Interval (CI) 94%–99%] and specificity of 14% [10%–19%] for in-hospital mortality. SOFA was ≥2 in 100% of decedents and in 99% of survivors, resulting in a sensitivity of 100% [100%–100%] and specificity of 1% [0%–3%] for in-hospital mortality at infection onset. In non-infected patients the respective values were for SIRS≥2 79% [68%-89%] and 22% [18%-27%] and for SOFA≥2 100% [100%-100%] and 14% [11%-18%] respectively. Positive and negative predictive values for in-hospital mortality in infected patients for SIRS≥2 were 42% and 89%, and for SOFA≥2 39% and 100% respectively. In non-infected patients positive and negative predictive values for SIRS≥2 were 15% and 86% and for SOFA≥2 17% and 100% respectively. Risk ratio and odds ratio for in-hospital mortality were significant for SIRS≥2 in infected patients at infection onset (risk ratio: 3.8 [1.4–9.7]; odds ratio: 5.8 [2.0–16.8]), but not significant in non-infected patients (risk ratio: 1.1 [0.6–2.0]; odds ratio: 1.1 [0.5–2.2]). Because all deceased patients had a SOFA score ≥2 at time points of infection or evaluation, risk and odds ratios could not be calculated for SOFA≥2. Risk ratio for in-hospital mortality of SOFA_change≥2 was similar in infected and non-infected patients (2.2 [1.4–3.5]) and 2.2 [1.2–3.6] respectively). Of note, 20% of infected patients that had a change in SOFA <2 points at infection onset died in hospital later on. In-hospital-death was observed for 11% of infected patients with SIRS <2 at infection onset.
Discrimination of raw scores (SIRS/SOFA) for in-hospital mortality
AUROC of SOFA for in-hospital mortality was 0.75 ([95%CI 0.69–0.80]; p<0.001) for infected patients, and significantly higher for non-infected patients 0.85 ([0.80–0.90]; p<0.001) (Fig 2). SOFA_change discriminated for in-hospital mortality similarly in infected (AUROC 0.70 [0.65–0.76]; p<0.001) and non-infected patients (AUROC 0.74 [0.65–0.82]; p<0.001). SIRS was associated with in-hospital mortality in both subgroups with lower AUROCs compared to SOFA and SOFA_change (AUROC infected: 0.61 [0.55–0.66]; p<0.001; non-infected: 0.62 [0.53–0.70]; p = 0.002) (Fig 2).
Discrimination capacity of SOFA for in-hospital mortality was significantly stronger in non-infected than in infected patients (p = 0.006). SIRS discriminated for in-hospital mortality in both subgroups with lower AUC compared to SOFA applications. If applied with the proposed threshold, SOFA≥2 was more sensitive and less specific for in-hospital mortality than SIRS≥2. AUC: Area Under the Curve; SIRS: Systemic Inflammatory Response Syndrome; SOFA: Sequential Organ Failure Assessment; sens: sensitivity; spec: specificity.
Correlation of raw scores (SIRS/SOFA) and sepsis criteria (SIRS≥2/SOFA≥2/SOFA_change≥2) with in-hospital mortality in relation to expected correlations between sepsis and in-hospital mortality
The expected correlation between sepsis and in-hospital mortality was 0.10 for infected and 0 for non-infected patients (details on the probabilistic mathematical model can be found in S1 Appendix section D). As a raw score, without a threshold, the observed correlation of SOFA with in-hospital mortality was significantly stronger than the expected correlation for a sepsis criterion in both subgroups (infected: r = 0.43, p<0.01; non-infected: r = 0.50, p<0.01). Correlation of SIRS and in-hospital mortality was scarcely in range of expectations in infected patients (r = 0.20; p = 0.0504) (Table 5).
Examining the threshold of 2, SIRS≥2 agreed with expected correlations in both subgroups (infected: r = 0.19, p = 0.10; non-infected: r = 0.01, p = 0.77). SOFA≥2 was in range of expectations in infected patients, but deviated significantly from the expected mortality association pattern in non-infected patients (r = 0.16, p<0.01). SOFA_change≥2 deviated from expectations in both patient subgroups (infected: r = 0.21, p = 0.04; non-infected: r = 0.14, p<0.01) (Table 5).
Sensitivity analysis
Sensitivity analysis restricting examination to ICU admission showed that 261 patients (33% of total cohort) entered the ICU with an infection (in-hospital mortality 44%). Results were overall consistent with those reported for the entire cohort (details can be found in S1 Appendix section C).
Discussion
In this study, we assessed the impact of sepsis-3 criteria on sepsis diagnosis in comparison to sepsis-1/2 criteria within an ICU cohort and we performed a new, reference-guided mathematical approach to mortality-based validation of sepsis criteria.
Major findings of our study were: i) Compared to SIRS≥2, SOFA≥2 was more frequently observed in infected and non-infected patients at infection onset or the corresponding evaluation time points. The same was true for respective comparisons based on patient days (Tables 2 and 3). ii) Only SIRS≥2 showed significantly high risk ratio and odds ratio for in-hospital mortality in infected but not in non-infected patients, whereas risk ratio of SOFA_change≥2 was similar in both patient subgroups and SOFA ≥2 was met by all non-survivors in both patient groups preventing mortality risk prediction even in non-infected patients (Table 4). iii) In-hospital mortality association evaluated by AUROC analyses showed higher values for SOFA applications compared to SIRS score but neither displayed higher values for infected than for non-infected patients. In contrast, AUROC of SOFA was even significantly higher in non-infected compared to infected patients (Fig 2). And finally iv) Solely SIRS≥2 but neither SOFA≥2 nor SOFA_change≥2 met the pattern and strength of correlation with in-hospital mortality that would be expected of a valid sepsis criterion (Table 5) as estimated by our probabilistic mathematical model (see S1 Appendix section D) developed in response to the sepsis-3 criterion validity evaluation approach, as previously requested [9, 26, 27].
Based on these findings we advise caution to consider SOFA-based sepsis criteria to be of superior validity as compared to SIRS-based sepsis criteria in diagnosing sepsis in critically ill patients regarding the additional analytical aspects found in this study and discussed in the following:
SOFA application: Maximum versus change
Main analyses in the sepsis-3 validation study were undertaken and reported for SOFA≥2 [9], while investigation of an acute change in SOFA ≥2 resulted from a post hoc analysis [9]. Nevertheless an acute change in SOFA ≥2 was chosen as final criterion [2]. Because a baseline SOFA score to calculate a change from can be unknown in clinical practice and research datasets, especially around hospital and ICU admission respectively, the task force suggested to assume the baseline SOFA score to be zero in patients not known to have preexisting organ dysfunction [2]. In the original sepsis-3 ICU cohort, onset of infection happened within 48 hours of admission in 77% [9]. Dependent on how often 0 was assumed as baseline SOFA score, which was not reported, SOFA≥2 and SOFA_change≥2 lead to more or less similar results. Later it was stated by the corresponding author of the sepsis-3 definitions that two SOFA points were sufficient to meet criteria for sepsis, and that there was no requirement to calculate a change [2, 34]. This results in heterogeneous SOFA operationalizations in studies using sepsis-3 clinical criteria for sepsis diagnosis in the ICU: Major studies primarily analyzed SOFA≥2 [9, 32, 33]. In other studies the used sepsis-3 implementation is called “an acute change in SOFA≥2” but zero is used as baseline SOFA without taking preexisting organ dysfunction into account, consequently the used criterion is equivalent to SOFA≥2 [29, 30, 35]. Others found individual solutions to operationalize preexistent organ dysfunction for calculation of “an acute change in SOFA≥2” as primary analyzed criterion [31, 36] or in sensitivity analyses [29, 30, 32]. In some studies, it is not documented how the “acute change in SOFA” was implemented [21, 37]. This heterogeneity in operationalization of sepsis-3 criteria is problematic, as variation in sepsis defining criteria and their application have been identified as major obstacle in sepsis reporting [38, 39]. In our study we analyzed both, SOFA≥2 and SOFA_change≥2, which revealed that, depending on the performed analysis, the two forms of SOFA application captured discrepant results and thus cannot be used interchangeably. While discrimination for in-hospital mortality of SOFA and SOFA_change were similar in the sepsis-3 validation study and our ICU cohort (AUROC for SOFA and SOFA_change respectively: 0.75 and 0.70 in our data while 0.74 and 0.70 in sepsis-3 [9]), consecutive frequencies of sepsis diagnosis differed significantly in our study: SOFA_change≥2 assigned 22% less infected patients as septic compared to SOFA≥2 (Table 2).
For reasons of comparability, in the following section we focus the discussion on SOFA≥2, because main results of the sepsis-3 validation study concerning the ICU are reported for this SOFA application [9] and the majority of studies analyzing sepsis-3 criteria in the ICU in fact studied SOFA≥2 [9, 29, 30, 32, 33, 35].
Impact of SOFA≥2 as criterion in infected patients
In our cohort, SOFA was ≥2 in 99% of infected patients at infection onset, SIRS in 90% (Table 2). Consequently, SIRS≥2 indicated discrimination between uncomplicated infection and sepsis in 10 of 100 infected ICU patients in our study, SOFA≥2 only in 1 of 100. Hence, SOFA≥2 diminished this differentiation, which significantly impacts epidemiological figures and potential clinical consequences: Also in the original sepsis-3 ICU cohort, SOFA≥2 resulted in a higher frequency of sepsis diagnoses than SIRS≥2 at infection onset (91% and 84%) [9], which was not discussed. Likewise, for mixed and cancer patients admitted to ICUs with infection, higher occurrence of SOFA≥2 (90%, 97% and 87%) compared to SIRS≥2 (87%, 77% and 59%) was reported [29, 31, 35]. SOFA≥2 resulted in an increase in sepsis diagnoses of 4% compared to SIRS≥2 in our overall cohort, and of 9% within the subgroup of infected patients (Table 2). Fittingly, Fullerton et al. reported an increase in sepsis incidence of 4% and 28% respectively regarding comparable ICU populations [35]. These higher frequencies of sepsis diagnoses were observed although sepsis-1/2 criteria for ‘simple’ sepsis [10, 11], not ‘severe sepsis’, were applied. Considering that sepsis-3 sought to require the presence of organ dysfunction and was thought to replace the former ‘severe sepsis’ [2], the increase in sepsis diagnoses according to SOFA≥2 contradicts expectations [18, 19, 40–43], reinforcing the question whether SOFA along with the threshold of 2 was an adequate operationalization of organ dysfunction [21, 31, 44–46]. This is of special interest, as the importance of considering patients’ severity of illness when deciding about the breadth of antibiotics has been highlighted [47, 48] and studies document that patients with less severe disease suffer from negative consequences of antibiotic overuse [8, 47, 49, 50]. Our investigation raises concerns in that a switch from SIRS≥2 to SOFA≥2 as underlying criterion may increase the frequency of sepsis diagnoses in the ICU.
Infection-specific occurrence of sepsis criteria
By the time a patient develops signs of an organ dysfunction or systemic inflammation it is often unclear whether these conditions are related to an infection or not [2, 14–16]. An infection-specific occurrence would therefore be essential for a reliable sepsis criterion [51]. In our study, none of the evaluated criteria displayed this desirable pattern of occurrence. SOFA was ≥2 on 100% (SIRS: 88%) of observed days in infected patients and on 94% (SIRS: 78%) in non-infected patients (Table 3). A major argument to replace SIRS as sepsis criterion had been that it was present in too many patients, including those who never develop infection [2, 12]. Therefore, Singer et al. denied SIRS’ validity as sepsis criterion. In our cohort, these concerns are even more substantial for SOFA≥2. Fittingly, before sepsis-3 it was evident that SOFA, as raw score, captures adverse outcome [14, 52–54] and organ dysfunction [14, 23], but irrespective of the cause [14, 42, 55] and in particular irrespective of the presence of sepsis [14]. This gave reason to rename SOFA from ‘Sepsis-related Organ Failure Assessment’ to ‘Sequential Organ Failure Assessment’ [14]. If a patient is severely ill, may this be evident because of signs of organ dysfunction or systemic inflammation or both, one of the crucial clinical question remains whether this condition is caused by an infection or not [16, 56]. Our study underlines that neither of the proposed sepsis criteria are helpful to answer this question (compare frequencies of SIRS and SOFA based sepsis criteria in infected and non-infected patients (Tables 2 and 3)), as also others have highlighted [12, 14, 51].
Associations of sepsis criteria with in-hospital mortality
It was the pivotal effort of the sepsis-3 validation strategy to demonstrate superior criterion validity of SOFA as sepsis measure by showing a stronger association with in-hospital mortality for SOFA than SIRS in infected patients, measured by AUROC [2, 9]. In very good agreement with sepsis-3 validation study, we observed moderate discrimination capacity of SOFA applications for in-hospital mortality in infected patients (AUROC for SOFA and SOFA_change respectively: 0.75 and 0.70 in our data while 0.74 and 0.70 in sepsis-3 [9]) that was higher than SIRS’ AUROC for in-hospital mortality. Further studies in the ICU setting replicating this evaluation approach also reported comparable results [29, 31, 32, 37]. However, the use of AUROC for the assessment of risk prediction in general [57] and for sepsis-3 validation in particular [17, 58] was criticized. AUROC is a measure for discrimination, but it can lead to false conclusions if it is used to assess predictive correctness [57]. Moreover, AUROC analyses of neither SOFA nor SIRS displayed superior discrimination for in-hospital mortality in infected compared to non-infected patients. AUROC of SOFA for in-hospital mortality was even significantly higher in non-infected patients than in infected patients (Fig 2). Thus, neither SOFA nor SIRS, as raw scores, help to discriminate infectious from non-infectious causes of mortality in critically ill patients. Notably, AUROC assesses SOFA and SIRS as raw scores and carries no information about the proposed sepsis criteria threshold for SIRS≥2, SOFA≥2 or SOFA_change≥2 [17, 19]. Therefore we introduced additional analyses in our study (Table 4). Our observation that, at infection onset, SOFA was ≥2 in 100% of decedents and also in 99% of survivors, which was similarly reported in the sepsis-3 validation study [9] (98% and 90% respectively), revealed SOFA with the threshold of 2 to yield poor guidance as prognostic criterion in infected ICU patients. High sensitivity for in-hospital mortality of SOFA≥2 at the expense of specificity was reported before [21, 31]. Evaluated by risk ratios and correlation coefficients, SIRS≥2 captured a similar risk for in-hospital mortality compared to SOFA_change≥2 in infected patients. However, while 20% of infected patients that had a change in SOFA <2 points at infection onset died in hospital later on, this was observed in 11% for SIRS <2.
Infection-specific mortality association patterns of sepsis criteria–A new reference for mortality-based evaluation of sepsis criteria
Further doubt was raised whether mortality was an eligible endpoint for the validation of sepsis criteria [17–21]. To address this, we established references for the correlation between sepsis and in-hospital mortality, which was postulated [26, 27] but omitted during sepsis-3 validation process [9]. Based on sepsis-3 data, the expected correlation could be estimated as 0.10 for infected ICU patients, demonstrating that in-hospital mortality can be used to examine criterion validity of sepsis criteria. But contrary to the postulated way to consider this association, namely the larger the better [2, 9, 37], application of our mathematical probabilistic model prompts that a valid criterion is expected to show a positive but weak correlation with in-hospital mortality in infected patients and no correlation in non-infected patients, as sepsis cannot cause death in non-infected patients.
Consequently, the consideration of non-infected patients in our model revealed that in order to be valid, a sepsis measure must display this infection-specific mortality association capturing two questions: If the criterion is fulfilled (namely SIRS, SOFA and SOFA_change are ≥2, respectively), does it correlate with mortality in infected patients, but not in non-infected patients? And is the displayed correlation with mortality within the mathematical estimation for sepsis in infected patients?
In this study, the expected infection-specific mortality association pattern was solely observed for SIRS≥2, which was—adequate for a sepsis measure—prognostic in infected patients and non-prognostic in non-infected patients (Table 5). This means that for an infected patient the presence of SIRS≥2 significantly worsens the prognosis, and it does so to the extent that is expected for the occurrence of sepsis, while in non-infected patients the presence of SIRS≥2 does not alter the prognosis. Fittingly, risk ratio for in-hospital mortality was significant for SIRS≥2 in infected patients and non-significant in non-infected patients (Table 3). In contrast, SOFA≥2 showed higher correlations with in-hospital mortality in non-infected than in infected patients which contradicted expectations for a valid sepsis criterion. SOFA_change≥2 deviated from expectations for correlation with in-hospital mortality in infected and non-infected patients. In our study, none of the SOFA applications captured the mortality pattern of sepsis.
As was considered before [18, 32, 41], our analysis substantiates, and for the first time establishes by mathematical reasoning, that good mortality prediction and valid sepsis detection are separate issues. This gave reason to investigate an infection-specific mortality association pattern as a validity measure of sepsis criteria. This expected pattern was not displayed by SOFA, SOFA≥2, SOFA_change≥2 or SIRS, but solely by SIRS≥2.
Limitations and strengths
This was a single center study including 791 patients with higher morbidity compared to the sepsis-3 ICU cohort [9]. Consequently, in-hospital mortality was higher, but in good agreement with results of 11883 patients from 133 German ICUs, in which in-hospital mortality for sepsis was 40% [59] compared to 39% in this study. The higher sepsis mortality in European compared to US or Australian ICUs is topic of ongoing debate [59, 60]. Nevertheless, AUROC analyses for SOFA, SOFA_change and SIRS were in very good agreement with those reported for the sepsis-3 ICU population [9] indicating comparability. We strongly encourage validation of our results in different settings. To our knowledge, we are the first to establish quantifiable references for the correlation of sepsis and mortality. Calculation of reference correlations was based on published sepsis-3 data gained from 7932 ICU patients [9]. By considering non-infected patients in our study, we address a named limitation of sepsis-3 [2, 9].
Conclusions
In our ICU cohort, application of SOFA≥2 yielded a more liberal sepsis diagnosis than SIRS≥2, diminishing the differentiation between uncomplicated infection and sepsis. In this study, we establish a mathematical probabilistic model as a new approach to mortality-based evaluation of sepsis criteria that substantiates by mathematical reasoning, that good mortality prediction and valid sepsis detection are separate issues. Based on here established references, valid sepsis criteria are characterized by a weak correlation with in-hospital mortality in infected patients and no association in non-infected patients. This expected infection-specific mortality pattern was not displayed by SOFA, SOFA≥2, SOFA_change≥2 or SIRS, but solely by SIRS≥2. Further, our approach revealed that validity evaluation with mortality as endpoint is necessary but not sufficient for validation of sepsis criteria as none of the criteria showed an infection-specific occurrence that would be essential for reliable sepsis detection.
Supporting information
S2 Appendix. Spreadsheet on calculations performed in the data of the UPMC ICU validation cohort of sepsis-3 validation study.
https://doi.org/10.1371/journal.pone.0238548.s002
(XLSX)
S3 Appendix. Anonymized data set underlying our study.
https://doi.org/10.1371/journal.pone.0238548.s003
(XLSX)
Acknowledgments
We are grateful for data preparation by Christoph Scherer (controlling, mortality data) and Sebastian Wegerer (Institute for Clinical Chemistry, microbiological data). And we would like to express our sincere thanks towards our study nurse Corinna Worm for support during the data collection process.
References
- 1. Hotchkiss RS, Moldawer LL, Opal SM, Reinhart K, Turnbull IR, Vincent J-L. Sepsis and septic shock. Nat Rev Dis Primers. 2016;2: 16045. pmid:28117397
- 2. Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, et al. The Third International Consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 2016;315: 801–810. pmid:26903338
- 3. Kumar A, Roberts D, Wood KE, Light B, Parrillo JE, Sharma S, et al. Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock. Crit Care Med. 2006;34: 1589–1596. pmid:16625125
- 4. Ferrer R, Martin-Loeches I, Phillips G, Osborn TM, Townsend S, Dellinger RP, et al. Empiric antibiotic treatment reduces mortality in severe sepsis and septic shock from the first hour: results from a guideline-based performance improvement program. Crit Care Med. 2014;42: 1749–1755. pmid:24717459
- 5. Levy MM, Dellinger RP, Townsend SR, Linde-Zwirble WT, Marshall JC, Bion J, et al. The Surviving Sepsis Campaign: results of an international guideline-based performance improvement program targeting severe sepsis. Intensive Care Med. 2010;36: 222–231. pmid:20069275
- 6. Rhodes A, Evans LE, Alhazzani W, Levy MM, Antonelli M, Ferrer R, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock: 2016. Intensive Care Med. 2017;43: 304–377. pmid:28101605
- 7. Roger P-M, Montera E, Lesselingue D, Troadec N, Charlot P, Simand A, et al. Risk factors for unnecessary antibiotic therapy: a major role for clinical management. Clin Infec Dis. 2018;69: 466–472.
- 8. Klompas M, Calandra T, Singer M. Antibiotics for Sepsis—Finding the Equilibrium. JAMA. 2018;320: 1433–1434. pmid:30242350
- 9. Seymour CW, Liu VX, Iwashyna TJ, Brunkhorst FM, Rea TD, Scherag A, et al. Assessment of clinical criteria for sepsis: for the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). JAMA. 2016;315: 762–774. pmid:26903335
- 10. Bone RC, Balk RA, Cerra FB, Dellinger RP, Fein AM, Knaus WA, et al. Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. The ACCP/SCCM Consensus Conference Committee. American College of Chest Physicians/Society of Critical Care Medicine. Chest. 1992;101: 1644–1655. pmid:1303622
- 11. Levy MM, Fink MP, Marshall JC, Abraham E, Angus D, Cook D, et al. 2001 sccm/esicm/accp/ats/sis international sepsis definitions conference. Intensive Care Med. 2003;29: 530–538. pmid:12664219
- 12. Vincent J-L, Opal SM, Marshall JC, Tracey KJ. Sepsis definitions: time for change. Lancet. 2013;381: 774. pmid:23472921
- 13. Kaukonen K-M, Bailey M, Pilcher D, Cooper DJ, Bellomo R. Systemic inflammatory response syndrome criteria in defining severe sepsis. N Engl J Med. 2015;372: 1629–1638. pmid:25776936
- 14. Vincent J-L, De Mendonça A, Cantraine F, Moreno R, Takala J, Suter PM, et al. Use of the SOFA score to assess the incidence of organ dysfunction/failure in intensive care units: results of a multicenter, prospective study. Crit Care Med. 1998;26: 1793–1800. pmid:9824069
- 15. Gucyetmez B, Atalan HK. C-reactive protein and hemogram parameters for the non-sepsis systemic inflammatory response syndrome and sepsis: What do they mean? PloS One. 2016;11: e0148699. pmid:26863002
- 16. Ratzinger F, Schuardt M, Eichbichler K, Tsirkinidou I, Bauer M, Haslacher H, et al. Utility of sepsis biomarkers and the infection probability score to discriminate sepsis and systemic inflammatory response syndrome in standard care patients. PloS One. 2013;8: e82946. pmid:24349403
- 17. Makam AN, Nguyen OK. Clinical Criteria to Identify Patients With Sepsis. JAMA. 2016;316: 453.
- 18. Simpson SQ. New sepsis criteria: a change we should not make. Chest. 2016;149: 1117–1118. pmid:26927525
- 19. Simpson SQ. SIRS in the time of Sepsis-3. Chest. 2018;153: 34–38. pmid:29037526
- 20. Berry M, Patel BV, Brett SJ. New Consensus Definitions for Sepsis and Septic Shock: Implications for Treatment Strategies and Drug Development? Drugs. 2017;77: 353–361. pmid:28188516
- 21. Yoon J, Kym D, Hur J, Kim Y, Yang H-T, Yim H, et al. Comparative Usefulness of Sepsis-3, Burn Sepsis, and Conventional Sepsis Criteria in Patients With Major Burns. Crit Care Med. 2018;46: e656–e662. pmid:29620554
- 22.
Rothman KJ, Greenland S, Lash TL. Types of epidemiologic studies. In: Rothman KJ, Greenland S, Lash TL. Modern epidemiology. Lippincott Williams & Wilkins; 2008. pp 87–99.
- 23. Vincent J-L, Moreno R, Takala J, Willatts S, De Mendonça A, Bruining H, et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. Intensive Care Med. 1996;22: 707–710. pmid:8844239
- 24.
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2016.
- 25. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC bioinformatics. 2011;12: 77. pmid:21414208
- 26. Angus DC, Seymour CW, Coopersmith CM, Deutschman CS, Klompas M, Levy MM, et al. A framework for the development and interpretation of different sepsis definitions and clinical criteria. Crit Care Med. 2016;44: e113–e121. pmid:26901559
- 27. Seymour CW, Coopersmith CM, Deutschman CS, Gesten F, Klompas M, Levy M, et al. Application of a framework to assess the usefulness of alternative sepsis criteria. Crit Care Med. 2016;44: e122–e130. pmid:26901560
- 28. Vrbik J. Population moments of sampling distributions. Computational Statistics. 2005;20: 611–21.
- 29. Raith EP, Udy AA, Bailey M, McGloughlin S, MacIsaac C, Bellomo R, et al. Prognostic accuracy of the SOFA score, SIRS criteria, and qSOFA score for in-hospital mortality among adults with suspected infection admitted to the intensive care unit. JAMA. 2017;317: 290–300. pmid:28114553
- 30. Shankar-Hari M, Harrison D, Rubenfeld G, Rowan K. Epidemiology of sepsis and septic shock in critical care units: comparison between sepsis-2 and sepsis-3 populations using a national critical care database. Br J Anaesth. 2017;119: 626–36. pmid:29121281
- 31. Costa RT, Nassar AP Jr, Caruso P. Accuracy of SOFA, qSOFA, and SIRS scores for mortality in cancer patients admitted to an intensive care unit with suspected infection. J Crit Care. 2018;45: 52–7. pmid:29413723
- 32. Fang X, Wang Z, Yang J, Cai H, Yao Z, Li K, et al. Clinical Evaluation of Sepsis-1 and Sepsis-3 in the ICU. Chest. 2018;153: 1169–76. pmid:28711593
- 33. Cheng B, Li Z, Wang J, Xie G, Liu X, Xu Z, et al. Comparison of the performance between sepsis-1 and sepsis-3 in ICUs in China: a retrospective multicenter study. Shock. 2017;48: 301–306. pmid:28448400
- 34. Deutschman CS. Sepsis-3: Of Love and Bliss. Crit Care Med. 2017;45:739–40. pmid:28291096
- 35. Fullerton JN, Thompson K, Shetty A, Iredell JR, Lander H, Myburgh JA, et al. New sepsis definition changes incidence of sepsis in the intensive care unit. Crit Care Resusc. 2017;19: 9–13. pmid:28215126
- 36. Giamarellos-Bourboulis EJ, Tsaganos T, Tsangaris I, Lada M, Routsi C, Sinapidis D, et al. Validation of the new Sepsis-3 definitions: proposal for improvement in early risk identification. Clin Microbiol Infect. 2017;23: 104–109. pmid:27856268
- 37. Estenssoro E, Edul VSK, Loudet CI, Osatnik J, Ríos FG, Vázquez DN, et al. Predictive Validity of Sepsis-3 Definitions and Sepsis Outcomes in Critically Ill Patients: A Cohort Study in 49 ICUs in Argentina. Crit Care Med. 2018;46: 1276–1283. pmid:29742584
- 38. Shankar-Hari M, Harrison DA, Rowan KM. Differences in impact of definitional elements on mortality precludes international comparisons of sepsis epidemiology—a cohort study illustrating the need for standardized reporting. Crit Care Med. 2016;44: 2223–2230. pmid:27352126
- 39. Verboom DM, Frencken JF, Ong DS, Horn J, van der Poll T, Bonten MJ, et al. Robustness of sepsis-3 criteria in critically ill patients. J Intensive Care. 2019;7: 46. pmid:31489199
- 40. Schneider-Lindner V, Lindner HA, Thiel M. Definitions for sepsis and septic shock. JAMA. 2016;316: 457.
- 41. Cortés-Puch I, Hartog CS. Opening the debate on the new sepsis definition change is not necessarily progress: revision of the sepsis definition should be based on new scientific insights. Am J Respir Crit Care Med. 2016;194: 16–18. pmid:27166972
- 42. Sprung CL, Schein RM, Balk RA. To SIRS With Love-An Open Letter. Crit Care Med. 2017;45: 736–738. pmid:27792041
- 43. Schmoch T, Bernhard M, Uhle F, Grundling M, Brenner T, Weigand MA. New Sepsis-3 definition: Do we have to treat sepsis before we can diagnose it from now on?. Anaesthesist. 2017;66: 614–621. pmid:28497242
- 44. Wu F, Zhou X. The comparisons and limitations of Sepsis 2.0 and Sepsis 3.0. J Crit Care. 2018;47: 350–351. pmid:30037661
- 45. Williams JM, Greenslade JH, McKenzie JV, Chu K, Brown AF, Lipman J. Systemic inflammatory response syndrome, quick sequential organ function assessment, and organ dysfunction: insights from a prospective database of ED patients with infection. Chest. 2017;151: 586–596. pmid:27876592
- 46. Ljungström L, Andersson R, Jacobsson G. Incidences of community onset severe sepsis, Sepsis-3 sepsis, and bacteremia in Sweden–A prospective population-based study. PloS One. 2019;14: e0225700. pmid:31805110
- 47. Gilbert DN, Kalil AC, Klompas M, Masur H, Winslow DL. IDSA Position Statement: Why IDSA did not endorse the surviving sepsis campaign guidelines. Clinical Infectious Diseases. 2018;66: 1631–1635.
- 48. Kalil AC, Gilbert DN, Winslow DL, Masur H, Klompas M. Reply to al-hasan and justo. Clin Infec Dis. 2018;68: 1432.
- 49. Hammond DA, Smith MN, Li C, Hayes SM, Lusardi K, Bookstaver PB. Systematic review and metaanalysis of acute kidney injury associated with concomitant vancomycin and piperacillin/tazobactam. Clin Infec Dis. 2017;64: 666–674.
- 50. Hranjec T, Rosenberger LH, Swenson B, Metzger R, Flohr TR, Politano AD, et al. Aggressive versus conservative initiation of antimicrobial treatment in critically ill surgical patients with suspected intensive-care-unit-acquired infection: a quasi-experimental, before and after observational cohort study. The Lancet Infect Dis. 2012;12: 774–780. pmid:22951600
- 51. Anand V, Zhang Z, Kadri SS, Klompas M, Rhee C. Epidemiology of qSOFA Criteria in Undifferentiated Patients and Association with Suspected Infection and Sepsis. Chest. 2019;156: 289–297. pmid:30978329
- 52. Moreno R, Vincent J-L, Matos R, Mendonca A, Cantraine F, Thijs L, et al. The use of maximum SOFA score to quantify organ dysfunction/failure in intensive care. Results of a prospective, multicentre study. Intensive Care Med. 1999;25: 686–696. pmid:10470572
- 53. Ferreira FL, Bota DP, Bross A, Mélot C, Vincent J-L. Serial evaluation of the SOFA score to predict outcome in critically ill patients. JAMA. 2001;286: 1754–1758. pmid:11594901
- 54. Minne L, Abu-Hanna A, de Jonge E. Evaluation of SOFA-based models for predicting mortality in the ICU: A systematic review. Crit Care. 2008;12: R161. pmid:19091120
- 55. Ho KM, Lan NS. Combining quick Sequential Organ Failure Assessment with plasma lactate concentration is comparable to standard Sequential Organ Failure Assessment score in predicting mortality of patients with and without suspected infection. J Crit Care. 2017;38: 1–5. pmid:27829179
- 56. Churpek MM, Snyder A, Sokol S, Pettit NN, Edelson DP. Investigating the impact of different suspicion of infection criteria on the accuracy of qSOFA, SIRS, and early warning scores. Crit Care Med. 2017;45: 1805–1812. pmid:28737573
- 57. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115: 928–935. pmid:17309939
- 58. Simpson SQ. Diagnosing sepsis: a step forward, and possibly a step back. Ann Transl Med. 2017;5: 55. pmid:28251134
- 59. Marx G, et al. Incidence of severe sepsis and septic shock in german intensive care units- the insep study. Intensive Care Med. 2016;42: 1980–1989. pmid:27686355
- 60. Levy MM, Artigas A, Phillips GS, Rhodes A, Beale R, Osborn T, et al. Outcomes of the Surviving Sepsis Campaign in intensive care units in the USA and Europe: a prospective cohort study. The Lancet Infect Dis. 2012;12: 919–924. pmid:23103175