Background
There is an increasing trend to consider mortality as a sub-optimal primary endpoint in randomized controlled trials (RCTs) involving critically ill patients [
1‐
4]. In acute respiratory failure, the need for invasive mechanical ventilation may appear as a legitimate alternative to avoid negative RCTs. Indeed, over the last two decades, strategies to avoid invasive mechanical ventilation and related complications have been evaluated in patients with acute hypoxemic respiratory failure [
1,
3,
5‐
7]. This consideration is even more relevant in immunocompromised patients in whom higher mortality rates have been reported in patients requiring invasive mechanical ventilation, most particularly in those meeting criteria for acute respiratory distress syndrome [
8,
9]. Thus, noninvasive ventilation (NIV), continuous positive airway pressure (cPAP), and the use of high-flow nasal cannula (HFNC) have been widely explored in this subset of patients [
1,
3,
5‐
7]. Hence, the need for intubation has become a major clinical endpoint in RCTs of patients with acute respiratory failure [
8,
10‐
14].
In both cohort studies and RCTs, invasive mechanical ventilation delivery may however vary across centers, possibly leading to biased conclusions. This heterogeneity in mechanical ventilation use refers to “center effects,” a concern already explored in the intensive care setting [
15]. Actually, center effects can be divided into two components: (i) heterogeneity in the distribution of the study endpoint across centers and (ii) heterogeneity in the treatment effect on the outcome across centers known as “treatment by center interaction.” To date, no study has investigated a potential center effect in studies of immunocompromised patients with hypoxemic acute respiratory failure. In the present study, we sought to identify a center effect about the need for endotracheal intubation in immunocompromised patients with acute respiratory failure.
Methods
Centers and patients
This study included adult immunocompromised patients with hypoxemic acute respiratory failure (ARF) from the TRIAL-OH cohort and the HIGH randomized controlled trial (RCT). Details of each study have been described previously [
3,
16]. Firstly, we derived from the TRIAL-OH study (1011 patients, 17 centers, 2010–2011) [
16] a cohort of hematological patients admitted to ICU for ARF. We selected patients with the following criteria: ARF as defined by tachypnea > 30/min, respiratory distress, SpO2 < 90% at ICU admission, and/or labored breathing. Exclusion criteria were admission for another cause, hypercapnia (defined by a PaCO2 > 50 mmHg), and invasive mechanical ventilation before ICU admission. Secondly, we used the HIGH trial [
3], a RCT which enrolled 776 immunocompromised patients with hypoxemic ARF in 31 ICUs, in order to compare HNFC to standard oxygen at day-28 mortality. In both studies, each participating center has a senior intensivist and a senior hematologist in site available 24/7 and sharing decisions of ICU admission [
17]. Note that 14 centers were common in both the TRIAL-OH cohort and the HIGH trial.
The appropriate ethics committees approved each study.
Variables and risk adjustment
The primary outcome was the need for intubation through ICU stay as a dichotomous variable. In the TRIAL-OH study, the decision to perform endotracheal intubation was left to clinicians in charge at each ICU. In the HIGH trial, the decision to perform endotracheal intubation was based on predefined criteria, in agreement with current guidelines, including the therapeutic response and the clinical status (SpO2, respiratory rate, signs of respiratory distress, and bronchial secretion volume) [
3]. Individual data reported in tables were collected prospectively. ICU-level data (i.e., center characteristics) were also used in the analysis. In the TRIAL-OH cohort, these data were assessed as part of data collection. In the HIGH trial, they were assessed with a secondary survey from all the participant centers.
Statistical analysis
Continuous variables are described as median and interquartile range (IQR), and categorical variables are summarized by counts (percent).
Patient characteristics at admission were examined at the patient level in both studies, and the need for intubation was first assessed in each center without adjusting on potential confounding factors, which is referred as the “crude need for intubation.” Hierarchical logistic regression models were used to examine the variability on the outcome between ICUs and the association between ICU characteristics and the outcome, adjusting for patient characteristics [
18]. To do this, we used the mixed-effect model with a normally distributed random effect for ICU (random intercept). Exchangeability was assumed across all providers [
5]. In practice, the effect of a given ICU was modeled through its own regression coefficient, which compares to the crude average need for intubation across all centers [
5,
6]. This allows us to model between ICU heterogeneity in the average intubation risk. These models provide an estimation of heterogeneity in the form of the variance of the random effects, where the closer the variance is to zero, the smaller the center effect is. Because it could be hard to interpret, we computed the median odds ratio (MOR) to better understand the importance of the center effect on the mean intubation risk, that is on the same scale as traditional prognostic factors [
6,
7,
18,
19]. Briefly, MOR corresponds to the median of all ORs that can be computed between two patients with the same characteristics, but randomly chosen from two different centers, namely in a higher risk center and in a lower risk one [
6,
18]. MOR quantifies the differences between ICUs. If the MOR equals 1, then there are no differences in intubation risk between ICUs.
First, the model was built without any adjustment (“empty model”), which allows us to investigate a potential center effect. This unadjusted model contained only ICU-specific random intercepts. Then, we provided adjustment on predefined individual covariates as fixed effects, without ICU characteristics or interventions. These covariates were specified a priori as potential confounders and included age, comorbidities assessed by the Charlson comorbidity index, type of immunosuppression (in three classes, malignant hemopathy, solid cancer, and others), previous allogeneic stem cell transplantation, performance status > 2 (bedridden or dependent), severity of illness with SOFA score without respiratory item, Pa02/FiO2 ratio in four categories (> 300, 200–300, 100–200, ≤ 100 mmHg, with > 300 mmHg as reference), respiratory rate > 30 /min, and ARF diagnosis. This model was used to estimate adjusted ICU random effects, because we were interested in studying heterogeneity in outcomes. ICU were then ranked by their estimated random effect on intubation risk (adjusted for patient characteristics only) and classified into quartiles. For descriptive purposes, patient and ICU characteristics were compared across quartiles of the risk-adjusted intubation rate, using the Cochran–Mantel–Haenszel row mean score test and nonzero correlation test for testing the difference for categorical and continuous variables, respectively. To illustrate the magnitude of the effect of center on intubation risk, we performed conditional standardization of the regression results for a given patient with median and modal values for the covariates in the patient-level adjusted model. Last, hospital- and ICU-level covariates (center volume, annual volume of ID in ICU, annual rate of IMV patients defined by the number of patients with IMV/number of admission, academic status of hospital) were entered in the model as fixed effects and reported in terms of odds ratios, in order to try to explain the discrepancies between center. Because the time since respiratory symptoms onset and ICU admission could reflect local practices, it was considered as an ICU-level characteristic in the analysis.
To test for the significance of the center effects (empty model and model adjusted on patient and center characteristics), we used permutations test, a recommended approach to test for random effect [
20‐
23]. More details about the methods used are given in the Additional file
1.
These methods were applied in both the observational cohort and the RCT, separately.
Sensitivity analyses
Primary analyses were performed on the complete cases, assuming missing completely at random covariates. Then, sensitivity analyses for such assumptions were performed, based on multiple imputation with chained equation [
24]. We performed exploratory subset analyses, restricting ourselves to patients with full code status, in patients with malignant diseases in the HIGH trial and after exclusion of ICUs with extreme size (ICUs > 20 beds or ICUs< 8 beds). In the HIGH trial, we also investigated the center heterogeneity in the prognostic effect of the oxygenation strategy which was randomly assigned (HFNC or standard oxygen). This was not possible in the observational cohort due to allocation bias. To do this, we introduced two random effects in the patient-level adjusted model for each center: a random effect on the mean intubation (random intercept) risk as previously described and a random effect on the effect of the oxygenation strategy (with standard oxygen as reference) on intubation risk random slope. This allows us to model centers’ variability not only on the average intubation risk but also on the effect of the oxygenation strategy on the need for intubation. We then applied permutation test to investigate for significance of center effect [
20,
22].
All reported
p values are two-sided;
p < .05 was considered statistically significant. All analyses were performed using R version 3.1.0 (R Foundation for Statistical Computing [
http://www.R-project.org/]).
Discussion
This study was the first to explore variability across centers of the risk for invasive mechanical ventilation in critically ill immunocompromised patients with ARF. The significant variability in intubation rates applies to both a large observational cohort and a large randomized controlled trial. Moreover, the significant variability persisted after adjustment on potential individual confounders for IMV. Furthermore, the magnitude of the center effect, summarized herein with the median OR, was quite large. Last, we identified two ICU-level characteristics which could partly explain the observed discrepancy.
These findings suggesting between-center heterogeneity in intubation risk raise several concerns. In immunocompromised patients, ARF is the leading cause for ICU admission, the need for IMV being near 50% and mortality rates reaching up to 70% [
26‐
28]. Also, strategies to improve oxygenation and avoid invasive mechanical ventilation have received a great attention over the last two decades [
29,
30]. It is generally admitted that an ideal trial endpoint should be clinically relevant, accepted in medical practice, and sensitive and specific to detect the anticipated effect of the treatment [
31]. In the ICU setting, mortality has long been a major criterion. However, numerous RCTs were deemed negative because of the inappropriateness of the primary endpoint [
32]. As a consequence, the need for endotracheal intubation could appear as a better target in ARF and has become the commonest primary endpoint of more recent RCTs [
8,
10,
25]. However, in this study, we found that the adjusted intubation risk ranged from 46 to 70% in an observational study and from 28 to 55% in a RCT. Hence, evidences for a significant variation in intubation rate across different ICUs could challenge the validity of such outcome.
Different reasons could be argued to explain these discrepancies: First, organizational practices and local admission policies. In this study, we found that intubation risk increased with the annual ratio of patient with IMV in a given center, and the time from respiratory symptom onset to ICU admission. One could suppose that these two factors could reflect local practices for ICU admission and IMV initiation. Second, physician experience as well as patient conditions could influence the intubation decision [
33]. This point is hard to capture in statistical analysis, but in our study, we found a significant variation in intubation rate despite predefined intubation criteria. Moreover, most of intubation procedures have been made during out of hour period, supporting the importance of workload and personal practice factors. In a study which focused on IMV initiation in septic shock patients, de Montmollin et al. found a good agreement in a panel of intensivist for hypoxemia and respiratory rate as criteria for the need of intubation [
34]. However, in our study, discrepancies persisted after adjustment on these factors. These two points encourage the need for define consensual intubation criteria, as it appears to date as an important outcome. Last, patient severity is an important factor and could vary across centers, but in the present study, a significant center effect on intubation risk persisted even after adjustment on individual confounders.
These findings emphasize the need for several considerations. First, mortality is probably an unrealistic outcome in an era of numerous negative trials and endotracheal intubation appears to suffer for large center-related source of variation due to case-mix heterogeneity and more importantly local practices. Non-mortal outcome such as patients reported outcome or variation of severity parameters (delta-SOFA or oxygenation parameters) could be taken into consideration for future trials [
32]. Then, in observational studies and maybe RCT, it could be of importance to take into account between center variation in analysis. In this way, the random-effect logistic model could be of interest [
35,
36]. Finally, the large variation in endotracheal rate across centers despite adjustment on individual characteristics and patient severity could suggest variation in the decision of endotracheal intubation. As it remains an important outcome in clinical studies, there is maybe the need for a consensual definition of IMV implementation.
This study has several limitations. First, it was a post hoc analysis and we cannot rule the inherent limitations of these methods. However, using adequate statistical methods, we found significant center variations in intubation risk, identified potential explicative factors, and assessed its importance. Second, although we attempted to address ICU variation by adjusting for patient-level clinical factors known to be related with intubation risk, but the possibility of confounding by unmeasured covariates remains. Third, we used data from two studies performed in specialized centers. This could limit the generalizability of the results. Last, we cannot exclude that some of the patients could have been intubated for non-respiratory reasons (i.e., coma, severe shock, or copious tracheal secretions). However, our record of non-respiratory SOFA and the adjustments that were made in the analyses actually takes into account this issue.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.