Introduction
The phenomenon of sepsis related myocardial dysfunction has been recognized for over three decades [
1]. One of the earliest studies by Weisel et al. in 1977, using pulmonary artery catheter thermodilution method measured left ventricular stroke work index, observed that not only was left ventricular depression common in patients with sepsis, but was also potentially reversible [
2]. They also found that the left ventricular systolic function was reduced in non-survivor. Six years later, Hoffman et al., using gated cardiac scintigraphy, documented that right ventricular ejection fraction (RVEF) was depressed in patients with septic shock compared to normal [
3]. It was unclear from this study whether the survivors exhibited a better baseline right ventricular function. The most quoted study on this subject is Parker and co-authors' 1984 landmark paper which reported approximately half of the 20 patients with septic shock in the study had severely reduced baseline left ventricular ejection fraction (LVEF), and that the mean LVEF was paradoxically lower in survivors [
4]. Using radionuclide cineagiography, they also showed that survivors had dilated left ventricles (LV). In a subsequent study, Parker et al. further found that survivors of septic shock also had depressed initial RVEF, a finding that was not shared by Vincent et al., who noted that survivors from septic shock had markedly higher initial RVEF in two studies [
5,
6].
While many physicians believe poor LVEF have better outcome in patients with sepsis, some conflicting data have started to emerge in recent years. For example, Vieillard-Baron et al., by examining the hemodynamic parameters in septic shock patients, found that the initial LVEF were similar in survivors and non-survivors [
7]. Pulido et al. also failed to find any association between 30- or 100-day mortality with initial ventricular dysfunction [
8]. Some studies also casted doubts about the earlier claims that survivors from severe sepsis or septic shock have larger ventricular size [
7,
9].
Despite all these studies, the debates of whether or not ventricular dysfunction or dilatation (increased ventricular cavity size) is associated with lower mortality in severe sepsis or septic shock continue. One of the main reasons is the small sample size in majority of these studies lacked the statistical power to detect genuine differences in effect sizes. In light of this continuing uncertainty, we conducted this meta-analysis to assist in clarifying the situation. The goals of this meta-analysis are: (1) to critically appraise the best available observational studies that correlated venticular function and/or dimension with mortality in adult patients with severe sepsis or septic shock; (2) to synthesize the relevant data and determine if ventricular dysfunction or dilatation is associated with lower mortality rate; and (3) to list the main difficulties encountered in past studies and to propose some suggestions for future studies on this subject.
Methods
Study identification
Our search strategy began by breaking the research question down to: (1) patient type (study population); (2) indicator or indices; and (3) outcomes. For patient type, we included only severe sepsis or septic shock; for indicator, we used myocardial depression and any synonyms; for outcomes, we looked for mortality and any synonyms. Primary studies dated July 2012 and earlier were searched using PubMed and Embase (including Medline). The following terms (some with truncation) were searched as key and text words: 'Sepsis OR septic shock [title]' AND '((Ventricular OR cardiac OR myocardial) AND (failure OR depression OR dysfunction OR function OR impairment OR dimension OR dilat*)) OR (ejection fraction) [title]' AND '(death OR mortality OR surviv* OR outcome OR predict* OR prognos*)'. We limited the search to human studies, English language, clinical study, outcome study, clinical trials, article, article in press, conference abstract, or conference paper. The reference lists of all eligible studies were examined to search for other relevant studies.
Study eligibility
Two investigators (SJH and MN) independently reviewed the titles and abstracts of all relevant studies; all potential relevant studies were retrieved. When disagreement occurred, the reviewers resolved the discrepancies by consensus. Selection criteria used to identify studies were: prospective or retrospective observational studies; adult patients with either severe sepsis or septic shock; the initial measurements must be obtained objectively and as early as practicable; and used mortality as one of the outcome. The 1992 Consensus definitions of sepsis and organ failure was applied in all studies except for those published before 1992 [
10]. For these pre-1992 studies, all of which recruited septic shock patients, we adopted the definitions of septic shock as specified by the primary authors and the minimum criteria had to be: (1) positive blood culture or documented source of infection; (2) hypotension; and (3) clinical evidence of tissue hypoperfusion (such as lactic acidosis, oliguria, or decreased mental state). Studies were excluded if they: included minors (aged <18 years), were unclear about the methods of recruitment, duplicated studies, or contained insufficient information to extract the data.
Appraisal of study quality
Most of the studies were descriptive in nature and were not designed as true prognosis studies. We therefore modified an appraisal checklist for prognosis study developed by the Centre for Evidence Based Medicine, Oxford, to assess the quality of the studies [
11]. Details of the appraisal items are shown in Table
1. Studies that received more than one 'No' or 'Unclear' to the questions were excluded.
Table 1
Appraisal checklist for study quality.
Recruitment
Was the study cohort representative of adult population with severe sepsis or septic shock? (Yes/No/Unclear) Were the patients recruited at an early point in the course of sepsis? (Yes/No/Unclear) Were eligible patients recruited consecutively? Were all eligible patients within the study period recruited? (Yes/No/Unclear) |
Outcome and follow-up
Was 'mortality' clearly defined? (Yes/No/Unclear) Was patient follow-up sufficiently long and complete given the type of mortality defined? (Yes/No/Unclear) |
Measurement
Were all measurements obtained objectively using an established method? (Yes/No/Unclear) Were the initial measurements made sufficient early in the course of sepsis? (i.e. within 24 hours of diagnosis or onset of symptoms) (Yes/No/Unclear) Did the patients receive the same number of measurements using the same method at the same stage? (Yes/No/Unclear) Were the assessors blinded from the measurements or outcome? (Yes/No/Unclear) |
Data abstraction
Data abstraction was conducted by two investigators in each study to obtain information on year of publication, country, patients' demographics, sample size, ventricular function and dimension data, mortality, and other clinical data. Any disagreements were resolved by consensus. We contacted the primary investigators to provide further information when data were missing or unclear. The studies were excluded if the authors did not respond.
The following data were abstracted: for LV systolic function, LVEF and LV fractional area contraction; for LV dimension, LV end-diastolic diameter (LVEDD) or volume (LVEDV); for RV systolic function, RVEF and RV stroke volume change; and for RV dimension, RV end-diastolic diameter (RVEDD), volume (RVEDV), or area (RVEDA).
Data analysis
The meta-analysis was performed using STATA with
mais package. Patients were divided into two groups: survivors and non-survivors from severe sepsis or septic shock. Data from different studies were combined to obtain a pooled outcome effect measure and unless specified otherwise, all data were expressed as standardized mean difference (SMD) (95% confidence interval (CI)), SMD, which is the weighted mean difference (survivors minus non-survivors) of the outcome measure (for example, LVEF). An overall SMD of zero represents no difference between the survivors and non-survivors. A negative value indicates survivors had smaller value, and a positive value indicates survivors had larger value. Since outcome data were presented in form of continuous data, and different studies might use different variable outcomes (for example, LVEDD or LVEDV for ventricular size) or different methods (for example, radionuclide
vs. echocardiography) to obtain the same variable outcome, the SMD was calculated as described by DerSimonian and Laird's random effects model [
12].
Heterogeneity was tested using Cochran's Q test. A
P value of < 0.05 was considered as indication of significant heterogeneity. I
2 statistics were computed from Cochran's Q test as described by Higgins et al. [
13]. I
2 is the percentage variation in SMD attributable to heterogeneity, and values between 25% and 50% indicate low heterogeneity, whereas between 50% and 75% and >75% indicate moderate and high heterogeneity, respectively. Before the analysis, we formed a prior hypothesis that if heterogeneity existed in the LV function studies, the source could be due to: (1) the difference in the definitions of sepsis (that is, pre-1992
vs. post-1992 consensus definition); (2) mortality (for example, in-hospital, 30-day
vs. 90-day mortality); (3) whether or not the study included patients with severe sepsis; and (4) whether or not patients with history of cardiac disease were excluded. To explore this, we decided that a meta-regression analysis containing the above four covariates should be carried out if the heterogeneity existed [
14]. Random permutation test for significance was used to adjust for multiple testings [
14].
Small-study effects (also known as 'publication bias') were examined by funnel plots. To avoid subjectivity in the interpretation of funnel plots, we tested for plot symmetry by Egger's regression test [
15]. Further assessment of the publication bias was tested using the Duval and Tweedie non-parametric 'trim and fill' method. This method estimates the number and outcomes of missing studies, and adjusts the meta-analysis to incorporate the imputed missing data [
16]. Due to the small number of studies, trim and fill analysis was not performed for RV function and dimension analysis.
Discussion
This meta-analysis provides pooled estimates of RV and LV function and dimension in patients with severe sepsis and septic shock from 14 individual studies. Using a set of well-defined inclusion criteria, we found that there was not enough evidence to support the view that survivors from severe sepsis or septic shock suffered from early LV dysfunction. When indexed to BSA or body height, the LV dimensions were similar in survivors and non-survivors. However, un-indexed dimensions and pooled results suggested that the LV were mildly enlarged in survivors. There were no statistical significant differences in RV function or dimensions between survivors and non-survivors.
LV depression is a well-established phenomenon among patients with severe sepsis or septic shock [
27]. Multiple mechanisms are believed to be responsible for sepsis-induced LV depression, including mitochondrial dysfunction, oxidative stress, inflammatory actions, and myocardial injury [
28]. Using radionuclide cineangiography, Parker and colleagues observed that survivors from septic shock developed acute LV and RV depression and dilatation which normalized with recovery, hence the term 'reversible myocardial depression' [
4,
5]. Similar findings were made by Jardin [
21]. Yet, a number of subsequent studies failed to reproduce Parker's results. For example, we analyzed the 1984 study results from Kimchi et al. and could not find any significant differences in LVEF or RVEF between survivors and non-survivors from septic shock (43 ± 13%
vs. 51 ± 22% for LVEF (
P = 0.232) and 35 ± 2%
vs. 39 ± 5% for RVEF (
P = 0.439)) [
17]. Vincent et al., on the other hand, showed that survivors from septic shock had better right ventricle function [
6,
19]. There are several explanations for the discordances reported by these studies, including sample size, definition of sepsis and septic shock, study population, types of cardiac investigations, and effects of therapy.
Sample size
Earlier studies usually had smaller sample sizes and significant group size imbalance. For example, the original study by Parker et al. had only 20 patients with seven non-survivors, Kimchi et al. had 25 with only eight non-survivors, and Vincent et al. had 34 patients (23 non-survivors) with septic shock. The main problems arise from small studies are that they tend to result in imprecise estimates and are more likely to find extreme results. The latter of these can account for inter-study discordant. To detect a 10% difference in LVEF between two groups (assuming a SD of 15%), one needs at least 70 patients (35 in each group) to achieve 80% power. A 5% difference will need over 300 patients in total. While more recent studies tended to be larger in size, few of them had the necessary sample size to provide a conclusive result.
Definition of severe and time of recruitment
Some primary studies used different definitions of septic shock for recruiting patients, and those published before the Consensus definitions defined septic shock differently. Even with the publication of the Consensus definitions, the process has not been made easier [
10]. For instance, depending on whether a restricted or a liberal approach is used, applying the consensus definition resulted in a range of incidences of severe sepsis (6% to 27%) and septic shock (4% to 9%) in the same study [
29]. This highlighted the difficulty in recruiting patients in sepsis studies. Not only could the differences in definitions and recruitment stringency affect the type of patients recruited, but the timing of recruitment could also contribute. One typical situation is that not all SIRS criteria are present at start of infection, which itself is difficult to define [
30]. Although some studies described the times of recruitment and when the first assessments were done, others were not so explicit probably due to the difficulty in defining the onset of sepsis, which might have occurred before admission to ICU. Arguably, some studies might report the worst values, while other might have reported the values on Day 1. Recruiting patients of different types or at different stages could inevitably affect the composition of study population, hence the generalizability.
Study population
To have a homogeneous study population across different studies is a challenge in systematic reviews. Heterogeneity of study populations can arise from two sources: (1) inclusion; and (2) exclusion. Although the study populations in all of the studies were severe sepsis and septic shock, the sources of sepsis varied. Fortunately, the sources of sepsis found were comparable in most of the included studies, and were typical to that found in general ICU. On the other hand, the exclusion criteria varied widely among the studies. The most common exclusion factor was the presence of pre-existing cardiac conditions. Although it is understandable that excluding these patients would reduce the risk of type I error (false-positives) of finding sepsis-induced myocardial depression, such exclusion could be a curb on giving the answer if pre-existing cardiac conditions would pre-dispose patients to sepsis-induced myocardial depression and/or increase the risk of death. In the original study by Parker et al., four of the 20 patients had underlying cardiac conditions (coronary heart disease and cardiomyopathy), but this did not increased the risk of death (OR = 0.6; 95% CI = 0.06 to 5.6) [
4]. In our previous study, none of the 12 patients with pre-existing cardiac conditions developed sepsis-induced myocardial depression and the presence of cardiac conditions did not increase the risk of death statistically (OR = 2.4; 95% CI = 0.5 to 10.7) [
9]. Similar findings were found in a study by ver Elst et al. where previous myocardial infarct did not increase mortality in septic shock (OR = 2.1; 95% CI = 0.5 to 8.3) [
31]. These studies suggested that pre-existing cardiac conditions neither altered the risk of sepsis-induced myocardial depression nor increased the risk of death in severe sepsis and septic shock. As the total sample size was still small in these studies, more research is required in this area.
Assessment methods
In the last three decades, there has been a clear shift in assessment methods for EFs in these studies - from invasive and to non-invasive. Temporally, radionuclide angiography was the mainstream in the earliest research, followed by pulmonary artery catheter thermodilution, and finally by echocardiography in modern era. Some doubts existed in relation to the comparability of LVEF obtained by echocardiography and radionuclide angiography. Takenaka et al. found that LVEF measurements by the two methods could yield different estimates as a result of the state of the patients such as anxiety [
32]. More recent studies, however, demonstrated that the two methods have good comparability [
33,
34]. The comparability of RVEF by thermodilution and radionuclide however was not as good [
35]. Of note, Vincent et al. found that RVEF from thermodilution could be underestimated in the presence of tricuspid regurgitations [
36]. The complicated shape of the right ventricle precludes the use of echocardiography to measure RVEF. Due to a lack of strong evidence to support comparability among the three methods, readers quoting EFs for survivors and non-survivors need to mention the methods used.
Effects of therapy
Depending on the time of cardiac assessments, fluid resuscitation had the potential effects of altering the ventricular dimensions and possibly function. However, most of the studies were not clear whether fluid had been administered before or after measurements. The use of vasopressors and/or inotropes is another factor that could affect the findings. Vieillard-Baron et al. found that while some (39%) patients with septic shock exhibited 'primary' global hypokinesia (depressed LV function) at admission, some (21%) developed 'secondary' global hypokinesia 24 to 48 h after hemodynamic support by norepinephrine [
7]. This 'secondary' hypokinesia could be partially reversed by dobutamine. The authors could not rule out a causative relationship between noradrenaline and hypokinesia, but pointed out that the hypokinesia could be the direct deleterious effects of the associated increase in afterload on a latent dysfunctioning heart in sepsis. If this is true, then studies that had their initial assessments done within 24 h of onset of symptoms could have underestimated the extent of myocardial depression in their cohorts. This could have implications in the interpretation of myocardial depression and mortality.
Mechanical ventilation is another potential confounding factors that could affect the ventricular function and dimension in sepsis via heart-lung interactions. The change in ventilation practice since the ARDSNet study in 2000 [
37] is a possible bias or confounders for ventricular function and dimension. However, the effects of mechanical ventilation, in particular protective ventilation, were not explored due to a lack of consistent ventilation data. In this regards, it is noteworthy that most eligible studies contained a mixed population of sepsis, instead of a pure population with acute lung injury or acute respiratory distress syndrome. Hence, the ventilator settings might be different for different patients.
Effects of indexation on LV dimensions
Our meta-analysis found that when LV dimensions were indexed against BSA or body height, no differences between survivors and non-survivors were found in the pooled results. However, the survivors displayed mildly dilated LV ventricles when non-indexed dimensions were used. Of interest, in Landesberg et al.'s study, non-indexed LVEDD resulted in non-significant results between the survivors and non-survivors, whereas indexed LVEDV led to significant different results. Differences in using indexed and non-indexed parameters are to be expected. While it is natural for heart size to follow both body size, body composition and physiological demands (for example, exercise) to accommodate the greater metabolic needs, the heart is also capable of adapting itself to acute conditions, for example, dilating acutely with expanded intravascular volume. According to our findings, the non-indexed dimensions were more consistent across studies when compared to indexed dimensions. Depending on which parameter is used, this clearly has an impact on study interpretations. More studies are needed in this area.
Excluded studies
Most studies were excluded due to a lack of sufficient data. One relevant study was excluded because it reported the results as median (25th/75th percentile), and the author did not provide the data in mean (± SD) when requested [
38]. Nevertheless, the authors in that study found no difference in LVEF between the survivor and non-survivor groups. Unfortunately, we have to exclude all studies published by Parker and colleagues due to the inclusion of non-adults in their cohorts [
4,
39,
40]. It was also unclear if the patients were recruited consecutively or if all the patients within the study period were recruited. Another interesting point to note about Parker's 1984 studies is that the cohort consisted of a large proportion (14 out of 20) of patients with underlying malignancy, whether or not the cardiac function or dimensions of these patients were already affected by the chemotherapy was unknown.
We also excluded studies that used other non-global estimates for LV or RV function, such as tissue Doppler as it only measures one segment of the myocardium [
30].
Suggestions for future studies
Considering the challenges above, we suggest future studies should be prospective in nature and use the Consensus definition for recruitment. To improve generalizability, exclusion criteria should not be too restrictive and inclusion of patients with underlying cardiac conditions should be considered. To handle possible confounding effects of underlying cardiac conditions, a priori plan of subgroups analyses can be made. The sample size should be estimated beforehand based on published data. To facilitate comparisons between studies, results should report the mean (SD), plus median (95% CI) if necessary. Echocardiography remains as the preferred methods for assessment at present. Although echocardiography cannot assess RVEF, other estimates of RV function (for example, tissue Doppler) can be used instead. Treatments (vasopressors and inotropes) given, and whether or not the patient received fluid, at the time of assessment should be documented. Serial measurements, at least for the first 2 to 3 days and after fluid optimization and norepinephrine administration, should be performed. While time-to-event survival analysis is the preferred method of analysis, ICU and in-hospital mortality should be reported at the very least.
The main purpose of meta-analysis is to summarize the results from smaller studies in a systematic and unbiased manner. Hence, the quality and conclusion of meta-analysis depends on the quality of included studies. The variability in study designs is one of the most concerned factor in meta-analysis. In this study we addressed a number of factors that could possibly explained results heterogeneity and might also affect the quality of this meta-analysis: sample size, definition of sepsis and septic shock, study population, types of cardiac investigations, and effects of therapy. To minimize results heterogeneity, we used a list of objective and unbiased recruitment criteria to identify appropriate studies. However, since meta-analysis is retrospective in nature, statistical manipulation could neither save the differences in study designed nor improve the studies' quality. As such, we are unable to address several important issues due to inconsistent study designs and lack of data: pre-admission ventricular function, the precise effects of early administration of vasopressors and inotropes, the relationship between load-independent myocardial function index and mortality. A definitive answer to these questions will require a large and well-designed prospective study.
Authors' contributions
SJH and ASM initiated and designed the study. SJH and MN performed the literature search, screened and appraised the papers. SJH performed the statistical analysis. SJH, MN, and ASM drafted the manuscript. All authors have read and approved the manuscript for publication.