Skip to main content
Erschienen in: BMC Medical Research Methodology 1/2010

Open Access 01.12.2010 | Research article

Evidence at a glance: error matrix approach for overviewing available evidence

verfasst von: Frederik Keus, Jørn Wetterslev, Christian Gluud, Cornelis JHM van Laarhoven

Erschienen in: BMC Medical Research Methodology | Ausgabe 1/2010

Abstract

Background

Clinical evidence continues to expand and is increasingly difficult to overview. We aimed at conceptualizing a visual assessment tool, i.e., a matrix for overviewing studies and their data in order to assess the clinical evidence at a glance.

Methods

A four-step matrix was constructed using the three dimensions of systematic error, random error, and design error. Matrix step I ranks the identified studies according to the dimensions of systematic errors and random errors. Matrix step II orders the studies according to the design errors. Matrix step III assesses the three dimensions of errors in studies. Matrix step IV assesses the size and direction of the intervention effect.

Results

The application of this four-step matrix is illustrated with two examples: peri-operative beta-blockade initialized in relation to surgery versus placebo for major non-cardiac surgery, and antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation. When clinical evidence is deemed both internally and externally valid, the size of the intervention effect is to be assessed.

Conclusion

The error matrix provides an overview of the validity of the available evidence at a glance, and may assist in deciding which interventions to use in clinical practice.
Hinweise

Electronic supplementary material

The online version of this article (doi:10.​1186/​1471-2288-10-90) contains supplementary material, which is available to authorized users.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FK, JW, CG, and CL contributed to the development of the ideas in the manuscript. FK and JW performed the analyses, created the figures, and drafted the original text. CG and CL commented and contributed to the discussion. All authors read and approved the final manuscript.

Background

Evidence-based medicine (EBM) was first introduced in 1992 [1], and its increased application is reflected among others by the growth of The Cochrane Library databases as well as implementation of evidence-based guidelines into clinical practice [2]. EBM underpins that information provided from randomized trials, and systematic reviews of randomized trials represent the most reliable evidence regarding intervention effects [3, 4]. Thanks to the sustained scientific process (Additional file 1: Table S1), we now know that the reliability of what we observe varies due to a whole array of different factors. There are three dimensions that particularly influence the reliability of our observations in clinical research and they are empirically and theoretically well accepted: the risk of systematic error ('bias'), the risk of random error ('play of chance'), and the risk of design error ('wrong design to answer the posed question') [4, 9].
EBM usually follows a four-phase process starting from a clinical question proceeding to the implementation of new evidence (Figure 1) [3]. Phase 1 is the formulation of a research question and literature search strategy. Phase 2 is the subsequent systematic appraisal and synthesis of the available evidence. Phase 3 covers the initiation of new research. Alternatively, phase 4 is the implementation of all available evidence when statistically and clinically convincing evidence has been obtained.
In daily clinical practice, the question of whether sufficient evidence is available to recommend the implementation of a specific intervention as a treatment arises frequently [3]. Depending on the specific clinical question, often an exhaustive list of references is retrieved when using a sensitive search strategy in multiple databases [3]. After the selection of studies, their data must be interpreted [1013].
Since results may be contradictory and studies may differ in more than one aspect, to draw a clear, practical conclusion from the publications may be problematic [14].

Objective

We aimed at conceptualizing a visual assessment tool, i.e., a matrix for overviewing studies and their data in order to assess the clinical evidence. The matrix is constructed from the three dimensions of errors: systematic error ('bias'), random error ('play of chance'), and design error ('wrong design to answer the question posed' or 'wrong context'). The application of this matrix will be illustrated by two examples: peri-operative beta-blockade initialized in relation to surgery versus placebo for major non-cardiac surgery, and antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation.

Methods

The three major error dimensions

The risk of systematic error ('bias')

When evaluating a clinical study, one should always try to assess its risk of systematic error [3, 4, 916]. There is increasing agreement on how trials and studies can be placed in a hierarchy when assessing the risk of systematic error [3, 4, 916], depending on the type of research (therapeutic, diagnostic, etiologic, or prognostic) [3, 10, 11, 17]. The risk of systematic error influences the reliability of observed intervention effects [3, 10, 11, 18, 19]. A significant association between inadequate or unclear bias protection and overestimation of beneficial effects and underreporting of adverse effects has been demonstrated [16, 1923]. Differences in risk of bias are found both between the different levels of evidence and within each level of evidence [4, 16, 20].
For randomized trials, there is empirical evidence that at least six components are associated with systematic error: generation of the allocation sequence [24], allocation concealment [25], blinding [26], incomplete outcome measure reporting [4], selective outcome measure reporting [4], and other bias mechanisms (e.g., baseline imbalance, early stopping, vested interests, etc.) [4, 16, 20, 2729]. The impact of early stopping of trials on bias is largely dependent on how the stopping rules were defined and the level of statistical significance of the interim analysis [3032]. Trials with one or more systematic error components assessed as inadequate or unclear are considered to be of high risk of bias, while trials with all quality components assessed as adequate are considered to be of low risk of bias [15, 27, 33]. Trials with a low risk of bias are more likely to estimate the 'true' effect of the intervention [16, 20, 27, 33].
The systematic error dimension can be measured by an ordinal variable expressed in the levels of evidence (Table 1).
Table 1
Categorization of systematic error (bias) of clinical intervention studies into levels of evidence
Category
Studies
Level 1a
Meta-analysis of randomized trials with low risk of bias
Level 1b
Randomized trial with low risk of bias
Level 1c
Meta-analysis of all randomized trials
Level 1d
Randomized trial with high risk of bias
Level 2a
Meta-analysis of cohort studies
Level 2b
Cohort study
Level 3a
Meta-analysis of case-control studies
Level 3b
Case-control study
Level 4
Case-series
Level 5
Expert opinion

The risk of random error ('play of chance')

The risk of random error is the risk of drawing a false conclusion based on sparse data. There are two types of false conclusions: a false rejection of the null hypothesis (type I error; alpha) or a false acceptance of the null hypothesis (type II error; beta). When data are sparse, then the so called 'intervention effect', whether beneficial or harmful, may in fact be caused by randomly skewed variation in prognostic factors between the intervention groups due to sampling error.
The question, however, is how we quantify and compare the risk of random error between different studies with varying numbers of participants. A p-value reflects the risk that the difference in outcome between two interventions has arisen by chance, given the data and the null hypothesis are true. Since random low (and random high) p-values occur, especially during accumulation of data and sequential testing, the p-value does not sufficiently reflect the true risk of random error. Therefore, the p-values of intervention effect estimates certainly are not suitable for comparison of the risk of random error between different studies [32, 3437]. We suggest using the standard error (SE) for the evaluation of the risk of random error. We used the statistical algorithms from the statistical methods group of the Cochrane Collaboration [38]. The SE in a study may be considered a measure of uncertainty. The SE measures the amount of variability in the sample mean; it indicates how closely the population mean is likely to be estimated by the sample mean. The size of the standard error depends both on how much variation there is in the population and on the size of the sample. When two independent proportions p 1 = a/n 1 and p 2 = c/n 2 (with a and c being the numbers of patients with events, b and d being the numbers of patients with no events, and n 1 and n 2 being the total numbers of patients in the intervention group and control group, respectively) are considered in an individual study or a trial i, then the relative risk (RR i ) is defined by:
R R i = p 1 p 2
The SE of the log risk ratio for an individual study is calculated by the following formula:
S E [ ln ( R R i ) ] = 1 a i + 1 c i - 1 n 1 i - 1 n 2 i
The Peto odds ratio (OR peto,i ) for an individual study or trial i is defined by:
O R P e t o , i = exp [ Z i V i ]
where
Z i = a i E [ a i ] = a i n 1 i ( a i + c i ) N i  and   V i = n 1 i n 2 i ( a i + c i ) ( b i + d i ) N i 2 ( N i 1 )
The SE of the log Peto odds ratio for an individual study is defined by:
S E [ L n ( O R P e t o , i ) ] = 1 V i
or
S E [ L n ( O R P e t o , i ) ] = N i N i 1 n 1 i n 2 i ( a i + c i ) ( b i + d i )
In a meta-analysis results of studies or trials are meta-analysed into one intervention effect estimate. For the Mantel-Haenszel pooled risk ratio (RR MH ) the natural logarithm of the RR MH has the standard error given by:
S E [ ln ( R R M H ) ] = P R S
where
P = i n 1 i n 2 i ( a i + c i ) a i c i N i N i 2  and  R = i a i n 2 i N i  and  S = i c i n 1 i N i
and N i being the total number of patients in a trial.
For the pooled Peto OR (OR peto ) the natural logarithm of the OR peto has the standard error given by:
S E [ L n ( O R P e t o ) ] = 1 V i
SE depends on the numbers of events and the sample size.
Due to spurious results, incorrect type I error inferences may be drawn. Recent reports indicate that the influence of the 'play of chance' may be much larger than generally perceived [39]. In randomized trials, random error may be one reason for the early stopping of trials at interim analyses when benefit or harm appear to be significant [32, 40]. Increased random error may also play a role in the repeated analyses of accumulating data in both trials and meta-analyses [36, 4144]. A cumulative meta-analysis subjects accumulating data to repeated testing of the data and is bound to eventually lead to a false rejection of the null hypothesis ('false positive' result) [45, 46]. The random error phenomenon or 'multiplicity' also plays a role in the evaluation of secondary outcome measures [40]. For example, when data on the primary research outcome, on which the sample size calculation was based, may not show statistical significance, while another outcome measure, for which no separate sample size calculation was performed, exhibits statistical significance [47, 48].
Random error may be expressed in a continuous variable using the standard error of for example the log of Peto odds ratios or the log of relative risks.

The risk of design errors (external validity) - the participants included, the outcomes measured, the interventions, etc

When there is sufficient internal validity, i.e., low risks of systematic errors and random errors, it becomes relevant to consider the risks of design errors (external validity). The design (or context) of any piece of research determines its external validity or generalisability (Table 2) [4]. The external validity becomes questionable when a wrong design has been used to answer the question posed. Among the many variables that should be considered, the relevance of different outcome measures are of central importance to clinical research [13]. We, therefore, focus on them from a patient's perspective.
Table 2
Types of variables to consider when evaluating the risk of design errors ('context errors') and hence external validity of evidence
1
Outcome measures
2
Participants
3
Experimental intervention
4
Control intervention
5
Clinical centres or settings including patients
6
Goal - explanatory or pragmatic
7
Trial structure - parallel group, crossover, etc
8
Objective - superiority, equivalence, non-inferiority
9
Unit of analysis
Outcome measures can be divided into three categories according to the GRADE classifications (Figure 2) [13]. Primary outcome measures are central in deciding the use of one intervention over another. Large differences in the primary outcome measure between groups in a clinical trial may lead to early termination of a trial (following recommendations of a data safety and monitoring committee) [49]. Choice of the primary outcome should concur with the GRADE category of outcomes, 'critical for decision-making' [13]. Secondary outcome measures are additional outcome measures. If they are positively influenced by an intervention, the results may speak for recommending the intervention only if they support a beneficial effect on the primary outcome or if no clinically and statistically significant effect exist on the primary outcomes (e.g., a RR = 1.00 with 95% confidence limits from 0.98 to 1.02). The secondary outcomes should concur with the second and third GRADE categories of 'important, but not critical outcomes' [1013].
GRADE has schematically ordered outcomes according to patients' perspective on a categorical scale from 1 to 9, with the most critical outcome, mortality, being graded 9 [13]. Depending on the outcomes, this scale should sometimes be considered nominal and in other situations be considered functional. Moreover, the severity of each outcome may differ as well. A stroke can be minor, while a myocardial infarction may involve a substantial worsening of cardiac function. Grading of outcome measures may also vary according to the clinical question. Therefore, outcomes within a category (i.e., critical, important, or not important) may be interchangeable. However, one can hardly argue that outcomes between categories (i.e., critical, important, or not important) are interchangeable (e.g., mortality is always more important than length of stay in hospital).
Eventually, the design error dimension can be expressed by the priority of outcome measures as an ordinal variable according to GRADE [13].

Conceptualization of the error matrix

A four-step matrix can be constructed, building upon the three dimensions: systematic error, random error, and design error. Matrix step I ranks the identified studies according to the dimensions of systematic errors and random errors. Matrix step II orders the studies according to the design errors. Matrix step III assesses the three dimensions of errors in studies. Here, a 'Manhattan-like' error matrix is constructed where the best evidence is represented by the largest skyscrapers located on the 'upper-west side'. Matrix step IV assesses the size and direction of the intervention effect, e.g., by calculating the number-needed-to-treat to obtain benefit or to harm one patient.
The principle of the matrix approach can be used in different situations. The overall effort in research should be to minimize all three risks of errors before the size and the direction of the intervention effect can be assessed reliably. The 'algorithm' of the matrix approach is generally applicable to all kinds of interventions, although details may differ according to the specific clinical question. Or, the character of the three dimensions remains the same, while according to the specific question details may differ, like: the preferred hierarchy for levels of evidence, the chosen formula for standard error (RR, OR peto , or any other association metric), and the types of outcomes.

Results

The application of this four-step matrix is illustrated with two examples: peri-operative beta-blockade initialized in relation to surgery versus placebo for major non-cardiac surgery, and antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation.

Example 1: Initiating peri-operative beta-blockade for major non-cardiac surgery

A clinical question in PICOT structure illustrates this model. Is initiating peri-operative beta-blockade effective in patients undergoing major non-cardiac surgery?
Patients: patients undergoing major non-cardiac surgery; Intervention: initiating peri-operative beta blockade; Control: placebo; Outcome measure: mortality, myocardial infarction, and stroke; Time: follow-up of at least 30 days.
We searched in CENTRAL in The Cochrane Library, PubMed, EMBASE, and personal files for all article types up to October 2009, in all languages. Specific searches using the terms 'beta-blockade', 'peri-operative', 'placebo', 'mortality', 'randomised', and 'non-cardiac surgery' were undertaken. The search resulted in multiple publications relevant to our question. References were selected from journals on the basis of importance and relevance [5058]. We included the publications in our matrix evaluation by extracting information on all-cause mortality, cardiovascular mortality, non-fatal myocardial infarction, and non-fatal stroke. However, the matrix may easily be extended to other outcomes.
In step I, we assessed the systematic error and the random error for the chosen outcomes of each study (Figure 3, Table 3). In step II, we evaluated the design error (Figure 4). In step III, we constructed the three-dimensional matrix (Figure 5). We did not elaborate on the matrix step IV in this example.
Table 3
Ordering of evidence according to levels of evidence (systematic error), standard error (random error), and outcome measures (design error) in peri-operative beta-blockade versus placebo for major non-cardiac surgery (example 1).
 
Level of evidence
Standard error
  
All-cause mortality
Cardiovascular mortality
Non-fatal myocardial infarction
Non-fatal stroke
Bangalore [50]
1a
0.12
0.16
0.10
0.28
Poise [51]
1b
0.13
0.17
0.10
0.33
MaVS [52]
1b
1.07
Z
N
0.66
Dipom [53]
1b
0.34
0.48
0.91
Z
Mangano [54]
1b
0.85
1.22
1.22
1.11
Bangalore [50]
1c
0.11
0.15
0.09
0.28
Wetterslev [55]
1c
0.24
N
0.23
N
Poldermans [56]
1d
0.76
0.76
Z
N
Lindenauer [57]
2b
0.02
N
N
N
AHA Guidelines [58]
5
N
N
N
N
Z: outcome with zero-events in one or both treatment arms which makes SE incalculable; N: no data
Some outcome measures may be correlated (e.g. cardiovascular mortality is included in all-cause mortality).
In this example the formulas for SE of lnRRi for individual studies and SE of lnRRMH for meta-analysis were used.
From Figure 5 it can be concluded at a glance that peri-operative beta blockade does not reduce mortality in patients undergoing major non-cardiac surgery. Peri-operative beta-blockade in these patients seems to increase all-cause mortality. However, peri-operative beta-blockade does reduce non-fatal myocardial infarction on the expense of an increased cardiovascular mortality and an increased rate of non-fatal stroke.

Example 2: Antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation [59]

The conclusion of this Cochrane review focuses on the significant increased mortality associated with use of class 1a antiarrhythmics (odds ratio 2.39; 95% confidence interval (CI) 1.03 to 5.59) [59]. The data of this outcome in class 1a antiarrhythmics in this review [59] as well as in the included randomised trials [6067] were analysed using the matrix error approach.
In step I, we assessed the risk of systematic error and the risk of random error for the chosen outcome of each study (Figure 6, Table 4). In step II, the design error should be evaluated by assessing multiple outcome measures. However, in this example we only consider the outcome 'all-cause mortality', since other outcomes were found to be not statistically significantly different [59]. Therefore, no figure of step II is shown. In step III, we constructed the three-dimensional matrix (Figure 7). We did not elaborate on the matrix step IV in this example, since the available studies are not internally valid (high risks of both systematic and random error).
Table 4
Ordering of evidence according to levels of evidence (systematic error), standard error (random error), and outcome measures (design error) in antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation (example 2)
 
Level of evidence
Standard error
  
All-cause mortality
Byrne-Quinn [60]
1d
2.02
Hillestad [61]
1d
2.00
Karlson [62]
1d
1.42
Lloyd [63]
1d
1.55
PAFAC [64]
1b
0.78
Sodermark [65]
1d
0.73
SOPAT [66]
1b
1.51
Steinbeck [67]
1d
Z
Lafuente-Lafuente [59]
1c
0.43
Z: outcome with zero-events in both treatment arms which makes SE incalculable
In this example the formulas for SE of lnORpeto,i for individual studies and SE of lnORpeto for meta-analysis were used.
From Figure 7 it can be concluded at a glance that there is both substantial risk of systematic and random error involved in the evidence available so far considering mortality associated with class 1a antiarrhythmics. The best available level of evidence 1c study shows substantial risk of random error (0.43) and the best available level of evidence 1b study shows high risk of random error (0.78). So, the conclusion in the Cochrane review of a significant increased mortality is based on data with high risks for both systematic and random errors, and should therefore be considered unreliable.

Discussion

The aim of our matrix is to facilitate the overview of evidence in clinical intervention research. The matrix can serve as a tool to provide visual assessment of reliability of observations with respect to systematic error, random error (internal validity), and design error (external validity).
The matrix should not replace the thorough process of systematically reviewing evidence and profound evaluations of data, but could be integrated within these research activities as a tool for overviewing the results. Also, this matrix is not an absolute measure of the risks of errors. The position of studies in relation to each other is relative rather than absolute.
There is a lack of awareness of the importance of the 'play of chance' for the reliability of study findings. Ordering the standard errors of the studies might be a tool for ranking studies according to the level of random error. We have used natural logarithm (ln) transformations for calculating standard errors, although the logarithm with the base 10 may be used without producing different conclusions.
As an alternative, the Bayes factor can be considered [37, 68]. The Bayes factor is a likelihood ratio comparing one hypothesis versus another, and, therefore, varies with the definition of the possible alternative hypotheses. The Bayes factor is a summary measure that provides an alternative to the p-value for the ranking or the flagging of associations as 'significant' [69]. The Bayes factor:
Bayes factor = Probability  ( Data ,  given the null hypothesis ) Probability  ( Data ,  given the alternative hypothesis )
or simple approximations can be very difficult or even impossible to implement for the clinician, since a search for the maximum of the multidimensional posterior may be required for each association [69]. This also includes the asymptotic Bayes factor introduced by Wakefield [69]. In contrast to the Bayes factor, it is possible to calculate the standard error and when available it provides a tool for comparison of the risk of random error between studies of the same intervention.
The aim of minimising error risks according to the three dimensions actually combines the methodological efforts of falsifying any alternative hypothesis in the evaluation of an intervention. Thereby, the matrix concept visualises how far the scientific process has evolved to fulfil Poppers falsification criterion stating that researchers should primarily engage trying to falsify any relevant alternative hypothesis and not only the null hypothesis [5]. The minimisation of systematic errors and random errors, by providing ample room for the null hypothesis, as well as measuring important outcomes is the most audacious attack on any realistic alternative hypothesis. If an array of progressively qualified attacks fails to support the null hypothesis then we can reliably trust the intervention to be either beneficial or harmful.
The conclusion based on an assessment of the evidence using the matrix approach may be implemented into clinical practice or serve as an incentive for new research. The matrix facilitates the identification of lacunae in our knowledge and is likely to benefit the process of developing evidence-based guidelines.

Preference for the highest evidence

One has to be aware of the multiple forms of bias, potentially present in evidence below level 1 (Table 1). Several examples illustrate that large, apparently beneficial intervention effects from lower level evidence, even from randomized trials [54, 56, 70], may eventually be reversed to harmful effects when new high-quality evidence appears [50, 71]. This is where the three dimensions of error are of central importance in providing a tool for reliability assessment.

Limitations

Apart from the three error dimensions influencing the reliability of data, other factors play a role in incomparability and uncertainty of inferences. Many reports of studies appear incomplete, and the lack of details raises questions. Incomplete reporting limits interpretation, but more importantly, this reporting factor should be distinguished from the methodological quality of the trial [72].
Statements like CONSORT [73], PRISMA [74], and MOOSE [75] aim to improve and to maximize the amount and correctness of information to be retrieved from publications. These guidelines also create awareness among researchers about the most important issues to report so that the quality of future research may increase. By following reporting guidelines the yield of the research question is likely to be increased (phase 1 in Figure 1).
Standard error does not consider testing of multiple outcomes and multiple testing on accumulating data, which may also induce risks of random error due to multiplicity as well as correlations.
The division of all outcomes into 'primary' and 'secondary' outcome measures can be helpful as this division sets the standards for the evaluation of interventions. However, this division is artificial, and outcome measures, situated on the border of primary and secondary outcomes, exist. For example, one can argue that quality of life is a primary outcome rather than a secondary outcome. Further, there is also a quantitative aspect in the artificial division into primary and secondary outcomes. Small significant differences in primary outcome measures (e.g., bile duct injuries in patients undergoing cholecystectomy) may be found favouring one intervention, while large differences in secondary outcome measures (e.g., costs) may favour the comparator. Eventually, one may prefer the larger advantages in secondary outcomes to the smaller disadvantages in a primary outcome measure.
Another limitation in the outcome measure dimension is that often outcome measures are correlated and mostly this correlation is ignored. For example when mortality is an outcome measure and complications is another, which again counts deaths as complications, then there is a correlation between the two outcome measures. Authors usually carry out multiple univariate analyses ignoring correlations between outcome measures.
Step IV of the matrix includes the assessment of the size of the intervention effect, e.g., expressed in numbers-needed-to-treat to obtain benefit or to harm one patient with the intervention. This step is the last one since it is irrelevant to consider effect sizes and their directions if a study does not appear to be internally and externally valid.
Another aspect to consider is heterogeneity [76, 77]. Statistical heterogeneity reflects the between trial variance of meta-analytic intervention effect estimates rather than the play of chance [76]. Clinical heterogeneity, however, represents differences in populations, procedures, or interventions in daily practice. All these factors of clinical heterogeneity, together with concordance of in- and exclusion criteria should be considered whenever we want to implement results of available evidence. Assessment and consideration of heterogeneity or diversity, therefore, forms the final step before new evidence is implemented. Assessment of heterogeneity is not included in our matrix.

Conclusions

Assessment of risks of systematic error, random error, and design error are essential factors in evaluating evidence and drawing conclusions. We used the standard error in our matrix to rank studies according to their risk of random error. The risks of these error types were incorporated into a three dimensional matrix to create a schematic overview of the internal and external validity of the evidence, seen at a glance.

Funding

There was no funding.

Acknowledgements

We thank Dimitrinka Nikolova and Kate Whitfield for linguistic assistance, Esther Lutteke for graphical assistance, and Gert-Jan van der Wilt and Rogier Donders for useful suggestions for improvement.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FK, JW, CG, and CL contributed to the development of the ideas in the manuscript. FK and JW performed the analyses, created the figures, and drafted the original text. CG and CL commented and contributed to the discussion. All authors read and approved the final manuscript.
Literatur
1.
Zurück zum Zitat Evidence-Based Medicine Working Group: Evidence-based medicine. A new approach to teaching the practice of medicine. JAMA. 1992, 268: 2420-5. 10.1001/jama.268.17.2420.CrossRef Evidence-Based Medicine Working Group: Evidence-based medicine. A new approach to teaching the practice of medicine. JAMA. 1992, 268: 2420-5. 10.1001/jama.268.17.2420.CrossRef
2.
Zurück zum Zitat Sutton AJ, Higgins JPT: Recent developments in meta-analysis. Stat Med. 2008, 27: 625-50. 10.1002/sim.2934.CrossRefPubMed Sutton AJ, Higgins JPT: Recent developments in meta-analysis. Stat Med. 2008, 27: 625-50. 10.1002/sim.2934.CrossRefPubMed
3.
Zurück zum Zitat Straus SE, Richardson WS, Glasziou P, Haynes RB: Evidence-based medicine. How to practice and teach EBM. 2005, Edinburgh, UK: Churchill Livingstone Straus SE, Richardson WS, Glasziou P, Haynes RB: Evidence-based medicine. How to practice and teach EBM. 2005, Edinburgh, UK: Churchill Livingstone
4.
Zurück zum Zitat Higgins JPT, Green S: Cochrane Handbook for Systematic Reviews of Interventions. 2008, The Cochrane CollaborationCrossRef Higgins JPT, Green S: Cochrane Handbook for Systematic Reviews of Interventions. 2008, The Cochrane CollaborationCrossRef
5.
Zurück zum Zitat Popper KR: Logik der Forschung. 1959, Vienna: Springer Popper KR: Logik der Forschung. 1959, Vienna: Springer
6.
Zurück zum Zitat Kuhn T: The Structure of Scientific Revolutions. 1962, Chicago: The University of Chicago Press Kuhn T: The Structure of Scientific Revolutions. 1962, Chicago: The University of Chicago Press
7.
8.
Zurück zum Zitat Quine WV: From a logical point of view. 1953, Cambridge: Harvard University Press Quine WV: From a logical point of view. 1953, Cambridge: Harvard University Press
9.
Zurück zum Zitat Gluud C: The culture of designing hepato-biliary randomised trials. J Hepatol. 2006, 44: 607-15. 10.1016/j.jhep.2005.12.006.CrossRefPubMed Gluud C: The culture of designing hepato-biliary randomised trials. J Hepatol. 2006, 44: 607-15. 10.1016/j.jhep.2005.12.006.CrossRefPubMed
10.
Zurück zum Zitat Atkins D, Eccles M, Flottorp S, Guyatt GH, Henry D, Hill S, Liberati A, O'Connell D, Oxman AD, Phillips B, Schünemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches. The GRADE Working Group. BMC Health Serv Res. 2004, 4: 38-10.1186/1472-6963-4-38.CrossRefPubMedPubMedCentral Atkins D, Eccles M, Flottorp S, Guyatt GH, Henry D, Hill S, Liberati A, O'Connell D, Oxman AD, Phillips B, Schünemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations I: critical appraisal of existing approaches. The GRADE Working Group. BMC Health Serv Res. 2004, 4: 38-10.1186/1472-6963-4-38.CrossRefPubMedPubMedCentral
11.
Zurück zum Zitat Atkins D, Briss PA, Eccles M, Flottorp S, Guyatt GH, Harbour RT, Hill S, Jaeschke R, Liberati A, Magrini N, Mason J, O'Connell D, Oxman AD, Phillips B, Schünemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations II: pilot study of a new system. BMC Health Serv Res. 2005, 5: 25-10.1186/1472-6963-5-25.CrossRefPubMedPubMedCentral Atkins D, Briss PA, Eccles M, Flottorp S, Guyatt GH, Harbour RT, Hill S, Jaeschke R, Liberati A, Magrini N, Mason J, O'Connell D, Oxman AD, Phillips B, Schünemann H, Edejer TT, Vist GE, Williams JW, GRADE Working Group: Systems for grading the quality of evidence and the strength of recommendations II: pilot study of a new system. BMC Health Serv Res. 2005, 5: 25-10.1186/1472-6963-5-25.CrossRefPubMedPubMedCentral
12.
Zurück zum Zitat Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, Schünemann HJ, GRADE Working Group: GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008, 336: 924-6. 10.1136/bmj.39489.470347.AD.CrossRefPubMedPubMedCentral Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, Schünemann HJ, GRADE Working Group: GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008, 336: 924-6. 10.1136/bmj.39489.470347.AD.CrossRefPubMedPubMedCentral
13.
Zurück zum Zitat Guyatt GH, Oxman AD, Kunz R, Vist GE, Falck-Ytter Y, Schünemann HJ, GRADE Working Group: What is "quality of evidence" and why is it important to clinicians?. BMJ. 2008, 336: 995-8. 10.1136/bmj.39490.551019.BE.CrossRefPubMedPubMedCentral Guyatt GH, Oxman AD, Kunz R, Vist GE, Falck-Ytter Y, Schünemann HJ, GRADE Working Group: What is "quality of evidence" and why is it important to clinicians?. BMJ. 2008, 336: 995-8. 10.1136/bmj.39490.551019.BE.CrossRefPubMedPubMedCentral
14.
Zurück zum Zitat Grimes DA, Schulz KF: An overview of clinical research: the lay of the land. Lancet. 2002, 359: 57-61. 10.1016/S0140-6736(02)07283-5.CrossRefPubMed Grimes DA, Schulz KF: An overview of clinical research: the lay of the land. Lancet. 2002, 359: 57-61. 10.1016/S0140-6736(02)07283-5.CrossRefPubMed
15.
Zurück zum Zitat Kjaergard LL, Villumsen J, Gluud C: Reported methodologic quality and discrepancies between large and small randomized trials in meta-analyses. Ann Intern Med. 2001, 135: 982-9.CrossRefPubMed Kjaergard LL, Villumsen J, Gluud C: Reported methodologic quality and discrepancies between large and small randomized trials in meta-analyses. Ann Intern Med. 2001, 135: 982-9.CrossRefPubMed
16.
Zurück zum Zitat Schulz KF, Chalmers I, Hayes RJ, Altman DG: Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. JAMA. 1995, 273: 408-12. 10.1001/jama.273.5.408.CrossRefPubMed Schulz KF, Chalmers I, Hayes RJ, Altman DG: Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. JAMA. 1995, 273: 408-12. 10.1001/jama.273.5.408.CrossRefPubMed
19.
Zurück zum Zitat Kunz R, Vist G, Oxman AD: Randomisation to protect against selection bias in healthcare trials. Cochrane Database of Methodology Reviews. 2002, 4 Kunz R, Vist G, Oxman AD: Randomisation to protect against selection bias in healthcare trials. Cochrane Database of Methodology Reviews. 2002, 4
20.
Zurück zum Zitat Wood L, Egger M, Gluud LL, Schulz KF, Jüni P, Altman DG, Gluud C, Martin RM, Wood AJ, Sterne JA: Empirical evidence of bias in treatment effect estimates in controlled trials with different interventions and outcomes: meta-epidemiological study. BMJ. 2008, 336: 601-5. 10.1136/bmj.39465.451748.AD.CrossRefPubMedPubMedCentral Wood L, Egger M, Gluud LL, Schulz KF, Jüni P, Altman DG, Gluud C, Martin RM, Wood AJ, Sterne JA: Empirical evidence of bias in treatment effect estimates in controlled trials with different interventions and outcomes: meta-epidemiological study. BMJ. 2008, 336: 601-5. 10.1136/bmj.39465.451748.AD.CrossRefPubMedPubMedCentral
21.
Zurück zum Zitat Grimes DA, Schulz KF: Bias and causal associations in observational research. Lancet. 2002, 359: 248-52. 10.1016/S0140-6736(02)07451-2.CrossRefPubMed Grimes DA, Schulz KF: Bias and causal associations in observational research. Lancet. 2002, 359: 248-52. 10.1016/S0140-6736(02)07451-2.CrossRefPubMed
22.
Zurück zum Zitat Grimes DA, Schulz KF: Cohort studies: marching towards outcomes. Lancet. 2002, 359: 341-5. 10.1016/S0140-6736(02)07500-1.CrossRefPubMed Grimes DA, Schulz KF: Cohort studies: marching towards outcomes. Lancet. 2002, 359: 341-5. 10.1016/S0140-6736(02)07500-1.CrossRefPubMed
23.
Zurück zum Zitat Grimes DA, Schulz KF: Descriptive studies: what they can and cannot do. Lancet. 2002, 359: 145-9. 10.1016/S0140-6736(02)07373-7.CrossRefPubMed Grimes DA, Schulz KF: Descriptive studies: what they can and cannot do. Lancet. 2002, 359: 145-9. 10.1016/S0140-6736(02)07373-7.CrossRefPubMed
24.
Zurück zum Zitat Schulz KF, Grimes DA: Generation of allocation sequences in randomised trials: chance, not choice. Lancet. 2002, 359: 515-9. 10.1016/S0140-6736(02)07683-3.CrossRefPubMed Schulz KF, Grimes DA: Generation of allocation sequences in randomised trials: chance, not choice. Lancet. 2002, 359: 515-9. 10.1016/S0140-6736(02)07683-3.CrossRefPubMed
25.
Zurück zum Zitat Schulz KF, Grimes DA: Allocation concealment in randomised trials: defending against deciphering. Lancet. 2002, 359: 614-8. 10.1016/S0140-6736(02)07750-4.CrossRefPubMed Schulz KF, Grimes DA: Allocation concealment in randomised trials: defending against deciphering. Lancet. 2002, 359: 614-8. 10.1016/S0140-6736(02)07750-4.CrossRefPubMed
26.
Zurück zum Zitat Schulz KF, Grimes DA: Blinding in randomised trials: hiding who got what. Lancet. 2002, 359: 696-700. 10.1016/S0140-6736(02)07816-9.CrossRefPubMed Schulz KF, Grimes DA: Blinding in randomised trials: hiding who got what. Lancet. 2002, 359: 696-700. 10.1016/S0140-6736(02)07816-9.CrossRefPubMed
27.
Zurück zum Zitat Moher D, Jadad AR, Tugwell P: Assessing the quality of randomized controlled trials. Current issues and future directions. Int J Technol Assess Health Care. 1996, 12: 195-208. 10.1017/S0266462300009570.CrossRefPubMed Moher D, Jadad AR, Tugwell P: Assessing the quality of randomized controlled trials. Current issues and future directions. Int J Technol Assess Health Care. 1996, 12: 195-208. 10.1017/S0266462300009570.CrossRefPubMed
28.
Zurück zum Zitat Jadad AR, Moore RA, Carroll D, Jenkinson C, Reynolds DJ, Gavaghan DJ, McQuay HJ: Assessing the quality of reports of randomized clinical trials: is blinding necessary?. Control Clin Trials. 1996, 17: 1-12. 10.1016/0197-2456(95)00134-4.CrossRefPubMed Jadad AR, Moore RA, Carroll D, Jenkinson C, Reynolds DJ, Gavaghan DJ, McQuay HJ: Assessing the quality of reports of randomized clinical trials: is blinding necessary?. Control Clin Trials. 1996, 17: 1-12. 10.1016/0197-2456(95)00134-4.CrossRefPubMed
29.
Zurück zum Zitat Gluud LL: Bias in clinical intervention research. Am J Epidemiol. 2006, 163: 493-501. 10.1093/aje/kwj069.CrossRefPubMed Gluud LL: Bias in clinical intervention research. Am J Epidemiol. 2006, 163: 493-501. 10.1093/aje/kwj069.CrossRefPubMed
30.
Zurück zum Zitat Mueller PS, Montori VM, Bassler D, Koenig BA, Guyatt GH: Ethical issues in stopping randomized trials early because of apparent benefit. Ann Intern Med. 2007, 146: 878-81.CrossRefPubMed Mueller PS, Montori VM, Bassler D, Koenig BA, Guyatt GH: Ethical issues in stopping randomized trials early because of apparent benefit. Ann Intern Med. 2007, 146: 878-81.CrossRefPubMed
31.
Zurück zum Zitat Goodman SN: Stopping at nothing? Some dilemmas of data monitoring in clinical trials. Ann Intern Med. 2007, 146: 882-7.CrossRefPubMed Goodman SN: Stopping at nothing? Some dilemmas of data monitoring in clinical trials. Ann Intern Med. 2007, 146: 882-7.CrossRefPubMed
32.
Zurück zum Zitat Montori VM, Devereaux PJ, Adhikari NK, Burns KE, Eggert CH, Briel M, Lacchetti C, Leung TW, Darling E, Bryant DM, Bucher HC, Schünemann HJ, Meade MO, Cook DJ, Erwin PJ, Sood A, Sood R, Lo B, Thompson CA, Zhou Q, Mills E, Guyatt GH: Randomized trials stopped early for benefit: a systematic review. JAMA. 2005, 294: 2203-9. 10.1001/jama.294.17.2203.CrossRefPubMed Montori VM, Devereaux PJ, Adhikari NK, Burns KE, Eggert CH, Briel M, Lacchetti C, Leung TW, Darling E, Bryant DM, Bucher HC, Schünemann HJ, Meade MO, Cook DJ, Erwin PJ, Sood A, Sood R, Lo B, Thompson CA, Zhou Q, Mills E, Guyatt GH: Randomized trials stopped early for benefit: a systematic review. JAMA. 2005, 294: 2203-9. 10.1001/jama.294.17.2203.CrossRefPubMed
33.
Zurück zum Zitat Jüni P, Altman DG, Egger M: Systematic reviews in health care: Assessing the quality of controlled clinical trials. BMJ. 2001, 323: 42-6. 10.1136/bmj.323.7303.42.CrossRefPubMedPubMedCentral Jüni P, Altman DG, Egger M: Systematic reviews in health care: Assessing the quality of controlled clinical trials. BMJ. 2001, 323: 42-6. 10.1136/bmj.323.7303.42.CrossRefPubMedPubMedCentral
34.
Zurück zum Zitat Bassler D, Ferreira-Gonzalez I, Briel M, Cook DJ, Devereaux PJ, Heels-Ansdell D, Kirpalani H, Meade MO, Montori VM, Rozenberg A, Schünemann HJ, Guyatt GH: Systematic reviewers neglect bias that results from trials stopped early for benefit. J Clin Epidemiol. 2007, 60: 869-73. 10.1016/j.jclinepi.2006.12.006.CrossRefPubMed Bassler D, Ferreira-Gonzalez I, Briel M, Cook DJ, Devereaux PJ, Heels-Ansdell D, Kirpalani H, Meade MO, Montori VM, Rozenberg A, Schünemann HJ, Guyatt GH: Systematic reviewers neglect bias that results from trials stopped early for benefit. J Clin Epidemiol. 2007, 60: 869-73. 10.1016/j.jclinepi.2006.12.006.CrossRefPubMed
35.
Zurück zum Zitat Bassler D, Montori VM, Briel M, Glasziou P, Guyatt GH: Early stopping of randomized clinical trials for overt efficacy is problematic. J Clin Epidemiol. 2008, 61: 241-6. 10.1016/j.jclinepi.2007.07.016.CrossRefPubMed Bassler D, Montori VM, Briel M, Glasziou P, Guyatt GH: Early stopping of randomized clinical trials for overt efficacy is problematic. J Clin Epidemiol. 2008, 61: 241-6. 10.1016/j.jclinepi.2007.07.016.CrossRefPubMed
36.
Zurück zum Zitat Wetterslev J, Thorlund K, Brok J, Gluud C: Trial sequential analysis may establish when firm evidence is reached in cumulative meta-analysis. J Clin Epidemiol. 2008, 61: 64-75. 10.1016/j.jclinepi.2007.03.013.CrossRefPubMed Wetterslev J, Thorlund K, Brok J, Gluud C: Trial sequential analysis may establish when firm evidence is reached in cumulative meta-analysis. J Clin Epidemiol. 2008, 61: 64-75. 10.1016/j.jclinepi.2007.03.013.CrossRefPubMed
37.
Zurück zum Zitat Goodman SN: Toward evidence-based medical statistics. 1: The P value fallacy. Ann Intern Med. 1999, 130: 995-1004.CrossRefPubMed Goodman SN: Toward evidence-based medical statistics. 1: The P value fallacy. Ann Intern Med. 1999, 130: 995-1004.CrossRefPubMed
39.
Zurück zum Zitat Vickers AJ: Underpowering in randomized trials reporting a sample size calculation. J Clin Epidemiol. 2003, 56: 717-20. 10.1016/S0895-4356(03)00141-0.CrossRefPubMed Vickers AJ: Underpowering in randomized trials reporting a sample size calculation. J Clin Epidemiol. 2003, 56: 717-20. 10.1016/S0895-4356(03)00141-0.CrossRefPubMed
40.
Zurück zum Zitat Schulz KF, Grimes DA: Multiplicity in randomised trials II: subgroup and interim analyses. Lancet. 2005, 365: 1657-61. 10.1016/S0140-6736(05)66516-6.CrossRefPubMed Schulz KF, Grimes DA: Multiplicity in randomised trials II: subgroup and interim analyses. Lancet. 2005, 365: 1657-61. 10.1016/S0140-6736(05)66516-6.CrossRefPubMed
41.
Zurück zum Zitat Pogue JM, Yusuf S: Cumulating evidence from randomized trials: utilizing sequential monitoring boundaries for cumulative meta-analysis. Control Clin Trials. 1997, 18: 580-93. 10.1016/S0197-2456(97)00051-2.CrossRefPubMed Pogue JM, Yusuf S: Cumulating evidence from randomized trials: utilizing sequential monitoring boundaries for cumulative meta-analysis. Control Clin Trials. 1997, 18: 580-93. 10.1016/S0197-2456(97)00051-2.CrossRefPubMed
42.
Zurück zum Zitat Brok J, Thorlund K, Gluud C, Wetterslev J: Trial sequential analysis reveals insufficient information size and potentially false positive results in many meta-analyses. J Clin Epidemiol. 2008, 61: 763-9. 10.1016/j.jclinepi.2007.10.007.CrossRefPubMed Brok J, Thorlund K, Gluud C, Wetterslev J: Trial sequential analysis reveals insufficient information size and potentially false positive results in many meta-analyses. J Clin Epidemiol. 2008, 61: 763-9. 10.1016/j.jclinepi.2007.10.007.CrossRefPubMed
43.
Zurück zum Zitat Thorlund K, Devereaux PJ, Wetterslev J, Guyatt GH, Ioannidis JP, Thabane L, Gluud LL, Als-Nielsen B, Gluud C: Can trial sequential monitoring boundaries reduce spurious inferences from meta-analyses?. Int J Epidemiol. 2009, 38: 276-86. 10.1093/ije/dyn179.CrossRefPubMed Thorlund K, Devereaux PJ, Wetterslev J, Guyatt GH, Ioannidis JP, Thabane L, Gluud LL, Als-Nielsen B, Gluud C: Can trial sequential monitoring boundaries reduce spurious inferences from meta-analyses?. Int J Epidemiol. 2009, 38: 276-86. 10.1093/ije/dyn179.CrossRefPubMed
44.
Zurück zum Zitat Brok J, Thorlund K, Wetterslev J, Gluud C: Apparently conclusive meta-analyses may be inconclusive - Trial sequential analysis adjustment of random error risk due to repetetive testing of accumulating data in apparently conclusive neonatal meta-analyses. Int J Epidemiol. 2009, 38: 287-98. 10.1093/ije/dyn188.CrossRefPubMed Brok J, Thorlund K, Wetterslev J, Gluud C: Apparently conclusive meta-analyses may be inconclusive - Trial sequential analysis adjustment of random error risk due to repetetive testing of accumulating data in apparently conclusive neonatal meta-analyses. Int J Epidemiol. 2009, 38: 287-98. 10.1093/ije/dyn188.CrossRefPubMed
45.
Zurück zum Zitat Berkey CS, Mosteller F, Lau J, Antman EM: Uncertainty of the time of first significance in random effects cumulative meta-analysis. Control Clin Trials. 1996, 17: 357-71. 10.1016/S0197-2456(96)00014-1.CrossRefPubMed Berkey CS, Mosteller F, Lau J, Antman EM: Uncertainty of the time of first significance in random effects cumulative meta-analysis. Control Clin Trials. 1996, 17: 357-71. 10.1016/S0197-2456(96)00014-1.CrossRefPubMed
46.
Zurück zum Zitat Lau J, Schmid CH, Chalmers TC: Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care. J Clin Epidemiol. 1995, 48: 45-57. 10.1016/0895-4356(94)00106-Z.CrossRefPubMed Lau J, Schmid CH, Chalmers TC: Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care. J Clin Epidemiol. 1995, 48: 45-57. 10.1016/0895-4356(94)00106-Z.CrossRefPubMed
47.
Zurück zum Zitat O'Neill RT: Secondary endpoints cannot be validly analyzed if the primary endpoint does not demonstrate clear statistical significance. Control Clin Trials. 1997, 18: 550-6. 10.1016/S0197-2456(97)00075-5.CrossRefPubMed O'Neill RT: Secondary endpoints cannot be validly analyzed if the primary endpoint does not demonstrate clear statistical significance. Control Clin Trials. 1997, 18: 550-6. 10.1016/S0197-2456(97)00075-5.CrossRefPubMed
48.
Zurück zum Zitat Schulz KF, Grimes DA: Multiplicity in randomised trials I: endpoints and treatments. Lancet. 2005, 365: 1591-5. 10.1016/S0140-6736(05)66461-6.CrossRefPubMed Schulz KF, Grimes DA: Multiplicity in randomised trials I: endpoints and treatments. Lancet. 2005, 365: 1591-5. 10.1016/S0140-6736(05)66461-6.CrossRefPubMed
49.
Zurück zum Zitat Friedman LM, Furberg CD, Demets DL: Fundamentals of clinical trials. 1998, New York: Springer VerlagCrossRef Friedman LM, Furberg CD, Demets DL: Fundamentals of clinical trials. 1998, New York: Springer VerlagCrossRef
50.
Zurück zum Zitat Bangalore S, Wetterslev J, Pranesh S, Sawhney S, Gluud C, Messerli FH: Peri-operative beta-blockers in patients undergoing non-cardiac surgery. A meta-analysis and trial sequential analysis of 12,306 patients from randomised trials. Lancet. 2008, 372: 1962-76. 10.1016/S0140-6736(08)61560-3.CrossRefPubMed Bangalore S, Wetterslev J, Pranesh S, Sawhney S, Gluud C, Messerli FH: Peri-operative beta-blockers in patients undergoing non-cardiac surgery. A meta-analysis and trial sequential analysis of 12,306 patients from randomised trials. Lancet. 2008, 372: 1962-76. 10.1016/S0140-6736(08)61560-3.CrossRefPubMed
51.
Zurück zum Zitat POISE Study Group, Devereaux PJ, Yang H, Yusuf S, Guyatt G, Leslie K, Villar JC, Xavier D, Chrolavicius S, Greenspan L, Pogue J, Pais P, Liu L, Xu S, Málaga G, Avezum A, Chan M, Montori VM, Jacka M, Choi P: Effects of extended-release metoprolol succinate in patients undergoing non-cardiac surgery (POISE trial): a randomised controlled trial. Lancet. 2008, 371: 1839-47. 10.1016/S0140-6736(08)60601-7.CrossRef POISE Study Group, Devereaux PJ, Yang H, Yusuf S, Guyatt G, Leslie K, Villar JC, Xavier D, Chrolavicius S, Greenspan L, Pogue J, Pais P, Liu L, Xu S, Málaga G, Avezum A, Chan M, Montori VM, Jacka M, Choi P: Effects of extended-release metoprolol succinate in patients undergoing non-cardiac surgery (POISE trial): a randomised controlled trial. Lancet. 2008, 371: 1839-47. 10.1016/S0140-6736(08)60601-7.CrossRef
52.
Zurück zum Zitat Yang H, Raymer K, Butler R, Parlow J, Roberts R: The effects of perioperative beta-blockade: results of the Metoprolol after Vascular Surgery (MaVS) study, a randomized controlled trial. Am Heart J. 2006, 152: 983-90. 10.1016/j.ahj.2006.07.024.CrossRefPubMed Yang H, Raymer K, Butler R, Parlow J, Roberts R: The effects of perioperative beta-blockade: results of the Metoprolol after Vascular Surgery (MaVS) study, a randomized controlled trial. Am Heart J. 2006, 152: 983-90. 10.1016/j.ahj.2006.07.024.CrossRefPubMed
53.
Zurück zum Zitat Juul AB, Wetterslev J, Gluud C, Kofoed-Enevoldsen A, Jensen G, Callesen T, Nørgaard P, Fruergaard K, Bestle M, Vedelsdal R, Miran A, Jacobsen J, Roed J, Mortensen MB, Jørgensen L, Jørgensen J, Rovsing ML, Petersen PL, Pott F, Haas M, Albret R, Nielsen LL, Johansson G, Stjernholm P, Mølgaard Y, Foss NB, Elkjaer J, Dehlie B, Boysen K, Zaric D, DIPOM Trial Group, et al: Effect of perioperative beta blockade in patients with diabetes undergoing major non-cardiac surgery: randomised placebo controlled, blinded multicentre trial. BMJ. 2006, 332: 1482-10.1136/bmj.332.7556.1482.CrossRefPubMedPubMedCentral Juul AB, Wetterslev J, Gluud C, Kofoed-Enevoldsen A, Jensen G, Callesen T, Nørgaard P, Fruergaard K, Bestle M, Vedelsdal R, Miran A, Jacobsen J, Roed J, Mortensen MB, Jørgensen L, Jørgensen J, Rovsing ML, Petersen PL, Pott F, Haas M, Albret R, Nielsen LL, Johansson G, Stjernholm P, Mølgaard Y, Foss NB, Elkjaer J, Dehlie B, Boysen K, Zaric D, DIPOM Trial Group, et al: Effect of perioperative beta blockade in patients with diabetes undergoing major non-cardiac surgery: randomised placebo controlled, blinded multicentre trial. BMJ. 2006, 332: 1482-10.1136/bmj.332.7556.1482.CrossRefPubMedPubMedCentral
54.
Zurück zum Zitat Mangano DT, Layug EL, Wallace A, Tateo I: Effect of atenolol on mortality and cardiovascular morbidity after noncardiac surgery. Multicenter Study of Perioperative Ischemia Research Group. N Engl J Med. 1996, 335: 1713-20. 10.1056/NEJM199612053352301.CrossRefPubMed Mangano DT, Layug EL, Wallace A, Tateo I: Effect of atenolol on mortality and cardiovascular morbidity after noncardiac surgery. Multicenter Study of Perioperative Ischemia Research Group. N Engl J Med. 1996, 335: 1713-20. 10.1056/NEJM199612053352301.CrossRefPubMed
55.
Zurück zum Zitat Wetterslev J, Juul AB: Benefit and harms of perioperative b-blockade. Best Pract Res Clin Anaesthesiol. 2006, 20: 285-302. 10.1016/j.bpa.2005.10.006.CrossRefPubMed Wetterslev J, Juul AB: Benefit and harms of perioperative b-blockade. Best Pract Res Clin Anaesthesiol. 2006, 20: 285-302. 10.1016/j.bpa.2005.10.006.CrossRefPubMed
56.
Zurück zum Zitat Poldermans D, Boersma E, Bax JJ, Thomson IR, van de Ven LL, Blankensteijn JD, Baars HF, Yo TI, Trocino G, Vigna C, Roelandt JR, van Urk H: The effect of bisoprolol on perioperative mortality and myocardial infarction in high-risk patients undergoing vascular surgery. Dutch Echocardiographic Cardiac Risk Evaluation Applying Stress Echocardiography Study Group. N Engl J Med. 1999, 341: 1789-94. 10.1056/NEJM199912093412402.CrossRefPubMed Poldermans D, Boersma E, Bax JJ, Thomson IR, van de Ven LL, Blankensteijn JD, Baars HF, Yo TI, Trocino G, Vigna C, Roelandt JR, van Urk H: The effect of bisoprolol on perioperative mortality and myocardial infarction in high-risk patients undergoing vascular surgery. Dutch Echocardiographic Cardiac Risk Evaluation Applying Stress Echocardiography Study Group. N Engl J Med. 1999, 341: 1789-94. 10.1056/NEJM199912093412402.CrossRefPubMed
57.
Zurück zum Zitat Lindenauer PK, Pekow P, Wang K, Mamidi DK, Gutierrez B, Benjamin EM: Perioperative beta-blocker therapy and mortality after major noncardiac surgery. N Engl J Med. 2005, 353: 349-61. 10.1056/NEJMoa041895.CrossRefPubMed Lindenauer PK, Pekow P, Wang K, Mamidi DK, Gutierrez B, Benjamin EM: Perioperative beta-blocker therapy and mortality after major noncardiac surgery. N Engl J Med. 2005, 353: 349-61. 10.1056/NEJMoa041895.CrossRefPubMed
58.
Zurück zum Zitat Fleisher LA, Beckman JA, Brown KA, Calkins H, Chaikof EL, Fleischmann KE, Freeman WK, Froehlich JB, Kasper EK, Kersten JR, Riegel B, Robb JF, Smith SC, Jacobs AK, Adams CD, Anderson JL, Antman EM, Buller CE, Creager MA, Ettinger SM, Faxon DP, Fuster V, Halperin JL, Hiratzka LF, Hunt SA, Lytle BW, Nishimura R, Ornato JP, Page RL, Riegel B, et al: ACC/AHA 2007 Guidelines on Perioperative Cardiovascular Evaluation and Care for Noncardiac Surgery: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines (Writing Committee to Revise the 2002 Guidelines on Perioperative Cardiovascular Evaluation for Noncardiac Surgery) Developed in Collaboration With the American Society of Echocardiography, American Society of Nuclear Cardiology, Heart Rhythm Society, Society of Cardiovascular Anesthesiologists, Society for Cardiovascular Angiography and Interventions, Society for Vascular Medicine and Biology, and Society for Vascular Surgery. J Am Coll Cardiol. 2007, 50: 1707-32. 10.1016/j.jacc.2007.09.001.CrossRefPubMed Fleisher LA, Beckman JA, Brown KA, Calkins H, Chaikof EL, Fleischmann KE, Freeman WK, Froehlich JB, Kasper EK, Kersten JR, Riegel B, Robb JF, Smith SC, Jacobs AK, Adams CD, Anderson JL, Antman EM, Buller CE, Creager MA, Ettinger SM, Faxon DP, Fuster V, Halperin JL, Hiratzka LF, Hunt SA, Lytle BW, Nishimura R, Ornato JP, Page RL, Riegel B, et al: ACC/AHA 2007 Guidelines on Perioperative Cardiovascular Evaluation and Care for Noncardiac Surgery: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines (Writing Committee to Revise the 2002 Guidelines on Perioperative Cardiovascular Evaluation for Noncardiac Surgery) Developed in Collaboration With the American Society of Echocardiography, American Society of Nuclear Cardiology, Heart Rhythm Society, Society of Cardiovascular Anesthesiologists, Society for Cardiovascular Angiography and Interventions, Society for Vascular Medicine and Biology, and Society for Vascular Surgery. J Am Coll Cardiol. 2007, 50: 1707-32. 10.1016/j.jacc.2007.09.001.CrossRefPubMed
59.
Zurück zum Zitat Lafuente-Lafuente C, Mouly S, Longas-Tejero MA, Bergmann JF: Antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation. Cochrane Database Syst Rev. 2007, CD005049-4 Lafuente-Lafuente C, Mouly S, Longas-Tejero MA, Bergmann JF: Antiarrhythmics for maintaining sinus rhythm after cardioversion of atrial fibrillation. Cochrane Database Syst Rev. 2007, CD005049-4
60.
Zurück zum Zitat Byrne-Quinn E, Wing AJ: Maintenance of sinus rhythm after DC reversion of atrial fibrilllation. A double-blind controlled trial of long-acting quinidine bisulphate. Br Heart J. 1970, 32: 370-6. 10.1136/hrt.32.3.370.CrossRefPubMedPubMedCentral Byrne-Quinn E, Wing AJ: Maintenance of sinus rhythm after DC reversion of atrial fibrilllation. A double-blind controlled trial of long-acting quinidine bisulphate. Br Heart J. 1970, 32: 370-6. 10.1136/hrt.32.3.370.CrossRefPubMedPubMedCentral
61.
Zurück zum Zitat Hillestad L, Bjerkelund C, Dale J, Maltau J, Storstein O: Quinidine in maintenance of sinus rhythm after electroconversion of chronic atrial fibrillation. A controlled clinical study. Br Heart J. 1971, 33: 518-21. 10.1136/hrt.33.4.518.CrossRefPubMedPubMedCentral Hillestad L, Bjerkelund C, Dale J, Maltau J, Storstein O: Quinidine in maintenance of sinus rhythm after electroconversion of chronic atrial fibrillation. A controlled clinical study. Br Heart J. 1971, 33: 518-21. 10.1136/hrt.33.4.518.CrossRefPubMedPubMedCentral
62.
Zurück zum Zitat Karlson BW, Torstensson I, Abjorn C, Jansson SO, Peterson LE: Disopyramide in the maintenance of sinus rhythm after electroconversion of atrial fibrillation. A placebo-controlled one year follow-up study. Eur Heart J. 1988, 9: 284-90.PubMed Karlson BW, Torstensson I, Abjorn C, Jansson SO, Peterson LE: Disopyramide in the maintenance of sinus rhythm after electroconversion of atrial fibrillation. A placebo-controlled one year follow-up study. Eur Heart J. 1988, 9: 284-90.PubMed
63.
Zurück zum Zitat Lloyd EA, Gersh BJ, Forman R: The efficacy of quinidine and disopyramide in the maintenance of sinus rhythm after electroconversion from atrial fibrillation. A double-blind study comparing quinidine, disopyramide and placebo. S Afr Med J. 1984, 65: 367-9.PubMed Lloyd EA, Gersh BJ, Forman R: The efficacy of quinidine and disopyramide in the maintenance of sinus rhythm after electroconversion from atrial fibrillation. A double-blind study comparing quinidine, disopyramide and placebo. S Afr Med J. 1984, 65: 367-9.PubMed
64.
Zurück zum Zitat Fetsch T, Bauer P, Engberding R, Koch HP, Lukl J, Meinertz T, Oeff M, Seipel L, Trappe HJ, Treese N, Breithardt G, Prevention of Atrial Fibrillation after Cardioversion Investigators: Prevention of atrial fibrillation after cardioversion: results of the PAFAC trial. Eur Heart J. 2004, 25: 1385-94. 10.1016/j.ehj.2004.04.015.CrossRefPubMed Fetsch T, Bauer P, Engberding R, Koch HP, Lukl J, Meinertz T, Oeff M, Seipel L, Trappe HJ, Treese N, Breithardt G, Prevention of Atrial Fibrillation after Cardioversion Investigators: Prevention of atrial fibrillation after cardioversion: results of the PAFAC trial. Eur Heart J. 2004, 25: 1385-94. 10.1016/j.ehj.2004.04.015.CrossRefPubMed
65.
Zurück zum Zitat Södermark T, Jonsson B, Olsson A, Orö L, Wallin H, Edhag O, Sjögren A, Danielsson M, Rosenhamer G: Effect of quinidine on maintaining sinus rhythm after conversion of atrial fibrillation or flutter. A multicentre study from Stockholm. Br Heart J. 1975, 37: 486-92. 10.1136/hrt.37.5.486.CrossRefPubMedPubMedCentral Södermark T, Jonsson B, Olsson A, Orö L, Wallin H, Edhag O, Sjögren A, Danielsson M, Rosenhamer G: Effect of quinidine on maintaining sinus rhythm after conversion of atrial fibrillation or flutter. A multicentre study from Stockholm. Br Heart J. 1975, 37: 486-92. 10.1136/hrt.37.5.486.CrossRefPubMedPubMedCentral
66.
Zurück zum Zitat Patten M, Maas R, Bauer P, Lüderitz B, Sonntag F, Dluzniewski M, Hatala R, Opolski G, Müller HW, Meinertz T, SOPAT Investigators: Suppression of paroxysmal atrial tachyarrhythmias-results of the SOPAT trial. Eur Heart J. 2004, 25: 1395-404. 10.1016/j.ehj.2004.06.014.CrossRefPubMed Patten M, Maas R, Bauer P, Lüderitz B, Sonntag F, Dluzniewski M, Hatala R, Opolski G, Müller HW, Meinertz T, SOPAT Investigators: Suppression of paroxysmal atrial tachyarrhythmias-results of the SOPAT trial. Eur Heart J. 2004, 25: 1395-404. 10.1016/j.ehj.2004.06.014.CrossRefPubMed
67.
Zurück zum Zitat Steinbeck G, Doliwa R, Bach P: Therapy of paroxysmal atrial fibrillation. Cardiac glycosides alone or combined with antiarrhythmia agents?. Dtsch Med Wochenschr. 1988, 113: 1867-71. 10.1055/s-2008-1067903.CrossRefPubMed Steinbeck G, Doliwa R, Bach P: Therapy of paroxysmal atrial fibrillation. Cardiac glycosides alone or combined with antiarrhythmia agents?. Dtsch Med Wochenschr. 1988, 113: 1867-71. 10.1055/s-2008-1067903.CrossRefPubMed
68.
Zurück zum Zitat Goodman SN: Toward evidence-based medical statistics. 2: The Bayes factor. Ann Intern Med. 1999, 130: 1005-13.CrossRefPubMed Goodman SN: Toward evidence-based medical statistics. 2: The Bayes factor. Ann Intern Med. 1999, 130: 1005-13.CrossRefPubMed
69.
Zurück zum Zitat Wakefield J: Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol. 2008, 33: 79-86. 10.1002/gepi.20359.CrossRef Wakefield J: Bayes factors for genome-wide association studies: comparison with P-values. Genet Epidemiol. 2008, 33: 79-86. 10.1002/gepi.20359.CrossRef
70.
Zurück zum Zitat Deeks JJ, Dinnes J, D'Amico R, Sowden AJ, Sakarovitch C, Song F, Petticrew M, Altman DG, International Stroke Trial Collaborative Group; European Carotid Surgery Trial Collaborative Group: Evaluating non-randomised intervention studies. Health Technol Assess. 2003, 7: iii-x,1-173. Deeks JJ, Dinnes J, D'Amico R, Sowden AJ, Sakarovitch C, Song F, Petticrew M, Altman DG, International Stroke Trial Collaborative Group; European Carotid Surgery Trial Collaborative Group: Evaluating non-randomised intervention studies. Health Technol Assess. 2003, 7: iii-x,1-173.
71.
Zurück zum Zitat Jüni P, Nartey L, Reichenbach S, Sterchi R, Dieppe PA, Egger M: Risk of cardiovascular events and rofecoxib: cumulative meta-analysis. Lancet. 2004, 364: 2021-9. 10.1016/S0140-6736(04)17514-4.CrossRefPubMed Jüni P, Nartey L, Reichenbach S, Sterchi R, Dieppe PA, Egger M: Risk of cardiovascular events and rofecoxib: cumulative meta-analysis. Lancet. 2004, 364: 2021-9. 10.1016/S0140-6736(04)17514-4.CrossRefPubMed
72.
Zurück zum Zitat Devereaux PJ, Choi PT, El Dika S, Bhandari M, Montori VM, Schünemann HJ, Garg AX, Busse JW, Heels-Ansdell D, Ghali WA, Manns BJ, Guyatt GH: An observational study found that authors of randomized controlled trials frequently use concealment of randomization and blinding, despite the failure to report these methods. J Clin Epidemiol. 2004, 57: 1232-6. 10.1016/j.jclinepi.2004.03.017.CrossRefPubMed Devereaux PJ, Choi PT, El Dika S, Bhandari M, Montori VM, Schünemann HJ, Garg AX, Busse JW, Heels-Ansdell D, Ghali WA, Manns BJ, Guyatt GH: An observational study found that authors of randomized controlled trials frequently use concealment of randomization and blinding, despite the failure to report these methods. J Clin Epidemiol. 2004, 57: 1232-6. 10.1016/j.jclinepi.2004.03.017.CrossRefPubMed
75.
Zurück zum Zitat Stroup DF, Berlin JA, Morton SC, Olkin I, Williamson GD, Rennie D, Moher D, Becker BJ, Sipe TA, Thacker SB: Meta-analysis of observational studies in epidemiology: a proposal for reporting. JAMA. 2000, 283: 2008-12. 10.1001/jama.283.15.2008.CrossRefPubMed Stroup DF, Berlin JA, Morton SC, Olkin I, Williamson GD, Rennie D, Moher D, Becker BJ, Sipe TA, Thacker SB: Meta-analysis of observational studies in epidemiology: a proposal for reporting. JAMA. 2000, 283: 2008-12. 10.1001/jama.283.15.2008.CrossRefPubMed
76.
Zurück zum Zitat Higgins JP, Thompson SG: Quantifying heterogeneity in a meta-analysis. Stat Med. 2002, 21: 1539-58. 10.1002/sim.1186.CrossRefPubMed Higgins JP, Thompson SG: Quantifying heterogeneity in a meta-analysis. Stat Med. 2002, 21: 1539-58. 10.1002/sim.1186.CrossRefPubMed
77.
Zurück zum Zitat Wetterslev J, Thorlund K, Brok J, Gluud C: Estimating required information size by quantifying diversity in random-effects model meta-analyses. BMC Med Res Methodol. 2009, 9: 86-10.1186/1471-2288-9-86.CrossRefPubMedPubMedCentral Wetterslev J, Thorlund K, Brok J, Gluud C: Estimating required information size by quantifying diversity in random-effects model meta-analyses. BMC Med Res Methodol. 2009, 9: 86-10.1186/1471-2288-9-86.CrossRefPubMedPubMedCentral
Metadaten
Titel
Evidence at a glance: error matrix approach for overviewing available evidence
verfasst von
Frederik Keus
Jørn Wetterslev
Christian Gluud
Cornelis JHM van Laarhoven
Publikationsdatum
01.12.2010
Verlag
BioMed Central
Erschienen in
BMC Medical Research Methodology / Ausgabe 1/2010
Elektronische ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-10-90

Weitere Artikel der Ausgabe 1/2010

BMC Medical Research Methodology 1/2010 Zur Ausgabe