nach oben

BMC Medical Research Methodology

Erschienen in:

Open Access 01.12.2013 | Research article

A counterfactual approach to bias and effect modification in terms of response types

verfasst von: Etsuji Suzuki, Toshiharu Mitsuhashi, Toshihide Tsuda, Eiji Yamamoto

Erschienen in: BMC Medical Research Methodology | Ausgabe 1/2013

Abstract

Background

The counterfactual approach provides a clear and coherent framework to think about a variety of important concepts related to causation. Meanwhile, directed acyclic graphs have been used as causal diagrams in epidemiologic research to visually summarize hypothetical relations among variables of interest, providing a clear understanding of underlying causal structures of bias and effect modification. In this study, the authors aim to further clarify the concepts of bias (confounding bias and selection bias) and effect modification in the counterfactual framework.

Methods

The authors show how theoretical data frequencies can be described by using unobservable response types both in observational studies and in randomized controlled trials. By using the descriptions of data frequencies, the authors show epidemiologic measures in terms of response types, demonstrating significant distinctions between association measures and effect measures. These descriptions also demonstrate sufficient conditions to estimate effect measures in observational studies. To illustrate the ideas, the authors show how directed acyclic graphs can be extended by integrating response types and observed variables.

Results

This study shows a hitherto unrecognized sufficient condition to estimate effect measures in observational studies by adjusting for confounding bias. The present findings would provide a further understanding of the assumption of conditional exchangeability, clarifying the link between the assumptions for making causal inferences in observational studies and the counterfactual approach. The extension of directed acyclic graphs using response types maintains the integrity of the original directed acyclic graphs, which allows one to understand the underlying causal structure discussed in this study.

Conclusions

The present findings highlight that analytic adjustment for confounders in observational studies has consequences quite different from those of physical control in randomized controlled trials. In particular, the present findings would be of great use when demonstrating the inherent distinctions between observational studies and randomized controlled trials.

Additional file 1: Appendices 1 to 5.(PDF 141 KB)

Additional file 2: Tables S1 and S2: Risk ratios in terms of response types. (PDF 41 KB)

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2288-13-101) contains supplementary material, which is available to authorized users.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

ES conceptualized the authors' views and drafted the manuscript. TM, TT, and EY critically revised the manuscript for intellectual content. EY supervised the study. All authors read and approved the final manuscript.

DAG

Directed acyclic graph

Risk ratio

Background

The counterfactual approach provides a clear and coherent framework to think about a variety of important concepts related to causation [1, 2]. In particular, the counterfactual approach to confounding has been widely accessible to epidemiologists since the publication of a classic methods paper by Greenland and Robins [3], and the concept of bias is now explained in the counterfactual framework [4‐12]. (Note that an update of the classic methods paper was recently published [13]). Meanwhile, directed acyclic graphs (DAGs) have long been used as causal diagrams in epidemiologic research to visually summarize hypothetical relations among variables of interest [14, 15]. DAGs have been used extensively to determine the variables for which it is necessary to control for confounding bias to estimate causal effects [14‐20]. Besides, Hernán et al. [21] showed that various types of selection bias share a common underlying causal structure, and referred to conditioning on common effects as selection bias. Furthermore, VanderWeele and Robins [22] provided a structural classification of effect modification by using DAGs. Indeed, the different approaches provide complementary perspectives, and can be employed together to provide a clearer understanding of causality [23].

In this study, we aim to further clarify the concepts of bias (confounding bias and selection bias) and effect modification in the counterfactual framework. To achieve this, we show how theoretical data frequencies can be described by using unobservable response types both in observational studies and in randomized controlled trials. These descriptions also demonstrate sufficient conditions to estimate effect measures in observational studies, which would provide a further understanding of the assumption of conditional exchangeability. To illustrate the ideas, DAGs are employed, and we show how one can extend the original DAGs by integrating response types and observed variables. We deal only with structural (systematic) relations among the underlying variables of interest, so that an issue of random variation does not arise. Throughout this article, we assume that the consistency condition is met [24‐28].

Methods

Definitions and notation

A causal diagram and causal effects

We use a total of 4 binary variables as shown in Figure 1. We let D denote a binary outcome of interest (1: outcome occurred, 0: outcome did not occur) and let E denote a binary cause of interest (1: exposed, 0: unexposed) that is potentially manipulable. We let C denote a binary common cause of E and D (1: present, 0: absent), which is also potentially manipulable. Typically, C is called a confounder of the effect of E on D. (Note that we assume that C precedes E temporally in this study, which is in general not necessary for C to be a confounder. Recently, VanderWeele and Shpitser [29, 30] further discussed the definition of a confounder.) As explained later, C can also act as a direct effect modifier for the causal effect of E on D because C is a direct cause of D[22]. Finally, we let S denote selection variable (1: selected, 0: not selected), which is a common effect of E and D. Adjustment for S yields a spurious association between E and D, which is called selection bias [21]. Alternatively, one may assume that S is also directly influenced by C, as shown by using a dashed arrow in Figure 1. Although the arrow is assumed to be absent throughout this paper to avoid technical complications, the following discussion can be readily extended to the situations in which the dashed arrow is present.

Throughout this paper, we discuss the case where the causal effect of primary interest is the effect of E on D in the total population, including the subpopulations of C = 0 and C = 1. In the counterfactual framework, this causal effect is given by comparing P[D _e=0 = 1] and P[D _e=1 = 1], where P[D _e = 1] denotes the proportion of subjects that would have developed the outcome D = 1 had all subjects in the total population received exposure value e. The variables D _e=0 and D _e=1 are referred to as potential outcomes. Therefore, when we use risk ratios (RRs) as a measure of interest, a causal RR in the total population is given by

\frac{P [D_{e = 1} = 1]}{P [D_{e = 0} = 1]} .

(1)

Causal RRs in the total population can be consistently estimated under the assumption of (conditional) exchangeability, or, equivalently, no unmeasured confounding (i.e., E∐D _e for ∀e). Furthermore, in addition to the effect of E on D in the total population, we also discuss causal effect of E on D within the subsets of C. In this case, a causal RR within the subsets is given by

\frac{P [D_{e = 1} = 1 | C = c]}{P [D_{e = 0} = 1 | C = c]} .

(2)

Causal RRs in the subsets of C can be consistently estimated under the assumption of conditional exchangeability, or, equivalently, no unmeasured confounding given data on C (i.e., E∐D _e|C for ∀e). Note that, when the causal effect of interest is the effect of E on D either in the total population or in the subsets of C, intervening on E is of concern, and one does not consider intervening on C. Indeed, as outlined by VanderWeele [31], intervening on C would only be of concern if the joint effect of E and C on D was of interest. Therefore, under the situation in which C is being considered as a potential confounder as well as direct effect modifier (Figure 1), intervening on C is not of interest.

When we show how theoretical data frequencies can be described by using unobservable response types in observational studies, however, it is of great use to elucidate the relations between C and E in the counterfactual framework. By so doing, we demonstrate sufficient conditions to estimate effect measures in observational studies, which would provide a further understanding of the assumption of conditional exchangeability.

Response types

First, we elucidate the relations between C and E by hypothetically conceptualizing potential outcomes of E in the counterfactual framework. We let E _c(ω) denote the potential outcomes of E for individual ω if, possibly contrary to fact, there had been interventions to set C to c. (In the following sections, we explicitly show ω to discuss response types.) Then, for each individual ω, there would be 2 relevant potential outcomes of E, i.e., E ₁(ω) and E ₀(ω), which correspond to exposure status of that individual when C is present and absent, respectively. As a result, individuals can be classified into 4 (i.e., 2²) different E response types as enumerated in Table 1. We let E ^T(ω) denote E response type of individual ω. In some cases, the effect of C may be in the same direction for all individuals in the population. We say that C has a positive monotonic effect on E if E _c(ω) is non-decreasing in c for all individuals, i.e., E ₁(ω) ≥ E ₀(ω) for ∀ω, which excludes E response type 3. Note that this should be clearly distinguished from the assumptions of no preventive action or no preventive sequence [32, 33].

Table 1

Enumeration of 4 response types for exposure E and corresponding potential outcomes

Etype	Potential outcomes of E
E ^T( ω)	E _c( ω)
	E ₁( ω)	E ₀( ω)
1	1	1
2	1	0
3 ^a	0	1
4	0	0

^a Under the assumption of positive monotonicity (i.e., E ₁(ω) ≥ E ₀(ω) for ∀ω), this response type is excluded.

In a similar manner, we let D _ce(ω) denote the potential outcomes of D for individual ω if, possibly contrary to fact, there had been interventions to set C to c and to set E to e. For each individual ω, there would thus be 4 possible potential outcomes D ₁₁(ω), D ₀₁(ω), D ₁₀(ω), and D ₀₀(ω), resulting in 16 (i.e., 2⁴) different D response types as enumerated in Table 2[34]. We let D ^T(ω) denote D response type of individual ω. We say that C and E have positive monotonic effects on D if D _ce(ω) is non-decreasing in c and e for all individuals, i.e., D _ce(ω) ≥ D _{c ' e '}(ω) for ∀ω whenever c ≥ c ' and e ≥ e '. Under this assumption, the individuals of D response types 3, 5, 7, and 9 through 15 are excluded; and individuals of D response types 1, 2, 4, 6, 8, and 16 may remain [32].

Table 2

Enumeration of 16 response types for outcome D and corresponding potential outcomes

Dtype	Potential outcomes of D
D ^T( ω)	D _ce( ω)
	D ₁₁( ω)	D ₀₁( ω)	D ₁₀( ω)	D ₀₀( ω)
1	1	1	1	1
2 ^{b, c}	1	1	1	0
3 ^{a, b, c}	1	1	0	1
4	1	1	0	0
5 ^{a, b, c}	1	0	1	1
6	1	0	1	0
7 ^{a, b, c}	1	0	0	1
8 ^b	1	0	0	0
9 ^{a, b, c}	0	1	1	1
10 ^{a, b, c}	0	1	1	0
11 ^a	0	1	0	1
12 ^{a, b}	0	1	0	0
13 ^a	0	0	1	1
14 ^{a, b}	0	0	1	0
15 ^{a, b}	0	0	0	1
16	0	0	0	0

^a Under the assumption of positive monotonicity (i.e., D _ce(ω) ≥ D _{c ' e '}(ω) for ∀ω whenever c ≥ c ' and e ≥ e '), these response types are excluded.

^b Given no interaction at the individual level on the additive scale between C and E in the counterfactual framework (i.e., D ₁₁(ω) − D ₀₁(ω) − D ₁₀(ω) + D ₀₀(ω) = 0 for ∀ω), these response types are excluded.

^c Given no interaction at the individual level on the multiplicative scale between C and E in the counterfactual framework (i.e., D ₁₁(ω)D ₀₀(ω) = D ₀₁(ω)D ₁₀(ω) for ∀ω), these response types are excluded.

Likewise, we let S _ed(ω) denote the potential outcomes of S for individual ω if, possibly contrary to fact, there had been interventions to set E to e and to set D to d. For each individual ω, there would thus be 4 possible potential outcomes S ₁₁(ω), S ₀₁(ω), S ₁₀(ω), and S ₀₀(ω), resulting in 16 (i.e., 2⁴) different S response types as enumerated in Table 3. We let S ^T(ω) denote S response type of individual ω.

Table 3

Enumeration of 16 response types for selection variable S and corresponding potential outcomes

Stype	Potential outcomes of S
S ^T( ω)	S _ed( ω)
	S ₁₁( ω)	S ₀₁( ω)	S ₁₀( ω)	S ₀₀( ω)
1	1	1	1	1
2 ^{b, c}	1	1	1	0
3 ^{a, b, c}	1	1	0	1
4	1	1	0	0
5 ^{a, b, c}	1	0	1	1
6	1	0	1	0
7 ^{a, b, c}	1	0	0	1
8 ^b	1	0	0	0
9 ^{a, b, c}	0	1	1	1
10 ^{a, b, c}	0	1	1	0
11 ^a	0	1	0	1
12 ^{a, b}	0	1	0	0
13 ^a	0	0	1	1
14 ^{a, b}	0	0	1	0
15 ^{a, b}	0	0	0	1
16	0	0	0	0

^a Under the assumption of positive monotonicity (i.e., S _ed(ω) ≥ S _{e ' d '}(ω) for ∀ω whenever e ≥ e ' and d ≥ d '), these response types are excluded.

^b Given no interaction at the individual level on the additive scale between E and D in the counterfactual framework (i.e., S ₁₁(ω) − S ₀₁(ω) − S ₁₀(ω) + S ₀₀(ω) = 0 for ∀ω), these response types are excluded.

^c Given no interaction at the individual level on the multiplicative scale between E and D in the counterfactual framework (i.e., S ₁₁(ω)S ₀₀(ω) = S ₀₁(ω)S ₁₀(ω) for ∀ω), these response types are excluded.

Finally, we integrate information about the potential outcomes discussed above to produce 2 types of compound potential outcomes, which are also called nested counterfactuals [2]. (Note that compound potential outcomes have been extensively used in the issues of mediation and direct/indirect effects [35‐38].) First, we combine the potential outcomes of E and the potential outcomes of D to define

D_{c E_{c'}} (ω)

. In other words, the compound potential outcomes of D are defined by (i) confounder status (C(ω) = 1, C(ω) = 0) and (ii) potential exposure status following an intervention on confounder (E ₁(ω), E ₀(ω)). For each individual ω, there would thus be 4 possible compound potential outcomes

D_{1 E_{1}} (ω)

D_{1 E_{0}} (ω)

D_{0 E_{1}} (ω)

, and

D_{0 E_{0}} (ω)

. Second, we combine the potential outcomes of E, the potential outcomes of D, and the potential outcomes of S to define

S_{E_{c} D_{c' E_{c''}}} (ω)

. Note that the compound potential outcomes of S are defined by (i) potential exposure status following an intervention on confounder (E ₁(ω), E ₀(ω)) and (ii) the compound potential outcomes of D (

D_{1 E_{1}} (ω)

D_{1 E_{0}} (ω)

D_{0 E_{1}} (ω)

, and

D_{0 E_{0}} (ω)

). Thus, for each individual ω, there would be 8 possible compound potential outcomes

S_{E_{1} D_{1 E_{1}}} (ω)

S_{E_{1} D_{1 E_{0}}} (ω)

S_{E_{1} D_{0 E_{1}}} (ω)

S_{E_{1} D_{0 E_{0}}} (ω)

S_{E_{0} D_{1 E_{1}}} (ω)

S_{E_{0} D_{1 E_{0}}} (ω)

S_{E_{0} D_{0 E_{1}}} (ω)

, and

S_{E_{0} D_{0 E_{0}}} (ω)

Combination of 4 E response types, 16 D response types, and 16 S response types yields 1,024 (i.e., 4 × 16 × 16) EDS response types. As noted above, under the assumption of positive monotonic effect of C on E, the number of possible E response types is reduced from 4 to 3. Further, under the assumptions of both positive monotonic effects of C and E on D and no interaction at the individual level on the additive scale between C and E on D, the number of possible D response types is reduced from 16 to 4 (see footnote of Table 2). Analogous argument applies to S response types (see footnote of Table 3). Consequently, the number of possible EDS response types is reduced from 1,024 to 48 (i.e., 3 × 4 × 4). In Table 4, we show a complete enumeration of these 48 EDS response types. To enhance readability, Table 4 shows only selection status when C = 1 (i.e.,

S_{E_{1} D_{1 E_{1}}} (ω)

) and when C = 0 (i.e.,

S_{E_{0} D_{0 E_{0}}} (ω)

) among

S_{E_{c} D_{c' E_{c''}}} (ω)

. Note that we made these restrictive assumptions to show the correspondence between E response types, D response types, and S response types in Table 4, which would be of great help to understand the present findings. The following discussion however applies even without these assumptions. Thus, in the following sections, we use a total of 1,024 EDS response types, considering general cases in which these assumptions are not met.

Table 4

Enumeration of 48 EDS response types and corresponding potential outcomes

Etype	Dtype	Stype	Potential outcomes of E		Potential outcomes of D				Compound potential outcomes of D				Potential outcomes of S				Selection status
E ^T( ω)	D ^T( ω)	S ^T( ω)	E _c( ω)		D _ce( ω)				$D_{c E_{c'}} (ω)$				S _ed( ω)				$S_{E_{c} D_{c E_{c}}} (ω)$
			E ₁	E ₀	D ₁₁	D ₀₁	D ₁₀	D ₀₀	$D_{1 E_{1}}$	$D_{1 E_{0}}$	$D_{0 E_{1}}$	$D_{0 E_{0}}$	S ₁₁	S ₀₁	S ₁₀	S ₀₀	$S_{E_{1} D_{_{1} E_{_{1}}}}$	$S_{E_{0} D_{_{0} E_{_{0}}}}$
1	1	1	1	1	1	1	(1)^a	(1)	1	(1)	(1)	1	1	(1)	(1)	(1)	1	1
1	1	4	1	1	1	1	(1)	(1)	1	(1)	(1)	1	1	(1)	(0)	(0)	1	1
1	1	6	1	1	1	1	(1)	(1)	1	(1)	(1)	1	1	(0)	(1)	(0)	1	1
1	1	16	1	1	1	1	(1)	(1)	1	(1)	(1)	1	0	(0)	(0)	(0)	0	0
1	4	1	1	1	1	1	(0)	(0)	1	(1)	(1)	1	1	(1)	(1)	(1)	1	1
1	4	4	1	1	1	1	(0)	(0)	1	(1)	(1)	1	1	(1)	(0)	(0)	1	1
1	4	6	1	1	1	1	(0)	(0)	1	(1)	(1)	1	1	(0)	(1)	(0)	1	1
1	4	16	1	1	1	1	(0)	(0)	1	(1)	(1)	1	0	(0)	(0)	(0)	0	0
1	6	1	1	1	1	0	(1)	(0)	1	(1)	(0)	0	1	(1)	1	(1)	1	1
1	6	4	1	1	1	0	(1)	(0)	1	(1)	(0)	0	1	(1)	0	(0)	1	0
1	6	6	1	1	1	0	(1)	(0)	1	(1)	(0)	0	1	(0)	1	(0)	1	1
1	6	16	1	1	1	0	(1)	(0)	1	(1)	(0)	0	0	(0)	0	(0)	0	0
1	16	1	1	1	0	0	(0)	(0)	0	(0)	(0)	0	1	(1)	1	(1)	1	1
1	16	4	1	1	0	0	(0)	(0)	0	(0)	(0)	0	1	(1)	0	(0)	0	0
1	16	6	1	1	0	0	(0)	(0)	0	(0)	(0)	0	1	(0)	1	(0)	1	1
1	16	16	1	1	0	0	(0)	(0)	0	(0)	(0)	0	0	(0)	0	(0)	0	0
2	1	1	1	0	1	(1)	(1)	1	1	(1)	(1)	1	1	1	(1)	(1)	1	1
2	1	4	1	0	1	(1)	(1)	1	1	(1)	(1)	1	1	1	(0)	(0)	1	1
2	1	6	1	0	1	(1)	(1)	1	1	(1)	(1)	1	1	0	(1)	(0)	1	0
2	1	16	1	0	1	(1)	(1)	1	1	(1)	(1)	1	0	0	(0)	(0)	0	0
2	4	1	1	0	1	(1)	(0)	0	1	(0)	(1)	0	1	(1)	(1)	1	1	1
2	4	4	1	0	1	(1)	(0)	0	1	(0)	(1)	0	1	(1)	(0)	0	1	0
2	4	6	1	0	1	(1)	(0)	0	1	(0)	(1)	0	1	(0)	(1)	0	1	0
2	4	16	1	0	1	(1)	(0)	0	1	(0)	(1)	0	0	(0)	(0)	0	0	0
2	6	1	1	0	1	(0)	(1)	0	1	(1)	(0)	0	1	(1)	(1)	1	1	1
2	6	4	1	0	1	(0)	(1)	0	1	(1)	(0)	0	1	(1)	(0)	0	1	0
2	6	6	1	0	1	(0)	(1)	0	1	(1)	(0)	0	1	(0)	(1)	0	1	0
2	6	16	1	0	1	(0)	(1)	0	1	(1)	(0)	0	0	(0)	(0)	0	0	0
2	16	1	1	0	0	(0)	(0)	0	0	(0)	(0)	0	(1)	(1)	1	1	1	1
2	16	4	1	0	0	(0)	(0)	0	0	(0)	(0)	0	(1)	(1)	0	0	0	0
2	16	6	1	0	0	(0)	(0)	0	0	(0)	(0)	0	(1)	(0)	1	0	1	0
2	16	16	1	0	0	(0)	(0)	0	0	(0)	(0)	0	(0)	(0)	0	0	0	0
4	1	1	0	0	(1)	(1)	1	1	1	(1)	(1)	1	(1)	1	(1)	(1)	1	1
4	1	4	0	0	(1)	(1)	1	1	1	(1)	(1)	1	(1)	1	(0)	(0)	1	1
4	1	6	0	0	(1)	(1)	1	1	1	(1)	(1)	1	(1)	0	(1)	(0)	0	0
4	1	16	0	0	(1)	(1)	1	1	1	(1)	(1)	1	(0)	0	(0)	(0)	0	0
4	4	1	0	0	(1)	(1)	0	0	0	(0)	(0)	0	(1)	(1)	(1)	1	1	1
4	4	4	0	0	(1)	(1)	0	0	0	(0)	(0)	0	(1)	(1)	(0)	0	0	0
4	4	6	0	0	(1)	(1)	0	0	0	(0)	(0)	0	(1)	(0)	(1)	0	0	0
4	4	16	0	0	(1)	(1)	0	0	0	(0)	(0)	0	(0)	(0)	(0)	0	0	0
4	6	1	0	0	(1)	(0)	1	0	1	(1)	(0)	0	(1)	1	(1)	1	1	1
4	6	4	0	0	(1)	(0)	1	0	1	(1)	(0)	0	(1)	1	(0)	0	1	0
4	6	6	0	0	(1)	(0)	1	0	1	(1)	(0)	0	(1)	0	(1)	0	0	0
4	6	16	0	0	(1)	(0)	1	0	1	(1)	(0)	0	(0)	0	(0)	0	0	0
4	16	1	0	0	(0)	(0)	0	0	0	(0)	(0)	0	(1)	(1)	(1)	1	1	1
4	16	4	0	0	(0)	(0)	0	0	0	(0)	(0)	0	(1)	(1)	(0)	0	0	0
4	16	6	0	0	(0)	(0)	0	0	0	(0)	(0)	0	(1)	(0)	(1)	0	0	0
4	16	16	0	0	(0)	(0)	0	0	0	(0)	(0)	0	(0)	(0)	(0)	0	0	0

We consider 4 binary variables as follows: exposure E, outcome D, confounder C, and selection variable S (see Figure 1). We show the enumeration under the assumptions of positive monotonicity of E, D, and S and no interaction at the individual level on the additive scale between C and E on D and E and D on S in the counterfactual framework.

^a Parentheses indicate that this particular outcome will never be observed.

Four hypothetical situations

In Figure 2, we give an overview of 4 hypothetical situations by using DAGs. Figure 2A describes a situation in which researchers conduct an observational study and the information about a portion of subjects is unavailable due to loss to follow-up. Note that the square around S indicates that the analysis is restricted to those who do not drop out (i.e., S = 1). Investigators often encounter this situation in observational studies. Researchers should be concerned about both confounding bias and selection bias in this situation.

Subsequently, Figure 2B shows a situation in which researchers can obtain the information about the total population, including those who dropped out. In this situation, a possibility of selection bias can be ruled out since researchers do not condition on S.

In observational studies, researchers usually aim to eliminate confounding bias by employing some statistical procedures, e.g., standardization and inverse-probability weighting method. In other words, they aim to analytically block or remove the path between C and E by making an adequate adjustment. (Note that outcome modeling techniques such as disease risk scores focus on the path between C and D[39].) By contrast, in randomized controlled trials, researchers manipulate the value of E by employing certain interventions; they physically prevent E from varying in response to variations in C. Thus, as shown in Figure 2C and D, C would no longer have effects on E, and the arrow from C to E is erased or removed [14]. This should be clearly distinguished from analytic control of C in observational studies.

In the following sections, we demonstrate significant differences between these 4 hypothetical situations, by describing theoretical data frequencies in terms of response types.

Results

Describing data from observational studies in terms of response types

As demonstrated above, under the situation described in Figure 1, individuals can be classified into one of the maximum of 1,024 EDS response types. Despite its sophistication and usefulness, however, the response type of each individual is unobservable. Indeed, this is called a fundamental problem of causal inference [40]. Nonetheless, we can show the conceptual link between unobservable response types and observed, or observable, data frequencies in the population. In this respect, the concept of compound potential outcomes is quite useful.

In Figure 3, we describe theoretical data frequencies from observational studies in terms of the 1,024 possible EDS response types. We let EiDjSk denote the EDS response type of [E ^T = i, D ^T = j, S ^T = k] (i = 1, ⋯, 4, j = 1, ⋯, 16, k = 1, ⋯, 16), and let P _EiDjSk denote a prevalence of the individuals of EiDjSk response type in the total population. We also let P _C|EiDjSk and

P_{\bar{C} | EiDjSk}

denote probabilities of being exposed and unexposed to C among the individuals of EiDjSk response type, respectively. When no confusion occurs for a dichotomous variable X, we use the notations X and

\bar{X}

in the terminologies of events of X = 1 and X = 0, respectively. For example, C and

\bar{C}

mean C = 1 and C = 0, respectively. Further, N denotes the number of total population. Then, data frequencies in each “cell” in Figure 3 can be described either as N ∑ _ijk P _C|EiDjSk P _EiDjSk or

N \sum_{ijk} P_{\bar{C} | EiDjSk} P_{EiDjSk}

. (Note that the former can be also expressed as NP _C ∑ _ijk P _EiDjSk|C and that the latter can be expressed as

N P_{\bar{C}} \sum_{ijk} P_{EiDjSk | \bar{C}}

, where P _C and

P_{\bar{C}}

denote probabilities of C and

\bar{C}

in the total population, respectively). It should be noted that individuals can be classified into 16 “cells,” which is equivalent to a maximum possible number of 4 independent random events (i.e., E, D, C, and S). The upper and lower parts of Figure 3 show data frequencies among the subpopulation with C = 1 and C = 0, respectively. Those who are classified into inner dashed rectangles represent individuals selected for analyses (i.e., S = 1) while those who are not classified into the rectangles represent non-selected individuals (i.e., S = 0). In other words, the information about the individuals outside the rectangles is unavailable to researchers.

Notably, individuals of the same EDS response types can be potentially classified into 2 cells. For example, consider individual ω who is classified as E1D6S4 response type (see Table 4). This individual is, by definition, exposed to E = 1 irrespective of the value of C (i.e., E ₁(ω) = E ₀(ω) = 1). Further, individual ω is expected to experience outcome D if there had been interventions to set C to 1 (i.e.,

D_{1 E_{1}} (ω) = D_{11} (ω) = 1

), whereas this individual is expected not to experience outcome D if there had been interventions to set C to 0 (i.e.,

D_{0 E_{0}} (ω) = D_{01} (ω) = 0

). Finally, the information about this individual is, by definition, available to researchers had there been interventions to set C to 1 (i.e.,

S_{E_{1} D_{1 E_{1}}} (ω) = S_{11} (ω) = 1

), whereas this individual is lost to follow-up had there been interventions to set C to 0 (i.e.,

S_{E_{0} D_{0 E_{0}}} (ω) = S_{10} (ω) = 0

). Thus, in observational studies, individual ω of E1D6S4 response type can be classified into either one of the following 2 cells in Figure 3; one is E = 1, D = 1, C = 1, and S = 1 while the other is E = 1, D = 0, C = 0, and S = 0. Note that this depends on the probabilities that C is present or absent in individual ω (i.e., P _C|E1D6S4 and

P_{\bar{C} | E 1 D 6 S 4}

To summarize, Figure 3 shows theoretical data frequencies in an observational study (i.e., Figure 2A and B). The situation is, however, strikingly different when we conduct a randomized controlled trial, which will be demonstrated in the next section.

Describing data from randomized controlled trials in terms of response types

As noted above, researchers manipulate the value of E in randomized controlled trials. Since researchers physically prevent E from varying in response to variations in C, we do not need to consider E response types when describing theoretical data frequencies in ideal randomized controlled trials; rather we focus on D response types and S response types. In other words, observed exposure status and E response types become independent (i.e., E ∐ E ^T) when researchers marginally intervene on E. Thus, theoretical data frequencies from randomized controlled trials can be described in terms of 256 (i.e., 16 × 16) possible DS response types, in contrast with 1,024 possible EDS response types.

We let P _E and

P_{\bar{E}}

denote the probabilities of E and

\bar{E}

in the total population, respectively. (For simplicity, we describe the situation of marginal randomization of E. However, the following discussion can be extended to the situation of stratified randomization, in which P _E and

P_{\bar{E}}

may vary across the strata of C.) Figure 4 shows distributions of individuals of the 256 possible DS response types in a randomized controlled trial. Note that data frequencies in each “cell” in Figure 4 can be described as NP _E ∑ _jk P _C|DjSk P _DjSk,

N P_{\bar{E}} \sum_{jk} P_{C | DjSk} P_{DjSk}

N P_{E} \sum_{jk} P_{\bar{C} | DjSk} P_{DjSk}

, or

N P_{\bar{E}} \sum_{jk} P_{\bar{C} | DjSk} P_{DjSk}

. (Note that these can be also expressed as NP _E P _C ∑ _jk P _DjSk|C,

N P_{\bar{E}} P_{C} \sum_{jk} P_{DjSk | C}

N P_{E} P_{\bar{C}} \sum_{jk} P_{DjSk | \bar{C}}

, and

N P_{\bar{E}} P_{\bar{C}} \sum_{jk} P_{DjSk | \bar{C}}

, respectively.) As in the case of observational studies (Figure 3), individuals can be classified into 16 “cells” in Figure 4.

The theoretical data frequencies in Figure 4 can be explained as a re-distribution of individuals in Figure 3. For example, consider individual ω who is classified as either E1D6S4 response type or E2D6S4 response type. In observational studies, if the value of C of individual ω is 1, this individual is classified into an upper-left cell within the inner dashed rectangle of the upper part of Figure 3, i.e., E = 1, D = 1, C = 1, and S = 1. Then, when this individual is forced to be exposed to E in a randomized controlled trial, this individual would remain in the upper-left cell within the inner dashed rectangle of the upper part of Figure 4. Note that neither D response types nor S response types of this individual change by the intervention on E. By contrast, if individual ω is forced to be unexposed to E, this individual “moves” to a lower-left cell within the inner dashed rectangle of the upper part of Figure 4, i.e., E = 0, D = 1, C = 1, and S = 1. On the other hand, consider individual ω who is classified as either E3D6S4 response type or E4D6S4 response type. In observational studies, if the value of C of individual ω is 1, this individual is classified into a lower-left cell within the inner dashed rectangle of the upper part of Figure 3, i.e., E = 0, D = 1, C = 1, and S = 1. Then, in randomized controlled trials, if this individual is forced to be unexposed to E, this individual would remain in the lower-left cell within the inner dashed rectangle of the upper part of Figure 4. Meanwhile, if this individual is forced to be exposed to E, this individual “moves” to an upper-left cell within the inner dashed rectangle of the upper part of Figure 4, i.e., E = 1, D = 1, C = 1, and S = 1. These re-distributions can be summarized as

\begin{array}{l} P_{E} (N \sum_{i = 1, 2} P_{C | EiD 6 S 4} P_{EiD 6 S 4} + N \sum_{i = 3, 4} P_{C | EiD 6 S 4} P_{EiD 6 S 4}) \\ = N P_{E} \sum_{i = 1, 2, 3, 4} P_{C} P_{EiD 6 S 4 | C} \\ = N P_{E} P_{C} P_{D 6 S 4 | C} \\ = N P_{E} P_{C | D 6 S 4} P_{D 6 S 4}, \end{array}

(3)

and

\begin{array}{l} P_{\bar{E}} (N \sum_{i = 1, 2} P_{C | EiD 6 S 4} P_{EiD 6 S 4} + N \sum_{i = 3, 4} P_{C | EiD 6 S 4} P_{EiD 6 S 4}) \\ = N P_{\bar{E}} \sum_{i = 1, 2, 3, 4} P_{C} P_{EiD 6 S 4 | C} \\ = N P_{\bar{E}} P_{C} P_{D 6 S 4 | C} \\ = N P_{\bar{E}} P_{C | D 6 S 4} P_{D 6 S 4} . \end{array}

(4)

Note that the numbers in the parentheses of left-hand sides of equations 3 and 4 are based on the subpopulation of C = 1 in observational studies (i.e., upper part of Figure 3), whereas the right-hand sides of these equations are based on the subpopulation of C = 1 in randomized controlled trials (i.e., upper part of Figure 4). In other words, these equations explain how individuals of subpopulation of C = 1 are re-distributed as a result of intervention on E.

Analogous discussion applies when the value of C is 0 among the individuals of E1D6S4, E2D6S4, E3D6S4, or E4D6S4 response types. Note that, in observational studies, these individuals are classified in either an upper-right cell (i.e., E = 1, D = 0, C = 0, and S = 0) or a lower-right cell (i.e., E = 0, D = 0, C = 0, and S = 0) outside the inner dashed rectangle of the lower part of Figure 3. The re-distributions of these individuals as a result of intervention on E can be summarized as

\begin{array}{l} P_{E} (N \sum_{i = 1, 3} P_{\bar{C} | EiD 6 S 4} P_{EiD 6 S 4} + N \sum_{i = 2, 4} P_{\bar{C} | EiD 6 S 4} P_{EiD 6 S 4}) \\ = N P_{E} \sum_{i = 1, 2, 3, 4} P_{\bar{C}} P_{EiD 6 S 4 | \bar{C}} \\ = N P_{E} P_{\bar{C}} P_{D 6 S 4 | \bar{C}} \\ = N P_{E} P_{\bar{C} | D 6 S 4} P_{D 6 S 4}, \end{array}

(5)

and

\begin{array}{l} P_{\bar{E}} (N \sum_{i = 1, 3} P_{\bar{C} | EiD 6 S 4} P_{EiD 6 S 4} + N \sum_{i = 2, 4} P_{\bar{C} | EiD 6 S 4} P_{EiD 6 S 4}) \\ = N P_{\bar{E}} \sum_{i = 1, 2, 3, 4} P_{\bar{C}} P_{EiD 6 S 4 | \bar{C}} \\ = N P_{\bar{E}} P_{\bar{C}} P_{D 6 S 4 | \bar{C}} \\ = N P_{\bar{E}} P_{\bar{C} | D 6 S 4} P_{D 6 S 4} . \end{array}

(6)

Again, the numbers in the parentheses of left-hand sides of equations 5 and 6 are based on the subpopulation of C = 0 in observational studies (i.e., lower part of Figure 3), whereas the right-hand sides of these equations are based on the subpopulation of C = 0 in randomized controlled trials (i.e., lower part of Figure 4). In other words, these equations explain how individuals of subpopulation of C = 0 are re-distributed as a result of intervention on E. It should be noted that these re-distributions do not occur across the upper and lower parts of Figures 3 and 4 because C precedes E temporally and the value of C is, by definition, predetermined before intervention on E. These discussions also demonstrate that, in Figure 4, individuals of the same DS response types can be potentially classified into 4 cells, depending on the probability of being exposed or unexposed to C (i.e., P _C|DjSk or

P_{\bar{C} | DjSk}

) and the probability of being exposed or unexposed to E (i.e., P _E or

P_{\bar{E}}

Note that, when the information about the total population is available, both marginal and conditional exchangeability assumptions are met in Figure 4; the distributions of DS response types are comparable between the exposed and unexposed groups. However, when the information about those who dropped out is not available, exchangeability assumptions do not hold, either conditionally or unconditionally. See (Additional file 1: Appendix 1) for a discussion of positivity – another fundamental assumption for causal inference [41‐43].

Epidemiologic measures in terms of response types

The descriptions of data frequencies in Figures 3 and 4 have a crucial implication, demonstrating significant distinctions between association measures and effect measures [9]. In the following sections, we continue to focus our discussion on RRs, which can be extended to other measures. Note also that, although epidemiologic measures can be defined for a variety of target population (e.g., the exposed and the unexposed), the following discussion focuses on the situation in which target population is the total population. Furthermore, we also discuss epidemiologic measures in the subpopulation defined by C or S.

In observational studies (Figure 2A and B), researchers can readily calculate associational RRs by referring to the notations in Figure 3. In particular, when no information is available about those who dropped out (Figure 2A), one can calculate an associational RR_S=1 by using the information about individuals classified into the inner rectangles in Figure 3. Then, as shown in (Additional file 2: Table S1), associational RR_S=1 can be described in terms of a probability of being exposed or unexposed to C among the individuals of EiDjSk response type (i.e., P _C|EiDjSk or

P_{\bar{C} | EiDjSk}

) and a prevalence of the individuals of EiDjSk response type in the total population (i.e., P _EiDjSk) (equation A1). Meanwhile, when researchers are capable of gathering information about those who dropped out (Figure 2B), the information about individuals of S response types 1 through 16 is available, which yields an associational RR (equation A4).

By contrast, when researchers obtain data from randomized controlled trials (Figure 2C and D), their frequencies can be described in a different way, as shown in Figure 4. In these cases, researchers can calculate causal RRs to infer causality between E and D. When no information is available about those who dropped out (Figure 2C), one can calculate a causal RR_S=1 by using the information about individuals classified into the inner rectangles in Figure 4. Then, as shown in (Additional file 2: Table S2), causal RR_S=1 can be described in terms of a probability of being exposed or unexposed to C among the individuals of DjSk response type (i.e., P _C|DjSk or

P_{\bar{C} | DjSk}

) and a prevalence of the individuals of DjSk response type in the total population (i.e., P _DjSk) (equation A7). In ideal randomized controlled trials without loss to follow-up (Figure 2D), the information about individuals of S response types 1 through 16 is available, which yields a causal RR (equation A10). We should note that the causal RR shown in equation A10 is an alternative notation of the causal RR shown in equation 1 in terms of response types (see Additional file 1: Appendix 2).

Note that, even in ideal (either marginal or stratified) randomized controlled trials, one may observe a heterogeneity between stratum-specific causal RRs, which will be addressed in the section entitled “Modification of epidemiologic measures”.

Confounding bias

In this section, we aim to further clarify the concept of confounding bias in the counterfactual framework, by describing it in terms of response types.

We show a sufficient condition to estimate effect measures in observational studies by adjusting for confounding bias in terms of response types. In this case, we use effect measures in the total population in ideal randomized controlled trials (i.e., causal RR) as a gold standard. As noted above, confounding bias is induced by a common cause C of E and D. Thus, to show a sufficient condition to adjust for confounding bias, we need to compare association measures in the total population in observational studies (Figure 2B) and effect measures in the total population in randomized controlled trials (Figure 2D) In other words, a sufficient condition to adjust for confounding bias can be described as: adjusted associational RR = causal RR. Note that we here compare 2 distinct types of epidemiologic measures, which are obtained from distinct study designs.

To adjust for confounding bias in observational studies, one may calculate a weighted average of stratum-specific associational RRs, or standardization, expecting to estimate a causal RR. By using stratum-specific associational RRs (equations A5 and A6), this procedure can be described in terms of response types as follows:

\begin{array}{l} \frac{P_{C} (\frac{\sum_{\begin{array}{l} i = 1, 2 \\ j = 1, 2, 3, 4, 5, 6, 7, 8 \end{array}} P_{C | EiDj} P_{EiDj}}{\sum_{i = 1, 2} P_{C | Ei} P_{Ei}}) + P_{\bar{C}} (\frac{\sum_{\begin{array}{l} i = 1, 3 \\ j = 1, 2, 3, 4, 9, 10, 11, 12 \end{array}} P_{\bar{C} | EiDj} P_{EiDj}}{\sum_{i = 1, 3} P_{\bar{C} | Ei} P_{Ei}})}{P_{C} (\frac{\sum_{\begin{array}{l} i = 3, 4 \\ j = 1, 2, 5, 6, 9, 10, 13, 14 \end{array}} P_{C | EiDj} P_{EiDj}}{\sum_{i = 3, 4} P_{C | Ei} P_{Ei}}) + P_{\bar{C}} (\frac{\sum_{\begin{array}{l} i = 2, 4 \\ j = 1, 3, 5, 9, 11, 13, 15 \end{array}} P_{\bar{C} | EiDj} P_{EiDj}}{\sum_{i = 2, 4} P_{\bar{C} | Ei} P_{Ei}})} \\ = \frac{(\frac{\sum_{\begin{array}{l} i = 1, 2 \\ j = 1, 2, 3, 4, 5, 6, 7, 8 \end{array}} P_{C} P_{EiDj | C}}{\sum_{i = 1, 2} P_{Ei | C}} + \frac{\sum_{\begin{array}{l} i = 1, 3 \\ j = 1, 2, 3, 4, 9, 10, 11, 12 \end{array}} P_{\bar{C}} P_{EiDj | \bar{C}}}{\sum_{i = 1, 3} P_{Ei | \bar{C}}})}{(\frac{\sum_{\begin{array}{l} i = 3, 4 \\ j = 1, 2, 5, 6, 9, 10, 13, 14 \end{array}} P_{C} P_{EiDj | C}}{\sum_{i = 3, 4} P_{Ei | C}} + \frac{\sum_{\begin{array}{l} i = 2, 4 \\ j = 1, 3, 5, 9, 11, 13, 15 \end{array}} P_{\bar{C}} P_{EiDj | \bar{C}}}{\sum_{i = 2, 4} P_{Ei | \bar{C}}})} . \end{array}

(7)

Notably, this is not equivalent to a causal RR (equation A10). In other words, this stratification-based procedure does not “delete” the arrow from C to E in Figure 2B, yielding subtly different measures from effect measures. When one can assume conditional exchangeability (i.e., E∐D _e|C for ∀e), the weighted average of stratum-specific associational RR can be expressed as

\frac{\sum_{j = 1, 2, 3, 4, 5, 6, 7, 8} P_{C | Dj} P_{Dj} + \sum_{j = 1, 2, 3, 4, 9, 10, 11, 12} P_{\bar{C} | Dj} P_{Dj}}{\sum_{j = 1, 2, 5, 6, 9, 10, 13, 14} P_{C | Dj} P_{Dj} + \sum_{j = 1, 3, 5, 9, 11, 13, 15} P_{\bar{C} | Dj} P_{Dj}},

(8)

which is equivalent to a causal RR in equation A10 (see Additional file 1: Appendix 3). Indeed, the condition E∐D _e|C for ∀e is a sufficient condition to estimate effect measures in observational studies by adjusting for confounding bias, and the assumption of exchangeability often gets most of the attention in discussions about causal inference [44]. Unfortunately, however, the condition is not guaranteed in observational studies, and expert knowledge is required. Importantly, the assumption of conditional exchangeability, i.e., E∐D _e|C for ∀e, is subtly weaker than another sufficient condition to estimate effect measures in observational studies, i.e., full conditional exchangeability, or E∐(D _e=1, D _e=0)|C[45]. It may be difficult, however, to imagine a practical scenario where the former holds but not the latter [46], and the word “exchangeability” has been sometimes used interchangeably in the literature. (A combination of full exchangeability and positivity has been termed “strongly ignorable treatment assignment” assumption or “strong ignorability,” whereas a combination of exchangeability and positivity has been termed “weakly ignorable treatment assignment” assumption or “weak ignorability” [2, 47, 48].) By comparing equations 7 and 8, we can show that the conditions E ^T ∐ D ^T|C and E ∐ D ^T|C are also sufficient conditions to estimate effect measures in observational studies by adjusting for confounding bias (see Additional file 1: Appendix 3). In Additional file 1: Appendix 4, we show a proof of the following inclusion relation:

E^{T} ∐ D^{T} | C \Rightarrow E ∐ D^{T} | C \Rightarrow E ∐ (D_{e = 1}, D_{e = 0}) | C \Rightarrow E ∐ D_{e} | C for \forall e

The subtle differences between E ^T ∐ D ^T|C and E ∐ D ^T|C are described graphically in the section entitled “Extended causal diagrams integrating response types”. It is worthwhile to mention that the condition E ^T ∐ D ^T|C is not guaranteed in randomized controlled trials.

The above discussion implies that analytic adjustment for C in observational studies has consequences quite different from those of physical control in randomized controlled trials. Even when adequate analytic control of C may be envisaged in observational studies, researchers cannot estimate effect measures without the assumption external to data. See Additional file 1: Appendix 5 for a discussion of recently-introduced assumptions to compensate for a lack of randomization.

Selection bias

In this section, we aim to further clarify the concept of selection bias in the counterfactual framework, by describing it in terms of response types.

We show sufficient conditions for non-selection bias in terms of response types. As explained above, selection bias is induced by conditioning on a common effect of E and D (Figure 2A and C). Thus, to show sufficient conditions for non-selection bias, we need to specify epidemiologic measures, i.e., association measures or effect measures. With regard to association measures, a sufficient condition for non-selection bias is described as associational RR_S=1 = associational RR (see equations A1 and A4). Likewise, a sufficient condition for non-selection bias for effect measures is described as causal RR_S=1 = causal RR (see equations A7 and A10). It is worthwhile to mention that, when discussing selection bias, one need to specify a stratum of S[21]. In most cases, researchers are interested in the presence and the degree of selection bias among the subjects who do not drop out. Thus, we here show sufficient conditions for non-selection bias in a stratum S = 1. As explained later by using extended causal diagrams, selection bias results in violation of E ∐ D ^T even when exposure is randomly assigned.

Modification of epidemiologic measures

For decades, epidemiologists have used the term “effect modification” in a broad context, simply referring to a variation in the selected effect measure for the factor under study across levels of another factor [49]. In this respect, a recent paper clarified the distinction between interaction and effect modification within the counterfactual framework [31]. It is also well known that the presence, direction, and size of modification can be dependent on the choice of measure [50]. Since the term “effect modification” is ambiguous, it is now recommended to specify the measures more precisely, e.g., risk-difference modification [50]. The above discussion implies that researchers need to distinguish association-measure modification and effect-measure modification. For example, when the information about total population is available in a randomized controlled trial, causal-RR modification is defined to be present if stratum-specific causal RRs from each subpopulation varies across the strata of C, i.e., causal RR_C=1 ≠ causal RR_C=0 (see equations A11 and A12). When stratum-specific causal RRs are (approximately) homogeneous or uniform across strata, researchers usually pool the data to calculate a causal RR in the total population (i.e., causal RR). In a similar manner, one can define associational-RR modification (see equations A5 and A6). Only if it is appropriate to pool the data across the strata of C, one can validly interpret associational RRs in the total population.

Notably, the presence of association-measure modification does not necessarily imply the presence of effect-measure modification, and vice versa.

Extended causal diagrams integrating response types

In this section, we attempt to explain the concept of bias by extending causal diagrams, which integrate response types and observed variables. Although these causal diagrams, or extended DAGs, may appear less intuitive, they maintain the integrity of the original DAGs and would be of great use in graphically describing the findings discussed in this study. In particular, by integrating response types and observed variables, we can readily understand subtle differences between E ^T ∐ D ^T|C and E ∐ D ^T|C, demonstrating sufficient conditions to estimate effect measures in observational studies.

Figure 5 shows the hypothetical situation described in Figure 1 by integrating response types of E, D, and S (i.e., E ^T, D ^T, and S ^T, respectively). First, note that the only arrows emanating from the response types point to the corresponding observed variables, i.e., E ^T → E, D ^T → D, and S ^T → S. Then, to describe the underlying correlation between E ^T, D ^T, and S ^T, we use a total of 3 unmeasured common causes, U1, U2, and U3, which are independent of each other. In other words, these unmeasured common causes as a whole represent underlying personal characteristics determining his/her response types of E, D, and S. (Note that, unlike U1, U2, and U3, we assume that C does not determine his/her response types. In other words, we assume that U1, U2, and U3 precede response types, while C does not.) It would be worth to mention that Figure 5 well describes how the observed variables are determined in response to the corresponding response types and their measured parent(s). For example, each individual has 2 potential outcomes of E: the outcome that would occur if C is present in that individual, E ₁, and the outcome that would occur if C is absent in that individual, E ₀. Thus, we observe E = CE ₁ + (1 − C)E ₀. In Figure 5, this equation is illustrated by the only 2 arrows from E ^T and C to E.

In randomized controlled trials, in which E is marginally randomized, researchers physically prevent E from varying in response to variations in C by intervening on E. Thus, by applying the rule used in the standard DAG theory, the 2 arrows pointing to E in Figure 5 are removed. As a result, observed exposure status and E response types become independent (i.e., E ∐ E ^T), as shown in Figure 6. In addition, observed exposure status becomes independent of D response types and S response types (i.e.,

E ∐ D^{T}

and

E ∐ S^{T}

, respectively) because the value of E is, by definition, determined randomly. Note that

E ∐ D^{T}

implies an assumption of (full) exchangeability. Trivially, observed exposure status is also independent of D ^T given C, i.e.,

E ∐ D^{T} |C

, thus implying the assumption of (full) conditional exchangeability. Importantly, even when adjusting for C, marginal randomization of E does not result in independence between E response types and D response types due to 2 open paths, E ^T ← U1 → D ^T and E ^T ← U2 → D ^T. If we adjust for U1 and U2, they become independent. Finally, Figure 6 also clearly shows that selection bias results in violation of

E ∐ D^{T}

; when some of the subjects are lost to follow up, 7 marginally blocked paths between E and D ^T (i.e., E → S ← D ← D ^T, E → S ← S ^T ← U1 → D ^T, E → S ← S ^T ← U2 → D ^T, E → S ← S ^T ← U3 → D ^T, E → D → S ← S ^T ← U1 → D ^T, E → D → S ← S ^T ← U2 → D ^T, and E → D → S ← S ^T ← U3 → D ^T) become open because we condition on the collider S. Indeed, extended DAGs are of great use to demonstrate that both confounding bias and selection bias result in lack of (full) exchangeability of the exposed and unexposed groups.

Meanwhile, when using stratified randomization of E by C, researchers physically prevent E from varying in response to variations in E response types, but the probability of E may vary across the strata of C. Thus, unlike the marginal randomization of E, of the 2 arrows pointing to E in Figure 5, only the arrow from E ^T to E is removed (Figure 7). As a result, there is an open path between E and E ^T, i.e., E ← C ← U1 → E ^T, which can be blocked by adjusting for C (i.e.,

E ∐ E^{T} |C

). Further, although there is an open path between E and D ^T, i.e., E ← C ← U1 → D ^T, this path can be blocked by adjusting for C (i.e.,

E ∐ D^{T} |C

), which implies that the assumption of (full) conditional exchangeability can be readily met in stratified randomization of E. Finally, there is an open path between E and S ^T, i.e., E ← C ← U1 → S ^T, which can be also blocked by adjusting for C (i.e.,

E ∐ S^{T} |C

). Note that, like Figure 6, even when adjusting for C, stratified randomization of E does not result in independence between E response types and D response types, and we need to additionally adjust for U1 and U2.

Figure 8 shows the situation in observational studies, in which researchers stratify by C to calculate a weighted average of stratum-specific association measures. (A square around C means that we condition on C.) Unlike marginal or stratified randomization of E, observed exposure status is determined in response to E response types as well as the status of C. Therefore, no arrows pointing to E are removed in Figure 8. Note that, in Figure 8, E and D ^T would be marginally connected via the following 3 paths, i.e., E ← E ^T ← U1 → D ^T, E ← E ^T ← U2 → D ^T, and E ← C ← U1 → D ^T. When we condition on C, only the third path can be blocked, and E and D ^T remain connected via the first 2 paths. Notably, the 3 paths can be theoretically blocked by conditioning on U1 and U2. In other words, a sufficient condition of

E ∐ D^{T} |C

is to adjust for U1 and U2 in observational studies. Meanwhile, in equation 8, we demonstrate that the condition

E^{T} ∐ D^{T} |C

is a sufficient condition to estimate effect measures in observational studies without loss to follow-up. Indeed, E ^T and D ^T are not independent given C in Figure 8, and they are connected via the following 2 paths, conditional on C, i.e., E ^T ← U1 → D ^T and E ^T ← U2 → D ^T. Note that both paths can be theoretically blocked by conditioning on U1 and U2. To summarize, although

E^{T} ∐ D^{T} |C

and

E ∐ D^{T} |C

are sufficient conditions to estimate effect measures in observational studies, neither is guaranteed in observational studies, and expert knowledge is required. In particular, although

E^{T} ∐ D^{T} |C

is stronger than

E ∐ D^{T} |C

, we need to adjust for U1 and U2 to achieve either condition as shown in Figure 8.

Finally, it is worthwhile to mention that the perspectives of the extended DAGs are different from those of the twin network method, which has been developed to deal with counterfactual values in DAGs [2]. This graphical method uses two networks, one to represent the actual world and one to represent the hypothetical world. Thus, this method is used to represent the causal relations under intervention. The aim of our extended DAGs is to integrate response types and observed variables, which is thus applicable to observational studies as well as randomized controlled trials. As a consequence, we can use the extended DAGs to describe the sufficient conditions to infer causality in observational studies in terms of response types.

Discussion

We have clarified the concepts of bias and effect modification in the counterfactual framework, by describing theoretical data frequencies from observational studies and randomized controlled trials in terms of response types. Although these concepts have been extensively explained in the epidemiologic literature, most of the studies have discussed them separately. In this article, we have highlighted the relations between these concepts, by discussing them simultaneously. The present findings would somehow clarify the link between the assumptions for making causal inferences in observational studies and the counterfactual approach, demonstrating the inherent distinctions between observational studies and randomized controlled trials. The extension of DAGs using response types maintains the integrity of the original DAGs, which allows one to understand the underlying causal structure discussed in this study.

We have shown a hitherto unrecognized sufficient condition

E^{T} ∐ D^{T} |C

to estimate effect measures in observational studies by adjusting for confounding bias. This condition is stronger than the assumption of (full) conditional exchangeability, and it is not straightforward to discuss technical advantages of the hitherto unrecognized condition. Such consideration however would enable one to further understand the conceptual link between unobservable response types and observed, or observable, data frequencies in the population. This would also facilitate understanding of the underlying causal structures of bias and effect modification.

In this article, we use a simple hypothetical situation, including only 4 binary variables. Thus, it should be noted that the present study does not encompass more complicated situations, e.g., M-bias [51]. It is however worthwhile to mention that the condition

E^{T} ∐ D^{T} |C

is applicable even when an exposure and an outcome are polytomous variables, because our discussion based on the extended DAGs does not restrict the type of variables. When considering situations in which there are some confounders, the present finding would apply by defining and estimating a function of measured confounders that can be treated as a single confounder. It should be also noted that we focused only on direct effect modification, and thus, the present discussion does not necessarily apply to other types of effect modification, i.e., indirect effect modification, effect modification by proxy, and effect modification by a common cause [22]. Further, this study does not address the issue of information bias or measurement error. Recent studies have discussed how DAGs can be used to represent them [52‐55], which should be addressed further in future studies.

Conclusion

As shown in the present study, researchers should recognize inherent limitations of observational studies in estimating causal effects. It should be emphasized, however, that the recognition should come in the interpretation of the evidence when trying to draw conclusions, not in the statement of research goals or study design and conduct phases [56]. The data from observational studies yield measures of association and those who examine the data should strive to impose a meaning based on their expert knowledge on each occasion, which would improve causal interpretations.

Authors’ information

ES is Assistant Professor of Epidemiology at Okayama University. His primary research interest concerns improving causal interpretations of observational studies. TM was a Research Fellow of Epidemiology when this study was conducted. He is currently working as Assistant Professor in Center for Innovative Clinical Medicine at Okayama University Hospital. TT, as a Professor of Environmental Epidemiology, has evaluated a variety of health effects of environmental factors to advance the public’s health. EY, as a Professor of Statistics, is interested in contributing to the advancement of statistical theories necessary for causal inference.

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

Electronic supplementary material

Additional file 1: Appendices 1 to 5.(PDF 141 KB)

Additional file 2: Tables S1 and S2: Risk ratios in terms of response types. (PDF 41 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Little RJ, Rubin DB: Causal effects in clinical and epidemiological studies via potential outcomes: concepts and analytical approaches. Annu Rev Public Health. 2000, 21: 121-145. 10.1146/annurev.publhealth.21.1.121.CrossRefPubMed

Pearl J: Causality: Models, Reasoning, and Inference. 2009, New York, NY: Cambridge University Press, 2CrossRef

Greenland S, Robins JM: Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol. 1986, 15: 413-419. 10.1093/ije/15.3.413.CrossRefPubMed

Greenland S, Robins JM, Pearl J: Confounding and collapsibility in causal inference. Stat Sci. 1999, 14: 29-46. 10.1214/ss/1009211805.CrossRef

Kaufman JS, Poole C: Looking back on "causal thinking in the health sciences". Annu Rev Public Health. 2000, 21: 101-119. 10.1146/annurev.publhealth.21.1.101.CrossRefPubMed

Greenland S, Morgenstern H: Confounding in health research. Annu Rev Public Health. 2001, 22: 189-212. 10.1146/annurev.publhealth.22.1.189.CrossRefPubMed

Maldonado G, Greenland S: Estimating causal effects. Int J Epidemiol. 2002, 31: 422-429. 10.1093/ije/31.2.422.CrossRefPubMed

Hernán MA: A definition of causal effect for epidemiological research. J Epidemiol Community Health. 2004, 58: 265-271. 10.1136/jech.2002.006361.CrossRefPubMedPubMedCentral

Greenland S, Rothman KJ, Lash TL: Measures of effect and measures of association. Modern Epidemiology. Edited by: Rothman KJ, Greenland S, Lash TL. 2008, Philadelphia, PA: Lippincott Williams & Wilkins, 51-70. 3

10.

Weisberg HI: Bias and Causation: Models and Judgment for Valid Comparisons. 2010, Hoboken, NJ: WileyCrossRef

11.

Morabia A: History of the modern epidemiological concept of confounding. J Epidemiol Community Health. 2011, 65: 297-300. 10.1136/jech.2010.112565.CrossRefPubMed

12.

Höfler M: Causal inference based on counterfactuals. BMC Med Res Methodol. 2005, 5: 28-10.1186/1471-2288-5-28.CrossRefPubMedPubMedCentral

13.

Greenland S, Robins JM: Identifiability, exchangeability and confounding revisited. Epidemiol Perspect Innov. 2009, 6: 4-10.1186/1742-5573-6-4.CrossRefPubMedPubMedCentral

14.

Greenland S, Pearl J, Robins JM: Causal diagrams for epidemiologic research. Epidemiology. 1999, 10: 37-48. 10.1097/00001648-199901000-00008.CrossRefPubMed

15.

Glymour MM, Greenland S: Causal diagram. Modern Epidemiology. Edited by: Rothman KJ, Greenland S, Lash TL. 2008, Philadelphia, PA: Lippincott Williams & Wilkins, 183-209. 3

16.

Robins JM: Data, design, and background knowledge in etiologic inference. Epidemiology. 2001, 12: 313-320. 10.1097/00001648-200105000-00011.CrossRefPubMed

17.

Hernán MA, Hernández-Díaz S, Werler MM, Mitchell AA: Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. Am J Epidemiol. 2002, 155: 176-184. 10.1093/aje/155.2.176.CrossRefPubMed

18.

VanderWeele TJ, Hernán MA, Robins JM: Causal directed acyclic graphs and the direction of unmeasured confounding bias. Epidemiology. 2008, 19: 720-728. 10.1097/EDE.0b013e3181810e29.CrossRefPubMedPubMedCentral

19.

Shrier I, Platt RW: Reducing bias through directed acyclic graphs. BMC Med Res Methodol. 2008, 8: 70-10.1186/1471-2288-8-70.CrossRefPubMedPubMedCentral

20.

Evans D, Chaix B, Lobbedez T, Verger C, Flahault A: Combining directed acyclic graphs and the change-in-estimate procedure as a novel approach to adjustment-variable selection in epidemiology. BMC Med Res Methodol. 2012, 12: 156-10.1186/1471-2288-12-156.CrossRefPubMedPubMedCentral

21.

Hernán MA, Hernández-Díaz S, Robins JM: A structural approach to selection bias. Epidemiology. 2004, 15: 615-625. 10.1097/01.ede.0000135174.63482.43.CrossRefPubMed

22.

VanderWeele TJ, Robins JM: Four types of effect modification: a classification based on directed acyclic graphs. Epidemiology. 2007, 18: 561-568. 10.1097/EDE.0b013e318127181b.CrossRefPubMed

23.

Greenland S, Brumback B: An overview of relations among causal modelling methods. Int J Epidemiol. 2002, 31: 1030-1037. 10.1093/ije/31.5.1030.CrossRefPubMed

24.

Cole SR, Frangakis CE: The consistency statement in causal inference: a definition or an assumption?. Epidemiology. 2009, 20: 3-5. 10.1097/EDE.0b013e31818ef366.CrossRefPubMed

25.

VanderWeele TJ: Concerning the consistency assumption in causal inference. Epidemiology. 2009, 20: 880-883. 10.1097/EDE.0b013e3181bd5638.CrossRefPubMed

26.

Pearl J: On the consistency rule in causal inference: axiom, definition, assumption, or theorem?. Epidemiology. 2010, 21: 872-875. 10.1097/EDE.0b013e3181f5d3fd.CrossRefPubMed

27.

Petersen ML: Compound treatments, transportability, and the structural causal model: the power and simplicity of causal graphs. Epidemiology. 2011, 22: 378-381. 10.1097/EDE.0b013e3182126127.CrossRefPubMed

28.

Hernán MA, VanderWeele TJ: Compound treatments and transportability of causal inference. Epidemiology. 2011, 22: 368-377. 10.1097/EDE.0b013e3182109296.CrossRefPubMedPubMedCentral

29.

VanderWeele TJ, Shpitser I: A new criterion for confounder selection. Biometrics. 2011, 67: 1406-1413. 10.1111/j.1541-0420.2011.01619.x.CrossRefPubMedPubMedCentral

30.

VanderWeele TJ, Shpitser I: On the definition of a confounder. Ann Stat. 2013, 41: 196-220. 10.1214/12-AOS1058.CrossRefPubMedPubMedCentral

31.

VanderWeele TJ: On the distinction between interaction and effect modification. Epidemiology. 2009, 20: 863-871. 10.1097/EDE.0b013e3181ba333c.CrossRefPubMed

32.

Suzuki E, Yamamoto E, Tsuda T: On the link between sufficient-cause model and potential-outcome model. Epidemiology. 2011, 22: 131-132. 10.1097/EDE.0b013e3181febc5c.CrossRefPubMed

33.

Suzuki E, Yamamoto E, Tsuda T: On the relations between excess fraction, attributable fraction, and etiologic fraction. Am J Epidemiol. 2012, 175: 567-575. 10.1093/aje/kwr333.CrossRefPubMed

34.

Greenland S, Poole C: Invariants and noninvariants in the concept of interdependent effects. Scand J Work Environ Health. 1988, 14: 125-129. 10.5271/sjweh.1945.CrossRefPubMed

35.

Hafeman DM: A sufficient cause based approach to the assessment of mediation. Eur J Epidemiol. 2008, 23: 711-721. 10.1007/s10654-008-9286-7.CrossRefPubMed

36.

VanderWeele TJ: Mediation and mechanism. Eur J Epidemiol. 2009, 24: 217-224. 10.1007/s10654-009-9331-1.CrossRefPubMed

37.

Suzuki E, Yamamoto E, Tsuda T: Identification of operating mediation and mechanism in the sufficient-component cause framework. Eur J Epidemiol. 2011, 26: 347-357. 10.1007/s10654-011-9568-3.CrossRefPubMed

38.

Hafeman DM, VanderWeele TJ: Alternative assumptions for the identification of direct and indirect effects. Epidemiology. 2011, 22: 753-764. 10.1097/EDE.0b013e3181c311b2.CrossRefPubMed

39.

Glynn RJ, Gagne JJ, Schneeweiss S: Role of disease risk scores in comparative effectiveness research with emerging therapies. Pharmacoepidemiol Drug Saf. 2012, 21 (Suppl 2): 138-147.CrossRefPubMedPubMedCentral

40.

Holland PW: Statistics and causal inference. J Am Stat Assoc. 1986, 81: 945-960. 10.1080/01621459.1986.10478354.CrossRef

41.

Hernán MA, Robins JM: Estimating causal effects from epidemiological data. J Epidemiol Community Health. 2006, 60: 578-586. 10.1136/jech.2004.029496.CrossRefPubMedPubMedCentral

42.

Westreich D, Cole SR: Invited commentary: positivity in practice. Am J Epidemiol. 2010, 171: 674-677. 10.1093/aje/kwp436.CrossRefPubMedPubMedCentral

43.

Petersen ML, Porter KE, Gruber S, Wang Y, van der Laan MJ: Diagnosing and responding to violations in the positivity assumption. Stat Methods Med Res. 2012, 21: 31-54. 10.1177/0962280210386207.CrossRefPubMed

44.

Hernán MA: Beyond exchangeability: the other conditions for causal inference in medical research. Stat Methods Med Res. 2012, 21: 3-5. 10.1177/0962280211398037.CrossRefPubMed

45.

Robins JM, Hernán MA: Estimation of the causal effects of time-varying exposures. Longitudinal Data Analysis. Edited by: Fitzmaurice GM, Davidian M, Verbeke G, Molenberghs G. 2009, Boca Raton, FL: CRC Press, 553-599.

46.

Sjölander A: The language of potential outcomes. Causality: Statistical Perspectives and Applications. Edited by: Berzuini C, Dawid P, Bernardinelli L. 2012, Hoboken, NJ: Wiley, 6-14.CrossRef

47.

Rosenbaum PR, Rubin DB: The central role of the propensity score in observational studies for causal effects. Biometrika. 1983, 70: 41-55. 10.1093/biomet/70.1.41.CrossRef

48.

Stone R: The assumptions on which causal inferences rest. J Roy Stat Soc B Met. 1993, 55: 455-466.

49.

A Dictionary of Epidemiology. Edited by: Porta MS. 2008, New York, NY: Oxford University Press, 5

50.

Greenland S, Lash TL, Rothman KJ: Concepts of interaction. Modern Epidemiology. Edited by: Rothman KJ, Greenland S, Lash TL. 2008, Philadelphia, PA: Lippincott Williams & Wilkins, 71-83. 3

51.

Greenland S: Quantifying biases in causal models: classical confounding vs collider-stratification bias. Epidemiology. 2003, 14: 300-306.PubMed

52.

Hernán MA, Cole SR: Invited commentary: causal diagrams and measurement bias. Am J Epidemiol. 2009, 170: 959-962. 10.1093/aje/kwp293.CrossRefPubMedPubMedCentral

53.

Shahar E: Causal diagrams for encoding and evaluation of information bias. J Eval Clin Pract. 2009, 15: 436-440. 10.1111/j.1365-2753.2008.01031.x.CrossRefPubMed

54.

Shahar E, Shahar DJ: On the causal structure of information bias and confounding bias in randomized trials. J Eval Clin Pract. 2009, 15: 1214-1216. 10.1111/j.1365-2753.2009.01347.x.CrossRefPubMed

55.

VanderWeele TJ, Hernán MA: Results on differential and dependent measurement error of the exposure and the outcome using signed directed acyclic graphs. Am J Epidemiol. 2012, 175: 1303-1310. 10.1093/aje/kwr458.CrossRefPubMedPubMedCentral

56.

Savitz DA: Interpreting Epidemiologic Evidence: Strategies for Study Design and Analysis. 2003, New York, NY: Oxford University PressCrossRef

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/13/101/prepub

Titel: A counterfactual approach to bias and effect modification in terms of response types
verfasst von: Etsuji Suzuki
Toshiharu Mitsuhashi
Toshihide Tsuda
Eiji Yamamoto
Publikationsdatum: 01.12.2013
Verlag: BioMed Central
Erschienen in: BMC Medical Research Methodology / Ausgabe 1/2013
Elektronische ISSN: 1471-2288
DOI: https://doi.org/10.1186/1471-2288-13-101

Springer Medizin

Abstract

Background

Methods

Results

Conclusions

Electronic supplementary material

Competing interests

Authors’ contributions

Background

Methods

Definitions and notation

A causal diagram and causal effects

Response types

Four hypothetical situations

Results

Describing data from observational studies in terms of response types

Describing data from randomized controlled trials in terms of response types

Epidemiologic measures in terms of response types

Confounding bias

Selection bias

Modification of epidemiologic measures

Extended causal diagrams integrating response types

Discussion

Conclusion

Authors’ information

Competing interests

Authors’ contributions

Electronic supplementary material

Authors’ original submitted files for images

Weitere Artikel der Ausgabe 1/2013

OvidSP Medline-to-PubMed search filter translation: a methodology for extending search filter range to include PubMed's unique content

How experiences become data: the process of eliciting adverse event, medical history and concomitant medication reports in antimalarial and antiretroviral interaction trials

Annual acknowledgement of manuscript reviewers

Are concomitant treatments confounding factors in randomized controlled trials on intensive blood-glucose control in type 2 diabetes? a systematic review

Effectiveness of Pilates exercise in treating people with chronic low back pain: a systematic review of systematic reviews

Association between funding source, methodological quality and research outcomes in randomized controlled trials of synbiotics, probiotics and prebiotics added to infant formula: A Systematic Review