Introduction

In 1957, an anonymous editorial published in the American Journal of Public Health boldly titled itself, “Prediction and Theory in Epidemiology”, and raised the ante even higher in its subtitle: “‘Where there is no vision the people perish.’ Proverbs XXIX.18” [1]. Written to accompany an article focused on the changing epidemiology of tuberculosis, as it shifted from being a disease of younger to older persons, and also was becoming increasingly “prevalent among the economically underprivileged,” the editorial emphasized the resounding importance of epidemiologic theory for prediction and disease control. Observing that “as a method for advancing knowledge, the analysis of theoretical models has proved fruitful in other fields—vide theoretical physics—and there would appear to be no a priori reason why this should not be equally true in epidemiology,” it optimistically concluded: “As the present interest in the epidemiology of noncommunicable diseases continues to expand there is need for the further development of a theoretical epidemiology” [1, pp.1306-7].

Theory is indeed inextricably linked to vision—traced to its Greek roots, the word “theoria” refers to seeing inwards [2], such that to theorize is to use our mind’s eye systematically, following articulated principles, to discern meaningful patterns among both ideas and observations, and develop causal explanations [3••]. Until the last decade of the 20th c. CE, however, development or analysis of epidemiologic theories of disease distribution in the mainstream English-language literature was a rare event [3••, 4]. A shift is now underway. In this article, guided by the ecosocial theory of disease distribution, I review developments in contemporary use of explicit epidemiologic theories and provide critical suggestions for their further development and application.

Epidemiologic Theory: An Intrinsic Part of Epidemiology

But first: why bother about epidemiologic theory? Simply put, in epidemiology, as in any science, the causal theories it employs are key to its definition and practices [3••, 46]. A corollary is that methods, however vital for testing hypotheses, cannot by themselves define a scientific discipline [3••, 6]; think only of the shared use of randomized trials in the public health, biomedical, and social sciences [7, 8].

Moreover, in the case of epidemiology, two aspects of the discipline’s domain and practice, as a population science, require the use of epidemiologic theory [3••, 5]. The first is epidemiology’s intrinsic interest in elucidating causal explanations of disease rates in populations, not just disease mechanisms in individuals, with these causal explanations necessarily needing to bridge the individual-level and population-level occurrence of disease. The second, linked to this, is every single epidemiological study’s need, no matter what the study design, to grapple with (1) who is eligible to become, and who actually becomes, a study participant, (2) the implications of potential selection bias for affecting the range of exposures and outcomes observed, and (3) the subsequent risk estimates produced.

In both cases, there is no way to get around the need to think about the relationships between—and substantive features of —“individuals” and “populations,” and their implications for disease risk and rates, including their variation by time, place, society, and population group [5]. This thinking about explaining health, disease and well-being at the population level is, in the most basic terms, the stuff of epidemiologic theorizing [3••]. It can no more be eliminated from epidemiology than can the actual or simulated data that fuel epidemiologic research.

Dominant and Alternative Epidemiologic Theories of Disease Distribution: An Early 21st c. CE Appraisal

As I have written about in depth elsewhere [3••, 911], present-day epidemiologic theories of disease distribution belong to one of two groups: dominant (biomedical and lifestyle) and alternative (theories used in social epidemiology, broadly writ). Table 1 lists the key features of the theories within these two groups. Table 2, in turn, presents data on trends in the number of articles explicitly focused on epidemiologic theory, as indexed by the Web of Science [12] and PubMed [13]; also included is a selected list of 21st c. CE books from the global North and South addressing epidemiologic theory (nine US and European, all in English, and six Latin American: three in Spanish, three in Portuguese).

Table 1 Epidemiologic theories of disease distribution: key features
Table 2 Trends in explicit articulation of epidemiologic theories of disease distribution, 1900–2013 (mid-July): (a) scientific articles (Web of Science [12]; PubMed [13]) by (i) theory and (ii) field, and (b) 21st c. CE books (English-language and from Latin American social epidemiology) with explicit analysis of epidemiologic theories of disease distribution

Range of Theories: From Dominant to Alternative

In brief, the dominant theories are biomedical and lifestyle [3••]. Together, they respectively (1) focus on individual-level biological characteristics, exposures, and behaviors, and (2) emphasize individually-oriented medical treatment and behavioral interventions, (3) ignore sociodemographic and contextual variables or else treat them as “nuisance” variables whose effects can be controlled through statistical adjustment, and (4) conceptualize population rates as simply the aggregate manifestation of individual-level phenomena [3••, 4, 14, 15]. Dominant metaphors portray the body as a machine and/or output of a genetic program, and “choice” as simply the taste of individual “consumers,” independent of context or constraints. Disease rates and distributions are thus conceptualized and analyzed as a consequence of ostensibly individual-level biological phenomena and freely-made choices, including those affecting the prenatal milieu, with history reduced to a question of individual biological development. Differentials in disease rates by race/ethnicity and gender (for the majority of conditions that occur in both biological females and males) are portrayed as primarily reflecting innate biology and cultural preferences, with difference emphasized over similarity. Multiple causation is addressed through the invocation of a “web of causation” [16], which in effect is “spiderless” [9], and with the causal theory behind what and who appears in the web left unstated [9, 14, 17].

By contrast, the alternatives comprise the diverse array of epidemiologic theories of disease distribution that populate the sub-field of social epidemiology (Table 1). Although differing in their emphases, they nevertheless all (1) are explicit about their theoretical premises, and (2) posit that socially-structured exposures, ranging from macroeconomics to psychosocial stress, drive population patterns of health [3••, 911, 17]. While it is beyond the scope of this review to offer a nuanced analysis of each alternative theory listed, they can be usefully subdivided into three groups: sociopolitical, psychosocial, and ecosocial [3••].

The sociopolitical theories, although varied in their approach to social analysis, share the common thread of analyzing disease distribution primarily in relation to power, politics, economics, and rights; elucidating biological pathways is a secondary concern. Examples include the social production of disease, political economy of health, Latin American social medicine (LASM), health and human rights, and fundamental cause. The first three, which gained their names in the 1970s [3••], together have roots that extend back to the mid-1800 s (e.g., in the writings of Engels [18], Virchow [19], and Villermé [20]); although applicable to any type of society, they primarily focus on adverse the health consequences of capitalism, mindful of who benefits from and who bears the burden of economies dedicated to maximizing private profit [3••, 911, 1721, 22, 23••]. The health and human rights approach, developed in the mid-1990s in the wake of the HIV/AIDs epidemic, in turn calls attention to the health impacts of who is engaged in upholding and extending, versus violating and restricting, the five sets of indivisible human rights (economic, social, political, civil, and cultural) articulated in the United Nation’s 1948 Universal Declaration of Human Rights [3••, 24]. “Fundamental cause,” first proposed in 1998, emphasizes the socially-shaped capacity of people to use material and social resources flexibly to improve or protect their health, to the extent that knowledge and technology permits; the “surface” causes (i.e., disease mechanisms) put into play by these fundamental causes are accorded less importance [3••, 25].

The psychosocial theories, by contrast, tend to emphasize people’s psychological perceptions of—and their health-damaging or health-enhancing responses to—social conditions, social interactions, and social status [3••, 26]. Grounded in the early 20th c. CE work on stress response by Cannon [27] and Seyle [28], a major emphasis is on the body’s capacity to regulate biological responses to stress (as per the construct of “allostatic load” [29]); although attention is given to how the perception of and response to stressors are affected by the presence and absence of social and material resources, analysis of the sociopolitical determinants of these resources is typically absent.

The ecosocial theory of disease distribution, which I first proposed in 1994 and have developed since [3••, 5, 911, 3032], shares features of the handful of other recent social epidemiologic theories that engage explicitly and literally with ecology (i.e, not ecology as metaphor or as an abstract stand-in for thinking about levels and cross-level interactions and causation, but actual dynamic ecosystems), and with these other theories, including eco-epidemiology (proposed in 1996 [33]) and critical epidemiology (referring to its 21st c. CE articulation as a specific theory [34], as opposed to its prior use in Latin America as equivalent to “social epidemiology” [17]). Additional important shared features include a concern with history, in relation to not only multilevel societal determinants and trends in population health, but also to the intellectual history of, and conflicts between, diverse epidemiologic theories and the accountability of epidemiologists for the frameworks we employ. Some differences include their attention to the role of overt politics and political movements in shaping societal health (accorded less attention by eco-epidemiology) and to biological mechanisms of pathogenesis (accorded less attention by critical epidemiology). Ecosocial theory, in particular, pays heed to societal and ecologic context, to lifecourse and historical generation, to levels of analysis, and to interrelationships between diverse forms of social inequality, including racism, class, and gender. A central focus is on “embodiment,” referring to how we literally embody, biologically, our lived experience, in societal and ecological context, thereby creating population patterns of health and disease; another is on “accountability and agency,” both for social inequalities in health and for ways they are—or are not—monitored, analyzed, and addressed [3••, 5, 911, 3032].

As should be evident, although the dominant and alternative theories certainly can and do overlap in their recognition of specific exposures (e.g., viruses, diet, smoking) and also in their use of constructs (e.g., lifecourse, etiologic period), they nevertheless differ profoundly in how they theorize the relevant causal pathways. It is one thing, after all, to agree that HIV causes HIV/AIDS; it is another entirely to explain who, in which societies and which population groups, is most at risk of becoming infected and developing the disease, including via what pathways of infection and at what point in the lifecourse (e.g., as an infant, adolescent, or adult) [3••, 24, 35].

Trends: Dominant and Alternative Epidemiologic Theories

Before reviewing recent trends in the explicit use of the epidemiologic theories, it is important to underscore just how dominant the dominant approaches are. Figure 1 offers an illustration. Thus, whether evaluated in terms of grants awarded by the US National Institutes of Health [36], or by articles included in PubMed [13], scientific work indexed by the terms “biomedical OR lifestyle”—especially with a genetic focus—completely overshadows work indexed by “social epidemiology OR health disparities”. For grants actively funded as of mid-July 2013, the ratio was on the order of 230:1; for articles, it was on the order of 410:1.

Fig. 1
figure 1

Scientific prominence: active NIH funding (as of July 2013 [36]) and article citations in PubMed (for past 10 years, 2003–July 2013 [13]) for (a) dominant theories (biomedical and lifestyle), and (b) social epidemiologic alternatives

Although the mapping of index words to underlying theory might not be 1:1, nevertheless, work indexed by “biomedical” or “lifestyle” is unlikely to be informed by social epidemiologic theories of disease distribution, and vice versa. Relatedly, the extent of misclassification required to dilute the ratios reported above would need to be enormous. Thus, Fig. 1 offers good grounds for inferring that despite the recent growth of alternative theories of disease distribution, as reviewed below, their influence on the actual funding and products of present-day epidemiologic research is, at best, modest.

Common Trends Across all Theories

One of the more notable trends in Table 2, cutting across all theories, is the pronounced rise, beginning in the 1990s and accelerating in the 21st c. CE, of conceptual articles about epidemiologic theory. Indeed, for more than half of the different alternative social epidemiology theories listed, the number of such articles published between 2010 and July 2013 equals or exceeds the number appearing between 2000 and 2009. When interpreting Table 2, it is important to remember that the different rows are not mutually exclusive, because any given article might be indexed by one or more theories (see Table 2 footnote on the search strategy); even so, how often a specific theory is indexed is informative. Also of note is the increase in use of the term “epidemiologic theory of disease distribution,” with 16 (70 %) of the 23 articles indexed by this phrase being published since 2000.

Biomedical and Lifestyle

A naïve prediction would be that, on account of their dominance, biomedical and lifestyle articles would garner the lion’s share of citations for articles focused on epidemiologic theory. Table 2 shows this is not the case. The most plausible explanation is that, given their hegemony, the premises of these theories are so ingrained that they need not be the objects of intellectual analysis; they are “simply” the water in which we swim [3••, 14]. Underscoring this point, of the 19 biomedical articles indexed, fully 10 negatively critiqued the “biomedical model,” seven offered a mixed appraisal, and only two were positive.

Social Epidemiology Alternatives

Among the social epidemiology theories, the most commonly indexed were the ecologically-oriented theories (n = 55), followed by the sociopolitical theories (n = 44); fewer were devoted to the psychosocial theories (n = 26). The most frequently indexed specific theory was ecosocial theory (n = 30).

Additional Relevant (or Potentially Relevant) Theories

Also starting in the 1990s, a growing number of articles that explicitly focused on epidemiologic theory were indexed by additional theoretical constructs or by theories external to epidemiology, as shown in Table 2. For each construct and theory listed, in the following text I provide solely one citation (from among the articles included) to serve as an illustration and also as a gateway to the broader literature.

Two commonly employed constructs, sometimes treated as theories, pertain to the epidemiologic transition (n = 28) [37] and lifecourse (n = 19) [38]. As is true for theoretical constructs employed by different theories, how each is elaborated depends on the underlying theory used, e.g., for lifecourse, the discussion of prenatal milieu in relation to “lifestyle choices” (dominant approach) versus ways of living that are facilitated or constrained by, say, racial privilege or discrimination (alternative approach) [3••].

The most common sets of external theories brought into articles on epidemiologic theory were from evolutionary biology (n = 21) [39] and ecology (n = 20) [40]. Next most frequent were articles drawing on theories pertaining to genetics or epigenetics (n = 14) [41] and development (n = 13) [42]. More general scientific theories that were invoked included theories pertaining to complexity or system sciences (n = 10) [43] and to critical realism, albeit less often (n = 4) [44]. Among theories originating in the social sciences, those most frequently deployed with epidemiologic theory were feminist theories (n = 10) [45] and theories of justice (n = 10) [46], followed by critical race theory (n = 5) [47] and intersectionality, a framework arising out of black feminist theory that emphasizes connections between different forms of inequitable social relations (e.g., involving racism, sexism, sexuality, and social class; n = 2) [48]; no articles on epidemiologic theory, however, were indexed by queer theory [49]. Several conceptual papers focused on epidemiologic theory in relation to theoretical systems of indigenous knowledge (n = 6) [50].

Prominence by Field

Considered in relation to field, as also shown in Table 2, a large jump in the 1990s occurred in articles on epidemiologic theory also indexed by “social epidemiology” [51] or by disciplines in the social sciences (overall and by each classic social science) [52]. A similar rise occurred in the 2000s for “population health” [53] and “social determinants of health” [54]. By mid-2013, the respective number of such articles for these four fields equaled 122, 121, 50 and 52. A growing number of books in these fields have also begun to address epidemiologic theory (see Table 2), with the number published since 2000 equaling, if not exceeding, the number of such books published in the prior 50 years [3••, 55].

Interpretation of Trends and Implications

The late 20th c. CE rise of conceptual articles focused on epidemiologic theory, whether in articles or books, is evident across all the theories listed, despite being most prominent among the alternative theories associated with social epidemiology. A reasonable question to ask is: why this trend now?

Four likely reasons exist, two external and one internal to epidemiology as a discipline, and a fourth that bridges these three. The first, and probably most influential, pertains to growing global attention to links between society and health, as reflected by the work of the WHO Global Commission on the Social Determinants of Health [56, 57] and involved in myriad political and policy debates about economic growth, poverty, sustainability, climate change, the Millennium Development Goals, and the framework of “Health in All Policies” [58]. Population distributions of health and their determinants, and hence epidemiologic theory, are central to all of these discussions. Second, in parallel, there is a rising call across the natural and social sciences and the humanities for interdisciplinary, multidisciplinary, and transdisciplinary research—and such work requires, among other things, a clear appraisal of the distinctions and links between the core theories that define these diverse disciplines [3••, 59].

The third reason, internal to epidemiology, is methodological, and arises from developments in the use of explicit causal models (e.g., DAGs) and also analytic techniques such as instrumental variables [6063]. As increasingly recognized, the ultimate utility of such models and approaches hinges on the quality of reasoning as to the variables included, which in turn depends on the underlying epidemiologic theory leading to specification of the variables and how they might be causally connected [3••, 60, 63].

Finally, the fourth reason, bridging from external into internal, concerns the very basic issue of who does epidemiology. Until well into the 1980s, most students receiving academic training in the USA in epidemiology had, by design or preference, a prior medical degree, such that well into the 1990s, most leaders in the field were imbued with biomedical training; similar trends are apparent in other countries [3••, 64, 65]. The subsequent rise of epidemiologists with training in fields outside of medicine, including the social sciences (in which explicit attention to theory is the norm), thus has made possible a new generation of epidemiologists potentially more versed in—or at least open to—the use of epidemiologic theory. Also likely contributing are post-1980 gains, linked to the long overdue rise of affirmative action programs in the late 1960s and 1970s, in the admission of students from groups previously underrepresented in academic epidemiology (e.g., people of color in the USA, people from the global South, Indigenous people in both the global North and South, along with women in any country context) [3••]; also germane for Latin America is the rise of academics and other health professionals connected to Latin American social medicine who no longer are held back by repressive dictatorships [17, 21, 22, 23••, 66]. Science, after all, is done by real people in real times; as usual, context—and critical mass—matters.

If correct, the four proposed explanations for the recent rise in the explicit analysis and use of epidemiologic theory augur well for the trend to continue. Nothing, however, is guaranteed.

Conclusion: Next Steps for Developing Epidemiologic Theories of Disease Distribution

Consequently, conscious steps are required to develop epidemiologic theories of disease distribution in the 21st c. CE. Needed are both (a) theoretical writings that further flesh out the kinds of causal explanations and predictions these theories envision and how their constructs can be operationalized, and (b) empirical studies, using apt methods, to test the hypotheses generated.

In advocating greater testing of epidemiologic theories in relation to the hypothesis they generate and predictions they make, a caveat is in order. It is one thing to reject a specific hypothesis (assuming the study results are valid); it is another entirely to reject an entire theory. Two considerations are at play. The first, as elegantly argued by Lieberson [67], pertains to the extent to which any given empirical study can test all the conditions under which the theory’s particular hypothesis may or may not hold—with the short answer being, especially in analyses involving complex systems, it cannot [6, 67]. The simplified scenario of any given theory triumphantly being rejected on the basis of one apparently refuted hypothesis, as per the Popperian approach advocated in modern epidemiology texts [68], belies the complexity of scientific theorizing, empirical research, and its interpretation [6, 69]. Relatedly, Kunitz’s recent useful critique of general theories in population health that are inconsistent with empirically observed particular realities [70], requires the further recognition that conflicting particular realities can in fact be consonant with general theories, as long these theories wrestle with historical contingency and the conditions under which their predictions will and will not hold.

More profoundly, exemplified by Longino’s new philosophical work examining scientific pluralism in relation to the scientific study of human behavior [71••], including behaviors that affect health, is a growing appreciation that different theories not only produce partial knowledge but can also lead to incommensurable findings. Whereas partial knowledge can arise because each theory poses questions at different levels (e.g., micro versus macro level phenomena), incommensurable knowledge can result from competing theories differently parsing the relevant “causal space” within the same level(s) [71••, pp.126–129]. For example, as Longino observes, to the extent that “genetic” and “environmental” explanations “apportion variance between genetic and environmental factors, each must put uterine effect into what is for it the alternative category,” such that “the values assignable to G [genetic] or E [environmental] will vary depending on what is included in those categories.” Hence, “correctness must be relativized to the initial parsing of the causal space” [71••, pp. 147–148].

Beyond this, Longino calls attention to a key insight fundamental to epidemiologic theory and inquiry: determinants of differences between populations and variation within populations may often differ [71••, pp.136–137], such that, as Rose iconically argued in his classic 1985 essay “Sick Individuals, Sick Population” [72], asking why populations vary in their rates of disease is not equivalent to asking who is at high risk of disease within a given population [3••, 5, 53]. These distinctions and questions in turn hinge on conceptualizing who and what are “populations” [5]—a complex and contested issue that goes to the heart of epidemiology and kindred disciplines in the population sciences and their respective theories.

What additional steps might help epidemiologic theory to blossom? I’ll propose a few. The first, echoing calls of Latin American epidemiologists [17, 21, 22, 23••], is that there needs to be more conscious global exchange, discussion, and debate about epidemiologic theory. Leading English-language epidemiology textbooks, for example, continue to include little mention of epidemiologic theories [3••, 55], even as such theory is more prominent in diverse Latin American epidemiology textbooks [23, 73]. One welcome sign is that what until now has been the only epidemiologic meeting pulling together the different epidemiology organizations in the USA—the North American Congress of Epidemiology—is en route to becoming, in 2016, the Epidemiology Congress of the Americas, involving epidemiology associations in both North and South America [74]. This expansion builds, in part, upon work done for the 2011 Congress and via related workshops to build better regional ties [75, 76], and also reflects the fast-growing number of epidemiologists in Latin America [77]. Any such dialogue, however, of course must involve more than just epidemiologists of the global North and Latin America. There are likely important insights to be gained, for example, from aspects of more ecologically-attuned and integrative frameworks, as per the classic Hippocratic emphasis on “Airs, Waters, Places” and related forms of “traditional” medicine [3••, 78••], and also from diverse systems of Indigenous knowledge that currently are informing epidemiologic analyses [3••, 34, 50, 79, 80].

Imagine, moreover, if journals that publish epidemiologic research permitted articles that are explicit about theory an additional 250 words and five references regarding their use of theory, as well as a bonus increase of 5 % in total words permitted. Imagine, too, if one of the explicit criteria for grant reviews included the evaluation of explicit use of epidemiologic theory. Both steps, I venture, would increase the quality of epidemiologic thinking and research, as well as provide an incentive for teaching courses on epidemiologic theory. Together, they might make “got theory?” as common a query in epidemiology as the “got milk?” of the iconic US ad campaign [8183].

In 1995, Tony McMichael warned fellow epidemiologists of the dangers of being “prisoners of the proximate” and called for greater use of what he termed “a social-ecologic systems perspective,” including forecasts regarding possible impacts of “large-scale social and environmental changes,” such as global climate change [84]. The proverb had it right: without vision, the people will perish [1]. The need for epidemiologic theory for understanding and prediction, and for testing causal explanations using rigorous methods and measures, is greater than ever, to improve the odds of generating knowledge that promotes health equity and enables the people’s health – and our planet – to flourish.