Regression discontinuity designs: A guide to practice

doi:10.1016/j.jeconom.2007.05.001

Journal of Econometrics

Volume 142, Issue 2, February 2008, Pages 615-635

https://doi.org/10.1016/j.jeconom.2007.05.001 Get rights and content

Abstract

In regression discontinuity (RD) designs for evaluating causal effects of interventions, assignment to a treatment is determined at least partly by the value of an observed covariate lying on either side of a fixed threshold. These designs were first introduced in the evaluation literature by Thistlewaite and Campbell [1960. Regression-discontinuity analysis: an alternative to the ex-post Facto experiment. Journal of Educational Psychology 51, 309–317] With the exception of a few unpublished theoretical papers, these methods did not attract much attention in the economics literature until recently. Starting in the late 1990s, there has been a large number of studies in economics applying and extending RD methods. In this paper we review some of the practical and theoretical issues in implementation of RD methods.

Introduction

Since the late 1990s there has been a large number of studies in economics applying and extending regression discontinuity (RD) methods, including Van Der Klaauw (2002), Black (1999), Angrist and Lavy (1999), Lee (2007), Chay and Greenstone (2005), DiNardo and Lee (2004), Chay et al. (2005), and Card et al. (2006). Key theoretical and conceptual contributions include the interpretation of estimates for fuzzy regression discontinuity (FRD) designs allowing for general heterogeneity of treatment effects (Hahn et al., 2001, HTV from hereon), adaptive estimation methods (Sun, 2005), specific methods for choosing bandwidths (Ludwig and Miller, 2005), and various tests for discontinuities in means and distributions of non-affected variables (Lee, 2007, McCrary, 2007).

In this paper, we review some of the practical issues in implementation of RD methods. There is relatively little novel in this discussion. Our general goal is instead to address practical issues in implementing RD designs and review some of the new theoretical developments.

After reviewing some basic concepts in Section 2, the paper focuses on five specific issues in the implementation of RD designs. In Section 3 we stress graphical analyses as powerful methods for illustrating the design. In Section 4 we discuss estimation and suggest using local linear regression methods using only the observations close to the discontinuity point. In Section 5 we propose choosing the bandwidth using cross-validation. In Section 6 we provide a simple plug-in estimator for the asymptotic variance and a second estimator that exploits the link with instrumental variable methods derived by HTV. In Section 7 we discuss a number of specification tests and sensitivity analyses based on tests for (a) discontinuities in the average values for covariates, (b) discontinuities in the conditional density of the forcing variable, as suggested by McCrary, and (c) discontinuities in the average outcome at other values of the forcing variable.

Section snippets

Basics

Our discussion will frame the RD design in the context of the modern literature on causal effects and treatment effects, using the Rubin Causal Model (RCM) set up with potential outcomes (Rubin, 1974, Holland, 1986, Imbens and Rubin, 2007), rather than the regression framework that was originally used in this literature. For a general discussion of the RCM and its use in the economic literature, see the survey by Imbens and Wooldridge (2007).

In the basic setting for the RCM (and for the RD

Nonparametric regression at the boundary

The practical estimation of the treatment effect $τ$ in both the SRD and FRD designs is largely a standard nonparametric regression problem (e.g., Pagan and Ullah, 1999, Härdle, 1990, Li and Racine, 2007). However, there are two unusual features. In this case we are interested in the regression function at a single point, and in addition that single point is a boundary point. As a result, standard nonparametric kernel regression does not work very well. At boundary points, such estimators have a

Bandwidth selection

An important issue in practice is the selection of the smoothing parameter, the binwidth $h$ . In general there are two approaches to choose bandwidths. A first approach consists of characterizing the optimal bandwidth in terms of the unknown joint distribution of all variables. The relevant components of this distribution can then be estimated, and plugged into the optimal bandwidth function. The second approach, on which we focus here, is based on a cross-validation procedure. The specific

Inference

We now discuss some asymptotic properties for the estimator for the FRD case given in (4.7) or its alternative representation in (4.9).⁵ More general results are given in HTV. We continue to make some

Specification testing

There are generally two main conceptual concerns in the application of RD designs, sharp or fuzzy. A first concern about RD designs is the possibility of other changes at the same cutoff value of the covariate. Such changes may affect the outcome, and these effects may be attributed erroneously to the treatment of interest. For example, at age 65 individuals become eligible for discounts at many cultural institutions. However, if one finds that there is a discontinuity in the number of hours

Conclusion: a summary guide to practice

In this paper, we reviewed the literature on RD designs and discussed the implications for applied researchers interested in implementing these methods. We end the paper by providing a summary guide of steps to be followed when implementing RD designs. We start with the case of SRD, and then add a number of details specific to the case of FRD.

Case 1: SRD designs

1.
Graph the data (Section 3) by computing the average value of the outcome variable over a set of bins. The binwidth has to be large

Acknowledgments

We are grateful for discussions with David Card and Wilbert Van Der Klaauw. Financial support for this research was generously provided through NSF Grant SES 0452590 and the SSHRC of Canada.

References (42)

W. Trochim
Regression-discontinuity design
J.D. Angrist et al.
Does compulsory school attendance affect schooling and earnings?
Quarterly Journal of Economics
(1991)
J.D. Angrist et al.
Using Maimonides’ rule to estimate the effect of class size on scholastic achievement
Quarterly Journal of Economics
(1999)
J.D. Angrist et al.
Identification of causal effects using instrumental variables
Journal of the American Statistical Association
(1996)
Battistin, E., Rettore, E., 2007. Ineligibles and eligible non-participants as a double comparison group in...
S. Black
Do better schools matter? Parental valuation of elementary education
Quarterly Journal of Economics
(1999)
Card, D., Dobkin, C., Maestas, N., 2004. The impact of nearly universal insurance coverage on health care utilization...
Card, D., Mas, A., Rothstein, J., 2006. Tipping and the dynamics of segregation in neighborhoods and schools....
K. Chay et al.
Does air quality matter? Evidence from the housing market
Journal of Political Economy
(2005)
K. Chay et al.
The central role of noise in evaluating interventions that use test scores to rank schools
American Economic Review
(2005)

J. DiNardo et al.

Economic impacts of new unionization on private sector employers: 1984–2001

Quarterly Journal of Economics

(2004)

J. Fan et al.

Local Polynomial Modelling and its Applications

(1996)

Hahn, J., Todd, P., Van Der Klaauw, W., 1999. Evaluating the effect of an anti discrimination law using a...

J. Hahn et al.

Identification and estimation of treatment effects with a regression discontinuity design

Econometrica

(2001)

W. Härdle

Applied Nonparametric Regression

(1990)

J.J. Heckman et al.

Alternative methods for evaluating the impact of training programs (with discussion)

Journal of the American Statistical Association

(1989)

P. Holland

Statistics and causal inference (with discussion)

Journal of the American Statistical Association

(1986)

G. Imbens

Nonparametric estimation of average treatment effects under exogeneity: a review

Review of Economics and Statistics

(2004)

G. Imbens et al.

Identification and estimation of local average treatment effects

Econometrica

(1994)

G. Imbens et al.

Causal Inference: Statistical Methods for Estimating Causal Effects in Biomedical, Social, and Behavioral Sciences

(2007)

G. Imbens et al.

Evaluating the cost of conscription in The Netherlands

Journal of Business and Economic Statistics

(1995)

Cited by (2434)

Impact of higher capital buffers on banks’ lending and risk-taking in the short- and medium-term: Evidence from the euro area experiments
2024, Journal of Financial Stability
We study the impact of higher capital buffers on bank lending and risk-taking behaviour, at different time horizons following the initial policy decision. Employing a regression discontinuity design and confidential centralised supervisory data for euro area banks from 2014 to 2017, our research uniquely explores the effects of the EU policy on other systemically important institutions (O-SIIs) through a quasi-randomised experiment, exploiting the induced policy change and discontinuity of the O-SII identification process. Our findings show that the introduction of the O-SII buffers resulted in a short-term reduction in credit supply to households and financial sector, followed by a medium-term shift towards less risky borrowers, particularly in the household sector. We find a temporary cut in loan growth post-capital hikes, succeeded by a rebound in the medium-term. Our results substantiate the hypothesis that higher capital buffers can positively discipline banks by reducing risk-taking in the medium-term. At the same time, evidence suggests a limited adverse impact on the real economy, characterised by a temporary reduction in credit supply restricted to instances of macroprudential policy tightening.
The effect of female leadership on contracting from Capitol Hill to Main Street
2024, Journal of Financial Economics
This paper provides novel evidence that female politicians increase the proportion of US government procurement contracts allocated to women-owned firms. For identification, we use a regression discontinuity design on a sample of mixed-gender elections in the US House of Representatives. The effect grows over a female representative's tenure and concentrates in female representatives who are on powerful congressional committees. Changes in the pool of and behavior by government contractors cannot explain the result. The more gender-balanced representation in government contracting is not associated with economic costs.
Reducing carbon emissions at the expense of firm physical capital investments and growing financialization? Impacts of carbon trading policy from a regression discontinuity design
2024, Journal of Environmental Management
This study examines the effects of China's carbon trading policy on firm emissions and explores its impact mechanisms through financial and physical asset investments. The empirical analysis utilizes a fuzzy regression discontinuity design based on a sample of 427 industrial firms in China between 2014 and 2019. The results indicate that China's carbon trading policy incentivized firms to increase their financial investments while simultaneously discouraging physical capital investments. These shifts in investment patterns helped firms achieve their emission reduction targets. The study reveals that carbon trading policy in China has contributed to the financialization of firms, resulting in the erosion of firm assets and a decline in their overall competitiveness. Based on these findings, some policy recommendations are put forward.
Fertility responses to cash transfers in Uruguay
2024, World Development Perspectives
Conditional cash transfer (CCT) programs have been the most used tool to reduce poverty and inequality in developing countries in the last decades. In addition to the objectives pursued by these programs, it has been shown that they can have unintended effects on different dimensions. Particularly, they can have an impact on fertility due to an increase in the household's income. This paper examines the relationship between non-labor income and women's childbearing behavior in a developing country. The assignment mechanism of the Uruguayan cash transfer program Asignaciones Familiares – Plan de Equidad (AFAM-PE) alters non-labor incomes across the applicant’s households. I estimate the impact of this program on women's fertility and teenage pregnancy. The identification strategy exploits the discontinuity present in the program eligibility criteria. I combined longitudinal vital statistics provided by the Ministry of Public Health and administrative data to assemble a panel of AFAM-PE applicants aged between 15 and 49 (in 2008 and 2009). The study finds no statistically significant impact of AFAM-PE on fertility rates or teenage pregnancy. These results are robust to different specifications and women samples. This provides evidence against the idea that transfer programs targeting disadvantaged individuals generate a direct effect on fertility.
The impact of subsidies on house prices in Mexico's mortgage market for low-income households 2008–2019
2024, Journal of Housing Economics
We estimate the effect of Mexico's primary house-purchase subsidy program for low-income individuals on house prices between 2008 and 2019, using administrative records from Infonavit, the nation's largest mortgage originator. We employ a fuzzy regression discontinuity design that leverages the existence of a threshold on the borrower's income that determined access to the subsidy program to identify the effect on house prices. Our estimations yield statistically significant evidence that the subsidy led to an average increase in house prices of 863 US dollars for the program participants at the threshold during those years. This effect represents 28.9 % of the average subsidy amount and 5.4 % of the average house price. The estimations control for individual, house, and location characteristics. Furthermore, we find evidence that when an intermediary is involved in the mortgage application process, there is a statistically significant price difference of 867 dollars for subsidy recipients. On the contrary, this impact disappears when no external broker is involved. These intermediaries are primarily real estate developers that build and sell the houses associated with the mortgages. These findings shed light on how market structure could have nonnegligible impacts on equilibrium outcomes and on the welfare effects of economic policy.
Sibling spillovers and the choice to get vaccinated: Evidence from a regression discontinuity design
2024, Journal of Health Economics
We investigate the effects of introducing population-wide free-of-charge Human Papillomavirus (HPV) vaccination programs on the targeted adolescent cohorts and their siblings. For identification, we rely on regression discontinuity designs and high-quality Danish administrative data to exploit that date of birth determines program eligibility. We find that the programs increased the HPV vaccine take-up of both the targeted children (53.2 percentage points for girls and 36.0 percentage points for boys) and their older same-sex siblings (4.5 percentage points for sisters and 3.5 percentage points for brothers). We show that while the direct effects of the programs reduced HPV vaccine take-up inequality, the spillover effects, in contrast, contributed to an increase in vaccine take-up inequality highlighting the potential importance of spillover effects in the determination of distributional consequences of public health programs. Finally, we find some evidence of cross-vaccine spillovers.

View all citing articles on Scopus

View full text

Regression discontinuity designs: A guide to practice

Abstract

Introduction

Section snippets

Basics

Nonparametric regression at the boundary

Bandwidth selection

Inference

Specification testing

Conclusion: a summary guide to practice

Acknowledgments

Does compulsory school attendance affect schooling and earnings?

Quarterly Journal of Economics

Using Maimonides’ rule to estimate the effect of class size on scholastic achievement

Quarterly Journal of Economics

Identification of causal effects using instrumental variables

Journal of the American Statistical Association

Do better schools matter? Parental valuation of elementary education

Quarterly Journal of Economics

Does air quality matter? Evidence from the housing market

Journal of Political Economy

The central role of noise in evaluating interventions that use test scores to rank schools

American Economic Review

Economic impacts of new unionization on private sector employers: 1984–2001

Quarterly Journal of Economics

Local Polynomial Modelling and its Applications

Identification and estimation of treatment effects with a regression discontinuity design

Econometrica

Applied Nonparametric Regression

Alternative methods for evaluating the impact of training programs (with discussion)

Journal of the American Statistical Association

Statistics and causal inference (with discussion)

Journal of the American Statistical Association

Nonparametric estimation of average treatment effects under exogeneity: a review

Review of Economics and Statistics

Identification and estimation of local average treatment effects

Econometrica

Causal Inference: Statistical Methods for Estimating Causal Effects in Biomedical, Social, and Behavioral Sciences

Evaluating the cost of conscription in The Netherlands

Journal of Business and Economic Statistics