CREAT - Census Bureau

Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling

August 2015

Written by: Quentin Brummet, Otávio Bartalotti

Working Paper Number:

carra-2015-06

Abstract

Regression Discontinuity (RD) designs have become popular in empirical studies due to their attractive properties for estimating causal effects under transparent assumptions. Nonetheless, most popular procedures assume i.i.d. data, which is not reasonable in many common applications. To relax this assumption, we derive the properties of traditional non-parametric estimators in a setting that incorporates potential clustering at the level of the running variable, and propose an accompanying optimal-MSE bandwidth selection rule. Simulation results demonstrate that falsely assuming data are i.i.d. when selecting the bandwidth may lead to the choice of bandwidths that are too small relative to the optimal-MSE bandwidth. Last, we apply our procedure using person-level microdata that exhibits clustering at the census tract level to analyze the impact of the Low-Income Housing Tax Credit program on neighborhood characteristics and low-income housing supply.

Document Tags and Keywords

Keywords:

estimation, econometric, estimating, data census, microdata, estimator, regression, cluster, metropolitan, subsidy, disadvantaged, housing, neighborhood, inference, household income, income housing

Tags:

Department of Economics, Journal of Economic Literature, United States Census Bureau, Census 2000, Center for Administrative Records Research

Similar Working Papers

The 10 most similar working papers to the working paper 'Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling' are listed below in order of similarity.

Working Paper
🔥

Interactions, Neighborhood Selection, and Housing Demand

August 2002

Authors: Yannis M Ioannides, Jeffrey Zabel

Working Paper Number:

CES-02-19

This paper contributes to the growing literature that identifies and measures the impact of social context on individual economic behavior. We develop a model of housing demand with neighborhood e'ects and neighborhood choice. Modelling neighborhood choice is of fundamental importance in estimating and understanding endogenous and exogenous neighborhood effects. That is, to obtain unbiased estimates of neighborhood effects, it is necessary to control for non-random sorting into neighborhoods. Estimation of the model exploits a unique data set of household data that has been augmented with contextual information at two di'erent levels ('scales') of aggregation. One is at the neighborhood cluster level, of about ten neighbors, with the data coming from a special sample of the American Housing Survey. A second level is the census tract to which these dwelling units belong. Tract-level data are available in the Summary Tape Files of the decennial Census data. We merge these two data sets by gaining access to confidential data of the U.S. Bureau of the Census. We overcome some limitations of these data by implementing some significant methodological advances in estimating discrete choice models. Our results for the neighborhood choice model indicate that individuals prefer to live near others like themselves. This can perpetuate income inequality since those with the best opportunities at economic success will cluster together. The results for the housing demand equation are similar to those in our earlier work [Ioannides and Zabel (2000] where we find evidence of significant endogenous and contextual neighborhood effects.
View Full Paper PDF
Working Paper
🔥

MISCLASSIFICATION IN BINARY CHOICE MODELS

May 2013

Authors: Bruce Meyer, Nikolas Mittag

Working Paper Number:

CES-13-27

We derive the asymptotic bias from misclassification of the dependent variable in binary choice models. Measurement error is necessarily non-classical in this case, which leads to bias in linear and non-linear models even if only the dependent variable is mismeasured. A Monte Carlo study and an application to food stamp receipt show that the bias formulas are useful to analyze the sensitivity of substantive conclusions, to interpret biased coefficients and imply features of the estimates that are robust to misclassification. Using administrative records linked to survey data as validation data, we examine estimators that are consistent under misclassification. They can improve estimates if their assumptions hold, but can aggravate the problem if the assumptions are invalid. The estimators differ in their robustness to such violations, which can be improved by incorporating additional information. We propose tests for the presence and nature of misclassification that can help to choose an estimator.
View Full Paper PDF
Working Paper

The Effect of Low-Income Housing on Neighborhood Mobility: Evidence from Linked Micro-Data

May 2016

Authors: Quentin Brummet, Otávio Bartalotti

Working Paper Number:

carra-2016-02

While subsidized low-income housing construction provides affordable living conditions for poor households, many observers worry that building low-income housing in poor communities induces individuals to move to poor neighborhoods. We examine this issue using detailed, nationally representative microdata constructed from linked decennial censuses. Our analysis exploits exogenous variation in low-income housing supply induced by program eligibility rules for Low-Income Housing Tax Credits to estimate the effect of subsidized housing on neighborhood mobility patterns. The results indicate little evidence to suggest a causal effect of additional low-income housing construction on the characteristics of neighborhoods to which households move. This result is true for households across the income distribution, and supports the hypothesis that subsidized housing provides affordable living conditions without encouraging households to move to less-affluent neighborhoods than they would have otherwise.
View Full Paper PDF
Working Paper

The Work Disincentive Effects of the Disability Insurance Program in the 1990s

February 2006

Authors: Susan Chen, Wilbert van der Klaauw

Working Paper Number:

CES-06-05

In this paper we evaluate the work disincentive effects of the Disability Insurance program during the 1990s. To accomplish this we construct a new large data set with detailed information on DI application and award decisions and use two different econometric evaluation methods. First, we apply a comparison group approach proposed by John Bound to estimate an upper bound for the work disincentive effect of the current DI program. Second, we adopt a Regression-Discontinuity approach that exploits a particular feature of the DI eligibility determination process to provide a credible point estimate of the impact of the DI program on labor supply for an important subset of DI applicants. Our estimates indicate that during the 1990s the labor force participation rate of DI beneficiaries would have been at most 20 percentage points higher had none received benefits. In addition, we find even smaller labor supply responses for the subset of 'marginal' applicants whose disability determination is based on vocational factors.
View Full Paper PDF
Working Paper

Assessing the Incidence and Efficiency of a Prominent Place Based Policy

February 2011

Authors: Matias Busso, Jesse Gregory, Patrick Kline

Working Paper Number:

CES-11-07

This paper empirically assesses the incidence and efficiency of Round I of the federal urban Empowerment Zone (EZ) program using confidential microdata from the Decennial Census and the Longitudinal Business Database. Using rejected and future applicants to the EZ program as controls, we find that EZ designation substantially increased employment in zone neighborhoods and generated wage increases for local workers without corresponding increases in population or the local cost of living. The results suggest the efficiency costs of first Round EZs were relatively small.
View Full Paper PDF
Working Paper

Revisiting the Effects of Unemployment Insurance Extensions on Unemployment: A Measurement Error-Corrected Regression Discontinuity Approach

March 2016

Authors: Quentin Brummet, Otávio Bartalotti, Steven Dieterle

Working Paper Number:

carra-2016-01

The extension of Unemployment Insurance (UI) benefits was a key policy response to the Great Recession. However, these benefit extensions may have had detrimental labor market effects. While evidence on the individual labor supply response indicates small effects on unemployment, recent work by Hagedorn et al. (2015) uses a county border pair identification strategy to find that the total effects inclusive of effects on labor demand are substantially larger. By focusing on variation within border county pairs, this identification strategy requires counties in the pairs to be similar in terms of unobservable factors. We explore this assumption using an alternative regression discontinuity approach that controls for changes in unobservables by distance to the border. To do so, we must account for measurement error induced by using county-level aggregates. These new results provide no evidence of a large change in unemployment induced by differences in UI generosity across state boundaries. Further analysis suggests that individuals respond to UI benefit differences across boundaries by targeting job search in high-benefit states, thereby raising concerns of treatment spillovers in this setting. Taken together, these two results suggest that the effect of UI benefit extensions on unemployment remains an open question.
View Full Paper PDF
Working Paper

A METHOD OF CORRECTING FOR MISREPORTING APPLIED TO THE FOOD STAMP PROGRAM

May 2013

Authors: Nikolas Mittag

Working Paper Number:

CES-13-28

Survey misreporting is known to be pervasive and bias common statistical analyses. In this paper, I first use administrative data on SNAP receipt and amounts linked to American Community Survey data from New York State to show that survey data can misrepresent the program in important ways. For example, more than 1.4 billion dollars received are not reported in New York State alone. 46 percent of dollars received by house- holds with annual income above the poverty line are not reported in the survey data, while only 19 percent are missing below the poverty line. Standard corrections for measurement error cannot remove these biases. I then develop a method to obtain consistent estimates by combining parameter estimates from the linked data with publicly available data. This conditional density method recovers the correct estimates using public use data only, which solves the problem that access to linked administrative data is usually restricted. I examine the degree to which this approach can be used to extrapolate across time and geography, in order to solve the problem that validation data is often based on a convenience sample. I present evidence from within New York State that the extent of heterogeneity is small enough to make extrapolation work well across both time and geography. Extrapolation to the entire U.S. yields substantive differences to survey data and reduces deviations from official aggregates by a factor of 4 to 9 compared to survey aggregates.
View Full Paper PDF
Working Paper

Recalculating... : How Uncertainty in Local Labor Market Definitions Affects Empirical Findings

January 2017

Authors: Lars Vilhuber, Mark J. Kutzbach, Andrew Foote

Working Paper Number:

CES-17-49R

This paper evaluates the use of commuting zones as a local labor market definition. We revisit Tolbert and Sizer (1996) and demonstrate the sensitivity of definitions to two features of the methodology: a cluster dissimilarity cutoff, or the count of clusters, and uncertainty in the input data. We show how these features impact empirical estimates using a standard application of commuting zones and an example from related literature. We conclude with advice to researchers on how to demonstrate the robustness of empirical findings to uncertainty in the definition of commuting zones
View Full Paper PDF
Working Paper

BIAS IN FOOD STAMPS PARTICIPATION ESTIMATES IN THE PRESENCE OF MISREPORTING ERROR

March 2013

Authors: Cathleen Li

Working Paper Number:

CES-13-13

This paper focuses on how survey misreporting of food stamp receipt can bias demographic estimation of program participation. Food stamps is a federally funded program which subsidizes the nutrition of low-income households. In order to improve the reach of this program, studies on how program participation varies by demographic groups have been conducted using census data. Census data are subject to a lot of misreporting error, both underreporting and over-reporting, which can bias the estimates. The impact of misreporting error on estimate bias is examined by calculating food stamp participation rates, misreporting rates, and bias for select household characteristics (covariates).
View Full Paper PDF
Working Paper

Location, Location, Location: The 3L Approach to House Price Determination

May 2004

Authors: Jeffrey Zabel, Katherine Kiel

Working Paper Number:

CES-04-06

The immobility of houses means that their location affects their values. This explains the common belief that three things determine the price of a house: location, location, and location. We use this notion to develop the 3L Approach to house price determination. That is, prices are determined by the Metropolitan Statistical Area (MSA), town, and street where the house is located. This study creates a unique data set based on data from the American Housing Survey (AHS) consisting of small 'clusters' of housing units with information on their housing characteristics and resident characteristics that is merged with census tract-level attributes. We use this data to verify the 3L Approach: we find that all three levels of location are significant when estimating the house price hedonic equation. This indicates that individuals care about their local neighborhood, i.e. the general upkeep of their street and possibly their neighbors' characteristics (cluster variables), a broader area such as the school district and/or the town (tract variables) that account for school quality and crime rates, and the particular amenities found in their MSA.
View Full Paper PDF

Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling

August 2015

Working Paper Number:

carra-2015-06

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling' are listed below in order of similarity.

August 2002

Working Paper Number:

CES-02-19

May 2013

Working Paper Number:

CES-13-27

May 2016

Working Paper Number:

carra-2016-02

February 2006

Working Paper Number:

CES-06-05

February 2011

Working Paper Number:

CES-11-07

March 2016

Working Paper Number:

carra-2016-01

May 2013

Working Paper Number:

CES-13-28

January 2017

Working Paper Number:

CES-17-49R

March 2013

Working Paper Number:

CES-13-13

May 2004

Working Paper Number:

CES-04-06