CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'estimator'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Viewing papers 11 through 20 of 32


  • Working Paper

    Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling

    August 2015

    Working Paper Number:

    carra-2015-06

    Regression Discontinuity (RD) designs have become popular in empirical studies due to their attractive properties for estimating causal effects under transparent assumptions. Nonetheless, most popular procedures assume i.i.d. data, which is not reasonable in many common applications. To relax this assumption, we derive the properties of traditional non-parametric estimators in a setting that incorporates potential clustering at the level of the running variable, and propose an accompanying optimal-MSE bandwidth selection rule. Simulation results demonstrate that falsely assuming data are i.i.d. when selecting the bandwidth may lead to the choice of bandwidths that are too small relative to the optimal-MSE bandwidth. Last, we apply our procedure using person-level microdata that exhibits clustering at the census tract level to analyze the impact of the Low-Income Housing Tax Credit program on neighborhood characteristics and low-income housing supply.
    View Full Paper PDF
  • Working Paper

    USING THE PARETO DISTRIBUTION TO IMPROVE ESTIMATES OF TOPCODED EARNINGS

    April 2014

    Working Paper Number:

    CES-14-21

    Inconsistent censoring in the public-use March Current Population Survey (CPS) limits its usefulness in measuring labor earnings trends. Using Pareto estimation methods with less-censored internal CPS data, we create an enhanced cell-mean series to capture top earnings in the public-use CPS. We find that previous approaches for imputing topcoded earnings systematically understate top earnings. Annual earnings inequality trends since 1963 using our series closely approximate those found by Kopczuk, Saez, & Song (2010) using Social Security Administration data for commerce and industry workers. However, when we consider all workers, earnings inequality levels are higher but earnings growth is more modest
    View Full Paper PDF
  • Working Paper

    RANDOMIZED SAFETY INSPECTIONS AND RISK EXPOSURE ON THE JOB: QUASI-EXPERIMENTAL ESTIMATES OF THE VALUE OF A STATISTICAL LIFE

    January 2014

    Working Paper Number:

    CES-14-05

    Compensating wages for workplace fatality and accident risks are used to infer the value of a statistical life (VSL), which in turn is used to assess the benefits of human health and safety regulations. The estimation of these wage differentials, however, has been plagued by measurement error and omitted variables. This paper employs the first quasi-experimental design within a labor market setting to overcome such limitations in the ex-tant literature. Specifically, randomly assigned, exogenous federal safety inspections are used to instrument for plant-level risks and combined with confidential U.S. Census data on manufacturing employment to estimate the VSL using a difference-in-differences framework. The VSL is estimated to be between $2 and $4 million ($2011), suggesting prior studies may substantially overstate the value workers place on safety, and therefore, the benefits of health and safety regulations.
    View Full Paper PDF
  • Working Paper

    A METHOD OF CORRECTING FOR MISREPORTING APPLIED TO THE FOOD STAMP PROGRAM

    May 2013

    Authors: Nikolas Mittag

    Working Paper Number:

    CES-13-28

    Survey misreporting is known to be pervasive and bias common statistical analyses. In this paper, I first use administrative data on SNAP receipt and amounts linked to American Community Survey data from New York State to show that survey data can misrepresent the program in important ways. For example, more than 1.4 billion dollars received are not reported in New York State alone. 46 percent of dollars received by house- holds with annual income above the poverty line are not reported in the survey data, while only 19 percent are missing below the poverty line. Standard corrections for measurement error cannot remove these biases. I then develop a method to obtain consistent estimates by combining parameter estimates from the linked data with publicly available data. This conditional density method recovers the correct estimates using public use data only, which solves the problem that access to linked administrative data is usually restricted. I examine the degree to which this approach can be used to extrapolate across time and geography, in order to solve the problem that validation data is often based on a convenience sample. I present evidence from within New York State that the extent of heterogeneity is small enough to make extrapolation work well across both time and geography. Extrapolation to the entire U.S. yields substantive differences to survey data and reduces deviations from official aggregates by a factor of 4 to 9 compared to survey aggregates.
    View Full Paper PDF
  • Working Paper

    MISCLASSIFICATION IN BINARY CHOICE MODELS

    May 2013

    Working Paper Number:

    CES-13-27

    We derive the asymptotic bias from misclassification of the dependent variable in binary choice models. Measurement error is necessarily non-classical in this case, which leads to bias in linear and non-linear models even if only the dependent variable is mismeasured. A Monte Carlo study and an application to food stamp receipt show that the bias formulas are useful to analyze the sensitivity of substantive conclusions, to interpret biased coefficients and imply features of the estimates that are robust to misclassification. Using administrative records linked to survey data as validation data, we examine estimators that are consistent under misclassification. They can improve estimates if their assumptions hold, but can aggravate the problem if the assumptions are invalid. The estimators differ in their robustness to such violations, which can be improved by incorporating additional information. We propose tests for the presence and nature of misclassification that can help to choose an estimator.
    View Full Paper PDF
  • Working Paper

    ENTREPRENEURSHIP AND URBAN GROWTH:AN EMPIRICAL ASSESSMENT WITH HISTORICAL MINES

    April 2013

    Working Paper Number:

    CES-13-15

    Measures of entrepreneurship, such as average establishment size and the prevalence of start-ups, correlate strongly with employment growth across and within metropolitan areas, but the endogeneity of these measures bedevils interpretation. Chinitz (1961) hypothesized that coal mines near Pittsburgh led that city to specialization in industries, like steel, with significant scale economies and that those big firms led to a dearth of entrepreneurial human capital across several generations. We test this idea by looking at the spatial location of past mines across the United States: proximity to historical mining deposits is associated with bigger firms and fewer start-ups in the middle of the 20th century. We use mines as an instrument for our entrepreneurship measures and find a persistent link between entrepreneurship and city employment growth; this connection works primarily through lower employment growth of start- ups in cities that are closer to mines. These effects hold in cold and warm regions alike and in industries that are not directly related to mining, such as trade, finance and services. We use quantile instrumental variable regression techniques and identify mostly homogeneous effects throughout the conditional city growth distribution.
    View Full Paper PDF
  • Working Paper

    Dynamically Consistent Noise Infusion and Partially Synthetic Data as Confidentiality Protection Measures for Related Time Series

    July 2012

    Working Paper Number:

    CES-12-13

    The Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job flows, tabulated by worker characteristics in various combinations. The data are released for several levels of NAICS industries and geography, the lowest aggregation of the latter being counties. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions { measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which confidentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying confidential data.
    View Full Paper PDF
  • Working Paper

    Using the Survey of Plant Capacity to Measure Capital Utilization

    July 2011

    Working Paper Number:

    CES-11-19

    Most capital in the United States is idle much of the time. By some measures, the average workweek of capital in U.S. manufacturing is as low as 55 hours per 168 hour week. The level and variability of capital utilization has important implications for understanding both the level of production and its cyclical fluctuations. This paper investigates a number of issues relating to aggregation of capital utilization measures from the Survey of Plant Capacity and makes recommendations on expanding and improving the published statistics deriving from the Survey of Plant Capacity. The paper documents a number of facts about properties of capital utilization. First, after growing for decades, capital utilization started to fall in mid 1990s. Second, capital utilization is a useful predictor of changes in capacity utilization and other factors of production. Third, adjustment of productivity measures for variable capital utilization improves statistical and economic properties of these measures. Fourth, the paper constructs weights to aggregate firm level capital utilization rates to industry and economy level, which is the major enhancement to available data.
    View Full Paper PDF
  • Working Paper

    Plant-Level Productivity and Imputation of Missing Data in the Census of Manufactures

    January 2011

    Working Paper Number:

    CES-11-02

    In the U.S. Census of Manufactures, the Census Bureau imputes missing values using a combination of mean imputation, ratio imputation, and conditional mean imputation. It is wellknown that imputations based on these methods can result in underestimation of variability and potential bias in multivariate inferences. We show that this appears to be the case for the existing imputations in the Census of Manufactures. We then present an alternative strategy for handling the missing data based on multiple imputation. Specifically, we impute missing values via sequences of classification and regression trees, which offer a computationally straightforward and flexible approach for semi-automatic, large-scale multiple imputation. We also present an approach to evaluating these imputations based on posterior predictive checks. We use the multiple imputations, and the imputations currently employed by the Census Bureau, to estimate production function parameters and productivity dispersions. The results suggest that the two approaches provide quite different answers about productivity.
    View Full Paper PDF
  • Working Paper

    Euler-Equation Estimation for Discrete Choice Models: A Capital Accumulation Application

    January 2010

    Working Paper Number:

    CES-10-02

    This paper studies capital adjustment at the establishment level. Our goal is to characterize capital adjustment costs, which are important for understanding both the dynamics of aggregate investment and the impact of various policies on capital accumulation. Our estimation strategy searches for parameters that minimize ex post errors in an Euler equation. This strategy is quite common in models for which adjustment occurs in each period. Here, we extend that logic to the estimation of parameters of dynamic optimization problems in which non-convexities lead to extended periods of investment inactivity. In doing so, we create a method to take into account censored observations stemming from intermittent investment. This methodology allows us to take the structural model directly to the data, avoiding time-consuming simulation based methods. To study the effectiveness of this methodology, we first undertake several Monte Carlo exercises using data generated by the structural model. We then estimate capital adjustment costs for U.S. manufacturing establishments in two sectors.
    View Full Paper PDF