CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'imputed'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Viewing papers 1 through 8 of 8


  • Working Paper

    Revisions to the LEHD Establishment Imputation Procedure and Applications to Administrative Job Frame

    September 2024

    Working Paper Number:

    CES-24-51

    The Census Bureau is developing a 'job frame' to provide detailed job-level employment data across the U.S. through linked administrative records such as unemployment insurance and IRS W-2 filings. This working paper summarizes the research conducted by the job frame development team on modifying and extending the LEHD Unit-to-Worker (U2W) imputation procedure for the job frame prototype. It provides a conceptual overview of the U2W imputation method, highlighting key challenges and tradeoffs in its current application. The paper then presents four imputation methodologies and evaluates their performance in areas such as establishment assignment accuracy, establishment size matching, and job separation rates. The results show that all methodologies perform similarly in assigning workers to the correct establishment. Non-spell-based methodologies excel in matching establishment sizes, while spell-based methodologies perform better in accurately tracking separation rates.
    View Full Paper PDF
  • Working Paper

    Revisions to the LEHD Establishment Imputation Procedure and Applications to Administrative Jobs Frame

    September 2024

    Working Paper Number:

    CES-24-51

    The Census Bureau is developing a 'jobs frame' to provide detailed job-level employment data across the U.S. through linked administrative records such as unemployment insurance and IRS W-2 filings. This working paper summarizes the research conducted by the jobs frame development team on modifying and extending the LEHD Unit-to-Worker (U2W) imputation procedure for the jobs frame prototype. It provides a conceptual overview of the U2W imputation method, highlighting key challenges and tradeoffs in its current application. The paper then presents four imputation methodologies and evaluates their performance in areas such as establishment assignment accuracy, establishment size matching, and job separation rates. The results show that all methodologies perform similarly in assigning workers to the correct establishment. Non-spell-based methodologies excel in matching establishment sizes, while spell-based methodologies perform better in accurately tracking separation rates.
    View Full Paper PDF
  • Working Paper

    Missing Growth from Creative Destruction

    April 2018

    Working Paper Number:

    CES-18-18

    Statistical agencies typically impute inflation for disappearing products based on surviving products, which may result in overstated inflation and understated growth. Using U.S. Census data, we apply two ways of assessing the magnitude of 'missing growth' for private nonfarm businesses from 1983'2013. The first approach exploits information on the market share of surviving plants. The second approach applies indirect inference to firm-level data. We find: (i) missing growth from imputation is substantial ' at least 0.6 percentage points per year; and (ii) most of the missing growth is due to creative destruction (as opposed to new varieties).
    View Full Paper PDF
  • Working Paper

    R&D, Attrition and Multiple Imputation in BRDIS

    January 2017

    Working Paper Number:

    CES-17-13

    Multiple imputation in business establishment surveys like BRDIS, an annual business survey in which some companies are sampled every year or multiple years, may enhance the estimates of total R&D in addition to helping researchers estimate models with subpopulations of small sample size. Considering a panel of BRDIS companies throughout the years 2008 to 2013 linked to LBD data, this paper uses the conclusions obtained with missing data visualization and other explorations to come up with a strategy to conduct multiple imputation appropriate to address the item nonresponse in R&D expenditures. Because survey design characteristics are behind much of the item and unit nonresponse, multiple imputation of missing data in BRDIS changes the estimates of total R&D significantly and alters the conclusions reached by models of the determinants of R&D investment obtained with complete case analysis.
    View Full Paper PDF
  • Working Paper

    USING IMPUTATION TECHNIQUES TO EVALUATE STOPPING RULES IN ADAPTIVE SURVEY DESIGN

    October 2014

    Working Paper Number:

    CES-14-40

    Adaptive Design methods for social surveys utilize the information from the data as it is collected to make decisions about the sampling design. In some cases, the decision is either to continue or stop the data collection. We evaluate this decision by proposing measures to compare the collected data with follow-up samples. The options are assessed by imputation of the nonrespondents under different missingness scenarios, including Missing Not at Random. The variation in the utility measures is compared to the cost induced by the follow-up sample sizes. We apply the proposed method to the 2007 U.S. Census of Manufacturers.
    View Full Paper PDF
  • Working Paper

    RECOVERING THE ITEM-LEVEL EDIT AND IMPUTATION FLAGS IN THE 1977-1997 CENSUSES OF MANUFACTURES

    September 2014

    Authors: T. Kirk White

    Working Paper Number:

    CES-14-37

    As part of processing the Census of Manufactures, the Census Bureau edits some data items and imputes for missing data and some data that is deemed erroneous. Until recently it was difficult for researchers using the plant-level microdata to determine which data items were changed or imputed during the editing and imputation process, because the edit/imputation processing flags were not available to researchers. This paper describes the process of reconstructing the edit/imputation flags for variables in the 1977, 1982, 1987, 1992, and 1997 Censuses of Manufactures using recently recovered Census Bureau files. Thepaper also reports summary statistics for the percentage of cases that are imputed for key variables. Excluding plants with fewer than 5 employees, imputation rates for several key variables range from 8% to 54% for the manufacturing sector as a whole, and from 1% to 72% at the 2-digit SIC industry level.
    View Full Paper PDF
  • Working Paper

    COMPARING METHODS FOR IMPUTING EMPLOYER HEALTH INSURANCE CONTRIBUTIONS IN THE CURRENT POPULATION SURVEY

    August 2013

    Working Paper Number:

    CES-13-41

    The degree to which firms contribute to the payment of workers' health insurance premiums is an important consideration in the measurement of income and for understanding the potential impact of the 2010 Affordable Care Act on employment-based health insurance participation. Currently the U.S. Census Bureau imputes employer contributions in the Annual Social and Economic Supplement of the Current Population Survey based on data from the 1977 National Medical Care Expenditure Survey. The goal of this paper is to assess the extent to which this imputation methodology produces estimates reflective of the current distribution of employer contributions. The paper uses recent contributions data from the Medical Expenditure Panel Survey-Insurance Component to estimate a new model to inform the imputation procedure and to compare the resulting distribution of contributions. These new estimates are compared with those produced under current production methods across employee and employer characteristics.
    View Full Paper PDF
  • Working Paper

    Plant-Level Productivity and Imputation of Missing Data in the Census of Manufactures

    January 2011

    Working Paper Number:

    CES-11-02

    In the U.S. Census of Manufactures, the Census Bureau imputes missing values using a combination of mean imputation, ratio imputation, and conditional mean imputation. It is wellknown that imputations based on these methods can result in underestimation of variability and potential bias in multivariate inferences. We show that this appears to be the case for the existing imputations in the Census of Manufactures. We then present an alternative strategy for handling the missing data based on multiple imputation. Specifically, we impute missing values via sequences of classification and regression trees, which offer a computationally straightforward and flexible approach for semi-automatic, large-scale multiple imputation. We also present an approach to evaluating these imputations based on posterior predictive checks. We use the multiple imputations, and the imputations currently employed by the Census Bureau, to estimate production function parameters and productivity dispersions. The results suggest that the two approaches provide quite different answers about productivity.
    View Full Paper PDF