-
The Promise and Potential of Linked Employer-Employee Data for Entrepreneurship Research
September 2015
Working Paper Number:
CES-15-29
In this paper, we highlight the potential for linked employer-employee data to be used in entrepreneurship research, describing new data on business start-ups, their founders and early employees, and providing examples of how they can be used in entrepreneurship research. Linked employer-employee data provides a unique perspective on new business creation by combining information on the business, workforce, and individual. By combining data on both workers and firms, linked data can investigate many questions that owner-level or firm-level data cannot easily answer alone - such as composition of the workforce at start-ups and their role in explaining business dynamics, the flow of workers across new and established firms, and the employment paths of the business owners themselves.
View Full
Paper PDF
-
Employer-Sim Microsimulation Model:
Model Development and Application to Estimation of Tax Subsidies to Health Insurance
December 2014
Working Paper Number:
CES-14-46
Employment-related health coverage is the predominant form of health insurance in the nonelderly, US population. Developing sound policies regarding the tax treatment of employer-sponsored insurance requires detailed information on the insurance benefits offered by employers as well as detailed information on the characteristics of employees and their familes. Unfortunately, no nationally representative data set contains all of the necessary elements. This paper describes the development of the Employer-Sim model which models tax-based health policies by using data on workers from the Medical Expenditure Panel Survey Household Component (MEPS HC) to form synthetic workforces for each establishment in the Medical Expenditure Panel Survey Insurance Component (MEPS IC). This paper describes the application of Employer-Sim to estimating tax subsidies to employer-sponsored health insurance and presents estimates of the cost and indcidence of the subsidy for 2008. The paper concludes by discussing other potential applications of the Employer-Sim model.
View Full
Paper PDF
-
Design Comparison of LODES and ACS Commuting Data Products
October 2014
Working Paper Number:
CES-14-38
The Census Bureau produces two complementary data products, the American Community Survey (ACS) commuting and workplace data and the Longitudinal Employer-Household Dynamics (LEHD) Origin-Destination Employment Statistics (LODES), which can be used to answer questions about spatial, economic, and demographic questions relating to workplaces and home-to-work flows. The products are complementary in the sense that they measure similar activities but each has important unique characteristics that provide information that the other measure cannot. As a result of questions from data users, the Census Bureau has created this document to highlight the major design differences between these two data products. This report guides users on the relative advantages of each data product for various analyses and helps explain differences that may arise when using the products.2,3
As an overview, these two data products are sourced from different inputs, cover different populations and time periods, are subject to different sets of edits and imputations, are released under different confidentiality protection mechanisms, and are tabulated at different geographic and characteristic levels. As a general rule, the two data products should not be expected to match exactly for arbitrary queries and may differ substantially for some queries.
Within this document, we compare the two data products by the design elements that were deemed most likely to contribute to differences in tabulated data. These elements are: Collection, Coverage, Geographic and Longitudinal Scope, Job Definition and Reference Period, Job and Worker Characteristics, Location Definitions (Workplace and Residence), Completeness of Geographic Information and Edits/Imputations, Geographic Tabulation Levels, Control Totals, Confidentiality Protection and Suppression, and Related
Public-Use Data Products.
An in-depth data analysis'in aggregate or with the microdata'between the two data products will be the subject of a future technical report. The Census Bureau has begun a pilot project to integrate ACS microdata with LEHD administrative data to develop an enhanced frame of employment status, place of work, and commuting. The Census Bureau will publish quality metrics for person match rates, residence and workplace match rates, and commute distance comparisons.
View Full
Paper PDF
-
NOISE INFUSION AS A CONFIDENTIALITY PROTECTION MEASURE FOR GRAPH-BASED STATISTICS
September 2014
Working Paper Number:
CES-14-30
We use the bipartite graph representation of longitudinally linked em-ployer-employee data, and the associated projections onto the employer and em-ployee nodes, respectively, to characterize the set of potential statistical summar-ies that the trusted custodian might produce. We consider noise infusion as the primary confidentiality protection method. We show that a relatively straightfor-ward extension of the dynamic noise-infusion method used in the U.S. Census Bureau's Quarterly Workforce Indicators can be adapted to provide the same confidentiality guarantees for the graph-based statistics: all inputs have been modified by a minimum percentage deviation (i.e., no actual respondent data are used) and, as the number of entities contributing to a particular statistic increases, the accuracy of that statistic approaches the unprotected value. Our method also ensures that the protected statistics will be identical in all releases based on the same inputs.
View Full
Paper PDF
-
LEHD Infrastructure files in the Census RDC - Overview
June 2014
Working Paper Number:
CES-14-26
The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2011 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureaus secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifcations made to the files to facilitate researcher access.
View Full
Paper PDF
-
FIRM AGE AND SIZE IN THE LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS DATA
March 2014
Working Paper Number:
CES-14-16
The Census Bureau's Quarterly Workforce Dynamics (QWI) and OnTheMap now provide detailed workforce statistics by employer age and size. These data allow a first look at the demographics of workers at small and young businesses as well as detailed analysis of how hiring, turnover, job creation/destruction vary throughout a firm's lifespan. Both the QWI and OnTheMap are tabulated from the Longitudinal Employer-Household Dynamics (LEHD) linked employer-employee data. Firm age and size information was added to the LEHD data through integration of Business Dynamics Statistics (BDS) microdata into the LEHD jobs frame. This paper describes how these two new firm characteristics were added to the microdata and how they are tabulated in QWI and OnTheMap
View Full
Paper PDF
-
COMPARING METHODS FOR IMPUTING EMPLOYER HEALTH INSURANCE CONTRIBUTIONS IN THE CURRENT POPULATION SURVEY
August 2013
Working Paper Number:
CES-13-41
The degree to which firms contribute to the payment of workers' health insurance premiums is an important consideration in the measurement of income and for understanding the potential impact of the 2010 Affordable Care Act on employment-based health insurance participation. Currently the U.S. Census Bureau imputes employer contributions in the Annual Social and Economic Supplement of the Current Population Survey based on data from the 1977 National Medical Care Expenditure Survey. The goal of this paper is to assess the extent to which this imputation methodology produces estimates reflective of the current distribution of employer contributions. The paper uses recent contributions data from the Medical Expenditure Panel Survey-Insurance Component to estimate a new model to inform the imputation procedure and to compare the resulting distribution of contributions. These new estimates are compared with those produced under current production methods across employee and employer characteristics.
View Full
Paper PDF
-
The LEHD Infrastructure Files and the Creation of the Quarterly Workforce Indicators
January 2006
Working Paper Number:
tp-2006-01
The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau,
with the support of several national research agencies, has built a set of infrastructure files
using administrative data provided by state agencies, enhanced with information from other administrative
data sources, demographic and economic (business) surveys and censuses. The LEHD
Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their
interaction in the U.S. economy. Beginning in 2003 and building on this infrastructure, the Census
Bureau has published the Quarterly Workforce Indicators (QWI), a new collection of data series
that offers unprecedented detail on the local dynamics of labor markets. Despite the fine detail,
confidentiality is maintained due to the application of state-of-the-art confidentiality protection
methods. This article describes how the input files are compiled and combined to create the infrastructure
files. We describe the multiple imputation methods used to impute in missing data and
the statistical matching techniques used to combine and edit data when a direct identifier match
requires improvement. Both of these innovations are crucial to the success of the final product. Finally,
we pay special attention to the details of the confidentiality protection system used to protect
the identity and micro data values of the underlying entities used to form the published estimates.
We provide a brief description of public-use and restricted-access data files with pointers to further
documentation for researchers interested in using these data.
View Full
Paper PDF