In this paper, we describe the sensitivity of small-cell flow statistics
to coding errors in the identity of the underlying entities. Specifically,
we present results based on a comparison of the U.S. Census Bureau's
Quarterly Workforce Indicators (QWI) before and after correcting for
such errors in SSN-based identifiers in the underlying individual wage
records. The correction used involves a novel application of existing
statistical matching techniques. It is found that even a very conservative
correction procedure has a sizable impact on the statistics. The
average bias ranges from 0.25 percent up to 15 percent for flow statistics,
and up to 5 percent for payroll aggregates.
-
The LEHD Infrastructure Files and the Creation of the Quarterly Workforce Indicators
January 2006
Working Paper Number:
tp-2006-01
The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau,
with the support of several national research agencies, has built a set of infrastructure files
using administrative data provided by state agencies, enhanced with information from other administrative
data sources, demographic and economic (business) surveys and censuses. The LEHD
Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their
interaction in the U.S. economy. Beginning in 2003 and building on this infrastructure, the Census
Bureau has published the Quarterly Workforce Indicators (QWI), a new collection of data series
that offers unprecedented detail on the local dynamics of labor markets. Despite the fine detail,
confidentiality is maintained due to the application of state-of-the-art confidentiality protection
methods. This article describes how the input files are compiled and combined to create the infrastructure
files. We describe the multiple imputation methods used to impute in missing data and
the statistical matching techniques used to combine and edit data when a direct identifier match
requires improvement. Both of these innovations are crucial to the success of the final product. Finally,
we pay special attention to the details of the confidentiality protection system used to protect
the identity and micro data values of the underlying entities used to form the published estimates.
We provide a brief description of public-use and restricted-access data files with pointers to further
documentation for researchers interested in using these data.
View Full
Paper PDF
-
The Creation of the Employment Dynamics Estimates
July 2002
Working Paper Number:
tp-2002-13
View Full
Paper PDF
-
Confidentiality Protection in the Census Bureau Quarterly Workforce Indicators
February 2006
Working Paper Number:
tp-2006-02
The QuarterlyWorkforce Indicators are new estimates developed by the Census Bureau's Longitudinal
Employer-Household Dynamics Program as a part of its Local Employment Dynamics
partnership with 37 state Labor Market Information offices. These data provide detailed quarterly
statistics on employment, accessions, layoffs, hires, separations, full-quarter employment
(and related flows), job creations, job destructions, and earnings (for flow and stock categories of
workers). The data are released for NAICS industries (and 4-digit SICs) at the county, workforce
investment board, and metropolitan area levels of geography. The confidential microdata - unemployment
insurance wage records, ES-202 establishment employment, and Title 13 demographic
and economic information - are protected using a permanent multiplicative noise distortion factor.
This factor distorts all input sums, counts, differences and ratios. The released statistics are analytically
valid - measures are unbiased and time series properties are preserved. The confidentiality
protection is manifested in the release of some statistics that are flagged as "significantly distorted
to preserve confidentiality." These statistics differ from the undistorted statistics by a significant
proportion. Even for the significantly distorted statistics, the data remain analytically valid for
time series properties. The released data can be aggregated; however, published aggregates are
less distorted than custom postrelease aggregates. In addition to the multiplicative noise distortion,
confidentiality protection is provided by the estimation process for the QWIs, which multiply imputes
all missing data (including missing establishment, given UI account, in the UI wage record
data) and dynamically re-weights the establishment data to provide state-level comparability with
the BLS's Quarterly Census of Employment and Wages.
View Full
Paper PDF
-
LEHD Infrastructure S2014 files in the FSRDC
September 2018
Working Paper Number:
CES-18-27R
The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
View Full
Paper PDF
-
Displaced workers, early leavers, and re-employment wages
November 2002
Working Paper Number:
tp-2002-18
In this paper, we lay out a search model that takes explicitly into account the
information flow prior to a mass layoff. Using universal wage data files that allow
us to identify individuals working with healthy and displacing firms both at
the time of displacement as well as any other time period, we test the predictions
of the model on re-employment wage differentials. Workers leaving a "distressed"
firm have higher re-employment wages than workers who stay with the
distressed firm until displacement. This result is robust to the inclusion of controls
for worker quality and unobservable firm characteristics.
View Full
Paper PDF
-
Optimal Probabilistic Record Linkage: Best Practice for Linking Employers in Survey and Administrative Data
March 2019
Working Paper Number:
CES-19-08
This paper illustrates an application of record linkage between a household-level survey and an establishment-level frame in the absence of unique identifiers. Linkage between frames in this setting is challenging because the distribution of employment across firms is highly asymmetric. To address these difficulties, this paper uses a supervised machine learning model to probabilistically link survey respondents in the Health and Retirement Study (HRS) with employers and establishments in the Census Business Register (BR) to create a new data source which we call the CenHRS. Multiple imputation is used to propagate uncertainty from the linkage step into subsequent analyses of the linked data. The linked data reveal new evidence that survey respondents' misreporting and selective nonresponse about employer characteristics are systematically correlated with wages.
View Full
Paper PDF
-
Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Survey and SSA Administrative Data
September 2002
Working Paper Number:
tp-2002-24
The third chapter investigates measurement error in SIPP annual job
earnings data linked to SSA administrative earnings data. The multiple
earnings measures provided by the survey and administrative data enable
the identification of components of true variation and variation due to
measurement error. We find that 18% of the variation in SIPP annual job
earnings can be attributed to measurement error. We also find that in
both the SIPP and the DER, measurement error is persistent over time.
A lower level of auto-correlation in the SIPP measurement error than in
the economic error component leads to a lower reliability ratio of .62 for
first-differenced earnings.
View Full
Paper PDF
-
National Estimates of Gross Employment and Job Flows from the Quarterly Workforce Indicators with Demographic and Industry Detail
June 2010
Working Paper Number:
CES-10-11
The Quarterly Workforce Indicators (QWI) are local labor market data produced and released every quarter by the United States Census Bureau. Unlike any other local labor market series produced in the U.S. or the rest of the world, the QWI measure employment flows for workers (accession and separations), jobs (creations and destructions) and earnings for demographic subgroups (age and gender), economic industry (NAICS industry groups), detailed geography (block (experimental), county, Core- Based Statistical Area, and Workforce Investment Area), and ownership (private, all) with fully interacted publication tables. The current QWI data cover 47 states, about 98% of the private workforce in those states, and about 92% of all private employment in the entire economy. State participation is sufficiently extensive to permit us to present the first national estimates constructed from these data. We focus on worker, job, and excess (churning) reallocation rates, rather than on levels of the basic variables. This permits comparison to existing series from the Job Openings and Labor Turnover Survey and the Business Employment Dynamics Series from the Bureau of Labor Statistics. The national estimates from the QWI are an important enhancement to existing series because they include demographic and industry detail for both worker and job flow data compiled from underlying micro-data that have been integrated at the job and establishment levels by the Longitudinal Employer-Household Dynamics Program at the Census Bureau. The estimates presented herein were compiled exclusively from public-use data series and are available for download.
View Full
Paper PDF
-
Two Perspectives on Commuting: A Comparison of Home to Work Flows Across Job-Linked Survey and Administrative Files
January 2017
Working Paper Number:
CES-17-34
Commuting flows and workplace employment data have a wide constituency of users including urban and regional planners, social science and transportation researchers, and businesses. The U.S. Census Bureau releases two, national data products that give the magnitude and characteristics of home to work flows. The American Community Survey (ACS) tabulates households' responses on employment, workplace, and commuting behavior. The Longitudinal Employer-Household Dynamics (LEHD) program tabulates administrative records on jobs in the LEHD Origin-Destination Employment Statistics (LODES). Design differences across the datasets lead to divergence in a comparable statistic: county-to-county aggregate commute flows. To understand differences in the public use data, this study compares ACS and LEHD source files, using identifying information and probabilistic matching to join person and job records. In our assessment, we compare commuting statistics for job frames linked on person, employment status, employer, and workplace and we identify person and job characteristics as well as design features of the data frames that explain aggregate differences. We find a lower rate of within-county commuting and farther commutes in LODES. We attribute these greater distances to differences in workplace reporting and to uncertainty of establishment assignments in LEHD for workers at multi-unit employers. Minor contributing factors include differences in residence location and ACS workplace edits. The results of this analysis and the data infrastructure developed will support further work to understand and enhance commuting statistics in both datasets.
View Full
Paper PDF
-
Earnings Growth, Job Flows and Churn
April 2020
Working Paper Number:
CES-20-15
How much do workers making job-to-job transitions benefit from moving away from a shrinking and towards a growing firm? We show that earnings growth in the transition increases with net employment growth at the destination firm and, to a lesser extent, decreases if the origin firm is shrinking. So, we sum the effect of leaving a shrinking and entering a growing firm and remove the excess turnover-related hires because gross hiring has a much smaller association with earnings growth than net employment growth. We find that job-to-job transitions with the cross-firm job flow have 23% more earnings growth than average.
View Full
Paper PDF