CREAT - Census Bureau

LEHD Infrastructure S2014 files in the FSRDC

September 2018

Written by: Lars Vilhuber

Working Paper Number:

CES-18-27R

Abstract

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.

Document Tags and Keywords

Keywords:

work census, payroll, data census, census data, census research, survey, agency, linked census, employee, employ, employed, employment data, department, hiring, workplace, workforce, worker, occupation, employment dynamics, clerical, census bureau, employment statistics, census file, worker demographics, employer household, longitudinal employer, research census, censuses surveys, employee data, census employment

Tags:

American Economic Association, Standard Statistical Establishment List, Internal Revenue Service, Standard Industrial Classification, Bureau of Labor Statistics, Social Security Administration, Service Annual Survey, National Science Foundation, Center for Economic Studies, Department of Defense, Review of Economics and Statistics, University of Maryland, American Economic Review, University of Chicago, Current Population Survey, Longitudinal Business Database, Bureau of Labor, Decennial Census, Employer Identification Numbers, Cornell University, Journal of Labor Economics, Business Master File, Social Security, Research Data Center, Department of Homeland Security, North American Industry Classification System, American Community Survey, Social Security Number, Alfred P Sloan Foundation, Longitudinal Employer Household Dynamics, Business Register, Protected Identification Key, Sloan Foundation, Employment History File, Employer Characteristics File, Individual Characteristics File, American Housing Survey, Quarterly Workforce Indicators, Core Based Statistical Area, Quarterly Census of Employment and Wages, Composite Person Record, Business Employment Dynamics, Local Employment Dynamics, Office of Personnel Management, Master Address File, Business Register Bridge, Probability Density Function, Disclosure Review Board, North American Industry Classi, SSA Numident, Federal Statistical Research Data Center, Federal Tax Information, HHS, Successor Predecessor File, DOB, LEHD Origin-Destination Employment Statistics, Federal Emergency Management Agency

Similar Working Papers

The 10 most similar working papers to the working paper 'LEHD Infrastructure S2014 files in the FSRDC' are listed below in order of similarity.

Working Paper
🔥

The LEHD Infrastructure Files and the Creation of the Quarterly Workforce Indicators

January 2006

Authors: Lars Vilhuber, John M. Abowd, Kevin L. McKinney, Bryce Stephens, Fredrik Andersson, Marc Roemer, Simon Woodcock

Working Paper Number:

tp-2006-01

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, has built a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. Beginning in 2003 and building on this infrastructure, the Census Bureau has published the Quarterly Workforce Indicators (QWI), a new collection of data series that offers unprecedented detail on the local dynamics of labor markets. Despite the fine detail, confidentiality is maintained due to the application of state-of-the-art confidentiality protection methods. This article describes how the input files are compiled and combined to create the infrastructure files. We describe the multiple imputation methods used to impute in missing data and the statistical matching techniques used to combine and edit data when a direct identifier match requires improvement. Both of these innovations are crucial to the success of the final product. Finally, we pay special attention to the details of the confidentiality protection system used to protect the identity and micro data values of the underlying entities used to form the published estimates. We provide a brief description of public-use and restricted-access data files with pointers to further documentation for researchers interested in using these data.
View Full Paper PDF
Working Paper
🔥

LEHD Infrastructure files in the Census RDC - Overview

June 2014

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-14-26

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2011 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureaus secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifcations made to the files to facilitate researcher access.
View Full Paper PDF
Working Paper
🔥

LEHD Snapshot Documentation, Release S2021_R2022Q4

November 2022

Authors: Kevin L. McKinney, Erika McEntarfer, Matthew R. Graham, Stephen Tibbets, Lee Tucker

Working Paper Number:

CES-22-51

The Longitudinal Employer-Household Dynamics (LEHD) data at the U.S. Census Bureau is a quarterly database of linked employer-employee data covering over 95% of employment in the United States. These data are used to produce a number of public-use tabulations and tools, including the Quarterly Workforce Indicators (QWI), LEHD Origin-Destination Employment Statistics (LODES), Job-to-Job Flows (J2J), and Post-Secondary Employment Outcomes (PSEO) data products. Researchers on approved projects may also access the underlying LEHD microdata directly, in the form of the LEHD Snapshot restricted-use data product. This document provides a detailed overview of the LEHD Snapshot as of release S2021_R2022Q4, including user guidance, variable codebooks, and an overview of the approvals needed to obtain access. Updates to the documentation for this and future snapshot releases will be made available in HTML format on the LEHD website.
View Full Paper PDF
Working Paper
🔥

LEHD Infrastructure Files in the Census RDC: Overview of S2004 Snapshot

April 2011

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-11-13

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, has built a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2004 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's Research Data Center network.
View Full Paper PDF
Working Paper
🔥

LEHD Data Documentation LEHD-OVERVIEW-S2008-rev1

December 2011

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-11-43

View Full Paper PDF
Working Paper
🔥

The Creation of the Employment Dynamics Estimates

July 2002

Authors: Lars Vilhuber, John M. Abowd, Paul A. Lengermann

Working Paper Number:

tp-2002-13

View Full Paper PDF
Working Paper
🔥

The Sensitivity of Economic Statistics to Coding Errors in Personal Identifiers

October 2002

Authors: Lars Vilhuber, John M. Abowd

Working Paper Number:

tp-2002-17

In this paper, we describe the sensitivity of small-cell flow statistics to coding errors in the identity of the underlying entities. Specifically, we present results based on a comparison of the U.S. Census Bureau's Quarterly Workforce Indicators (QWI) before and after correcting for such errors in SSN-based identifiers in the underlying individual wage records. The correction used involves a novel application of existing statistical matching techniques. It is found that even a very conservative correction procedure has a sizable impact on the statistics. The average bias ranges from 0.25 percent up to 15 percent for flow statistics, and up to 5 percent for payroll aggregates.
View Full Paper PDF
Working Paper
🔥

Confidentiality Protection in the Census Bureau Quarterly Workforce Indicators

February 2006

Authors: Lars Vilhuber, John M. Abowd, Bryce Stephens

Working Paper Number:

tp-2006-02

The QuarterlyWorkforce Indicators are new estimates developed by the Census Bureau's Longitudinal Employer-Household Dynamics Program as a part of its Local Employment Dynamics partnership with 37 state Labor Market Information offices. These data provide detailed quarterly statistics on employment, accessions, layoffs, hires, separations, full-quarter employment (and related flows), job creations, job destructions, and earnings (for flow and stock categories of workers). The data are released for NAICS industries (and 4-digit SICs) at the county, workforce investment board, and metropolitan area levels of geography. The confidential microdata - unemployment insurance wage records, ES-202 establishment employment, and Title 13 demographic and economic information - are protected using a permanent multiplicative noise distortion factor. This factor distorts all input sums, counts, differences and ratios. The released statistics are analytically valid - measures are unbiased and time series properties are preserved. The confidentiality protection is manifested in the release of some statistics that are flagged as "significantly distorted to preserve confidentiality." These statistics differ from the undistorted statistics by a significant proportion. Even for the significantly distorted statistics, the data remain analytically valid for time series properties. The released data can be aggregated; however, published aggregates are less distorted than custom postrelease aggregates. In addition to the multiplicative noise distortion, confidentiality protection is provided by the estimation process for the QWIs, which multiply imputes all missing data (including missing establishment, given UI account, in the UI wage record data) and dynamically re-weights the establishment data to provide state-level comparability with the BLS's Quarterly Census of Employment and Wages.
View Full Paper PDF
Working Paper
🔥

LODES Design and Methodology Report: Methodology Version 7

August 2025

Authors: Matthew R. Graham, Mark J. Kutzbach, Andrew Foote

Working Paper Number:

CES-25-52

The purpose of this report is to document the important features of Version 7 of the LEHD Origin-Destination Employment Statistics (LODES) processing system. This includes data sources, data processing methodology, confidentiality protection methodology, some quality measures, and a high-level description of the published data. The intended audience for this document includes LODES data users, Local Employment Dynamics (LED) Partnership members, U.S. Census Bureau management, program quality auditors, and current and future research and development staff members.
View Full Paper PDF
Working Paper

Developing a Residence Candidate File for Use With Employer-Employee Matched Data

January 2017

Authors: Matthew R. Graham, Mark J. Kutzbach, Danielle H. Sandler

Working Paper Number:

CES-17-40

This paper describes the Longitudinal Employer-Household Dynamics (LEHD) program's ongoing efforts to use administrative records in a predictive model that describes residence locations for workers. This project was motivated by the discontinuation of a residence file produced elsewhere at the U.S. Census Bureau. The goal of the Residence Candidate File (RCF) process is to provide the LEHD Infrastructure Files with residence information that maintains currency with the changing state of administrative sources and represents uncertainty in location as a probability distribution. The discontinued file provided only a single residence per person/year, even when contributing administrative data may have contained multiple residences. This paper describes the motivation for the project, our methodology, the administrative data sources, the model estimation and validation results, and the file specifications. We find that the best prediction of the person-place model provides similar, but superior, accuracy compared with previous methods and performs well for workers in the LEHD jobs frame. We outline possibilities for further improvement in sources and modeling as well as recommendations on how to use the preference weights in downstream processing.
View Full Paper PDF

LEHD Infrastructure S2014 files in the FSRDC

September 2018

Working Paper Number:

CES-18-27R

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'LEHD Infrastructure S2014 files in the FSRDC' are listed below in order of similarity.

January 2006

Working Paper Number:

tp-2006-01

June 2014

Working Paper Number:

CES-14-26

November 2022

Working Paper Number:

CES-22-51

April 2011

Working Paper Number:

CES-11-13

December 2011

Working Paper Number:

CES-11-43

July 2002

Working Paper Number:

tp-2002-13

October 2002

Working Paper Number:

tp-2002-17

February 2006

Working Paper Number:

tp-2006-02

August 2025

Working Paper Number:

CES-25-52

January 2017

Working Paper Number:

CES-17-40