CREAT - Census Bureau

LEHD Snapshot Documentation, Release S2021_R2022Q4

November 2022

Written by: Kevin L. McKinney, Erika McEntarfer, Matthew R. Graham, Stephen Tibbets, Lee Tucker

Working Paper Number:

CES-22-51

Abstract

The Longitudinal Employer-Household Dynamics (LEHD) data at the U.S. Census Bureau is a quarterly database of linked employer-employee data covering over 95% of employment in the United States. These data are used to produce a number of public-use tabulations and tools, including the Quarterly Workforce Indicators (QWI), LEHD Origin-Destination Employment Statistics (LODES), Job-to-Job Flows (J2J), and Post-Secondary Employment Outcomes (PSEO) data products. Researchers on approved projects may also access the underlying LEHD microdata directly, in the form of the LEHD Snapshot restricted-use data product. This document provides a detailed overview of the LEHD Snapshot as of release S2021_R2022Q4, including user guidance, variable codebooks, and an overview of the approvals needed to obtain access. Updates to the documentation for this and future snapshot releases will be made available in HTML format on the LEHD website.

Document Tags and Keywords

Keywords:

work census, payroll, microdata, survey, employee, employ, employed, workforce, employment count, employment statistics, worker demographics, employer household, longitudinal employer, employee data, census employment, workforce indicators

Tags:

Metropolitan Statistical Area, Internal Revenue Service, Standard Industrial Classification, Bureau of Labor Statistics, Social Security Administration, Service Annual Survey, National Science Foundation, Center for Economic Studies, Census Bureau Longitudinal Business Database, University of Chicago, Longitudinal Business Database, Employer Identification Numbers, Unemployment Insurance, Research Data Center, North American Industry Classification System, American Community Survey, Social Security Number, National Institute on Aging, Alfred P Sloan Foundation, Longitudinal Employer Household Dynamics, Protected Identification Key, National Opinion Research Center, Employer-Household Dynamics, Employment History File, Employer Characteristics File, Individual Characteristics File, Quarterly Workforce Indicators, Core Based Statistical Area, Quarterly Census of Employment and Wages, Composite Person Record, Local Employment Dynamics, Office of Personnel Management, Master Address File, Person Validation System, Census Numident, Federal Statistical Research Data Center, MAF-ARF, Federal Tax Information, Successor Predecessor File, DOB

Similar Working Papers

The 10 most similar working papers to the working paper 'LEHD Snapshot Documentation, Release S2021_R2022Q4' are listed below in order of similarity.

Working Paper
🔥

LEHD Infrastructure S2014 files in the FSRDC

September 2018

Authors: Lars Vilhuber

Working Paper Number:

CES-18-27R

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
View Full Paper PDF
Working Paper
🔥

The LEHD Infrastructure Files and the Creation of the Quarterly Workforce Indicators

January 2006

Authors: Lars Vilhuber, John M. Abowd, Kevin L. McKinney, Bryce Stephens, Fredrik Andersson, Marc Roemer, Simon Woodcock

Working Paper Number:

tp-2006-01

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, has built a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. Beginning in 2003 and building on this infrastructure, the Census Bureau has published the Quarterly Workforce Indicators (QWI), a new collection of data series that offers unprecedented detail on the local dynamics of labor markets. Despite the fine detail, confidentiality is maintained due to the application of state-of-the-art confidentiality protection methods. This article describes how the input files are compiled and combined to create the infrastructure files. We describe the multiple imputation methods used to impute in missing data and the statistical matching techniques used to combine and edit data when a direct identifier match requires improvement. Both of these innovations are crucial to the success of the final product. Finally, we pay special attention to the details of the confidentiality protection system used to protect the identity and micro data values of the underlying entities used to form the published estimates. We provide a brief description of public-use and restricted-access data files with pointers to further documentation for researchers interested in using these data.
View Full Paper PDF
Working Paper
🔥

LEHD Infrastructure files in the Census RDC - Overview

June 2014

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-14-26

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2011 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureaus secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifcations made to the files to facilitate researcher access.
View Full Paper PDF
Working Paper
🔥

LEHD Infrastructure Files in the Census RDC: Overview of S2004 Snapshot

April 2011

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-11-13

The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, has built a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2004 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's Research Data Center network.
View Full Paper PDF
Working Paper
🔥

The Creation of the Employment Dynamics Estimates

July 2002

Authors: Lars Vilhuber, John M. Abowd, Paul A. Lengermann

Working Paper Number:

tp-2002-13

View Full Paper PDF
Working Paper

FIRM AGE AND SIZE IN THE LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS DATA

March 2014

Authors: John Haltiwanger, Erika McEntarfer, Henry Hyatt, Liliana Sousa, Stephen Tibbets

Working Paper Number:

CES-14-16

The Census Bureau's Quarterly Workforce Dynamics (QWI) and OnTheMap now provide detailed workforce statistics by employer age and size. These data allow a first look at the demographics of workers at small and young businesses as well as detailed analysis of how hiring, turnover, job creation/destruction vary throughout a firm's lifespan. Both the QWI and OnTheMap are tabulated from the Longitudinal Employer-Household Dynamics (LEHD) linked employer-employee data. Firm age and size information was added to the LEHD data through integration of Business Dynamics Statistics (BDS) microdata into the LEHD jobs frame. This paper describes how these two new firm characteristics were added to the microdata and how they are tabulated in QWI and OnTheMap
View Full Paper PDF
Working Paper

Design Comparison of LODES and ACS Commuting Data Products

October 2014

Authors: Matthew R. Graham, Mark J. Kutzbach, Brian McKenzie

Working Paper Number:

CES-14-38

The Census Bureau produces two complementary data products, the American Community Survey (ACS) commuting and workplace data and the Longitudinal Employer-Household Dynamics (LEHD) Origin-Destination Employment Statistics (LODES), which can be used to answer questions about spatial, economic, and demographic questions relating to workplaces and home-to-work flows. The products are complementary in the sense that they measure similar activities but each has important unique characteristics that provide information that the other measure cannot. As a result of questions from data users, the Census Bureau has created this document to highlight the major design differences between these two data products. This report guides users on the relative advantages of each data product for various analyses and helps explain differences that may arise when using the products.2,3 As an overview, these two data products are sourced from different inputs, cover different populations and time periods, are subject to different sets of edits and imputations, are released under different confidentiality protection mechanisms, and are tabulated at different geographic and characteristic levels. As a general rule, the two data products should not be expected to match exactly for arbitrary queries and may differ substantially for some queries. Within this document, we compare the two data products by the design elements that were deemed most likely to contribute to differences in tabulated data. These elements are: Collection, Coverage, Geographic and Longitudinal Scope, Job Definition and Reference Period, Job and Worker Characteristics, Location Definitions (Workplace and Residence), Completeness of Geographic Information and Edits/Imputations, Geographic Tabulation Levels, Control Totals, Confidentiality Protection and Suppression, and Related Public-Use Data Products. An in-depth data analysis'in aggregate or with the microdata'between the two data products will be the subject of a future technical report. The Census Bureau has begun a pilot project to integrate ACS microdata with LEHD administrative data to develop an enhanced frame of employment status, place of work, and commuting. The Census Bureau will publish quality metrics for person match rates, residence and workplace match rates, and commute distance comparisons.
View Full Paper PDF
Working Paper

LEHD Data Documentation LEHD-OVERVIEW-S2008-rev1

December 2011

Authors: Lars Vilhuber, Kevin L. McKinney

Working Paper Number:

CES-11-43

View Full Paper PDF
Working Paper

Successor/Predecessor Firms

March 2002

Authors: Kevin L. McKinney

Working Paper Number:

tp-2002-04

The goal of this research was to investigate the value added from using worker flows to identify the spurious births and deaths of businesses. We identify four types of "at risk" businesses from ES202 using the successor/predecessor flag and mimic the same categories using UI wage record data. We use two critical decision rules in the analysis: a successor firm has to have at least 80% of employment coming from the donor firm and (in two of the four categories) at least 5 employees have to come from the donor firm. We examine the sensitivity of the categories based on the percentage definition, and find that the results stay very similar, with the exception of the identification of the pure successor. We examine the sensitivity based on the count threshold, and find that there are enormous differences, particularly with identifying spinoff businesses.
View Full Paper PDF
Working Paper

JOB-TO-JOB (J2J) Flows: New Labor Market Statistics From Linked Employer-Employee Data

September 2014

Authors: Kevin L. McKinney, Erika McEntarfer, Henry Hyatt, Stephen Tibbets, Doug Walton

Working Paper Number:

CES-14-34

Flows of workers across jobs are a principal mechanism by which labor markets allocate workers to optimize productivity. While these job flows are both large and economically important, they represent a significant gap in available economic statistics. A soon to be released data product from the U.S. Census Bureau will fill this gap. The Job-to-Job (J2J) flow statistics provide estimates of worker flows across jobs, across different geographic labor markets, by worker and firm characteristics, including direct job-to-job flows as well as job changes with intervening nonemployment. In this paper, we describe the creation of the public-use data product on job-to-job flows. The data underlying the statistics are the matched employer-employee data from the U.S. Census Bureau's Longitudinal Employer-Household Dynamics program. We describe definitional issues and the identification strategy for tracing worker movements between employers in administrative data. We then compare our data with related series and discuss similarities and differences. Lastly, we describe disclosure avoidance techniques for the public use file, and our methodology for estimating national statistics when there is partially missing geography.
View Full Paper PDF

LEHD Snapshot Documentation, Release S2021_R2022Q4

November 2022

Working Paper Number:

CES-22-51

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'LEHD Snapshot Documentation, Release S2021_R2022Q4' are listed below in order of similarity.

September 2018

Working Paper Number:

CES-18-27R

January 2006

Working Paper Number:

tp-2006-01

June 2014

Working Paper Number:

CES-14-26

April 2011

Working Paper Number:

CES-11-13

July 2002

Working Paper Number:

tp-2002-13

March 2014

Working Paper Number:

CES-14-16

October 2014

Working Paper Number:

CES-14-38

December 2011

Working Paper Number:

CES-11-43

March 2002

Working Paper Number:

tp-2002-04

September 2014

Working Paper Number:

CES-14-34