-
The Composition of Firm Workforces from 2006'2022: Findings from the Business Dynamics Statistics of Human Capital Experimental Product
April 2025
Working Paper Number:
CES-25-20
We introduce the Business Dynamics Statistics of Human Capital (BDS-HC) tables, a new Census Bureau experimental product that provides public-use statistics on the workforce composition of firms and its relationship to business dynamics. We use administrative W-2 filings to combine population-level worker demographic data with longitudinal business data to estimate the demographic and educational composition of nearly all non-farm employer businesses in the United States between 2006 and 2022. We use this newly constructed data to document the evolution of employment, entry, and exit of employers based on their workforce compositions. We also provide new statistics on the interaction between firm and worker characteristics, including the composition of workers at startup firms. We find substantial changes between 2006 and 2022 in the distribution of employers along several dimensions, primarily driven by changing workforce compositions within continuing firms rather than the reallocation of employment between firms. We also highlight systematic differences in the business dynamics of firms by their workforce compositions, suggesting that different groups of workers face different economic environments due to their employers.
View Full
Paper PDF
-
Financing, Ownership, and Performance: A Novel, Longitudinal Firm-Level Database
December 2024
Working Paper Number:
CES-24-73
The Census Bureau's Longitudinal Business Database (LBD) underpins many studies of firm-level behavior. It tracks longitudinally all employers in the nonfarm private sector but lacks information about business financing and owner characteristics. We address this shortcoming by linking LBD observations to firm-level data drawn from several large Census Bureau surveys. The resulting Longitudinal Employer, Owner, and Financing (LEOF) database contains more than 3 million observations at the firm-year level with information about start-up financing, current financing, owner demographics, ownership structure, profitability, and owner aspirations ' all linked to annual firm-level employment data since the firm hired its first employee. Using the LEOF database, we document trends in owner demographics and financing patterns and investigate how these business characteristics relate to firm-level employment outcomes.
View Full
Paper PDF
-
Revisions to the LEHD Establishment Imputation Procedure and Applications to Administrative Job Frame
September 2024
Working Paper Number:
CES-24-51
The Census Bureau is developing a 'job frame' to provide detailed job-level employment data across the U.S. through linked administrative records such as unemployment insurance and IRS W-2 filings. This working paper summarizes the research conducted by the job frame development team on modifying and extending the LEHD Unit-to-Worker (U2W) imputation procedure for the job frame prototype. It provides a conceptual overview of the U2W imputation method, highlighting key challenges and tradeoffs in its current application. The paper then presents four imputation methodologies and evaluates their performance in areas such as establishment assignment accuracy, establishment size matching, and job separation rates. The results show that all methodologies perform similarly in assigning workers to the correct establishment. Non-spell-based methodologies excel in matching establishment sizes, while spell-based methodologies perform better in accurately tracking separation rates.
View Full
Paper PDF
-
Trade Liberalization and Labor-Market Outcomes: Evidence from US Matched Employer-Employee Data
September 2022
Working Paper Number:
CES-22-42
We use matched employer-employee data to examine outcomes among workers initially employed within and outside manufacturing after trade liberalization with China. We find that exposure to this shock operates predominantly through workers' counties (versus industries), that larger own industry and downstream exposure typically reduce relative earnings, and that greater upstream exposure often raises them. The latter is particularly important outside manufacturing: while we find substantial and persistent predicted declines in relative earnings among manufacturing workers, those outside manufacturing are generally predicted to experience relative earnings gains. Investigation of employment reactions indicates they account for a small share of the earnings effect.
View Full
Paper PDF
-
Introducing the Medical Expenditure Panel Survey-Insurance Component with Administrative Records (MEPS-ICAR): Description, Data Construction Methodology, and Quality Assessment
August 2022
Working Paper Number:
CES-22-29
This report introduces a new dataset, the Medical Expenditure Panel Survey-Insurance Component with Administrative Records (MEPS-ICAR), consisting of MEPS-IC survey data on establishments and their health insurance benefits packages linked to Decennial Census data and administrative tax records on MEPS-IC establishments' workforces. These data include new measures of the characteristics of MEPS-IC establishments' parent firms, employee turnover, the full distribution of MEPS-IC workers' personal and family incomes, the geographic locations where those workers live, and improved workforce demographic detail. Next, this report details the methods used for producing the MEPS-ICAR. Broadly, the linking process begins by matching establishments' parent firms to their workforces using identifiers appearing in tax records. The linking process concludes by matching establishments to their own workforces by identifying the subset of their parent firm's workforce that best matches the expected size, total payroll, and residential geographic distribution of the establishment's workforce. Finally, this report presents statistics characterizing the match rate and the MEPS-ICAR data itself. Key results include that match rates are consistently high (exceeding 90%) across nearly all data subgroups and that the matched data exhibit a reasonable distribution of employment, payroll, and worker commute distances relative to expectations and external benchmarks. Notably, employment measures derived from tax records, but not used in the match itself, correspond with high fidelity to the employment levels that establishments report in the MEPS-IC. Cumulatively, the construction of the MEPS-ICAR significantly expands the capabilities of the MEPS-IC and presents many opportunities for analysts.
View Full
Paper PDF
-
Business Applications as a Leading Economic Indicator?
May 2021
Working Paper Number:
CES-21-09R
How are applications to start new businesses related to aggregate economic activity? This paper explores the properties of three monthly business application series from the U.S. Census Bureau's Business Formation Statistics as economic indicators: all business applications, business applications that are relatively likely to turn into new employer businesses ('likely employers'), and the residual series -- business applications that have a relatively low rate of becoming employers ('likely non-employers'). Growth in applications for likely employers significantly leads total nonfarm employment growth and has a strong positive correlation with it. Furthermore, growth in applications for likely employers leads growth in most of the monthly Principal Federal Economic Indicators (PFEIs). Motivated by our findings, we estimate a dynamic factor model (DFM) to forecast nonfarm employment growth over a 12-month period using the PFEIs and the likely employers series. The latter improves the model's forecast, especially in the years following the turning points of the Great Recession and the COVID-19 pandemic. Overall, applications for likely employers are a strong leading indicator of monthly PFEIs and aggregate economic activity, whereas applications for likely non-employers provide early information about changes in increasingly prevalent self-employment activity in the U.S. economy.
View Full
Paper PDF
-
Redesigning the Longitudinal Business Database
May 2021
Working Paper Number:
CES-21-08
In this paper we describe the U.S. Census Bureau's redesign and production implementation of the Longitudinal Business Database (LBD) first introduced by Jarmin and Miranda (2002). The LBD is used to create the Business Dynamics Statistics (BDS), tabulations describing the entry, exit, expansion, and contraction of businesses. The new LBD and BDS also incorporate information formerly provided by the Statistics of U.S. Businesses program, which produced similar year-to-year measures of employment and establishment flows. We describe in detail how the LBD is created from curation of the input administrative data, longitudinal matching, retiming of economic census-year births and deaths, creation of vintage consistent industry codes and noise factors, and the creation and cleaning of each year of LBD data. This documentation is intended to facilitate the proper use and understanding of the data by both researchers with approved projects accessing the LBD microdata and those using the BDS tabulations.
View Full
Paper PDF
-
LEHD Infrastructure S2014 files in the FSRDC
September 2018
Working Paper Number:
CES-18-27R
The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
View Full
Paper PDF
-
Does Federally-Funded Job Training Work? Nonexperimental Estimates of WIA Training Impacts Using Longitudinal Data on Workers and Firms
January 2018
Working Paper Number:
CES-18-02
We study the job training provided under the US Workforce Investment Act (WIA) to adults and dislocated workers in two states. Our substantive contributions center on impacts estimated non-experimentally using administrative data. These impacts compare WIA participants who do and do not receive training. In addition to the usual impacts on earnings and employment, we link our state data to the Longitudinal Employer-Household Dynamics (LEHD) data at the US Census Bureau, which allows us to estimate impacts on the characteristics of the firms at which participants find employment. We find moderate positive impacts on employment, earnings and desirable firm characteristics for adults, but not for dislocated workers. Our primary methodological contribution consists of assessing the value of the additional conditioning information provided by the LEHD relative to the data available in state Unemployment Insurance (UI) earnings records. We find that value to be zero.
View Full
Paper PDF
-
The Potential for Using Combined Survey and Administrative Data Sources to Study Internal Labor Migration
January 2017
Working Paper Number:
CES-17-55
This paper introduces a novel data set combining survey data from the American Community Survey (ACS) with administrative data on employment from the Longitudinal Employer-Household Dynamics program, in order to study geographic labor mobility. With its rich set of information about individuals at the time of the migration decision, large sample size, and near-comprehensive ability to detect labor mobility, the new combined ACS-LEHD data offers several advantages over the existing data sets that are typically used in the study of migration, such as the Decennial Census, Current Population Survey, and Internal Revenue Service data. An overview of how these different data sets can be employed, and examples demonstrating the usefulness of the newly proposed data set, are provided.
Aggregate statistics and stylized facts are generated from the ACS-LEHD data which reveal many of the same features as the existing data sets, including the decline of aggregate mobility throughout the past decade, as well as many of the known demographic differences in migration propensity.
View Full
Paper PDF