Computing Person and Firm Effects Using Linked Longitudinal Employer-Employee Data
March 2002
Working Paper Number:
tp-2002-06
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
estimation,
analysis,
data,
payroll,
statistical,
agency,
model,
employee,
employed,
labor,
longitudinal,
regression,
employing,
worker,
regressors,
associate,
census bureau,
employer household,
longitudinal employer,
unemployment insurance,
employee data
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Service Annual Survey,
National Science Foundation,
National Bureau of Economic Research,
Cornell University,
Longitudinal Employer Household Dynamics,
AKM,
United States Census Bureau
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'Computing Person and Firm Effects Using Linked Longitudinal Employer-Employee Data' are listed below in order of similarity.
-
Working PaperMixed-Effects Methods For Search and Matching Research🔥
September 2023
Working Paper Number:
CES-23-43
We study mixed-effects methods for estimating equations containing person and firm effects. In economics such models are usually estimated using fixed-effects methods. Recent enhancements to those fixed-effects methods include corrections to the bias in estimating the covariance matrix of the person and firm effects, which we also consider.View Full Paper PDF
-
Working PaperSorting Between and Within Industries: A Testable Model of Assortative Matching
January 2017
Working Paper Number:
CES-17-43
We test Shimer's (2005) theory of the sorting of workers between and within industrial sectors based on directed search with coordination frictions, deliberately maintaining its static general equilibrium framework. We fit the model to sector-specific wage, vacancy and output data, including publicly-available statistics that characterize the distribution of worker and employer wage heterogeneity across sectors. Our empirical method is general and can be applied to a broad class of assignment models. The results indicate that industries are the loci of sorting-more productive workers are employed in more productive industries. The evidence confirm that strong assortative matching can be present even when worker and employer components of wage heterogeneity are weakly correlated.View Full Paper PDF
-
Working PaperUnlocking the Information in Integrated Social Data
May 2002
Working Paper Number:
tp-2002-21
-
Working PaperA Formal Test of Assortative Matching in the Labor Market
November 2009
Working Paper Number:
CES-09-40
We estimate a structural model of job assignment in the presence of coordination frictions due to Shimer (2005). The coordination friction model places restrictions on the joint distribution of worker and firm effects from a linear decomposition of log labor earnings. These restrictions permit estimation of the unobservable ability and productivity differences between workers and their employers as well as the way workers sort into jobs on the basis of these unobservable factors. The estimation is performed on matched employer-employee data from the LEHD program of the U.S. Census Bureau. The estimated correlation between worker and firm effects from the earnings decomposition is close to zero, a finding that is often interpreted as evidence that there is no sorting by comparative advantage in the labor market. Our estimates suggest that his finding actually results from a lack of sufficient heterogeneity in the workforce and available jobs. Workers do sort into jobs on the basis of productive differences, but the effects of sorting are not visible because of the composition of workers and employers.View Full Paper PDF
-
Working PaperThe Measurement of Human Capital in the U.S. Economy
April 2002
Working Paper Number:
tp-2002-09
We develop a new approach to measuring human capital that permits the distinction of both observable and unobservable dimensions of skill by associating human capital with the portable part of an individual's wage rate. Using new large-scale, integrated employer-employee data containing information on 68 million individuals and 3.6 million firms, we explain a very large proportion (84%) of the total variation in wages rates and attribute substantial variation to both individual and employer heterogeneity. While the wage distribution remained largely unchanged between 1992-1997, we document a pronounced right shift in the overall distribution of human capital. Most workers entering our sample, while less experienced, were otherwise more highly skilled, a difference which can be attributed almost exclusively to unobservables. Nevertheless, compared to exiters and continuers, entrants exhibited a greater tendency to match to firms paying below average internal wages. Firms reduced employment shares of low skilled workers and increased employment shares of high skilled workers in virtually every industry. Our results strongly suggest that the distribution of human capital will continue to shift to the right, implying a continuing up-skilling of the employed labor force.View Full Paper PDF
-
Working PaperModeling Endogenous Mobility in Wage Determiniation
June 2015
Working Paper Number:
CES-15-18
We evaluate the bias from endogenous job mobility in fixed-effects estimates of worker- and firm-specific earnings heterogeneity using longitudinally linked employer-employee data from the LEHD infrastructure file system of the U.S. Census Bureau. First, we propose two new residual diagnostic tests of the assumption that mobility is exogenous to unmodeled determinants of earnings. Both tests reject exogenous mobility. We relax the exogenous mobility assumptions by modeling the evolution of the matched data as an evolving bipartite graph using a Bayesian latent class framework. Our results suggest that endogenous mobility biases estimated firm effects toward zero. To assess validity, we match our estimates of the wage components to out-of-sample estimates of revenue per worker. The corrected estimates attribute much more of the variation in revenue per worker to variation in match quality and worker quality than the uncorrected estimates.View Full Paper PDF
-
Working PaperNOISE INFUSION AS A CONFIDENTIALITY PROTECTION MEASURE FOR GRAPH-BASED STATISTICS
September 2014
Working Paper Number:
CES-14-30
We use the bipartite graph representation of longitudinally linked em-ployer-employee data, and the associated projections onto the employer and em-ployee nodes, respectively, to characterize the set of potential statistical summar-ies that the trusted custodian might produce. We consider noise infusion as the primary confidentiality protection method. We show that a relatively straightfor-ward extension of the dynamic noise-infusion method used in the U.S. Census Bureau's Quarterly Workforce Indicators can be adapted to provide the same confidentiality guarantees for the graph-based statistics: all inputs have been modified by a minimum percentage deviation (i.e., no actual respondent data are used) and, as the number of entities contributing to a particular statistic increases, the accuracy of that statistic approaches the unprotected value. Our method also ensures that the protected statistics will be identical in all releases based on the same inputs.View Full Paper PDF
-
Working PaperJob Referral Networks and the Determination of Earnings in Local Labor Markets
December 2010
Working Paper Number:
CES-10-40
Referral networks may affect the efficiency and equity of labor market outcomes, but few studies have been able to identify earnings effects empirically. To make progress, I set up a model of on-the-job search in which referral networks channel information about high-paying jobs. I evaluate the model using employer-employee matched data for the U.S. linked to the Census block of residence for each worker. The referral effect is identified by variations in the quality of local referral networks within narrowly defined neighborhoods. I find, consistent with the model, a positive and significant role for local referral networks on the full distribution of earnings outcomes from job search.View Full Paper PDF
-
Working PaperTotal Error and Variability Measures for the Quarterly Workforce Indicators and LEHD Origin Destination Employment Statistics in OnTheMap
September 2020
Working Paper Number:
CES-20-30
We report results from the first comprehensive total quality evaluation of five major indicators in the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) Program Quarterly Workforce Indicators (QWI): total flow-employment, beginning-of-quarter employment, full quarter employment, average monthly earnings of full-quarter employees, and total quarterly payroll. Beginning-of-quarter employment is also the main tabulation variable in the LEHD Origin-Destination Employment Statistics (LODES) workplace reports as displayed in On-TheMap (OTM), including OnTheMap for Emergency Management. We account for errors due to coverage; record-level non response; edit and imputation of item missing data; and statistical disclosure limitation. The analysis reveals that the five publication variables under study are estimated very accurately for tabulations involving at least 10 jobs. Tabulations involving three to nine jobs are a transition zone, where cells may be fit for use with caution. Tabulations involving one or two jobs, which are generally suppressed on fitness-for-use criteria in the QWI and synthesized in LODES, have substantial total variability but can still be used to estimate statistics for untabulated aggregates as long as the job count in the aggregate is more than 10.View Full Paper PDF
-
Working PaperMale Earnings Volatility in LEHD before, during, and after the Great Recession
September 2020
Working Paper Number:
CES-20-31
This paper is part of a coordinated collection of papers on prime-age male earnings volatility. Each paper produces a similar set of statistics for the same reference population using a different primary data source. Our primary data source is the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files. Using LEHD data from 1998 to 2016, we create a well-defined population frame to facilitate accurate estimation of temporal changes comparable to designed longitudinal samples of people. We show that earnings volatility, excluding increases during recessions, has declined over the analysis period, a finding robust to various sensitivity analyses. Although we find volatility is declining, the effect is not homogeneous, particularly for workers with tenuous labor force attachment for whom volatility is increasing. These 'not stable' workers have earnings volatility approximately 30 times larger than stable workers, but more important for earnings volatility trends we observe a large increase in the share of stable employment from 60% in 1998 to 67% in 2016, which we show to largely be responsible for the decline in overall earnings volatility. To further emphasize the importance of not stable and/or low earning workers we also conduct comparisons with the PSID and show how changes over time in the share of workers at the bottom tail of the cross-sectional earnings distributions can produce either declining or increasing earnings volatility trends.View Full Paper PDF