CREAT: Census Research Exploration and Analysis Tool

Using Worker Flows in the Analysis of the Firm

August 2003

Working Paper Number:

tp-2003-09

Abstract

This paper uses a novel approach to measure firm entry and exit, mergers and acquisition. It uses information about the flows of clusters of workers across business units to identify longitudinal linkage relationships in longitudinal business data. These longitudinal relationships may be the result of either administrative or economic changes and we explore both types of newly identified longitudinal relationships. In particular, we develop a set of criteria based on worker flows to identify changes in firm relationships ? such as mergers and acquisitions, administrative identifier changes and outsourcing. We demonstrate how this new data infrastructure and this cluster flow methodology can be used to better differentiate true firm entry/exit and simple changes in administrative identifiers. We explore the role of outsourcing in a variety of ways but in particular the outsourcing of workers to the temporary help industry. While the primary focus is on developing the data infrastructure and the methodology to identify and interpret these clustered flows of workers, we conclude the paper with an analysis of the impact of these changes on the earnings of workers.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
analysis, statistical, data census, report, census data, survey, study, respondent, research, empirical, yearly, firms census, longitudinal, measure, population, census business, census bureau, aging

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Internal Revenue Service, Standard Industrial Classification, National Science Foundation, University of Maryland, Financial, Insurance and Real Estate Industries, Current Population Survey, Employer Identification Numbers, Cornell University, Longitudinal Employer Household Dynamics, LEHD Program, Census Bureau Business Register

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Using Worker Flows in the Analysis of the Firm' are listed below in order of similarity.