CREAT: Census Research Exploration and Analysis Tool

Industry Wage Differentials: A Firm-Based Approach

August 2023

Working Paper Number:

CES-23-40

Abstract

We revisit the estimation of industry wage differentials using linked employer-employee data from the U.S. LEHD program. Building on recent advances in the measurement of employer wage premiums, we define the industry wage effect as the employment-weighted average workplace premium in that industry. We show that cross-sectional estimates of industry differentials overstate the pay premiums due to unmeasured worker heterogeneity. Conversely, estimates based on industry movers understate the true premiums, due to unmeasured heterogeneity in pay premiums within industries. Industry movers who switch to higher-premium industries tend to leave firms in the origin sector that pay above-average premiums and move to firms in the destination sector with below-average premiums (and vice versa), attenuating the measured industry effects. Our preferred estimates reveal substantial heterogeneity in narrowly-defined industry premiums, with a standard deviation of 12%. On average, workers in higher-paying industries have higher observed and unobserved skills, widening between-industry wage inequality. There are also small but systematic differences in industry premiums across cities, with a wider distribution of pay premiums and more worker sorting in cities with more highpremium firms and high-skilled workers.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
economist, industrial, earnings, employee, employed, labor, heterogeneity, unobserved, bias, workplace, workforce, worker, wage effects, industry wages, wage differences, wage industries, wage data, premium, disparity

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Ordinary Least Squares, National Longitudinal Survey of Youth, Census Industry Code, North American Industry Classification System, American Community Survey, Longitudinal Employer Household Dynamics, AKM, Census Bureau Disclosure Review Board

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Industry Wage Differentials: A Firm-Based Approach' are listed below in order of similarity.