CREAT: Census Research Exploration and Analysis Tool

Trends in Earnings Volatility using Linked Administrative and Survey Data

August 2020

Working Paper Number:

CES-20-24

Abstract

We document trends in earnings volatility separately by gender in combination with other characteristics such as race, educational attainment, and employment status using unique linked survey and administrative data for the tax years spanning 1995-2015. We also decompose the variance of trend volatility into within- and between-group contributions, as well as transitory and permanent shocks. Our results for continuously working men suggest that trend earnings volatility was stable over our period in both survey and tax data, though with a substantial countercyclical business-cycle component. Trend earnings volatility among women declined over the period in both survey and administrative data, but unlike for men, there was no change over the Great Recession. The variance decompositions indicate that nonresponders, low-educated, racial minorities, and part-year workers have the greatest group specific earnings volatility, but with the exception of part-year workers, they contribute least to the level and trend of volatility owing to their small share of the population. There is evidence of stable transitory volatility, but rising permanent volatility over the past two decades in male and female earnings.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
economist, econometric, macroeconomic, earnings, employed, employ, minority, recession, trend, heterogeneity, retirement, workforce, percentile, household, recessionary, poverty, socioeconomic, earner, volatility

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
National Science Foundation, Social Security Administration, Current Population Survey, Chicago Census Research Data Center, Survey of Income and Program Participation, Social Security, Social Security Number, PSID, Detailed Earnings Records, Federal Insurance Contributions Act, Protected Identification Key, Master Earnings File, Census Bureau Disclosure Review Board, Disclosure Review Board, ASEC, Person Validation System, Society of Labor Economists, Federal Statistical Research Data Center, Current Population Survey Annual Social and Economic Supplement

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Trends in Earnings Volatility using Linked Administrative and Survey Data' are listed below in order of similarity.