Using Administrative Earnings Records to Assess Wage Data Quality in the March Current Population Survey and the Survey of Income and Program Participation
November 2002
Working Paper Number:
tp-2002-22
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
statistical,
survey,
respondent,
earnings,
employ,
employed,
yearly,
discrepancy,
salary,
percentile,
population,
labor statistics,
wage data,
income survey,
earn,
earner,
survey income,
assessing,
income year
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Bureau of Labor Statistics,
Social Security Administration,
Current Population Survey,
Employer Identification Numbers,
Survey of Income and Program Participation,
Social Security,
Social Security Number,
LEHD Program,
Detailed Earnings Records
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'Using Administrative Earnings Records to Assess Wage Data Quality in the March Current Population Survey and the Survey of Income and Program Participation' are listed below in order of similarity.
-
Working PaperFurther Evidence from Census 2000 About Earnings by Detailed Occupation for Men and Women: The Role of Race and Hispanic Origin🔥
November 2011
Working Paper Number:
CES-11-37
A 2004 report by the author reviewed data from Census 2000 and concluded "There is a substantial gap in median earnings between men and women that is unexplained, even after controlling for work experience (to the extent it can be represented by age and presence of children), education, and occupati...View Full Paper PDF
-
Working PaperOccupation Inflation in the Current Population Survey
September 2012
Working Paper Number:
CES-12-26
A common caveat often accompanying results relying on household surveys regards respondent error. There is research using independent, presumably error-free administrative data, to estimate the extent of error in the data, the correlates of error, and potential corrections for the error. We investig...View Full Paper PDF
-
Working PaperAn Evaluation of the Gender Wage Gap Using Linked Survey and Administrative Data
November 2020
Working Paper Number:
CES-20-34
The narrowing of the gender wage gap has slowed in recent decades. However, current estimates show that, among full-time year-round workers, women earn approximately 18 to 20 percent less than men at the median. Women's human capital and labor force characteristics that drive wages increasingly rese...View Full Paper PDF
-
Working PaperAn Analysis of Sample Selection and the Reliability of Using Short-term Earnings Averages in SIPP-SSA Matched Data
December 2011
Working Paper Number:
CES-11-39
In this paper, we document the extent to which the sample of the Survey of Income and Program Participation that is matched to the Social Security Administration's administrative earnings records is nationally representative. We conclude that the match bias is small, so selection is not a serious co...View Full Paper PDF
-
Working PaperUnderstanding Earnings Instability: How Important are Employment Fluctuations and Job Changes?
August 2009
Working Paper Number:
CES-09-20
Using three panel datasets (the matched CPS, the SIPP, and the newly available Longitudinal Employment and Household Dynamics (LEHD) data), we examine trends in male earnings instability in recent decades. In contrast to several papers that find a recent upward trend in earnings instability using th...View Full Paper PDF
-
Working PaperSocial, Economic, Spatial, and Commuting Patterns of Informal Jobholders
April 2007
Working Paper Number:
tp-2007-02
A significant number of employees within the United States can be considered "informal" or "off-the-books" workers. These workers, who by definition do not appear in administrative wage records, are distinct from the larger group of private jobholders who do appear in administrative records. However...View Full Paper PDF
-
Working PaperSocial, Economic, Spatial, and Commuting Patterns of Self-Employed Jobholders
April 2007
Working Paper Number:
tp-2007-03
A significant number of employees within the United States identify themselves as selfemployed, and they are distinct from the larger group identified as private jobholders. While socioeconomic and spatial information on these individuals is readily available in standard datasets, such as the 2000 D...View Full Paper PDF
-
Working PaperEarnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures
September 2024
Working Paper Number:
CES-24-52
In this paper, I explore the impact of generalized coverage error, item non-response bias, and measurement error on measures of earnings and earnings inequality in the CPS ASEC. I match addresses selected for the CPS ASEC to administrative data from 1040 tax returns. I then compare earnings statisti...View Full Paper PDF
-
Working PaperLong-Run Earnings Volatility and Health Insurance Coverage: Evidence from the SIPP Gold Standard File
October 2011
Working Paper Number:
CES-11-35
Despite the notable increase in earnings volatility and the attention paid to the growing ranks of the uninsured, the relationship between career earnings and short- and mediumrun health insurance status has been ignored due to a lack of data. I use a new dataset, the SIPP Gold Standard File, that m...View Full Paper PDF
-
Working PaperSocial, Economic, Spatial, and Commuting Patterns of Dual Jobholders
April 2007
Working Paper Number:
tp-2007-01
Individuals who hold multiple jobs have complex working lives and complex commuting patterns. Economic and spatial information on these individuals is not readily available in standard datasets, such as the 2000 Decennial Census Long Form, because the survey questions were not designed to collect de...View Full Paper PDF