The March Current Population Survey (CPS) and the Survey of Income and Program
Participation (SIPP) produce different aggregates and distributions of annual wages. An excess of
high wages and shortage of low wages occurs in the March CPS. SIPP shows the opposite, an
excess of low wages and shortage of high wages. Exactly-matched Detailed Earnings Records
(DER) from the Social Security Administration allow comparing March CPS and SIPP people's
wages using data independent of the surveys. Findings include the following. March CPS and
SIPP people differ little in their true wage characteristics. March CPS and SIPP represent a
worker's percentile rank better than the dollar amount of wages. Workers with one job and low
work effort have underestimated March CPS wages. March CPS has a higher level of
"underground" wages than SIPP, and increasingly so in the 1990s. March CPS has a higher level
of self-employment income "misclassified" as wages than SIPP, and increasingly so in the 1990s.
These trends may explain one-third of March CPS's 6-percentage-point increase in aggregate
wages relative to independent estimates from 1993 to 1995. Finally, the paper delineates March
CPS occupations disproportionately likely to be absent from the administrative data entirely or to
"misclassify" self-employment income as wages.
-
Further Evidence from Census 2000 About Earnings by Detailed Occupation for Men and Women: The Role of Race and Hispanic Origin
November 2011
Working Paper Number:
CES-11-37
A 2004 report by the author reviewed data from Census 2000 and concluded "There is a substantial gap in median earnings between men and women that is unexplained, even after controlling for work experience (to the extent it can be represented by age and presence of children), education, and occupation." This paper extends the analysis and concludes that once those characteristics are controlled for, no further explanatory power is attributable to race or Hispanic origin.
View Full
Paper PDF
-
Occupation Inflation in the Current Population Survey
September 2012
Working Paper Number:
CES-12-26
A common caveat often accompanying results relying on household surveys regards respondent error. There is research using independent, presumably error-free administrative data, to estimate the extent of error in the data, the correlates of error, and potential corrections for the error. We investigate measurement error in occupation in the Current Population Survey (CPS) using the panel component of the CPS to identify those that incorrectly report changing occupation. We find evidence that individuals are inflating their occupation to higher skilled and higher paying occupations than the ones they actually perform. Occupation inflation biases the education and race coefficients in standard Mincer equation results within occupations.
View Full
Paper PDF
-
An Evaluation of the Gender Wage Gap Using Linked Survey and Administrative Data
November 2020
Working Paper Number:
CES-20-34
The narrowing of the gender wage gap has slowed in recent decades. However, current estimates show that, among full-time year-round workers, women earn approximately 18 to 20 percent less than men at the median. Women's human capital and labor force characteristics that drive wages increasingly resemble men's, so remaining differences in these characteristics explain less of the gender wage gap now than in the past. As these factors wane in importance, studies show that others like occupational and industrial segregation explain larger portions of the gender wage gap. However, a major limitation of these studies is that the large datasets required to analyze occupation and industry effectively lack measures of labor force experience. This study combines survey and administrative data to analyze and improve estimates of the gender wage gap within detailed occupations, while also accounting for gender differences in work experience. We find a gender wage gap of 18 percent among full-time, year-round workers across 316 detailed occupation categories. We show the wage gap varies significantly by occupation: while wages are at parity in some occupations, gaps are as large as 45 percent in others. More competitive and hazardous occupations, occupations that reward longer hours of work, and those that have a larger proportion of women workers have larger gender wage gaps. The models explain less of the wage gap in occupations with these attributes. Occupational characteristics shape the conditions under which men and women work and we show these characteristics can make for environments that are more or less conducive to gender parity in earnings.
View Full
Paper PDF
-
An Analysis of Sample Selection and the Reliability of Using Short-term Earnings Averages in SIPP-SSA Matched Data
December 2011
Working Paper Number:
CES-11-39
In this paper, we document the extent to which the sample of the Survey of Income and Program Participation that is matched to the Social Security Administration's administrative earnings records is nationally representative. We conclude that the match bias is small, so selection is not a serious concern. The matched sample over-represents individuals who are wealthy, who have financial assets or who have received a government-transfer and under-represents individuals who attrited from the SIPP. We use this matched sample to examine the relationship between short-term averages of earnings from the SIPP earnings and average lifetime earnings from the administrative records. Our estimates suggest that using short averages of earnings may understate the effects of permanent income on particular outcomes of interest.
View Full
Paper PDF
-
Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures
September 2024
Working Paper Number:
CES-24-52
In this paper, I explore the impact of generalized coverage error, item non-response bias, and measurement error on measures of earnings and earnings inequality in the CPS ASEC. I match addresses selected for the CPS ASEC to administrative data from 1040 tax returns. I then compare earnings statistics in the tax data for wage and salary earnings in samples corresponding to seven stages of the CPS ASEC survey production process. I also compare the statistics using the actual survey responses. The statistics I examine include mean earnings, the Gini coefficient, percentile earnings shares, and shares of the survey weight for a range of percentiles. I examine how the accuracy of the statistics calculated using the survey data is affected by including imputed responses for both those who did not respond to the full CPS ASEC and those who did not respond to the earnings question. I find that generalized coverage error and item nonresponse bias are dominated by measurement error, and that an important aspect of measurement error is households reporting no wage and salary earnings in the CPS ASEC when there are such earnings in the tax data. I find that the CPS ASEC sample misses earnings at the high end of the distribution from the initial selection stage and that the final survey weights exacerbate this.
View Full
Paper PDF
-
Social, Economic, Spatial, and Commuting Patterns of Self-Employed Jobholders
April 2007
Working Paper Number:
tp-2007-03
A significant number of employees within the United States identify themselves as selfemployed,
and they are distinct from the larger group identified as private jobholders. While
socioeconomic and spatial information on these individuals is readily available in standard
datasets, such as the 2000 Decennial Census Long Form, it is possible to gain further information
on their wage earnings by using data from administrative wage records. This study takes
advantage of firm-based data from Unemployment Insurance administrative wage records linked
with the Census Bureau's household-based data in order to examine self-employed jobholders -
both as a whole and as subgroups defined according to their earned wage status - by their
demographic characteristics as well as their economic, commuting, and spatial location
outcomes. Additionally, this report evaluates whether self-employed jobholders and the defined
subgroups should be included explicitly in future labor-workforce analyses and transportation
modeling. The analyses in this report use the sample of self-employed workers who lived in Los
Angeles County, California.
View Full
Paper PDF
-
Long-Run Earnings Volatility and Health Insurance Coverage: Evidence from the SIPP Gold Standard File
October 2011
Working Paper Number:
CES-11-35
Despite the notable increase in earnings volatility and the attention paid to the growing ranks of the uninsured, the relationship between career earnings and short- and mediumrun health insurance status has been ignored due to a lack of data. I use a new dataset, the SIPP Gold Standard File, that merges health insurance status and demographics from the Survey of Income and Program Participation with career earnings records from the Social Security Administration (SSA) and the Internal Revenue Service (IRS) to examine the relationship between long-run family earnings volatility and health insurance coverage. I find that more volatile career earnings are associated with an increased probability of experiencing an uninsured episode, with larger effects for men, young workers, and the unmarried. These findings are consistent with the 'scarring' literature, and suggest the importance of safety-net measures for job losses and health insurance coverage.
View Full
Paper PDF
-
Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey
January 2024
Working Paper Number:
CES-24-02
Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations.
After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics.
This paper is for research purposes only. No changes to production are being implemented at this time.
View Full
Paper PDF
-
Errors in Survey Reporting and Imputation and Their Effects on Estimates of Food Stamp Program Participation
April 2011
Working Paper Number:
CES-11-14
Benefit receipt in major household surveys is often underreported. This misreporting leads to biased estimates of the economic circumstances of disadvantaged populations, program takeup, and the distributional effects of government programs, and other program effects. We use administrative data on Food Stamp Program (FSP) participation matched to American Community Survey (ACS) and Current Population Survey (CPS) household data. We show that nearly thirty-five percent of true recipient households do not report receipt in the ACS and fifty percent do not report receipt in the CPS. Misreporting, both false negatives and false positives, varies with individual characteristics, leading to complicated biases in FSP analyses. We then directly examine the determinants of program receipt using our combined administrative and survey data. The combined data allow us to examine accurate participation using individual characteristics missing in administrative data. Our results differ from conventional estimates using only survey data, as such estimates understate participation by single parents, non-whites, low income households, and other groups. To evaluate the use of Census Bureau imputed ACS and CPS data, we also examine whether our estimates using survey data alone are closer to those using the accurate combined data when imputed survey observations are excluded. Interestingly, excluding the imputed observations leads to worse ACS estimates, but has less effect on the CPS estimates.
View Full
Paper PDF
-
The Recent Decline of Single Quarter Jobs
January 2015
Working Paper Number:
CES-15-05
Rates of hiring and job separation fell by as much as a third in the U.S. between the late 1990s and the early 2010s. Half of this decline is associated with the declining incidence of jobs that start and end in the same calendar quarter, employment events that we call 'single quarter jobs.' We investigate this unique subset of jobs and its decline using matched employer-employee data for the years 1996-2012. We characterize the worker demographics and employer characteristics of single quarter jobs, and demonstrate that changes over time in workforce and employer composition explain little of the decline in these jobs. We find that the decline in these jobs accounts for about a third of the decline in the fraction of the population that holds a job in the private sector that occurred from the mid 2000s to the early 2010s. We also find little evidence that single quarter jobs are stepping stones into longer-term employment. Finally, we show that the inclusion or exclusion of these single quarter jobs creates divergent trends in average earnings and the dispersion of earnings for the years 1996-2012. To the extent that administrative records measure the volatile tail of the employment distribution better thanconventional household surveys, these findings show that measurement of short duration jobs matters for economic analysis.
View Full
Paper PDF