Using the Current Population Survey Annual Social and Economic Supplement matched to Social Security Administration Detailed Earnings Records, we link observations across consecutive years to investigate a relationship between item nonresponse and measurement error in the earnings questions. Linking individuals across consecutive years allows us to observe switching from response to nonresponse and vice versa. We estimate OLS, IV, and finite mixture models that allow for various assumptions separately for men and women. We find that those who respond in both years of the survey exhibit less measurement error than those who respond in one year. Our findings suggest a trade-off between survey response and data quality that should be considered by survey designers, data collectors, and data users.
-
Trends in Earnings Volatility using Linked Administrative and Survey Data
August 2020
Working Paper Number:
CES-20-24
We document trends in earnings volatility separately by gender in combination with other characteristics such as race, educational attainment, and employment status using unique linked survey and administrative data for the tax years spanning 1995-2015. We also decompose the variance of trend volatility into within- and between-group contributions, as well as transitory and permanent shocks. Our results for continuously working men suggest that trend earnings volatility was stable over our period in both survey and tax data, though with a substantial countercyclical business-cycle component. Trend earnings volatility among women declined over the period in both survey and administrative data, but unlike for men, there was no change over the Great Recession. The variance decompositions indicate that nonresponders, low-educated, racial minorities, and part-year workers have the greatest group specific earnings volatility, but with the exception of part-year workers, they contribute least to the level and trend of volatility owing to their small share of the population. There is evidence of stable transitory volatility, but rising permanent volatility over the past two decades in male and female earnings.
View Full
Paper PDF
-
The Antipoverty Impact of the EITC: New Estimates from Survey and Administrative Tax Records
April 2019
Working Paper Number:
CES-19-14R
We reassess the antipoverty effects of the EITC using unique data linking the CPS Annual Social and Economic Supplement to IRS data for the same individuals spanning years 2005-2016. We compare EITC benefits from standard simulators to administrative EITC payments and find that significantly more actual EITC payments flow to childless tax units than predicted, and to those whose family income places them above official poverty thresholds. However, actual EITC payments appear to be target efficient at the tax unit level. In 2016, about 3.1 million persons were lifted out of poverty by the EITC, substantially less than prior estimates.
View Full
Paper PDF
-
Interpreting Cohort Profiles of Lifecycle Earnings Volatility
April 2024
Working Paper Number:
CES-24-21
We present new estimates of earnings volatility over time and the lifecycle for men and women by race and human capital. Using a long panel of restricted-access administrative Social Security earnings linked to the Current Population Survey, we estimate volatility with both transparent summary measures, as well as decompositions into permanent and transitory components. From the late 1970s to the mid 1990s there is a strong negative trend in earnings volatility for both men and women. We show this is driven by a reduction in transitory variance. Starting in the mid 1990s there is relative stability in trends of male earnings volatility because of an increase in the variance of permanent shocks, especially among workers without a college education, and a more attenuated trend decline among women. Cohort analyses indicate a strong U-shape pattern of volatility over the working life, which comes from large permanent shocks early and later in the lifecycle. However, this U-shape shifted downward and leftward in more recent cohorts, the latter from the fanning out of lifecycle transitory volatility in younger cohorts. These patterns are more pronounced among White men and women compared to Black workers.
View Full
Paper PDF
-
Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Bureau Survey and SSA Administrative Data
July 2011
Working Paper Number:
CES-11-20
We quantify sources of variation in annual job earnings data collected by the Survey of Income and Program Participation (SIPP) to determine how much of the variation is the result of measurement error. Jobs reported in the SIPP are linked to jobs reported in an administrative database, the Detailed Earnings Records (DER) drawn from the Social Security Administration's Master Earnings File, a universe file of all earnings reported on W-2 tax forms. As a result of the match, each job potentially has two earnings observations per year: survey and administrative. Unlike previous validation studies, both of these earnings measures are viewed as noisy measures of some underlying true amount of annual earnings. While the existence of survey error resulting from respondent mistakes or misinterpretation is widely accepted, the idea that administrative data are also error-prone is new. Possible sources of employer reporting error, employee under-reporting of compensation such as tips, and general differences between how earnings may be reported on tax forms and in surveys, necessitates the discarding of the assumption that administrative data are a true measure of the quantity that the survey was designed to collect. In addition, errors in matching SIPP and DER jobs, a necessary task in any use of administrative data, also contribute to measurement error in both earnings variables. We begin by comparing SIPP and DER earnings for different demographic and education groups of SIPP respondents. We also calculate different measures of changes in earnings for individuals switching jobs. We estimate a standard earnings equation model using SIPP and DER earnings and compare the resulting coefficients. Finally exploiting the presence of individuals with multiple jobs and shared employers over time, we estimate an econometric model that includes random person and firm effects, a common error component shared by SIPP and DER earnings, and two independent error components that represent the variation unique to each earnings measure. We compare the variance components from this model and consider how the DER and SIPP differ across unobservable components.
View Full
Paper PDF
-
Response Error & the Medicaid undercount in the CPS
December 2016
Working Paper Number:
carra-2016-11
The Current Population Survey Annual Social and Economic Supplement (CPS ASEC) is an important source for estimates of the uninsured population. Previous research has shown that survey estimates produce an undercount of beneficiaries compared to Medicaid enrollment records. We extend past work by examining the Medicaid undercount in the 2007-2011 CPS ASEC compared to enrollment data from the Medicaid Statistical Information System for calendar years 2006-2010. By linking individuals across datasets, we analyze two types of response error regarding Medicaid enrollment - false negative error and false positive error. We use regression analysis to identify factors associated with these two types of response error in the 2011 CPS ASEC. We find that the Medicaid undercount was between 22 and 31 percent from 2007 to 2011. In 2011, the false negative rate was 40 percent, and 27 percent of Medicaid reports in CPS ASEC were false positives. False negative error is associated with the duration of enrollment in Medicaid, enrollment in Medicare and private insurance, and Medicaid enrollment in the survey year. False positive error is associated with enrollment in Medicare and shared Medicaid coverage in the household. We discuss implications for survey reports of health insurance coverage and for estimating the uninsured population.
View Full
Paper PDF
-
Earnings Mobility in the US: A New Look at Intergenerational Inequality
May 2002
Working Paper Number:
CES-02-11
This study uses a new data set that contains the Social Security earnings histories of parents and children in the 1984 Survey of Income and Program Participation, to measure the intergenerational elasticity in earnings in the United States. Earlier studies that found an intergenerational elasticity of 0.4 have typically used only up to five-year averages of fathers' earnings to measure fathers' permanent earnings. However, dynamic earnings models that allow for serial correlation in transitory shocks to earnings imply that using such a short time span may lead to estimates that are biased down by nearly 30 percent. Indeed, by using many more years of fathers' earnings than earlier studies, the intergenerational elasticity between fathers and sons is estimated to be around 0.6 implying significantly less mobility in the U.S. than previous research indicated. The elasticity in earnings between fathers and daughters is of a similar magnitude. The evidence also suggests that family income has an even larger effect than fathers' earnings on children's future labor market success. The elasticity of earnings is higher for families with low net worth, offering some empirical support for theoretical models that predict differences due to borrowing constraints. Some evidence of a higher elasticity among blacks is found but the results are not conclusive.
View Full
Paper PDF
-
The Measurement of Medicaid Coverage in the SIPP: Evidence from California, 1990-1996
September 2002
Working Paper Number:
CES-02-21
This paper studies the accuracy of reported Medicaid coverage in the Survey of Income and Program Participation (SIPP) using a unique data set formed by matching SIPP survey responses to administrative records from the State of California. Overall, we estimate that the SIPP underestimates Medicaid coverage in the California populaton by about 10 percent. Among SIPP respondents who can be matched to administrative records, we estimate that the probability someone reports Medicaid coverage in a month when they are actually covered is around 85 percent. The corresponding probability for low-income children is even higher ' at least 90 percent. These estimates suggest that the SIPP provides reasonably accurate coverage reports for those who are actually in the Medicaid system. On the other hand, our estimate of the false positive rate (the rate of reported coverage for those who are not covered in the administrative records) is relatively high: 2.5 percent for the sample as a whole, and up to 20 percent for poor children. Some of this is due to errors in the recording of Social Security numbers in the administrative system, rather than to problems in the SIPP.
View Full
Paper PDF
-
The Mis-Measurement of Permanent Earnings: New Evidence from Social Security Earnings Data
May 2002
Working Paper Number:
CES-02-12
This study investigates the reliability of using short-term averages of earnings as a proxy for permanent earnings in empirical research. An earnings dynamics model is estimated on a large sample of men covering the period from 1983 to 1997 following the cohort-based methodology of Baker and Solon (1999). The analysis uses a unique dataset that matches men in the 1984, 1990 and 1996 Surveys of Income and Program Participation (SIPP) to the Social Security Administration's Summary Earnings Records (SER). The results confirm that using a short-term average of earnings can lead to spurious estimates of the effect of lifetime earnings on a particular outcome. In addition, the transitory variance appears to vary considerably over the lifecycle. The share of earnings variance due to transitory factors is higher among blacks and the persistence of transitory shocks appears to be greater for this group as well. Finally, the transitory variance appears to be a more important factor in explaining the overall earnings variance of college educated men than those without college.
View Full
Paper PDF
-
Using the P90/P10 Index to Measure U.S. Inequality Trends with Current Population Survey Data: A View From Inside the Census Bureau Vaults
June 2007
Working Paper Number:
CES-07-17
The March Current Population Survey (CPS) is the primary data source for estimation of levels and trends in labor earnings and income inequality in the USA. Time-inconsistency problems related to top coding in theses data have led many researchers to use the ratio of the 90th and 10th percentiles of these distributions (P90/P10) rather than a more traditional summary measure of inequality. With access to public use and restricted-access internal CPS data, and bounding methods, we show that using P90/P10 does not completely obviate time inconsistency problems, especially for household income inequality trends. Using internal data, we create consistent cell mean values for all top-coded public use values that, when used with public use data, closely track inequality trends in labor earnings and household income using internal data. But estimates of longer-term inequality trends with these corrected data based on P90/P10 differ from those based on the Gini coefficient. The choice of inequality measure matters.
View Full
Paper PDF
-
Occupation Inflation in the Current Population Survey
September 2012
Working Paper Number:
CES-12-26
A common caveat often accompanying results relying on household surveys regards respondent error. There is research using independent, presumably error-free administrative data, to estimate the extent of error in the data, the correlates of error, and potential corrections for the error. We investigate measurement error in occupation in the Current Population Survey (CPS) using the panel component of the CPS to identify those that incorrectly report changing occupation. We find evidence that individuals are inflating their occupation to higher skilled and higher paying occupations than the ones they actually perform. Occupation inflation biases the education and race coefficients in standard Mincer equation results within occupations.
View Full
Paper PDF