CREAT - Census Bureau

Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures

September 2024

Written by: Ethan Krohn

Working Paper Number:

CES-24-52

Abstract

In this paper, I explore the impact of generalized coverage error, item non-response bias, and measurement error on measures of earnings and earnings inequality in the CPS ASEC. I match addresses selected for the CPS ASEC to administrative data from 1040 tax returns. I then compare earnings statistics in the tax data for wage and salary earnings in samples corresponding to seven stages of the CPS ASEC survey production process. I also compare the statistics using the actual survey responses. The statistics I examine include mean earnings, the Gini coefficient, percentile earnings shares, and shares of the survey weight for a range of percentiles. I examine how the accuracy of the statistics calculated using the survey data is affected by including imputed responses for both those who did not respond to the full CPS ASEC and those who did not respond to the earnings question. I find that generalized coverage error and item nonresponse bias are dominated by measurement error, and that an important aspect of measurement error is households reporting no wage and salary earnings in the CPS ASEC when there are such earnings in the tax data. I find that the CPS ASEC sample misses earnings at the high end of the distribution from the initial selection stage and that the final survey weights exacerbate this.

Document Tags and Keywords

Keywords:

survey, respondent, earnings, imputation, revenue, insurance, bias, salary, percentile, tax, coverage, filing, 1040

Tags:

Internal Revenue Service, Social Security Administration, Current Population Survey, Detailed Earnings Records, Master Address File, Census Bureau Disclosure Review Board, ASEC, MAFID, Adjusted Gross Income

Similar Working Papers

The 10 most similar working papers to the working paper 'Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures' are listed below in order of similarity.

Working Paper

National Experimental Wellbeing Statistics - Version 1

February 2023

Authors: Nikolas Mittag, Joshua Mitchell, Adam Bee, Jonathan Rothbaum, Carl Sanders, Lawrence Schmidt, Matthew Unrath

Working Paper Number:

CES-23-04

This is the U.S. Census Bureau's first release of the National Experimental Wellbeing Statistics (NEWS) project. The NEWS project aims to produce the best possible estimates of income and poverty given all available survey and administrative data. We link survey, decennial census, administrative, and third-party data to address measurement error in income and poverty statistics. We estimate improved (pre-tax money) income and poverty statistics for 2018 by addressing several possible sources of bias documented in prior research. We address biases from 1) unit nonresponse through improved weights, 2) missing income information in both survey and administrative data through improved imputation, and 3) misreporting by combining or replacing survey responses with administrative information. Reducing survey error substantially affects key measures of well-being: We estimate median household income is 6.3 percent higher than in survey estimates, and poverty is 1.1 percentage points lower. These changes are driven by subpopulations for which survey error is particularly relevant. For house holders aged 65 and over, median household income is 27.3 percent higher and poverty is 3.3 percentage points lower than in survey estimates. We do not find a significant impact on median household income for householders under 65 or on child poverty. Finally, we discuss plans for future releases: addressing other potential sources of bias, releasing additional years of statistics, extending the income concepts measured, and including smaller geographies such as state and county.
View Full Paper PDF
Working Paper

Investigating the Use of Administrative Records in the Consumer Expenditure Survey

March 2018

Authors: Quentin Brummet, Joshua Mitchell, John Voorheis, Denise Flanagan-Doyle, Laura Erhard, Brett McBride

Working Paper Number:

carra-2018-01

In this paper, we investigate the potential of applying administrative records income data to the Consumer Expenditure (CE) survey to inform measurement error properties of CE estimates, supplement respondent-collected data, and estimate the representativeness of the CE survey by income level. We match individual responses to Consumer Expenditure Quarterly Interview Survey data collected from July 2013 through December 2014 to IRS administrative data in order to analyze CE questions on wages, social security payroll deductions, self-employment income receipt and retirement income. We find that while wage amounts are largely in alignment between the CE and administrative records in the middle of the wage distribution, there is evidence that wages are over-reported to the CE at the bottom of the wage distribution and under-reported at the top of the wage distribution. We find mixed evidence for alignment between the CE and administrative records on questions covering payroll deductions and self-employment income receipt, but find substantial divergence between CE responses and administrative records when examining retirement income. In addition to the analysis using person-based linkages, we also match responding and non-responding CE sample units to the universe of IRS 1040 tax returns by address to examine non-response bias. We find that non-responding households are substantially richer than responding households, and that very high income households are less likely to respond to the CE.
View Full Paper PDF
Working Paper

Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net

October 2015

Authors: Bruce Meyer, Nikolas Mittag

Working Paper Number:

CES-15-35

We examine the consequences of underreporting of transfer programs in household survey data for several prototypical analyses of low-income populations. We focus on the Current Population Survey (CPS), the source of official poverty and inequality statistics, but provide evidence that our qualitative conclusions are likely to apply to other surveys. We link administrative data for food stamps, TANF, General Assistance, and subsidized housing from New York State to the CPS at the individual level. Program receipt in the CPS is missed for over one-third of housing assistance recipients, 40 percent of food stamp recipients and 60 percent of TANF and General Assistance recipients. Dollars of benefits are also undercounted for reporting recipients, particularly for TANF, General Assistance and housing assistance. We find that the survey data sharply understate the income of poor households, as conjectured in past work by one of the authors. Underreporting in the survey data also greatly understates the effects of anti-poverty programs and changes our understanding of program targeting, often making it seem that welfare programs are less targeted to both the very poorest and middle income households than they are. Using the combined data rather than survey data alone, the poverty reducing effect of all programs together is nearly doubled while the effect of housing assistance is tripled. We also re-examine the coverage of the safety net, specifically the share of people without work or program receipt. Using the administrative measures of program receipt rather than the survey ones often reduces the share of single mothers falling through the safety net by one-half or more.
View Full Paper PDF
Working Paper

Incorporating Administrative Data in Survey Weights for the 2018-2022 Survey of Income and Program Participation

October 2024

Authors: Jonathan Eggleston, Julia Yang

Working Paper Number:

CES-24-58

Response rates to the Survey of Income and Program Participation (SIPP) have declined over time, raising the potential for nonresponse bias in survey estimates. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we modify various parts of the SIPP weighting algorithm to incorporate such data. We create these new weights for the 2018 through 2022 SIPP panels and examine how the new weights affect survey estimates. Our results show that before weighting adjustments, SIPP respondents in these panels have higher socioeconomic status than the general population. Existing weighting procedures reduce many of these differences. Comparing SIPP estimates between the production weights and the administrative data-based weights yields changes that are not uniform across the joint income and program participation distribution. Unlike other Census Bureau household surveys, there is no large increase in nonresponse bias in SIPP due to the COVID-19 Pandemic. In summary, the magnitude and sign of nonresponse bias in SIPP is complicated, and the existing weighting procedures may change the sign of nonresponse bias for households with certain incomes and program benefit statuses.
View Full Paper PDF
Working Paper

Alternative Measures of Income Poverty and the Anti-Poverty Effects of Taxes and Transfers

June 2005

Authors: Daniel Weinberg

Working Paper Number:

CES-05-08

The Census Bureau prepared a number of alternative income-based measures of poverty to illustrate the distributional impacts of several alternatives to the official measure. The paper examines five income variants for two different units of analysis (families and households) for two different assumptions about inflation (the historical Consumer Price Index and a 'Research Series' alternative that uses current methods) for two different sets of thresholds (official and a formula-based alternative base on three parameters). The poverty rate effects are analyzed for the total population, the distributional effects are analyzed using poverty shares, and the anti-poverty effects of taxes and transfers are analyzed using a percentage reduction in poverty rates. Suggestions for future research are included.
View Full Paper PDF
Working Paper

The Antipoverty Impact of the EITC: New Estimates from Survey and Administrative Tax Records

April 2019

Authors: Maggie R. Jones, James P. Ziliak

Working Paper Number:

CES-19-14R

We reassess the antipoverty effects of the EITC using unique data linking the CPS Annual Social and Economic Supplement to IRS data for the same individuals spanning years 2005-2016. We compare EITC benefits from standard simulators to administrative EITC payments and find that significantly more actual EITC payments flow to childless tax units than predicted, and to those whose family income places them above official poverty thresholds. However, actual EITC payments appear to be target efficient at the tax unit level. In 2016, about 3.1 million persons were lifted out of poverty by the EITC, substantially less than prior estimates.
View Full Paper PDF
Working Paper

Errors in Survey Reporting and Imputation and Their Effects on Estimates of Food Stamp Program Participation

April 2011

Authors: Bruce Meyer, Robert Goerge

Working Paper Number:

CES-11-14

Benefit receipt in major household surveys is often underreported. This misreporting leads to biased estimates of the economic circumstances of disadvantaged populations, program takeup, and the distributional effects of government programs, and other program effects. We use administrative data on Food Stamp Program (FSP) participation matched to American Community Survey (ACS) and Current Population Survey (CPS) household data. We show that nearly thirty-five percent of true recipient households do not report receipt in the ACS and fifty percent do not report receipt in the CPS. Misreporting, both false negatives and false positives, varies with individual characteristics, leading to complicated biases in FSP analyses. We then directly examine the determinants of program receipt using our combined administrative and survey data. The combined data allow us to examine accurate participation using individual characteristics missing in administrative data. Our results differ from conventional estimates using only survey data, as such estimates understate participation by single parents, non-whites, low income households, and other groups. To evaluate the use of Census Bureau imputed ACS and CPS data, we also examine whether our estimates using survey data alone are closer to those using the accurate combined data when imputed survey observations are excluded. Interestingly, excluding the imputed observations leads to worse ACS estimates, but has less effect on the CPS estimates.
View Full Paper PDF
Working Paper

Estimating Trends in U.S. Income Inequality Using the Current Population Survey: The Importance of Controlling for Censoring

August 2008

Authors: Richard Burkhauser, Shuaizhang Feng, Stephen Jenkins, Jeff Larrimore

Working Paper Number:

CES-08-25

Using internal and public use March Current Population Survey (CPS) data, we analyze trends in US income inequality (1975'2004). We find that the upward trend in income inequality prior to 1993 significantly slowed thereafter once we control for top coding in the public use data and censoring in the internal data. Because both series do not capture trends at the very top of the income distribution, we use a multiple imputation approach in which values for censored observations are imputed using draws from a Generalized Beta distribution of the Second Kind (GB2) fitted to internal data. Doing so, we find income inequality trends similar to those derived from unadjusted internal data. Our trend results are generally robust to the choice of inequality index, whether Gini coefficient or other commonly-used indices. When we compare our best estimates of the income shares held by the richest tenth with those reported by Piketty and Saez (2003), our trends fairly closely match their trends, except for the top 1 percent of the distribution. Thus, we argue that if United States income inequality has been substantially increasing since 1993, such increases are confined to this very high income group.
View Full Paper PDF
Working Paper

Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey

January 2024

Authors: John Voorheis, Jonathan Eggleston, Carl Lieberman, Yarissa Gonzalez, Tim Trudell

Working Paper Number:

CES-24-02

Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations. After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics. This paper is for research purposes only. No changes to production are being implemented at this time.
View Full Paper PDF
Working Paper

Comparing Earnings Outcome Differences Between All Graduates and Title IV Graduates

August 2021

Authors: Andrew Foote

Working Paper Number:

CES-21-19

Recently, two public data products have been released that publish earnings outcomes for college graduates by program of study and institution: Post-Secondary Employment Outcomes and College Scorecard, from the Census Bureau and U.S. Department of Education, respectively. While the earnings data underlying the data products is similar, persons eligible for the frames of the two products is different, with College Scorecard restricted to only students that receive Title IV aid. This paper documents how these differences in the population studied affect the published earnings outcomes. I show that at an institution, of the institutions in my sample, an average of sixty percent of baccalaureate graduates receive Title IV aid, and that the lower the coverage, the large the difference in earnings measurement. Additionally, I show that short-run earnings outcomes are very similar for these two samples, while longer-run outcomes (10 years after graduation) are significantly lower for the Title IV population. I also show that program ranking can change significantly when considering the Title IV population rather than the entire graduate population.
View Full Paper PDF

Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures

September 2024

Working Paper Number:

CES-24-52

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures' are listed below in order of similarity.

February 2023

Working Paper Number:

CES-23-04

March 2018

Working Paper Number:

carra-2018-01

October 2015

Working Paper Number:

CES-15-35

October 2024

Working Paper Number:

CES-24-58

June 2005

Working Paper Number:

CES-05-08

April 2019

Working Paper Number:

CES-19-14R

April 2011

Working Paper Number:

CES-11-14

August 2008

Working Paper Number:

CES-08-25

January 2024

Working Paper Number:

CES-24-02

August 2021

Working Paper Number:

CES-21-19