CREAT - Census Bureau

Measuring Inequality Using Censored Data: A Multiple Imputation Approach

April 2009

Written by: Richard Burkhauser, Shuaizhang Feng, Stephen Jenkins, Jeff Larrimore

Working Paper Number:

CES-09-05

Abstract

To measure income inequality with right censored (topcoded) data, we propose multiple imputation for censored observations using draws from Generalized Beta of the Second Kind distributions to provide partially synthetic datasets analyzed using complete data methods. Estimation and inference uses Reiter's (Survey Methodology 2003) formulae. Using Current Population Survey (CPS) internal data, we find few statistically significant differences in income inequality for pairs of years between 1995 and 2004. We also show that using CPS public use data with cell mean imputations may lead to incorrect inferences about inequality differences. Multiply-imputed public use data provide an intermediate solution.

Document Tags and Keywords

Keywords:

estimating, data, statistical, census research, survey, estimator, earnings, imputation, bias, salary, percentile, inference, distribution, datasets, poorer, income data, income distributions

Tags:

Internal Revenue Service, National Science Foundation, Organization for Economic Cooperation and Development, Current Population Survey, Cornell University, Cornell Institute for Social and Economic Research, Special Sworn Status

Similar Working Papers

The 10 most similar working papers to the working paper 'Measuring Inequality Using Censored Data: A Multiple Imputation Approach' are listed below in order of similarity.

Working Paper
🔥

Estimating Trends in U.S. Income Inequality Using the Current Population Survey: The Importance of Controlling for Censoring

August 2008

Authors: Richard Burkhauser, Shuaizhang Feng, Stephen Jenkins, Jeff Larrimore

Working Paper Number:

CES-08-25

Using internal and public use March Current Population Survey (CPS) data, we analyze trends in US income inequality (1975'2004). We find that the upward trend in income inequality prior to 1993 significantly slowed thereafter once we control for top coding in the public use data and censoring in the internal data. Because both series do not capture trends at the very top of the income distribution, we use a multiple imputation approach in which values for censored observations are imputed using draws from a Generalized Beta distribution of the Second Kind (GB2) fitted to internal data. Doing so, we find income inequality trends similar to those derived from unadjusted internal data. Our trend results are generally robust to the choice of inequality index, whether Gini coefficient or other commonly-used indices. When we compare our best estimates of the income shares held by the richest tenth with those reported by Piketty and Saez (2003), our trends fairly closely match their trends, except for the top 1 percent of the distribution. Thus, we argue that if United States income inequality has been substantially increasing since 1993, such increases are confined to this very high income group.
View Full Paper PDF
Working Paper
🔥

Using the P90/P10 Index to Measure U.S. Inequality Trends with Current Population Survey Data: A View From Inside the Census Bureau Vaults

June 2007

Authors: Richard Burkhauser, Shuaizhang Feng, Stephen Jenkins

Working Paper Number:

CES-07-17

The March Current Population Survey (CPS) is the primary data source for estimation of levels and trends in labor earnings and income inequality in the USA. Time-inconsistency problems related to top coding in theses data have led many researchers to use the ratio of the 90th and 10th percentiles of these distributions (P90/P10) rather than a more traditional summary measure of inequality. With access to public use and restricted-access internal CPS data, and bounding methods, we show that using P90/P10 does not completely obviate time inconsistency problems, especially for household income inequality trends. Using internal data, we create consistent cell mean values for all top-coded public use values that, when used with public use data, closely track inequality trends in labor earnings and household income using internal data. But estimates of longer-term inequality trends with these corrected data based on P90/P10 differ from those based on the Gini coefficient. The choice of inequality measure matters.
View Full Paper PDF
Working Paper
🔥

Recent Trends in Top Income Shares in the USA: Reconciling Estimates from March CPS and IRS Tax Return Data

September 2009

Authors: Richard Burkhauser, Shuaizhang Feng, Stephen Jenkins, Jeff Larrimore

Working Paper Number:

CES-09-26

Although the vast majority of US research on trends in the inequality of family income is based on public-use March Current Population Survey (CPS) data, a new wave of research based on Internal Revenue Service (IRS) tax return data reports substantially higher levels of inequality and faster growing trends. We show that these apparently inconsistent estimates can largely be reconciled once one uses internal CPS data (which better captures the top of the income distribution than public-use CPS data) and defines the income distribution in the same way. Using internal CPS data for 1967'2006, we closely match the IRS data-based estimates of top income shares reported by Piketty and Saez (2003), with the exception of the share of the top 1 percent of the distribution during 1993'2000. Our results imply that, if inequality has increased substantially since 1993, the increase is confined to income changes for those in the top 1 percent of the distribution.
View Full Paper PDF
Working Paper
🔥

Measuring Labor Earnings Inequality Using Public-Use March Current Population Survey Data: The Value of Including Variances and Cell Means When Imputing Topcoded Values

November 2008

Authors: Richard Burkhauser, Shuaizhang Feng, Jeff Larrimore

Working Paper Number:

CES-08-38

Using the Census Bureau's internal March Current Population Surveys (CPS) file, we construct and make available variances and cell means for all topcoded income values in the publicuse version of these data. We then provide a procedure that allows researchers with access only to the public-use March CPS data to take advantage of this added information when imputing its topcoded income values. As an example of its value we show how our new procedure improves on existing imputation methods in the labor earnings inequality literature.
View Full Paper PDF
Working Paper
🔥

Consistent Cell Means for Topcoded Incomes in the Public Use March CPS (1976-2007)

March 2008

Authors: Richard Burkhauser, Shuaizhang Feng, Laura Zayatz, Jeff Larrimore

Working Paper Number:

CES-08-06

Using the internal March CPS, we create and in this paper distribute to the larger research community a cell mean series that provides the mean of all income values above the topcode for any income source of any individual in the public use March CPS that has been topcoded since 1976. We also describe our construction of this series. When we use this series together with the public use March CPS, we closely match the yearly mean income levels and income inequalities of the U.S. population found using the internal March CPS data.
View Full Paper PDF
Working Paper
🔥

USING THE PARETO DISTRIBUTION TO IMPROVE ESTIMATES OF TOPCODED EARNINGS

April 2014

Authors: Richard Burkhauser, Jeff Larrimore, Philip Armour

Working Paper Number:

CES-14-21

Inconsistent censoring in the public-use March Current Population Survey (CPS) limits its usefulness in measuring labor earnings trends. Using Pareto estimation methods with less-censored internal CPS data, we create an enhanced cell-mean series to capture top earnings in the public-use CPS. We find that previous approaches for imputing topcoded earnings systematically understate top earnings. Annual earnings inequality trends since 1963 using our series closely approximate those found by Kopczuk, Saez, & Song (2010) using Social Security Administration data for commerce and industry workers. However, when we consider all workers, earnings inequality levels are higher but earnings growth is more modest
View Full Paper PDF
Working Paper
🔥

Using Internal Current Population Survey Data to Reevaluate Trends in Labor Earnings Gaps by Gender, Race, and Education Level

July 2008

Authors: Richard Burkhauser, Jeff Larrimore

Working Paper Number:

CES-08-18

Most empirical studies of trends in labor earnings gaps by gender, race or education level are based on data from the public use March Current Population Survey (CPS). Using the internal March CPS, we show that inconsistent topcoding in the public use data will understate these gaps and inaccurately capture their trends. We create a cell mean series beginning in 1975 that provides the mean of all values above the topcode for each income source in the public use March CPS and better approximate earnings gaps found in the internal March CPS than was previously possible using publically available data.
View Full Paper PDF
Working Paper

Trends in the Relative Household Income of Working-Age Men with Work Limitations: Correcting the Record Using Internal Current Population Survey Data

March 2008

Authors: Richard Burkhauser, Jeff Larrimore

Working Paper Number:

CES-08-05

Previous research measuring the economic well-being of working-age men with work limitations relative to such men without work limitations in the public use March Current Population Survey (CPS) systematically understates the mean household income of both groups; overstates the relative household income of those with work limitations; and understates the decline in their relative household income over time. Using the internal March CPS, we demonstrate this by creating a cell mean series beginning in 1975 that provides the mean reported income of all topcoded persons for each source of income in the public use March CPS data. Using our cell mean series with the public use March CPS, we closely match the yearly mean income of working-age men with and without work limitations over the period 1987-2004 in the internal data and show that this match is superior to ones using alternative methods of correcting for topcoding currently used in the disability literature. We then provide levels and trends in the relative income of working-age men with work limitations from 1980-2006, the earliest year in the March CPS that such comparisons can be made.
View Full Paper PDF
Working Paper

Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net

October 2015

Authors: Bruce Meyer, Nikolas Mittag

Working Paper Number:

CES-15-35

We examine the consequences of underreporting of transfer programs in household survey data for several prototypical analyses of low-income populations. We focus on the Current Population Survey (CPS), the source of official poverty and inequality statistics, but provide evidence that our qualitative conclusions are likely to apply to other surveys. We link administrative data for food stamps, TANF, General Assistance, and subsidized housing from New York State to the CPS at the individual level. Program receipt in the CPS is missed for over one-third of housing assistance recipients, 40 percent of food stamp recipients and 60 percent of TANF and General Assistance recipients. Dollars of benefits are also undercounted for reporting recipients, particularly for TANF, General Assistance and housing assistance. We find that the survey data sharply understate the income of poor households, as conjectured in past work by one of the authors. Underreporting in the survey data also greatly understates the effects of anti-poverty programs and changes our understanding of program targeting, often making it seem that welfare programs are less targeted to both the very poorest and middle income households than they are. Using the combined data rather than survey data alone, the poverty reducing effect of all programs together is nearly doubled while the effect of housing assistance is tripled. We also re-examine the coverage of the safety net, specifically the share of people without work or program receipt. Using the administrative measures of program receipt rather than the survey ones often reduces the share of single mothers falling through the safety net by one-half or more.
View Full Paper PDF
Working Paper

Granular Income Inequality and Mobility using IDDA: Exploring Patterns across Race and Ethnicity

November 2023

Authors: Abigail Wozniak, John Voorheis, Kevin Rinz, Illenin Kondo, Natalie Gubbay, Brandon Hawkins

Working Paper Number:

CES-23-55

Shifting earnings inequality among U.S. workers over the last five decades has been widely stud ied, but understanding how these shifts evolve across smaller groups has been difficult. Publicly available data sources typically only ensure representative data at high levels of aggregation, so they obscure many details of earnings distributions for smaller populations. We define and construct a set of granular statistics describing income distributions, income mobility and con ditional income growth for a large number of subnational groups in the U.S. for a two-decade period (1998-2019). In this paper, we use the resulting data to explore the evolution of income inequality and mobility for detailed groups defined by race and ethnicity. We find that patterns identified from the universe of tax filers and W-2 recipients that we observe differ in important ways from those that one might identify in public sources. The full set of statistics that we construct is available publicly as the Income Distributions and Dynamics in America, or IDDA, data set.
View Full Paper PDF

Measuring Inequality Using Censored Data: A Multiple Imputation Approach

April 2009

Working Paper Number:

CES-09-05

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Measuring Inequality Using Censored Data: A Multiple Imputation Approach' are listed below in order of similarity.

August 2008

Working Paper Number:

CES-08-25

June 2007

Working Paper Number:

CES-07-17

September 2009

Working Paper Number:

CES-09-26

November 2008

Working Paper Number:

CES-08-38

March 2008

Working Paper Number:

CES-08-06

April 2014

Working Paper Number:

CES-14-21

July 2008

Working Paper Number:

CES-08-18

March 2008

Working Paper Number:

CES-08-05

October 2015

Working Paper Number:

CES-15-35

November 2023

Working Paper Number:

CES-23-55