Shifting earnings inequality among U.S. workers over the last five decades has been widely stud ied, but understanding how these shifts evolve across smaller groups has been difficult. Publicly available data sources typically only ensure representative data at high levels of aggregation, so they obscure many details of earnings distributions for smaller populations. We define and construct a set of granular statistics describing income distributions, income mobility and con ditional income growth for a large number of subnational groups in the U.S. for a two-decade period (1998-2019). In this paper, we use the resulting data to explore the evolution of income inequality and mobility for detailed groups defined by race and ethnicity. We find that patterns identified from the universe of tax filers and W-2 recipients that we observe differ in important ways from those that one might identify in public sources. The full set of statistics that we construct is available publicly as the Income Distributions and Dynamics in America, or IDDA, data set.
-
Mobility, Opportunity, and Volatility Statistics (MOVS):
Infrastructure Files and Public Use Data
April 2024
Working Paper Number:
CES-24-23
Federal statistical agencies and policymakers have identified a need for integrated systems of household and personal income statistics. This interest marks a recognition that aggregated measures of income, such as GDP or average income growth, tell an incomplete story that may conceal large gaps in well-being between different types of individuals and families. Until recently, longitudinal income data that are rich enough to calculate detailed income statistics and include demographic characteristics, such as race and ethnicity, have not been available. The Mobility, Opportunity, and Volatility Statistics project (MOVS) fills this gap in comprehensive income statistics. Using linked demographic and tax records on the population of U.S. working-age adults, the MOVS project defines households and calculates household income, applying an equivalence scale to create a personal income concept, and then traces the progress of individuals' incomes over time. We then output a set of intermediate statistics by race-ethnicity group, sex, year, base-year state of residence, and base-year income decile. We select the intermediate statistics most useful in developing more complex intragenerational income mobility measures, such as transition matrices, income growth curves, and variance-based volatility statistics. We provide these intermediate statistics as part of a publicly released data tool with downloadable flat files and accompanying documentation. This paper describes the data build process and the output files, including a brief analysis highlighting the structure and content of our main statistics.
View Full
Paper PDF
-
Recent Trends in Top Income Shares in the USA: Reconciling Estimates from March CPS and IRS Tax Return Data
September 2009
Working Paper Number:
CES-09-26
Although the vast majority of US research on trends in the inequality of family income is based on public-use March Current Population Survey (CPS) data, a new wave of research based on Internal Revenue Service (IRS) tax return data reports substantially higher levels of inequality and faster growing trends. We show that these apparently inconsistent estimates can largely be reconciled once one uses internal CPS data (which better captures the top of the income distribution than public-use CPS data) and defines the income distribution in the same way. Using internal CPS data for 1967'2006, we closely match the IRS data-based estimates of top income shares reported by Piketty and Saez (2003), with the exception of the share of the top 1 percent of the distribution during 1993'2000. Our results imply that, if inequality has increased substantially since 1993, the increase is confined to income changes for those in the top 1 percent of the distribution.
View Full
Paper PDF
-
Trends in Earnings Volatility using Linked Administrative and Survey Data
August 2020
Working Paper Number:
CES-20-24
We document trends in earnings volatility separately by gender in combination with other characteristics such as race, educational attainment, and employment status using unique linked survey and administrative data for the tax years spanning 1995-2015. We also decompose the variance of trend volatility into within- and between-group contributions, as well as transitory and permanent shocks. Our results for continuously working men suggest that trend earnings volatility was stable over our period in both survey and tax data, though with a substantial countercyclical business-cycle component. Trend earnings volatility among women declined over the period in both survey and administrative data, but unlike for men, there was no change over the Great Recession. The variance decompositions indicate that nonresponders, low-educated, racial minorities, and part-year workers have the greatest group specific earnings volatility, but with the exception of part-year workers, they contribute least to the level and trend of volatility owing to their small share of the population. There is evidence of stable transitory volatility, but rising permanent volatility over the past two decades in male and female earnings.
View Full
Paper PDF
-
Re-examining Regional Income Convergence: A Distributional Approach
February 2023
Working Paper Number:
CES-23-05
We re-examine recent trends in regional income convergence, considering the full distribution of income rather than focusing on the mean. Measuring similarity by comparing each percentile of state
distributions to the corresponding percentile of the national distribution, we find that state incomes have become less similar (i.e. they have diverged) within the top 20 percent of the income distribution since 1969. The top percentile alone accounts for more than half of aggregate divergence across states over this period by our measure, and the top five percentiles combine to account for 93 percent. Divergence in top incomes across states appears to be driven largely by changes in top incomes among White people, while top incomes among Black people have experienced relatively little divergence.
View Full
Paper PDF
-
Male Earnings Volatility in LEHD before, during, and after the Great Recession
September 2020
Working Paper Number:
CES-20-31
This paper is part of a coordinated collection of papers on prime-age male earnings volatility. Each paper produces a similar set of statistics for the same reference population using a different primary data source. Our primary data source is the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files. Using LEHD data from 1998 to 2016, we create a well-defined population frame to facilitate accurate estimation of temporal changes comparable to designed longitudinal samples of people. We show that earnings volatility, excluding increases during recessions, has declined over the analysis period, a finding robust to various sensitivity analyses. Although we find volatility is declining, the effect is not homogeneous, particularly for workers with tenuous labor force attachment for whom volatility is increasing. These 'not stable' workers have earnings volatility approximately 30 times larger than stable workers, but more important for earnings volatility trends we observe a large increase in the share of stable employment from 60% in 1998 to 67% in 2016, which we show to largely be responsible for the decline in overall earnings volatility. To further emphasize the importance of not stable and/or low earning workers we also conduct comparisons with the PSID and show how changes over time in the share of workers at the bottom tail of the cross-sectional earnings distributions can produce either declining or increasing earnings volatility trends.
View Full
Paper PDF
-
Estimating Trends in U.S. Income Inequality Using the Current Population Survey: The Importance of Controlling for Censoring
August 2008
Working Paper Number:
CES-08-25
Using internal and public use March Current Population Survey (CPS) data, we analyze trends in US income inequality (1975'2004). We find that the upward trend in income inequality prior to 1993 significantly slowed thereafter once we control for top coding in the public use data and censoring in the internal data. Because both series do not capture trends at the very top of the income distribution, we use a multiple imputation approach in which values for censored observations are imputed using draws from a Generalized Beta distribution of the Second Kind (GB2) fitted to internal data. Doing so, we find income inequality trends similar to those derived from unadjusted internal data. Our trend results are generally robust to the choice of inequality index, whether Gini coefficient or other commonly-used indices. When we compare our best estimates of the income shares held by the richest tenth with those reported by Piketty and Saez (2003), our trends fairly closely match their trends, except for the top 1 percent of the distribution. Thus, we argue that if United States income inequality has been substantially increasing since 1993, such increases are confined to this very high income group.
View Full
Paper PDF
-
Using the P90/P10 Index to Measure U.S. Inequality Trends with Current Population Survey Data: A View From Inside the Census Bureau Vaults
June 2007
Working Paper Number:
CES-07-17
The March Current Population Survey (CPS) is the primary data source for estimation of levels and trends in labor earnings and income inequality in the USA. Time-inconsistency problems related to top coding in theses data have led many researchers to use the ratio of the 90th and 10th percentiles of these distributions (P90/P10) rather than a more traditional summary measure of inequality. With access to public use and restricted-access internal CPS data, and bounding methods, we show that using P90/P10 does not completely obviate time inconsistency problems, especially for household income inequality trends. Using internal data, we create consistent cell mean values for all top-coded public use values that, when used with public use data, closely track inequality trends in labor earnings and household income using internal data. But estimates of longer-term inequality trends with these corrected data based on P90/P10 differ from those based on the Gini coefficient. The choice of inequality measure matters.
View Full
Paper PDF
-
United States Earnings Dynamics: Inequality, Mobility, and Volatility
September 2020
Working Paper Number:
CES-20-29
Using data from the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files, we study changes over time and across sub-national populations in the distribution of real labor earnings. We consider four large MSAs (Detroit, Los Angeles, New York, and San Francisco) for the period 1998 to 2017, with particular attention paid to the subperiods before, during, and after the Great Recession. For the four large MSAs we analyze, there are clear national trends represented in each of the local areas, the most prominent of which is the increase in the share of earnings accruing to workers at the top of the earnings distribution in 2017 compared with 1998. However, the magnitude of these trends varies across MSAs, with New York and San Francisco showing relatively large increases and Los Angeles somewhere in the middle relative to Detroit whose total real earnings distribution is relatively stable over the period. Our results contribute to the emerging literature on differences between national and regional economic outcomes, exemplifying what will be possible with a new data exploration tool'the Earnings and Mobility Statistics (EAMS) web application'currently under development at the U.S. Census Bureau.
View Full
Paper PDF
-
Geographic Immobility in the United States: Assessing the Prevalence and Characteristics of Those Who Never Migrate Across State Lines Using Linked Federal Tax Microdata
March 2025
Working Paper Number:
CES-25-19
This paper explores the prevalence and characteristics of those who never migrate at the state scale in the U.S. Studying people who never migrate requires regular and frequent observation of their residential location for a lifetime, or at least for many years. A novel U.S. population-sized longitudinal dataset that links individual level Internal Revenue Service (IRS) and Social Security Administration (SSA) administrative records supplies this information annually, along with information on income and socio-demographic characteristics. We use these administrative microdata to follow a cohort aged between 15 and 50 in 2001 from 2001 to 2016, differentiating those who lived in the same state every year during this period (i.e., never made an interstate move) from those who lived in more than one state (i.e., made at least one interstate move). We find those who never made an interstate move comprised 75 percent of the total population of this age cohort. This percentage varies by year of age but never falls below 62 percent even for those who were teenagers or young adults in 2001. There are also variations in these percentages by sex, race, nativity, and income, with the latter having the largest effects. We also find substantial variation in these percentages across states. Our findings suggest a need for more research on geographically immobile populations in U.S.
View Full
Paper PDF
-
Trends in the Relative Household Income of Working-Age Men with Work Limitations: Correcting the Record Using Internal Current Population Survey Data
March 2008
Working Paper Number:
CES-08-05
Previous research measuring the economic well-being of working-age men with work limitations relative to such men without work limitations in the public use March Current Population Survey (CPS) systematically understates the mean household income of both groups; overstates the relative household income of those with work limitations; and understates the decline in their relative household income over time. Using the internal March CPS, we demonstrate this by creating a cell mean series beginning in 1975 that provides the mean reported income of all topcoded persons for each source of income in the public use March CPS data. Using our cell mean series with the public use March CPS, we closely match the yearly mean income of working-age men with and without work limitations over the period 1987-2004 in the internal data and show that this match is superior to ones using alternative methods of correcting for topcoding currently used in the disability literature. We then provide levels and trends in the relative income of working-age men with work limitations from 1980-2006, the earliest year in the March CPS that such comparisons can be made.
View Full
Paper PDF