Trends in the Relative Household Income of Working-Age Men with Work Limitations: Correcting the Record Using Internal Current Population Survey Data
March 2008
Working Paper Number:
CES-08-05
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
respondent,
earnings,
average,
yearly,
salary,
percentile,
income individuals,
household,
poverty,
earn,
earner,
household income,
income year,
prevalence,
disability,
income data,
income households,
income distributions
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Bureau of Labor Statistics,
National Science Foundation,
Current Population Survey,
Survey of Income and Program Participation,
Cornell University,
Social Security,
National Health Interview Survey
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'Trends in the Relative Household Income of Working-Age Men with Work Limitations: Correcting the Record Using Internal Current Population Survey Data' are listed below in order of similarity.
-
Working PaperUsing Internal Current Population Survey Data to Reevaluate Trends in Labor Earnings Gaps by Gender, Race, and Education Level🔥
July 2008
Working Paper Number:
CES-08-18
Most empirical studies of trends in labor earnings gaps by gender, race or education level are based on data from the public use March Current Population Survey (CPS). Using the internal March CPS, we show that inconsistent topcoding in the public use data will understate these gaps and inaccurately capture their trends. We create a cell mean series beginning in 1975 that provides the mean of all values above the topcode for each income source in the public use March CPS and better approximate earnings gaps found in the internal March CPS than was previously possible using publically available data.View Full Paper PDF
-
Working PaperConsistent Cell Means for Topcoded Incomes in the Public Use March CPS (1976-2007)🔥
March 2008
Working Paper Number:
CES-08-06
Using the internal March CPS, we create and in this paper distribute to the larger research community a cell mean series that provides the mean of all income values above the topcode for any income source of any individual in the public use March CPS that has been topcoded since 1976. We also describe our construction of this series. When we use this series together with the public use March CPS, we closely match the yearly mean income levels and income inequalities of the U.S. population found using the internal March CPS data.View Full Paper PDF
-
Working PaperEstimating Trends in U.S. Income Inequality Using the Current Population Survey: The Importance of Controlling for Censoring🔥
August 2008
Working Paper Number:
CES-08-25
Using internal and public use March Current Population Survey (CPS) data, we analyze trends in US income inequality (1975'2004). We find that the upward trend in income inequality prior to 1993 significantly slowed thereafter once we control for top coding in the public use data and censoring in the internal data. Because both series do not capture trends at the very top of the income distribution, we use a multiple imputation approach in which values for censored observations are imputed using draws from a Generalized Beta distribution of the Second Kind (GB2) fitted to internal data. Doing so, we find income inequality trends similar to those derived from unadjusted internal data. Our trend results are generally robust to the choice of inequality index, whether Gini coefficient or other commonly-used indices. When we compare our best estimates of the income shares held by the richest tenth with those reported by Piketty and Saez (2003), our trends fairly closely match their trends, except for the top 1 percent of the distribution. Thus, we argue that if United States income inequality has been substantially increasing since 1993, such increases are confined to this very high income group.View Full Paper PDF
-
Working PaperMeasuring Labor Earnings Inequality Using Public-Use March Current Population Survey Data: The Value of Including Variances and Cell Means When Imputing Topcoded Values🔥
November 2008
Working Paper Number:
CES-08-38
Using the Census Bureau's internal March Current Population Surveys (CPS) file, we construct and make available variances and cell means for all topcoded income values in the publicuse version of these data. We then provide a procedure that allows researchers with access only to the public-use March CPS data to take advantage of this added information when imputing its topcoded income values. As an example of its value we show how our new procedure improves on existing imputation methods in the labor earnings inequality literature.View Full Paper PDF
-
Working PaperRecent Trends in Top Income Shares in the USA: Reconciling Estimates from March CPS and IRS Tax Return Data🔥
September 2009
Working Paper Number:
CES-09-26
Although the vast majority of US research on trends in the inequality of family income is based on public-use March Current Population Survey (CPS) data, a new wave of research based on Internal Revenue Service (IRS) tax return data reports substantially higher levels of inequality and faster growing trends. We show that these apparently inconsistent estimates can largely be reconciled once one uses internal CPS data (which better captures the top of the income distribution than public-use CPS data) and defines the income distribution in the same way. Using internal CPS data for 1967'2006, we closely match the IRS data-based estimates of top income shares reported by Piketty and Saez (2003), with the exception of the share of the top 1 percent of the distribution during 1993'2000. Our results imply that, if inequality has increased substantially since 1993, the increase is confined to income changes for those in the top 1 percent of the distribution.View Full Paper PDF
-
Working PaperUsing the P90/P10 Index to Measure U.S. Inequality Trends with Current Population Survey Data: A View From Inside the Census Bureau Vaults🔥
June 2007
Working Paper Number:
CES-07-17
The March Current Population Survey (CPS) is the primary data source for estimation of levels and trends in labor earnings and income inequality in the USA. Time-inconsistency problems related to top coding in theses data have led many researchers to use the ratio of the 90th and 10th percentiles of these distributions (P90/P10) rather than a more traditional summary measure of inequality. With access to public use and restricted-access internal CPS data, and bounding methods, we show that using P90/P10 does not completely obviate time inconsistency problems, especially for household income inequality trends. Using internal data, we create consistent cell mean values for all top-coded public use values that, when used with public use data, closely track inequality trends in labor earnings and household income using internal data. But estimates of longer-term inequality trends with these corrected data based on P90/P10 differ from those based on the Gini coefficient. The choice of inequality measure matters.View Full Paper PDF
-
Working PaperUSING THE PARETO DISTRIBUTION TO IMPROVE ESTIMATES OF TOPCODED EARNINGS🔥
April 2014
Working Paper Number:
CES-14-21
Inconsistent censoring in the public-use March Current Population Survey (CPS) limits its usefulness in measuring labor earnings trends. Using Pareto estimation methods with less-censored internal CPS data, we create an enhanced cell-mean series to capture top earnings in the public-use CPS. We find that previous approaches for imputing topcoded earnings systematically understate top earnings. Annual earnings inequality trends since 1963 using our series closely approximate those found by Kopczuk, Saez, & Song (2010) using Social Security Administration data for commerce and industry workers. However, when we consider all workers, earnings inequality levels are higher but earnings growth is more modestView Full Paper PDF
-
Working PaperMeasuring Inequality Using Censored Data: A Multiple Imputation Approach
April 2009
Working Paper Number:
CES-09-05
To measure income inequality with right censored (topcoded) data, we propose multiple imputation for censored observations using draws from Generalized Beta of the Second Kind distributions to provide partially synthetic datasets analyzed using complete data methods. Estimation and inference uses Reiter's (Survey Methodology 2003) formulae. Using Current Population Survey (CPS) internal data, we find few statistically significant differences in income inequality for pairs of years between 1995 and 2004. We also show that using CPS public use data with cell mean imputations may lead to incorrect inferences about inequality differences. Multiply-imputed public use data provide an intermediate solution.View Full Paper PDF
-
Working PaperErrors in Survey Reporting and Imputation and Their Effects on Estimates of Food Stamp Program Participation
April 2011
Working Paper Number:
CES-11-14
Benefit receipt in major household surveys is often underreported. This misreporting leads to biased estimates of the economic circumstances of disadvantaged populations, program takeup, and the distributional effects of government programs, and other program effects. We use administrative data on Food Stamp Program (FSP) participation matched to American Community Survey (ACS) and Current Population Survey (CPS) household data. We show that nearly thirty-five percent of true recipient households do not report receipt in the ACS and fifty percent do not report receipt in the CPS. Misreporting, both false negatives and false positives, varies with individual characteristics, leading to complicated biases in FSP analyses. We then directly examine the determinants of program receipt using our combined administrative and survey data. The combined data allow us to examine accurate participation using individual characteristics missing in administrative data. Our results differ from conventional estimates using only survey data, as such estimates understate participation by single parents, non-whites, low income households, and other groups. To evaluate the use of Census Bureau imputed ACS and CPS data, we also examine whether our estimates using survey data alone are closer to those using the accurate combined data when imputed survey observations are excluded. Interestingly, excluding the imputed observations leads to worse ACS estimates, but has less effect on the CPS estimates.View Full Paper PDF
-
Working PaperMeasuring Income of the Aged in Household Surveys: Evidence from Linked Administrative Records
June 2024
Working Paper Number:
CES-24-32
Research has shown that household survey estimates of retirement income (defined benefit pensions and defined contribution account withdrawals) suffer from substantial underreporting which biases downward measures of financial well-being among the aged. Using data from both the redesigned 2016 Current Population Survey Annual Social and Economic Supplement (CPS ASEC) and the Health and Retirement Study (HRS), each matched with administrative records, we examine to what extent underreporting of retirement income affects key statistics such as reliance on Social Security benefits and poverty among the aged. We find that underreporting of retirement income is still prevalent in the CPS ASEC. While the HRS does a better job than the CPS ASEC in terms of capturing retirement income, it still falls considerably short compared to administrative records. Consequently, the relative importance of Social Security income remains overstated in household surveys'53 percent of elderly beneficiaries in the CPS ASEC and 49 percent in the HRS rely on Social Security for the majority of their incomes compared to 42 percent in the linked administrative data. The poverty rate for those aged 65 and over is also overstated'8.8 percent in the CPS ASEC and 7.4 percent in the HRS compared to 6.4 percent in the linked administrative data. Our results illustrate the effects of using alternative data sources in producing key statistics from the Social Security Administration's Income of the Aged publication.View Full Paper PDF