CREAT - Census Bureau

Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey

January 2024

Written by: John Voorheis, Jonathan Eggleston, Carl Lieberman, Yarissa Gonzalez, Tim Trudell

Working Paper Number:

CES-24-02

Abstract

Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations. After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics. This paper is for research purposes only. No changes to production are being implemented at this time.

Document Tags and Keywords

Keywords:

information census, statistical, data census, census data, survey, agency, respondent, estimator, population, socioeconomic, census bureau, sampling, use census, census employment, unemployed, survey income, assessed, income data, population survey, propensity

Tags:

Metropolitan Statistical Area, Standard Statistical Establishment List, Internal Revenue Service, Bureau of Labor Statistics, Social Security Administration, Service Annual Survey, New England County Metropolitan, Current Population Survey, Decennial Census, Housing and Urban Development, Employer Identification Numbers, Social Security, Department of Housing and Urban Development, American Community Survey, Longitudinal Employer Household Dynamics, Census Bureau Business Register, Master Beneficiary Record, Disability Insurance, Protected Identification Key, Computer Assisted Personal Interview, W-2, Quarterly Census of Employment and Wages, Master Address File, Census Bureau Disclosure Review Board, 2010 Census, Person Validation System, Supplemental Nutrition Assistance Program, Census Bureau Person Identification Validation System, Social Science Research Institute, Adjusted Gross Income, MAF-ARF

Similar Working Papers

The 10 most similar working papers to the working paper 'Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey' are listed below in order of similarity.

Working Paper
🔥

Incorporating Administrative Data in Survey Weights for the 2018-2022 Survey of Income and Program Participation

October 2024

Authors: Jonathan Eggleston, Julia Yang

Working Paper Number:

CES-24-58

Response rates to the Survey of Income and Program Participation (SIPP) have declined over time, raising the potential for nonresponse bias in survey estimates. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we modify various parts of the SIPP weighting algorithm to incorporate such data. We create these new weights for the 2018 through 2022 SIPP panels and examine how the new weights affect survey estimates. Our results show that before weighting adjustments, SIPP respondents in these panels have higher socioeconomic status than the general population. Existing weighting procedures reduce many of these differences. Comparing SIPP estimates between the production weights and the administrative data-based weights yields changes that are not uniform across the joint income and program participation distribution. Unlike other Census Bureau household surveys, there is no large increase in nonresponse bias in SIPP due to the COVID-19 Pandemic. In summary, the magnitude and sign of nonresponse bias in SIPP is complicated, and the existing weighting procedures may change the sign of nonresponse bias for households with certain incomes and program benefit statuses.
View Full Paper PDF
Working Paper
🔥

Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data

October 2024

Authors: Jonathan Eggleston, Carl Lieberman

Working Paper Number:

CES-24-60

The Household Pulse Survey (HPS) conducted by the U.S. Census Bureau is a unique survey that provided timely data on the effects of the COVID-19 Pandemic on American households and continues to provide data on other emergent social and economic issues. Because the survey has a response rate in the single digits and only has an online response mode, there are concerns about nonresponse and coverage bias. In this paper, we match administrative data from government agencies and third-party data to HPS respondents to examine how representative they are of the U.S. population. For comparison, we create a benchmark of American Community Survey (ACS) respondents and nonrespondents and include the ACS respondents as another point of reference. Overall, we find that the HPS is less representative of the U.S. population than the ACS. However, performance varies across administrative variables, and the existing weighting adjustments appear to greatly improve the representativeness of the HPS. Additionally, we look at household characteristics by their email domain to examine the effects on coverage from limiting email messages in 2023 to addresses from the contact frame with at least 90% deliverability rates, finding no clear change in the representativeness of the HPS afterwards.
View Full Paper PDF
Working Paper
🔥

The Impact of Household Surveys on 2020 Census Self-Response

July 2022

Authors: Jonathan Eggleston

Working Paper Number:

CES-22-24

Households who were sampled in 2019 for the American Community Survey (ACS) had lower self-response rates to the 2020 Census. The magnitude varied from -1.5 percentage point for household sampled in January 2019 to -15.1 percent point for households sampled in December 2019. Similar effects are found for the Current Population Survey (CPS) as well.
View Full Paper PDF
Working Paper

National Experimental Wellbeing Statistics - Version 1

February 2023

Authors: Nikolas Mittag, Joshua Mitchell, Adam Bee, Jonathan Rothbaum, Carl Sanders, Lawrence Schmidt, Matthew Unrath

Working Paper Number:

CES-23-04

This is the U.S. Census Bureau's first release of the National Experimental Wellbeing Statistics (NEWS) project. The NEWS project aims to produce the best possible estimates of income and poverty given all available survey and administrative data. We link survey, decennial census, administrative, and third-party data to address measurement error in income and poverty statistics. We estimate improved (pre-tax money) income and poverty statistics for 2018 by addressing several possible sources of bias documented in prior research. We address biases from 1) unit nonresponse through improved weights, 2) missing income information in both survey and administrative data through improved imputation, and 3) misreporting by combining or replacing survey responses with administrative information. Reducing survey error substantially affects key measures of well-being: We estimate median household income is 6.3 percent higher than in survey estimates, and poverty is 1.1 percentage points lower. These changes are driven by subpopulations for which survey error is particularly relevant. For house holders aged 65 and over, median household income is 27.3 percent higher and poverty is 3.3 percentage points lower than in survey estimates. We do not find a significant impact on median household income for householders under 65 or on child poverty. Finally, we discuss plans for future releases: addressing other potential sources of bias, releasing additional years of statistics, extending the income concepts measured, and including smaller geographies such as state and county.
View Full Paper PDF
Working Paper

Connected and Uncooperative: The Effects of Homogenous and Exclusive Social Networks on Survey Response Rates and Nonresponse Bias

January 2024

Authors: Jonathan Eggleston, Chase Sawyer

Working Paper Number:

CES-24-01

Social capital, the strength of people's friendship networks and community ties, has been hypothesized as an important determinant of survey participation. Investigating this hypothesis has been difficult given data constraints. In this paper, we provide insights by investigating how response rates and nonresponse bias in the American Community Survey are correlated with county-level social network data from Facebook. We find that areas of the United States where people have more exclusive and homogenous social networks have higher nonresponse bias and lower response rates. These results provide further evidence that the effects of social capital may not be simply a matter of whether people are socially isolated or not, but also what types of social connections people have and the sociodemographic heterogeneity of their social networks.
View Full Paper PDF
Working Paper

Understanding Earnings Instability: How Important are Employment Fluctuations and Job Changes?

August 2009

Authors: Kristin McCue, Sule Celik, Chinhui Juhn, Jesse Thompson

Working Paper Number:

CES-09-20

Using three panel datasets (the matched CPS, the SIPP, and the newly available Longitudinal Employment and Household Dynamics (LEHD) data), we examine trends in male earnings instability in recent decades. In contrast to several papers that find a recent upward trend in earnings instability using the PSID data, we find that earnings instability has been remarkably stable in the 1990s and the 2000s. We find that job changing rates remained relatively constant casting doubt on the importance of labor market 'churning.' We find some evidence that earnings instability increased among job stayers which lends credence to the view that greater reliance on incentive pay increased instability of worker pay. We also find an offsetting decrease in earnings instability among job changers due largely to declining unemployment associated with job changes. One caveat to our findings is that we focus on men who have positive earnings in two adjacent years and thus ignore men who exit the labor force or re-enter after an extended period. Preliminary investigation suggests that ignoring these transitions understates the rise in earnings instability over the past two decades.
View Full Paper PDF
Working Paper

Investigating the Use of Administrative Records in the Consumer Expenditure Survey

March 2018

Authors: Quentin Brummet, Joshua Mitchell, John Voorheis, Denise Flanagan-Doyle, Laura Erhard, Brett McBride

Working Paper Number:

carra-2018-01

In this paper, we investigate the potential of applying administrative records income data to the Consumer Expenditure (CE) survey to inform measurement error properties of CE estimates, supplement respondent-collected data, and estimate the representativeness of the CE survey by income level. We match individual responses to Consumer Expenditure Quarterly Interview Survey data collected from July 2013 through December 2014 to IRS administrative data in order to analyze CE questions on wages, social security payroll deductions, self-employment income receipt and retirement income. We find that while wage amounts are largely in alignment between the CE and administrative records in the middle of the wage distribution, there is evidence that wages are over-reported to the CE at the bottom of the wage distribution and under-reported at the top of the wage distribution. We find mixed evidence for alignment between the CE and administrative records on questions covering payroll deductions and self-employment income receipt, but find substantial divergence between CE responses and administrative records when examining retirement income. In addition to the analysis using person-based linkages, we also match responding and non-responding CE sample units to the universe of IRS 1040 tax returns by address to examine non-response bias. We find that non-responding households are substantially richer than responding households, and that very high income households are less likely to respond to the CE.
View Full Paper PDF
Working Paper

Measuring the Impact of COVID-19 on Businesses and People: Lessons from the Census Bureau's Experience

January 2021

Authors: Lucia Foster, Catherine Buffington, Jason Fields

Working Paper Number:

CES-21-02

We provide an overview of Census Bureau activities to enhance the consistency, timeliness, and relevance of our data products in response to the COVID-19 pandemic. We highlight new data products designed to provide timely and granular information on the pandemic's impact: the Small Business Pulse Survey, weekly Business Formation Statistics, the Household Pulse Survey, and Community Resilience Estimates. We describe pandemic-related content introduced to existing surveys such as the Annual Business Survey and the Current Population Survey. We discuss adaptations to ensure the continuity and consistency of existing data products such as principal economic indicators and the American Community Survey.
View Full Paper PDF
Working Paper

Measuring Income of the Aged in Household Surveys: Evidence from Linked Administrative Records

June 2024

Authors: Joshua Mitchell, Adam Bee, Irena Dushi, Brad Trenkamp

Working Paper Number:

CES-24-32

Research has shown that household survey estimates of retirement income (defined benefit pensions and defined contribution account withdrawals) suffer from substantial underreporting which biases downward measures of financial well-being among the aged. Using data from both the redesigned 2016 Current Population Survey Annual Social and Economic Supplement (CPS ASEC) and the Health and Retirement Study (HRS), each matched with administrative records, we examine to what extent underreporting of retirement income affects key statistics such as reliance on Social Security benefits and poverty among the aged. We find that underreporting of retirement income is still prevalent in the CPS ASEC. While the HRS does a better job than the CPS ASEC in terms of capturing retirement income, it still falls considerably short compared to administrative records. Consequently, the relative importance of Social Security income remains overstated in household surveys'53 percent of elderly beneficiaries in the CPS ASEC and 49 percent in the HRS rely on Social Security for the majority of their incomes compared to 42 percent in the linked administrative data. The poverty rate for those aged 65 and over is also overstated'8.8 percent in the CPS ASEC and 7.4 percent in the HRS compared to 6.4 percent in the linked administrative data. Our results illustrate the effects of using alternative data sources in producing key statistics from the Social Security Administration's Income of the Aged publication.
View Full Paper PDF
Working Paper

The Design of Sampling Strata for the National Household Food Acquisition and Purchase Survey

February 2025

Authors: Jonathan Eggleston, Mark Klee, Linden McBride

Working Paper Number:

CES-25-13

The National Household Food Acquisition and Purchase Survey (FoodAPS), sponsored by the United States Department of Agriculture's (USDA) Economic Research Service (ERS) and Food and Nutrition Service (FNS), examines the food purchasing behavior of various subgroups of the U.S. population. These subgroups include participants in the Supplemental Nutrition Assistance Program (SNAP) and the Special Supplemental Nutrition Program for Women, Infants, and Children (WIC), as well as households who are eligible for but don't participate in these programs. Participants in these social protection programs constitute small proportions of the U.S. population; obtaining an adequate number of such participants in a survey would be challenging absent stratified sampling to target SNAP and WIC participating households. This document describes how the U.S. Census Bureau (which is planning to conduct future versions of the FoodAPS survey on behalf of USDA) created sampling strata to flag the FoodAPS targeted subpopulations using machine learning applications in linked survey and administrative data. We describe the data, modeling techniques, and how well the sampling flags target low-income households and households receiving WIC and SNAP benefits. We additionally situate these efforts in the nascent literature on the use of big data and machine learning for the improvement of survey efficiency.
View Full Paper PDF

Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey

January 2024

Working Paper Number:

CES-24-02

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey' are listed below in order of similarity.

October 2024

Working Paper Number:

CES-24-58

October 2024

Working Paper Number:

CES-24-60

July 2022

Working Paper Number:

CES-22-24

February 2023

Working Paper Number:

CES-23-04

January 2024

Working Paper Number:

CES-24-01

August 2009

Working Paper Number:

CES-09-20

March 2018

Working Paper Number:

carra-2018-01

January 2021

Working Paper Number:

CES-21-02

June 2024

Working Paper Number:

CES-24-32

February 2025

Working Paper Number:

CES-25-13