-
The Design of Sampling Strata for the National Household Food Acquisition and Purchase Survey
February 2025
Working Paper Number:
CES-25-13
The National Household Food Acquisition and Purchase Survey (FoodAPS), sponsored by the United States Department of Agriculture's (USDA) Economic Research Service (ERS) and Food and Nutrition Service (FNS), examines the food purchasing behavior of various subgroups of the U.S. population. These subgroups include participants in the Supplemental Nutrition Assistance Program (SNAP) and the Special Supplemental Nutrition Program for Women, Infants, and Children (WIC), as well as households who are eligible for but don't participate in these programs. Participants in these social protection programs constitute small proportions of the U.S. population; obtaining an adequate number of such participants in a survey would be challenging absent stratified sampling to target SNAP and WIC participating households. This document describes how the U.S. Census Bureau (which is planning to conduct future versions of the FoodAPS survey on behalf of USDA) created sampling strata to flag the FoodAPS targeted subpopulations using machine learning applications in linked survey and administrative data. We describe the data, modeling techniques, and how well the sampling flags target low-income households and households receiving WIC and SNAP benefits. We additionally situate these efforts in the nascent literature on the use of big data and machine learning for the improvement of survey efficiency.
View Full
Paper PDF
-
Geographic Disparities in Alzheimer's Disease and Related Dementia Mortality in the US: Comparing Impacts of Place of Birth and Place of Residence
January 2025
Working Paper Number:
CES-25-11
Objective: Building on the hypothesis that early-life exposures might influence the onset of Alzheimer's Disease and Related Dementia (ADRD), this study delves into geographic variations in ADRD mortality in the US. By considering both state of residence and state of birth, we aim to discern the comparative significance of these geospatial factors.
Methods: We conducted a secondary data analysis of the National Longitudinal Mortality Study (NLMS), that has 3.5 million records from 1973-2011 and over 0.5 million deaths. We focused on individuals born in or before 1930, tracked in NLMS cohorts from 1979-2000. Employing multi-level logistic regression, with individuals nested within states of residence and/or states of birth, we assessed the role of geographical factors in ADRD mortality variation.
Results: We found that both state of birth and state of residence account for a modest portion of ADRD mortality variation. Specifically, state of residence explains 1.19% of the total variation in ADRD mortality, whereas state of birth explains only 0.6%. When combined, both state of residence and state of birth account for only 1.05% of the variation, suggesting state of residence could matter more in ADRD mortality outcomes.
Conclusion: Findings of this study suggest that state of residence explains more variation in ADRD mortality than state of birth. These results indicate that factors in later life may present more impactful intervention points for curbing ADRD mortality. While early-life environmental exposures remain relevant, their role as primary determinants of ADRD in later life appears to be less pronounced in this study.
View Full
Paper PDF
-
Potential Bias When Using Administrative Data to Measure the Family Income of School-Aged Children
January 2025
Working Paper Number:
CES-25-03
Researchers and practitioners increasingly rely on administrative data sources to measure family income. However, administrative data sources are often incomplete in their coverage of the population, giving rise to potential bias in family income measures, particularly if coverage deficiencies are not well understood. We focus on the school-aged child population, due to its particular import to research and policy, and because of the unique challenges of linking children to family income information. We find that two of the most significant administrative sources of family income information that permit linking of children and parents'IRS Form 1040 and SNAP participation records'usefully complement each other, potentially reducing coverage bias when used together. In a case study considering how best to measure economic disadvantage rates in the public school student population, we demonstrate the sensitivity of family income statistics to assumptions about individuals who do not appear in administrative data sources.
View Full
Paper PDF
-
Food Security Status Across the Rural-Urban Continuum Before and During the COVID-19 Pandemic
January 2025
Working Paper Number:
CES-25-01
Background: Food security, defined as consistent access to sufficient food to support an active life, is a crucial social determinant of health. A key dimension affecting food security is position along the rural-urban continuum, as there are important socio-economic and environmental differences between communities related to urbanicity or rurality that impact food access. The COVID-19 pandemic created social and economic shocks that altered financial and food security, which may have had differential effects by rurality and urbanicity. However, there has been limited research on how food security differs across the shades of the rural-urban community spectrum, as most often researchers have characterized communities as either urban or rural.
Methods: In this study, which linked restricted use Current Population Survey Food Security Supplement data to census-tract level United States Department of Agriculture Rural-Urban Commuting Area codes, we estimated the prevalence of household food security across temporal (2015-2019 versus 2020-2021) and socio-spatial (urban, large rural city/town, small rural town, or isolated rural town/area) dimensions in order to characterize variations before and during the COVID-19 pandemic by urbanicity/rurality. We report prevalences as point estimates with 95% confidence intervals.
Results: The prevalence of food security was 87.7% (87.5-88.0%) in 2015-2019 and 88.8% (88.4-89.3%) in 2020-2021 for urban areas, 85.5% (84.7-86.2%) in 2015-2019 and 87.1% (85.7-88.3%) in 2020-2021 for large rural towns/cities, 82.8% (81.5-84.1%) in 2015-2019 and 87.3% (85.7-89.2%) in 2020-2021 for small rural towns, and 87.6% (86.3-88.8%) in 2015-2019 and 90.9% (88.7-92.7%) in 2020-2021 for isolated rural towns/areas.
Conclusion: These findings show that rural communities experiences of food security vary and aggregating households in these environments may mask areas of concern and concentrated need.
View Full
Paper PDF
-
Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data
October 2024
Working Paper Number:
CES-24-60
The Household Pulse Survey (HPS) conducted by the U.S. Census Bureau is a unique survey that provided timely data on the effects of the COVID-19 Pandemic on American households and continues to provide data on other emergent social and economic issues. Because the survey has a response rate in the single digits and only has an online response mode, there are concerns about nonresponse and coverage bias. In this paper, we match administrative data from government agencies and third-party data to HPS respondents to examine how representative they are of the U.S. population. For comparison, we create a benchmark of American Community Survey (ACS) respondents and nonrespondents and include the ACS respondents as another point of reference. Overall, we find that the HPS is less representative of the U.S. population than the ACS. However, performance varies across administrative variables, and the existing weighting adjustments appear to greatly improve the representativeness of the HPS. Additionally, we look at household characteristics by their email domain to examine the effects on coverage from limiting email messages in 2023 to addresses from the contact frame with at least 90% deliverability rates, finding no clear change in the representativeness of the HPS afterwards.
View Full
Paper PDF
-
Incorporating Administrative Data in Survey Weights for the 2018-2022 Survey of Income and Program Participation
October 2024
Working Paper Number:
CES-24-58
Response rates to the Survey of Income and Program Participation (SIPP) have declined over time, raising the potential for nonresponse bias in survey estimates. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we modify various parts of the SIPP weighting algorithm to incorporate such data. We create these new weights for the 2018 through 2022 SIPP panels and examine how the new weights affect survey estimates. Our results show that before weighting adjustments, SIPP respondents in these panels have higher socioeconomic status than the general population. Existing weighting procedures reduce many of these differences. Comparing SIPP estimates between the production weights and the administrative data-based weights yields changes that are not uniform across the joint income and program participation distribution. Unlike other Census Bureau household surveys, there is no large increase in nonresponse bias in SIPP due to the COVID-19 Pandemic. In summary, the magnitude and sign of nonresponse bias in SIPP is complicated, and the existing weighting procedures may change the sign of nonresponse bias for households with certain incomes and program benefit statuses.
View Full
Paper PDF
-
Comparison of Child Reporting in the American Community Survey and Federal Income Tax Returns Based on California Birth Records
September 2024
Working Paper Number:
CES-24-55
This paper takes advantage of administrative records from California, a state with a large child population and a significant historical undercount of children in Census Bureau data, dependent information in the Internal Revenue Service (IRS) Form 1040 records, and the American Community Survey to characterize undercounted children and compare child reporting. While IRS Form 1040 records offer potential utility for adjusting child undercounting in Census Bureau surveys, this analysis finds overlapping reporting issues among various demographic and economic groups. Specifically, older children, those of Non-Hispanic Black mothers and Hispanic mothers, children or parents with lower English proficiency, children whose mothers did not complete high school, and families with lower income-to-poverty ratio were less frequently reported in IRS 1040 records than other groups. Therefore, using IRS 1040 dependent records may have limitations for accurately representing populations with characteristics associated with the undercount of children in surveys.
View Full
Paper PDF
-
The Effect of Food Assistance Work Requirements on Labor Market Outcomes
September 2024
Working Paper Number:
CES-24-54
The Supplemental Nutrition Assistance Program (SNAP), formerly named the Food Stamp Program, has long been an integral part of the US social safety net. During US welfare reforms in the mid-1990s, SNAP eligibility became more restrictive with legislation citing a need to improve self-sufficiency of participating households. As a result, legislatures created two of these eligibility requirements: the General Work Requirement (GWR), which forces an adult to work to receive benefits, and the Able-Bodied Adult Without Dependents (ABAWD) work requirement, which requires certain adults to work a certain number of hours to receive benefits. Using restricted-access SNAP microdata from nine states, we exploit age cutoffs of the ABAWD work requirement and General Work Requirement (GWR) to estimate the effect of these policies on labor outcomes. We find that at the ABAWD age cutoff, there is no statistically significant evidence of a discontinuity across static and dynamic employment outcomes. At the GWR age cutoff, unemployed SNAP users and SNAP-eligible adults are on average more likely to leave the labor force than to continue to search for work.
View Full
Paper PDF
-
The Icing on the Cake: The Effects of Monetary Incentives on Income Data Quality in the SIPP
January 2024
Working Paper Number:
CES-24-03
Accurate measurement of key income variables plays a crucial role in economic research and policy decision-making. However, the presence of item nonresponse and measurement error in survey data can cause biased estimates. These biases can subsequently lead to sub-optimal policy decisions and inefficient allocation of resources. While there have been various studies documenting item nonresponse and measurement error in economic data, there have not been many studies investigating interventions that could reduce item nonresponse and measurement error. In our research, we investigate the impact of monetary incentives on reducing item nonresponse and measurement error for labor and investment income in the Survey of Income and Program Participation (SIPP). Our study utilizes a randomized incentive experiment in Waves 1 and 2 of the 2014 SIPP, which allows us to assess the effectiveness of incentives in reducing item nonresponse and measurement error. We find that households receiving incentives had item nonresponse rates that are 1.3 percentage points lower for earnings and 1.5 percentage points lower for Social Security income. Measurement error was 6.31 percentage points lower at the intensive margin for interest income, and 16.48 percentage points lower for dividend income compared to non-incentive recipient households. These findings provide valuable insights for data producers and users and highlight the importance of implementing strategies to improve data quality in economic research.
View Full
Paper PDF
-
Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey
January 2024
Working Paper Number:
CES-24-02
Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations.
After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics.
This paper is for research purposes only. No changes to production are being implemented at this time.
View Full
Paper PDF