-
Experimental Capture/recapture Estimation Using Census and Administrative Data
June 2026
Working Paper Number:
CES-26-38
This report expands upon the innovation of utilizing administrative records and third-party data implemented in the 2020 Census. The 2020 Census used administrative records and third-party data in address canvassing and nonresponse followup operations. The Census Bureau also has a long history of using administrative records of births, deaths, and other information to produce Demographic Analysis coverage estimates. Since 1980, the Census Bureau has produced capture-recapture coverage estimates by conducting an independent post-enumeration survey and utilizing dual system estimation approaches. This report presents the research results of attempting to see if administrative records and third-party data could be utilized to produce capture-recapture coverage estimates. This work uses an Expectation Maximization Log Linear Modeling approach previously researched by Statistics Netherlands and Statistics New Zealand. This report documents some of the experimental results from an evaluation that was part of the 2020 Census Program for Evaluation, Experiments, and Assessments.
View Full
Paper PDF
-
The Impact of Expanding Public Health Insurance on Safety Net Program Participation: Evidence From the ACA Medicaid Expansion
May 2026
Working Paper Number:
CES-26-32
We examine spillover effects from the ACA Medicaid expansion to public programs providing cash and food assistance. We consider program participation in contiguous county pairs crossing state borders, where one state took up the Medicaid expansion and the other did not, allowing us to better control for local economic trends that could affect program participation. We find that the Medicaid expansion increased participation in food assistance and one of the cash programs, with impacts mainly due to participation conditional on eligibility, rather than from labor supply responses. Our results demonstrate the potential for spillovers across safety net programs.
View Full
Paper PDF
-
The Mortality Risk of Raising Grandchildren in the United States
February 2026
Working Paper Number:
CES-26-13
In the United States, grandparents who live with and provide primary care to their grandchildren have emerged as a particularly vulnerable group since the 1990s. Using confidential data from the U.S. Census Bureau and Social Security Administration, this study linked individuals aged 50 years or older from the 2000 census long-form sample to their death records from 2000'2019 (weighted n = 64,027,000) and examined the longitudinal association between coresident grandparenting status and mortality for non-Hispanic Whites, non-Hispanic Blacks, Hispanics, and Asians. We found consistently higher rates of mortality for White coresident grandparents and lower rates for Asian coresident grandparents, regardless of the duration of primary caregiving, compared to their peers without coresident grandchildren. We also found increased risks of mortality among Hispanic long-term primary caregivers but reduced risks among Black short-term primary caregivers, compared to their peers without coresident grandchildren.
View Full
Paper PDF
-
The Design of Sampling Strata for the National Household Food Acquisition and Purchase Survey
February 2025
Working Paper Number:
CES-25-13
The National Household Food Acquisition and Purchase Survey (FoodAPS), sponsored by the United States Department of Agriculture's (USDA) Economic Research Service (ERS) and Food and Nutrition Service (FNS), examines the food purchasing behavior of various subgroups of the U.S. population. These subgroups include participants in the Supplemental Nutrition Assistance Program (SNAP) and the Special Supplemental Nutrition Program for Women, Infants, and Children (WIC), as well as households who are eligible for but don't participate in these programs. Participants in these social protection programs constitute small proportions of the U.S. population; obtaining an adequate number of such participants in a survey would be challenging absent stratified sampling to target SNAP and WIC participating households. This document describes how the U.S. Census Bureau (which is planning to conduct future versions of the FoodAPS survey on behalf of USDA) created sampling strata to flag the FoodAPS targeted subpopulations using machine learning applications in linked survey and administrative data. We describe the data, modeling techniques, and how well the sampling flags target low-income households and households receiving WIC and SNAP benefits. We additionally situate these efforts in the nascent literature on the use of big data and machine learning for the improvement of survey efficiency.
View Full
Paper PDF
-
Measuring Income of the Aged in Household Surveys: Evidence from Linked Administrative Records
June 2024
Working Paper Number:
CES-24-32
Research has shown that household survey estimates of retirement income (defined benefit pensions and defined contribution account withdrawals) suffer from substantial underreporting which biases downward measures of financial well-being among the aged. Using data from both the redesigned 2016 Current Population Survey Annual Social and Economic Supplement (CPS ASEC) and the Health and Retirement Study (HRS), each matched with administrative records, we examine to what extent underreporting of retirement income affects key statistics such as reliance on Social Security benefits and poverty among the aged. We find that underreporting of retirement income is still prevalent in the CPS ASEC. While the HRS does a better job than the CPS ASEC in terms of capturing retirement income, it still falls considerably short compared to administrative records. Consequently, the relative importance of Social Security income remains overstated in household surveys'53 percent of elderly beneficiaries in the CPS ASEC and 49 percent in the HRS rely on Social Security for the majority of their incomes compared to 42 percent in the linked administrative data. The poverty rate for those aged 65 and over is also overstated'8.8 percent in the CPS ASEC and 7.4 percent in the HRS compared to 6.4 percent in the linked administrative data. Our results illustrate the effects of using alternative data sources in producing key statistics from the Social Security Administration's Income of the Aged publication.
View Full
Paper PDF
-
Citizenship Question Effects on Household Survey Response
June 2024
Working Paper Number:
CES-24-31
Several small-sample studies have predicted that a citizenship question in the 2020 Census would cause a large drop in self-response rates. In contrast, minimal effects were found in Poehler et al.'s (2020) analysis of the 2019 Census Test randomized controlled trial (RCT). We reconcile these findings by analyzing associations between characteristics about the addresses in the 2019 Census Test and their response behavior by linking to independently constructed administrative data. We find significant heterogeneity in sensitivity to the citizenship question among households containing Hispanics, naturalized citizens, and noncitizens. Response drops the most for households containing noncitizens ineligible for a Social Security number (SSN). It falls more for households with Latin American-born immigrants than those with immigrants from other countries. Response drops less for households with U.S.-born Hispanics than households with noncitizens from Latin America. Reductions in responsiveness occur not only through lower unit self-response rates, but also by increased household roster omissions and internet break-offs. The inclusion of a citizenship question increases the undercount of households with noncitizens. Households with noncitizens also have much higher citizenship question item nonresponse rates than those only containing citizens. The use of tract-level characteristics and significant heterogeneity among Hispanics, the foreign-born, and noncitizens help explain why the effects found by Poehler et al. were so small. Linking administrative microdata with the RCT data expands what we can learn from the RCT.
View Full
Paper PDF
-
The Long-Term Effects of Income for At-Risk Infants: Evidence from Supplemental Security Income
March 2024
Working Paper Number:
CES-24-10
This paper examines whether a generous cash intervention early in life can "undo" some of the long-term disadvantage associated with poor health at birth. We use new linkages between several large-scale administrative datasets to examine the short-, medium-, and long-term effects of providing low-income families with low birthweight infants support through the Supplemental Security Income (SSI) program. This program uses a birthweight cutoff at 1200 grams to determine eligibility. We find that families of infants born just below this cutoff experience a large increase in cash benefits totaling about 27%of family income in the first three years of the infant's life. These cash benefits persist at lower amounts through age 10. Eligible infants also experience a small but statistically significant increase in Medicaid enrollment during childhood. We examine whether this support affects health care use and mortality in infancy, educational performance in high school, post-secondary school attendance and college degree attainment, and earnings, public assistance use, and mortality in young adulthood for all infants born in California to low-income families whose birthweight puts them near the cutoff. We also examine whether these payments had spillover effects onto the older siblings of these infants who may have also benefited from the increase in family resources. Despite the comprehensive nature of this early life intervention, we detect no improvements in any of the study outcomes, nor do we find improvements among the older siblings of these infants. These null effects persist across several subgroups and alternative model specifications, and, for some outcomes, our estimates are precise enough to rule out published estimates of the effect of early life cash transfers in other settings.
View Full
Paper PDF
-
Estimating the U.S. Citizen Voting-Age Population (CVAP) Using Blended Survey Data, Administrative Record Data, and Modeling: Technical Report
April 2023
Authors:
J. David Brown,
Danielle H. Sandler,
Lawrence Warren,
Moises Yi,
Misty L. Heggeness,
Joseph L. Schafer,
Matthew Spence,
Marta Murray-Close,
Carl Lieberman,
Genevieve Denoeux,
Lauren Medina
Working Paper Number:
CES-23-21
This report develops a method using administrative records (AR) to fill in responses for nonresponding American Community Survey (ACS) housing units rather than adjusting survey weights to account for selection of a subset of nonresponding housing units for follow-up interviews and for nonresponse bias. The method also inserts AR and modeling in place of edits and imputations for ACS survey citizenship item nonresponses. We produce Citizen Voting-Age Population (CVAP) tabulations using this enhanced CVAP method and compare them to published estimates. The enhanced CVAP method produces a 0.74 percentage point lower citizen share, and it is 3.05 percentage points lower for voting-age Hispanics. The latter result can be partly explained by omissions of voting-age Hispanic noncitizens with unknown legal status from ACS household responses. Weight adjustments may be less effective at addressing nonresponse bias under those conditions.
View Full
Paper PDF
-
Age, Sex, and Racial/Ethnic Disparities and Temporal-Spatial Variation in
Excess All-Cause Mortality During the COVID-19 Pandemic: Evidence from Linked Administrative and Census Bureau Data
May 2022
Working Paper Number:
CES-22-18
Research on the impact of the COVID-19 pandemic in the United States has highlighted substantial racial/ethnic disparities in excess mortality, but reports often differ in the details with respect to the size of these disparities. We suggest that these inconsistencies stem from differences in the temporal scope and measurement of race/ethnicity in existing data. We address these issues using death records for 2010 through 2021 from the Social Security Administration, covering the universe of individuals ever issued a Social Security Number, linked to race/ethnicity responses from the decennial census and American Community Survey. We use these data to (1) estimate excess all-cause mortality at the national level and for age-, sex-, and race/ethnicity-specific subgroups, (2) examine racial/ethnic variation in excess mortality over the course of the pandemic, and (3) explore whether and how racial/ethnic mortality disparities vary across states.
View Full
Paper PDF
-
Disclosure Limitation and Confidentiality Protection in Linked Data
January 2018
Working Paper Number:
CES-18-07
Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.
View Full
Paper PDF