-
Non-Random Assignment of Individual Identifiers and Selection into Linked Data: Implications for Research
January 2026
Working Paper Number:
CES-26-06
The U.S. Census Bureau's Person Identification Validation System facilitates anonymous linkages between survey and administrative records by assigning Protected Identification Keys (PIKs) to person records. While PIK assignment is generally accurate, some person records are not successfully assigned a PIK, which can lead to sample selection bias in analyses of linked data. Using the American Community Survey (ACS) and the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) between 2005 and 2022, we corroborate and extend existing findings on the drivers of PIK assignment, showing that the rate of PIK assignment varies widely across socio-demographic subgroups. Using earnings as a test case, we then show that limiting a survey sample of wage earners to person records with PIKs or successful linkages to W-2 wage records tends to overestimate self-reported wage earnings, on average, indicative of linkage-induced selection bias. In a validation exercise, we demonstrate that reweighting methods, such as inverse probability weighting or entropy balancing, can mitigate this bias.
View Full
Paper PDF
-
Integrating Multiple U.S. Census Bureau Data Assets to Create Standardized Profiles of Program Participants
January 2026
Working Paper Number:
CES-26-01
The Foundations for Evidence-Based Policymaking Act of 2018 (Evidence Act) directed federal agencies to systematically use data when making policy decisions. In response, the U.S. Census Bureau established the Evidence Group within its Center for Economic Studies (CES). With an interdisciplinary team of economists, sociologists, and statisticians, the Evidence Group can support the broader federal government in their efforts to use existing data to improve program operations without increasing respondent burden. For federal agencies administering social safety net and business assistance programs in particular, the team provides a no-cost evidence-building service that links program records to Census Bureau data assets and creates a series of standardized tables describing participants, their economic outcomes prior to program entry, and the communities where they live. These tables provide partner agencies with the detailed information they need to better understand their participants and potentially make their programs more accountable and effective in reaching their target populations. In this working paper, we describe the standardized tables themselves as well as the data assets available at the Census Bureau to create these tables, the data files produced by the table production process, and the methodology used to merge and harmonize data on participants and subsequently calculate unbiased and accurate estimates. We conclude with a brief discussion of steps taken to ensure confidentiality and data security. This documentation is intended to facilitate proper use and understanding of the standardized tables by partner agencies as well as researchers who are interested in leveraging these tools to explore characteristics of their samples of interest.
View Full
Paper PDF
-
Parental Death, Inheritance, and Labor Supply in the United States
December 2025
Working Paper Number:
CES-25-71
We are the first to study how inheritances affect labor supply in the U.S. using large-scale administrative data. Leveraging federal tax and Social Security records, we estimate event studies around parental death to investigate impacts on adult children. Our results indicate that the death of a last parent causes sizable gains in investment income'our main proxy for inheritances'and proportionate reductions in labor supply. On average, annual per-adult investment income at the tax unit level increases by about $300 (45 percent) and annual per-adult wage earnings decrease by $600 (2 percent). These earnings responses are large relative to the implied wealth transfer. Income effects are the dominant channel through which parental death reduces earnings, with children of wealthier parents exhibiting larger earnings reductions. Over six years, inheritances slightly equalize the distribution of investment income.
View Full
Paper PDF
-
Estimating the Graduate Coverage of Post-Secondary Employment Outcomes
September 2025
Working Paper Number:
CES-25-61
This paper proposes a new methodology for estimating the coverage rate of the Post-Secondary Employment Outcomes data product (PSEO), both as a share of new graduates and as a share of total working-age degree holders in the United States. This paper also assesses how representative PSEO is of the broader population of college graduates across an array of institutional and individual characteristics.
View Full
Paper PDF
-
Business Owners and the Self-Employed: 33 Million (and Counting!)
September 2025
Working Paper Number:
CES-25-60
Entrepreneurs are known to be key drivers of economic growth, and the rise of online platforms and the broader 'gig economy' has led self-employment to surge in recent decades. Yet the young and small businesses associated with this activity are often absent from economic data. In this paper, we explore a novel longitudinal dataset that covers the owners of tens of millions of the smallest businesses: those without employees. We produce three new sets of statistics on the rapidly growing set of nonemployer businesses. First, we measure transitions between self-employment and wage and salary jobs. Second, we describe nonemployer business entry and exit, as well as transitions between legal form (e.g., sole proprietorship to S corporation). Finally, we link owners to their nonemployer businesses and examine the dynamics of business ownership.
View Full
Paper PDF
-
A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census
August 2025
Authors:
Lars Vilhuber,
John M. Abowd,
Ethan Lewis,
Nathan Goldschlag,
Michael B. Hawes,
Robert Ashmead,
Daniel Kifer,
Philip Leclerc,
Rolando A. Rodríguez,
Tamara Adams,
David Darais,
Sourya Dey,
Simson L. Garfinkel,
Scott Moore,
Ramy N. Tadros
Working Paper Number:
CES-25-57
For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
View Full
Paper PDF
-
Differences in Disability Insurance Allowance Rates
August 2025
Working Paper Number:
CES-25-54
Allowance rates for disability insurance applications vary by race and ethnicity, but it is unclear to what extent these differences are artifacts of other differing socio-economic and health characteristics, or selection issues in SSA's race and ethnicity data. This paper uses the 2015 American Community Survey linked to 2015-2019 SSA administrative data to investigate DI application allowance rates among non-Hispanic White, non-Hispanic Black, non-Hispanic Asian, non-Hispanic American Indian/Alaska Native, and Hispanic applicants aged 25-65. The analysis uses regression, propensity score matching, and inverse probability weighting to estimate differences in allowance rates among applicants who are similar on observable characteristics. Relative to raw comparisons, differences by race and ethnicity in multivariate analyses are substantially smaller in magnitude and are generally not statistically significant.
View Full
Paper PDF
-
Credit Access in the United States
July 2025
Working Paper Number:
CES-25-45
We construct new population-level linked administrative data to study households' access to credit in the United States. These data reveal large differences in credit access by race, class, and hometown. By age 25, Black individuals, those who grew up in low-income families, and those who grew up in certain areas (including the Southeast and Appalachia) have significantly lower credit scores than other groups. Consistent with lower scores generating credit constraints, these individuals have smaller balances, more credit inquiries, higher credit card utilization rates, and greater use of alternative higher-cost forms of credit. Tests for alternative definitions of algorithmic bias in credit scores yield results in opposite directions. From a calibration perspective, group-level differences in credit scores understate differences in delinquency: conditional on a given credit score, Black individuals and those from low-income families fall delinquent at relatively higher rates. From a balance perspective, these groups receive lower credit scores even when comparing those with the same future repayment behavior. Addressing both of these biases and expanding credit access to groups with lower credit scores requires addressing group-level differences in delinquency rates. These delinquencies emerge soon after individuals access credit in their early twenties, often due to missed payments on credit cards, student loans, and other bills. Comprehensive measures of individuals' income profiles, income volatility, and observed wealth explain only a small portion of these repayment gaps. In contrast, we find that the large variation in repayment across hometowns mostly reflects the causal effect of childhood exposure to these places. Places that promote upward income mobility also promote repayment and expand credit access even conditional on income, suggesting that common place-level factors may drive behaviors in both credit and labor markets. We discuss suggestive evidence for several mechanisms that drive our results, including the role of social and cultural capital. We conclude that gaps in credit access by race, class, and hometown have roots in childhood environments.
View Full
Paper PDF
-
Consequences of Eviction for Parenting and Non-parenting College Students
June 2025
Working Paper Number:
CES-25-35
Amidst rising and increasingly unaffordable rents, 7.6 million people are threatened with eviction each year across the United States'and eviction rates are twice as high for renters with children. One important and neglected population who may experience unique levels of housing insecurity is college students, especially given that one in five college students are parents. In this study, we link 11.9 million student records to eviction filings from housing courts, demographic characteristics reported in decennial census and survey data, incomes reported on tax returns by students and their parents, and dates of birth and death from the Social Security Administration. Parenting students are more likely than non-parenting students to identify as female (62.81% vs. 55.94%) and Black (19.66% vs. 14.30%), be over 30 years old (42.73% vs. 20.25%), and have parents with lower household incomes ($100,000 vs. $140,000). Parenting students threatened with eviction (i.e., had an eviction filed against them) are much more likely than non-threatened parenting students to identify as female (81.18% vs. 62.81%) and Black (56.84% vs. 19.66%). In models adjusted for individual and institutional characteristics, we find that being threatened with an eviction was significantly associated with reduced likelihood of degree completion, reduced post-enrollment income, reduced likelihood of being married post-enrollment, and increased post-enrollment mortality. Among parenting students, 38.38% (95% confidence interval (CI): 32.50-44.26%) of non-threatened students completed a bachelor's degree compared to just 15.36% (CI: 11.61-19.11%) of students threatened with eviction. Our findings highlight the long-term economic and health impacts of housing insecurity during college, especially for parenting students. Housing stability for parenting students may have substantial multigenerational benefits for economic mobility and population health.
View Full
Paper PDF
-
Divorce, Family Arrangements, and Children's Adult Outcomes
May 2025
Working Paper Number:
CES-25-28
Nearly a third of American children experience parental divorce before adulthood. To understand its consequences, we use linked tax and Census records for over 5 million children to examine how divorce affects family arrangements and children's long-term outcomes. Following divorce, parents move apart, household income falls, parents work longer hours, families move more frequently, and households relocate to poorer neighborhoods with less economic opportunity. This bundle of changes in family circumstances suggests multiple channels through which divorce may affect children's development and outcomes. In the years following divorce, we observe sharp increases in teen births and child mortality. To examine long-run effects on children, we compare siblings with different lengths of exposure to the same divorce. We find that parental divorce reduces children's adult earnings and college residence while increasing incarceration, mortality, and teen births. Changes in household income, neighborhood quality, and parent proximity account for 25 to 60 percent of these divorce effects.
View Full
Paper PDF