-
Revisiting the Unintended Consequences of Ban the Box
August 2025
Working Paper Number:
CES-25-58
Ban-the-Box (BTB) policies intend to help formerly incarcerated individuals find employment by delaying when employers can ask about criminal records. We revisit the finding in Doleac and Hansen (2020) that BTB causes statistical discrimination against minority men. We correct miscoded BTB laws and show that estimates from the Current Population Survey (CPS) remain quantitatively similar, while those from the American Community Survey (ACS) now fail to reject the null hypothesis of no effect of BTB on employment. In contrast to the published estimates, these ACS results are statistically significantly different from the CPS results, indicating a lack of robustness across datasets. We do not find evidence that these differences are due to sample composition or survey weights. There is limited evidence that these divergent results are explained by the different frequencies of these surveys. Differences in sample sizes may also lead to different estimates; the ACS has a much larger sample and more statistical power to detect effects near the corrected CPS estimates.
View Full
Paper PDF
-
A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census
August 2025
Authors:
Lars Vilhuber,
John M. Abowd,
Ethan Lewis,
Nathan Goldschlag,
Michael B. Hawes,
Robert Ashmead,
Daniel Kifer,
Philip Leclerc,
Rolando A. Rodríguez,
Tamara Adams,
David Darais,
Sourya Dey,
Simson L. Garfinkel,
Scott Moore,
Ramy N. Tadros
Working Paper Number:
CES-25-57
For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
View Full
Paper PDF
-
Locating Hispanic Americans, 1900-2020
July 2025
Working Paper Number:
CES-25-50
This study examines Hispanic Americans' residential settlement patterns nationwide in the last 120 years. Drawing on newly available neighborhood data for the whole country as early as 1900, it documents the direction and timing of changes in two aspects of their location. First, it charts Hispanics' transition from a predominantly rural population to majority metropolitan by 1930 and also their growing presence in all regions of the U.S. while still maintaining a predominance in the West and Texas. Second, it provides the first evidence of the long-term trajectory of their segregation from whites in the metropolitan areas where they were settling. As shown by studies of more recent decades, Hispanics were never as segregated as African Americans. Nonetheless, similar to African Americans, their segregation from whites increased to high levels through the middle of the century, followed by slow decline. For both groups metropolitan segregation was driven mainly by segregation among central city neighborhoods prior to the 1940s. But new forms of segregation ' a growing city/suburb divide and increasing segregation among suburban places ' have become the largest contributors to segregation today.
View Full
Paper PDF
-
Earnings Measurement Error, Nonresponse and Administrative Mismatch in the CPS
July 2025
Working Paper Number:
CES-25-48
Using the Current Population Survey Annual Social and Economic Supplement matched to Social Security Administration Detailed Earnings Records, we link observations across consecutive years to investigate a relationship between item nonresponse and measurement error in the earnings questions. Linking individuals across consecutive years allows us to observe switching from response to nonresponse and vice versa. We estimate OLS, IV, and finite mixture models that allow for various assumptions separately for men and women. We find that those who respond in both years of the survey exhibit less measurement error than those who respond in one year. Our findings suggest a trade-off between survey response and data quality that should be considered by survey designers, data collectors, and data users.
View Full
Paper PDF
-
Credit Access in the United States
July 2025
Working Paper Number:
CES-25-45
We construct new population-level linked administrative data to study households' access to credit in the United States. These data reveal large differences in credit access by race, class, and hometown. By age 25, Black individuals, those who grew up in low-income families, and those who grew up in certain areas (including the Southeast and Appalachia) have significantly lower credit scores than other groups. Consistent with lower scores generating credit constraints, these individuals have smaller balances, more credit inquiries, higher credit card utilization rates, and greater use of alternative higher-cost forms of credit. Tests for alternative definitions of algorithmic bias in credit scores yield results in opposite directions. From a calibration perspective, group-level differences in credit scores understate differences in delinquency: conditional on a given credit score, Black individuals and those from low-income families fall delinquent at relatively higher rates. From a balance perspective, these groups receive lower credit scores even when comparing those with the same future repayment behavior. Addressing both of these biases and expanding credit access to groups with lower credit scores requires addressing group-level differences in delinquency rates. These delinquencies emerge soon after individuals access credit in their early twenties, often due to missed payments on credit cards, student loans, and other bills. Comprehensive measures of individuals' income profiles, income volatility, and observed wealth explain only a small portion of these repayment gaps. In contrast, we find that the large variation in repayment across hometowns mostly reflects the causal effect of childhood exposure to these places. Places that promote upward income mobility also promote repayment and expand credit access even conditional on income, suggesting that common place-level factors may drive behaviors in both credit and labor markets. We discuss suggestive evidence for several mechanisms that drive our results, including the role of social and cultural capital. We conclude that gaps in credit access by race, class, and hometown have roots in childhood environments.
View Full
Paper PDF
-
Impact Investing and Worker Outcomes
May 2025
Working Paper Number:
CES-25-30
Impact investors claim to distinguish themselves from traditional venture capital and growth equity investors by also pursuing environmental, social, and governance (ESG) objectives. Whether they successfully do so in practice is unclear. We use confidential Census Bureau microdata to assess worker outcomes across portfolio companies. Impact investors are more likely than other private equity firms to fund businesses in economically disadvantaged areas, and the performance of these companies lags behind those held by traditional private investors. We show that post-funding impact-backed firms are more likely to hire minorities, unskilled workers, and individuals with lower historical earnings, perhaps reflecting the higher representation of minorities in top positions. They also allocate wage increases more favorably to minorities and rank-and-file workers than VC-backed firms. Our results are consistent with impact investors and their portfolio companies acting according to non-pecuniary social goals and thus are not consistent with mere window dressing or cosmetic changes.
View Full
Paper PDF
-
Exploratory Report: Annual Business Survey Ownership Diversity and Its Association with Patenting and Venture Capital Success
October 2024
Working Paper Number:
CES-24-62
The Annual Business Survey (ABS) as the replacement for the Survey of Business Owners (SBO) serves as the principal data source for investigating business ownership of minorities, women, and immigrants. As a combination of SBO, the innovation questions formerly collected in the Business R&D and Innovation Survey (BRDIS), and an R&D module for microbusinesses with fewer than 10 employees, ABS opens new research opportunities investigating how ownership demographics are associated with innovation. One critical issue that ABS is uniquely able to investigate is the role that diversity among ownership teams plays in facilitating innovation or intermediate innovation outcomes in R&D-performing microbusinesses. Earlier research using ABS identified both demographic and disciplinary diversity as strong correlates to new-to-market innovation. This research investigates the extent to which the various forms of diversity also impact tangible innovation related intermediate outcomes such as the awarding of patents or securing venture capital financing for R&D. The other major difference with the earlier work is the focus on R&D-performing microbusinesses that are an essential input to radical innovation through the division of innovative labor. Evidence that disciplinary and/or demographic diversity affect the likelihood of receiving a patent or securing venture capital financing by small, high-tech start-ups may have implications for higher education, affirmative action, and immigration policy.
View Full
Paper PDF
-
Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data
October 2024
Working Paper Number:
CES-24-60
The Household Pulse Survey (HPS) conducted by the U.S. Census Bureau is a unique survey that provided timely data on the effects of the COVID-19 Pandemic on American households and continues to provide data on other emergent social and economic issues. Because the survey has a response rate in the single digits and only has an online response mode, there are concerns about nonresponse and coverage bias. In this paper, we match administrative data from government agencies and third-party data to HPS respondents to examine how representative they are of the U.S. population. For comparison, we create a benchmark of American Community Survey (ACS) respondents and nonrespondents and include the ACS respondents as another point of reference. Overall, we find that the HPS is less representative of the U.S. population than the ACS. However, performance varies across administrative variables, and the existing weighting adjustments appear to greatly improve the representativeness of the HPS. Additionally, we look at household characteristics by their email domain to examine the effects on coverage from limiting email messages in 2023 to addresses from the contact frame with at least 90% deliverability rates, finding no clear change in the representativeness of the HPS afterwards.
View Full
Paper PDF
-
Income, Wealth, and Environmental Inequality in the United States
October 2024
Working Paper Number:
CES-24-57
This paper explores the relationships between air pollution, income, wealth, and race by combining administrative data from U.S. tax returns between 1979'2016, various measures of air pollution, and sociodemographic information from linked survey and administrative data. In the first year of our data, the relationship between income and ambient pollution levels nationally is approximately zero for both non-Hispanic White and Black individuals. However, at every single percentile of the national income distribution, Black individuals are exposed to, on average, higher levels of pollution than White individuals. By 2016, the relationship between income and air pollution had steepened, primarily for Black individuals, driven by changes in where rich and poor Black individuals live. We utilize quasi-random shocks to income to examine the causal effect of changes in income and wealth on pollution exposure over a five year horizon, finding that these income'pollution elasticities map closely to the values implied by our descriptive patterns. We calculate that Black-White differences in income can explain ~10 percent of the observed gap in air pollution levels in 2016.
View Full
Paper PDF
-
Separate but Not Equal: The Uneven Cost of Residential Segregation for Network-Based Hiring
October 2024
Working Paper Number:
CES-24-56
This paper studies how residential segregation by race and by education affects job search via neighbor networks. Using confidential microdata from the US Census Bureau, I measure segregation for each characteristic at both the individual level and the neighborhood level. My findings are manifold. At the individual level, future coworkership with new neighbors on the same block is less likely among segregated individuals than among integrated workers, irrespective of races and levels of schooling. The impacts are most adverse for the most socioeconomically disadvantaged demographics: Blacks and those without a high school education. At the block level, however, higher segregation along either dimension raises the likelihood of any future coworkership on the block for all racial or educational groups. My identification strategy, capitalizing on data granularity, allows a causal interpretation of these results. Together, they point to the coexistence of homophily and in-group competition for job opportunities in linking residential segregation to neighbor-based informal hiring. My subtle findings have important implications for policy-making.
View Full
Paper PDF