CREAT - Census Bureau

Papers Containing Keywords(s): 'census disclosure'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.

Click here to search again

Filter Working Papers By Year:

Frequently Occurring Concepts within this Search

Viewing papers 1 through 10 of 13

Working Paper

What Happens to Contractors After States Ban Affirmative Action?

July 2026

Authors: Benjamin V. Rosa

Working Paper Number:

CES-26-41

Using restricted Census business records, I explore how banning affirmative action in state contracting affects minority- and women-owned business enterprises (MWBEs). I find that ending affirmative action led MWBE contractors to gradually downsize, with the most pronounced reductions in force experienced by Black-owned businesses and larger MWBEs. Despite these workforce changes, existing MWBEs were no more likely to shut down than other businesses. New MWBEs were relatively less common after a state's ban, highlighting how bans can shift the demographic composition of new contractors. A calibrated model suggests bans are equivalent to considerable reductions in MWBE productivity and scrap values.
View Full Paper PDF
Working Paper

School-Based Disability Identification Varies by Student Family Income

December 2025

Authors: Quentin Brummet, Andrew Penner, Emily Penner, Leah R. Clark, Michelle Spiegel, Paul Y. Yoo, Paul Hanselman, Nicholas J. Ainsworth, Christopher Cleveland, Jacob Hibel, Andrew Saultz, Juan Camilo Cristancho

Working Paper Number:

CES-25-74

Currently, 18 percent of K-12 students in the United States receive additional supports through the identification of a disability. Socioeconomic status is viewed as central to understanding who gets identified as having a disability, yet limited large-scale evidence examines how disability identification varies for students from different income backgrounds. Using unique data linking information on Oregon students and their family income, we document pronounced income-based differences in how students are categorized for two school-based disability supports: special education services and Section 504 plans. We find that a quarter of students in the lowest income percentile receive supports through special education, compared with less than seven percent of students in the top income percentile. This pattern may partially reflect differences in underlying disability-related needs caused by poverty. However, we find the opposite pattern for 504 plans, where students in the top income percentiles are two times more likely to receive 504 plan supports. We further document substantial variation in these income-based differences by disability category, by race/ethnicity, and by grade level. Together, these patterns suggest that disability-related needs alone cannot account for the income-based differences that we observe and highlight the complex ways that income shapes the school and family processes that lead to variability in disability classification and services.
View Full Paper PDF
Working Paper

A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census

August 2025

Authors: Lars Vilhuber, John M. Abowd, Ethan Lewis, Nathan Goldschlag, Michael B. Hawes, Robert Ashmead, Daniel Kifer, Philip Leclerc, Rolando A. Rodríguez, Tamara Adams, David Darais, Sourya Dey, Simson L. Garfinkel, Scott Moore, Ramy N. Tadros

Working Paper Number:

CES-25-57

For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
View Full Paper PDF
Working Paper

Earnings Measurement Error, Nonresponse and Administrative Mismatch in the CPS

July 2025

Authors: James P. Ziliak, Charles Hokayem, Christopher R. Bollinger

Working Paper Number:

CES-25-48

Using the Current Population Survey Annual Social and Economic Supplement matched to Social Security Administration Detailed Earnings Records, we link observations across consecutive years to investigate a relationship between item nonresponse and measurement error in the earnings questions. Linking individuals across consecutive years allows us to observe switching from response to nonresponse and vice versa. We estimate OLS, IV, and finite mixture models that allow for various assumptions separately for men and women. We find that those who respond in both years of the survey exhibit less measurement error than those who respond in one year. Our findings suggest a trade-off between survey response and data quality that should be considered by survey designers, data collectors, and data users.
View Full Paper PDF
Working Paper

Empirical Distribution of the Plant-Level Components of Energy and Carbon Intensity at the Six-digit NAICS Level Using a Modified KAYA Identity

September 2024

Authors: Gale Boyd, Matthew Doolin, Yu Ma

Working Paper Number:

CES-24-46

Three basic pillars of industry-level decarbonization are energy efficiency, decarbonization of energy sources, and electrification. This paper provides estimates of a decomposition of these three components of carbon emissions by industry: energy intensity, carbon intensity of energy, and energy (fuel) mix. These estimates are constructed at the six-digit NAICS level from non-public, plant-level data collected by the Census Bureau. Four quintiles of the distribution of each of the three components are constructed, using multiple imputation (MI) to deal with non-reported energy variables in the Census data. MI allows the estimates to avoid non-reporting bias. MI also allows more six-digit NAICS to be estimated under Census non-disclosure rules, since dropping non-reported observations may have reduced the sample sizes unnecessarily. The estimates show wide variation in each of these three components of emissions (intensity) and provide a first empirical look into the plant-level variation that underlies carbon emissions.
View Full Paper PDF
Working Paper

Connected and Uncooperative: The Effects of Homogenous and Exclusive Social Networks on Survey Response Rates and Nonresponse Bias

January 2024

Authors: Jonathan Eggleston, Chase Sawyer

Working Paper Number:

CES-24-01

Social capital, the strength of people's friendship networks and community ties, has been hypothesized as an important determinant of survey participation. Investigating this hypothesis has been difficult given data constraints. In this paper, we provide insights by investigating how response rates and nonresponse bias in the American Community Survey are correlated with county-level social network data from Facebook. We find that areas of the United States where people have more exclusive and homogenous social networks have higher nonresponse bias and lower response rates. These results provide further evidence that the effects of social capital may not be simply a matter of whether people are socially isolated or not, but also what types of social connections people have and the sociodemographic heterogeneity of their social networks.
View Full Paper PDF
Working Paper

A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census: Full Technical Report

December 2023

Authors: Lars Vilhuber, John M. Abowd, Ethan Lewis, Nathan Goldschlag, Robert Ashmead, Daniel Kifer, Philip Leclerc, Rolando A. Rodríguez, Tamara Adams, David Darais, Sourya Dey, Simson L. Garfinkel, Scott Moore, Ramy N. Tadros

Working Paper Number:

CES-23-63R

For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act. You are reading the full technical report. For the summary paper see https://doi.org/10.1162/99608f92.4a1ebf70.
View Full Paper PDF
Working Paper

An In-Depth Examination of Requirements for Disclosure Risk Assessment

October 2023

Authors: Ron Jarmin, John M. Abowd, Ian M. Schmutte, Jerome P. Reiter, Nathan Goldschlag, Victoria A. Velkoff, Michael B. Hawes, Robert Ashmead, Ryan Cumings-Menon, Sallie Ann Keller, Daniel Kifer, Philip Leclerc, Rolando A. Rodríguez, Pavel Zhuravlev

Working Paper Number:

CES-23-49

The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be based on pre-specified, objective criteria. Such criteria should be used to compare methodologies to identify those with the most desirable properties. We illustrate this approach, using simple desiderata, to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. Thus, more research is needed, but in the near-term, the counterfactual approach appears best-suited for privacy-utility analysis.
View Full Paper PDF
Working Paper

Universal Preschool Lottery Admissions and Its Effects on Long-Run Earnings and Outcomes

March 2023

Authors: Randall Akee, Leah R. Clark

Working Paper Number:

CES-23-09

We use an admissions lottery to estimate the effect of a universal (non-means tested) preschool program on students' long-run earnings, income, marital status, fertility and geographic mobility. We observe long-run outcomes by linking both admitted and non-admitted individuals to confidential administrative data including tax records. Funding for this preschool program comes from an Indigenous organization, which grants Indigenous students admissions preference and free tuition. We find treated children have between 5 to 6 percent higher earnings as young adults. The results are strongest for individuals from the lower half of the household income distribution in childhood. Likely mechanisms include high-quality teachers and curriculum.
View Full Paper PDF
Working Paper

Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels

November 2022

Authors: Rachel M. Hantman, Anja Zgodic, Jan M. Eberth, Alexander C. McLain

Working Paper Number:

CES-22-48

In this study, we implement small-area estimation to assess the prevalence of child health outcomes at the county, state, and regional levels, using national survey data.
View Full Paper PDF

1 2 Next Total Results: 13

Papers Containing Keywords(s): 'census disclosure'

See Working Papers by Tag(s), Keywords(s), Author(s), or Search Text

Click here to search again

Frequently Occurring Concepts within this Search

Viewing papers 1 through 10 of 13

July 2026

Working Paper Number:

CES-26-41

December 2025

Working Paper Number:

CES-25-74

August 2025

Working Paper Number:

CES-25-57

July 2025

Working Paper Number:

CES-25-48

September 2024

Working Paper Number:

CES-24-46

January 2024

Working Paper Number:

CES-24-01

December 2023

Working Paper Number:

CES-23-63R

October 2023

Working Paper Number:

CES-23-49

March 2023

Working Paper Number:

CES-23-09

November 2022

Working Paper Number:

CES-22-48