CREAT - Census Bureau

Papers Containing Keywords(s): 'census bureau'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.

Click here to search again

Filter Working Papers By Year:

Frequently Occurring Concepts within this Search

Viewing papers 1 through 10 of 95

Working Paper

Non-Random Assignment of Individual Identifiers and Selection into Linked Data: Implications for Research

January 2026

Authors: Liana Christin Landivar, Kyle Raze, Nicole Perales

Working Paper Number:

CES-26-06

The U.S. Census Bureau's Person Identification Validation System facilitates anonymous linkages between survey and administrative records by assigning Protected Identification Keys (PIKs) to person records. While PIK assignment is generally accurate, some person records are not successfully assigned a PIK, which can lead to sample selection bias in analyses of linked data. Using the American Community Survey (ACS) and the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) between 2005 and 2022, we corroborate and extend existing findings on the drivers of PIK assignment, showing that the rate of PIK assignment varies widely across socio-demographic subgroups. Using earnings as a test case, we then show that limiting a survey sample of wage earners to person records with PIKs or successful linkages to W-2 wage records tends to overestimate self-reported wage earnings, on average, indicative of linkage-induced selection bias. In a validation exercise, we demonstrate that reweighting methods, such as inverse probability weighting or entropy balancing, can mitigate this bias.
View Full Paper PDF
Working Paper

Integrating Multiple U.S. Census Bureau Data Assets to Create Standardized Profiles of Program Participants

January 2026

Authors: Joyce K. Hahn, Robert Dominy III, Samuel Glick, Katlyn King, MariTere Molinet, JJ Naddeo, Margaret Sabelhous, Aaron Weinstock

Working Paper Number:

CES-26-01

The Foundations for Evidence-Based Policymaking Act of 2018 (Evidence Act) directed federal agencies to systematically use data when making policy decisions. In response, the U.S. Census Bureau established the Evidence Group within its Center for Economic Studies (CES). With an interdisciplinary team of economists, sociologists, and statisticians, the Evidence Group can support the broader federal government in their efforts to use existing data to improve program operations without increasing respondent burden. For federal agencies administering social safety net and business assistance programs in particular, the team provides a no-cost evidence-building service that links program records to Census Bureau data assets and creates a series of standardized tables describing participants, their economic outcomes prior to program entry, and the communities where they live. These tables provide partner agencies with the detailed information they need to better understand their participants and potentially make their programs more accountable and effective in reaching their target populations. In this working paper, we describe the standardized tables themselves as well as the data assets available at the Census Bureau to create these tables, the data files produced by the table production process, and the methodology used to merge and harmonize data on participants and subsequently calculate unbiased and accurate estimates. We conclude with a brief discussion of steps taken to ensure confidentiality and data security. This documentation is intended to facilitate proper use and understanding of the standardized tables by partner agencies as well as researchers who are interested in leveraging these tools to explore characteristics of their samples of interest.
View Full Paper PDF
Working Paper

School-Based Disability Identification Varies by Student Family Income

December 2025

Authors: Quentin Brummet, Andrew Penner, Emily Penner, Leah R. Clark, Michelle Spiegel, Paul Y. Yoo, Paul Hanselman, Nicholas J. Ainsworth, Christopher Cleveland, Jacob Hibel, Andrew Saultz, Juan Camilo Cristancho

Working Paper Number:

CES-25-74

Currently, 18 percent of K-12 students in the United States receive additional supports through the identification of a disability. Socioeconomic status is viewed as central to understanding who gets identified as having a disability, yet limited large-scale evidence examines how disability identification varies for students from different income backgrounds. Using unique data linking information on Oregon students and their family income, we document pronounced income-based differences in how students are categorized for two school-based disability supports: special education services and Section 504 plans. We find that a quarter of students in the lowest income percentile receive supports through special education, compared with less than seven percent of students in the top income percentile. This pattern may partially reflect differences in underlying disability-related needs caused by poverty. However, we find the opposite pattern for 504 plans, where students in the top income percentiles are two times more likely to receive 504 plan supports. We further document substantial variation in these income-based differences by disability category, by race/ethnicity, and by grade level. Together, these patterns suggest that disability-related needs alone cannot account for the income-based differences that we observe and highlight the complex ways that income shapes the school and family processes that lead to variability in disability classification and services.
View Full Paper PDF
Working Paper

Gifted Identification Across the Distribution of Family Income

December 2025

Authors: Quentin Brummet, Andrew Penner, Emily Penner, Leah R. Clark, Michelle Spiegel, Paul Hanselman, Nicholas J. Ainsworth, Aaron J. Ainsworth, Christopher Cleveland, Jacob Hibel, Andrew Saultz

Working Paper Number:

CES-25-73

Currently, 6.1 percent of K-12 students in the United States receive gifted education. Using education and IRS data that provide information on students and their family income, we show pronounced differences in who schools identify as gifted across the distribution of family income. Under 4 percent of students in the lowest income percentile are identified as gifted, compared with 20 percent of those in the top income percentile. Income-based differences persist after accounting for student test scores and exist across students of different sexes and racial/ethnic groups, underscoring the importance of family resources for gifted identification in schools.
View Full Paper PDF
Working Paper

Optimal Stratified Sampling for Probability-Based Online Panels

September 2025

Authors: Jonathan Eggleston

Working Paper Number:

CES-25-69

Online probability-based panels have emerged as a cost-efficient means of conducting surveys in the 21st century. While there have been various recent advancements in sampling techniques for online panels, several critical aspects of sampling theory for online panels are lacking. Much of current sampling theory from the middle of the 20th century, when response rates were high, and online panels did not exist. This paper presents a mathematical model of stratified sampling for online panels that takes into account historical response rates and survey costs. Through some simplifying assumptions, the model shows that the optimal sample allocation for online panels can largely resemble the solution for a cross-sectional survey. To apply the model, I use the Census Household Panel to show how this method could improve the average precision of key estimates. Holding fielding costs constant, the new sample rates improve the average precision of estimates between 1.47 and 17.25 percent, depending on the importance weight given to an overall population mean compared to mean estimates for racial and ethnic subgroups.
View Full Paper PDF
Working Paper

Matching Compustat Data to the Longitudinal Business Database, 1976-2020

September 2025

Authors: Cristina Tello-Trillo, Lawrence Schmidt, Sean Streiff

Working Paper Number:

CES-25-65

This paper details the methodology for creating an updated Compustat-Longitudinal Business Database (LBD) bridge, facilitating linkage between company identifiers in Compustat and firm identifiers in the LBD. In addition to data from Compustat, we incorporate historical data on public companies from various public and private sources, including information on executive names. Our methodology involves a series of stages using fuzzy name and address matching, including EIN, telephone number, and industry code matching. Qualified researchers with approved proposals can access this bridge though the Federal Statistical Research Data Centers. The Compustat-SSL bridge serves as a crucial resource for longitudinal studies on U.S. businesses, corporate governance, and executive compensation.
View Full Paper PDF
Working Paper

Job Tasks, Worker Skills, and Productivity

September 2025

Authors: John Haltiwanger, Lucia Foster, Cheryl Grim, Zoltan Wolf, Cindy Cunningham, Sabrina Wulff Pabilonia, Jay Stewart, Cody Tuttle, G. Jacob Blackwood, Matthew Dey, Rachel Nesbit

Working Paper Number:

CES-25-63

We present new empirical evidence suggesting that we can better understand productivity dispersion across businesses by accounting for differences in how tasks, skills, and occupations are organized. This aligns with growing attention to the task content of production. We link establishment-level data from the Bureau of Labor Statistics Occupational Employment and Wage Statistics survey with productivity data from the Census Bureau's manufacturing surveys. Our analysis reveals strong relationships between establishment productivity and task, skill, and occupation inputs. These relationships are highly nonlinear and vary by industry. When we account for these patterns, we can explain a substantial share of productivity dispersion across establishments.
View Full Paper PDF
Working Paper

Estimating the Graduate Coverage of Post-Secondary Employment Outcomes

September 2025

Authors: Cody Orr

Working Paper Number:

CES-25-61

This paper proposes a new methodology for estimating the coverage rate of the Post-Secondary Employment Outcomes data product (PSEO), both as a share of new graduates and as a share of total working-age degree holders in the United States. This paper also assesses how representative PSEO is of the broader population of college graduates across an array of institutional and individual characteristics.
View Full Paper PDF
Working Paper

A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census

August 2025

Authors: Lars Vilhuber, John M. Abowd, Ethan Lewis, Nathan Goldschlag, Michael B. Hawes, Robert Ashmead, Daniel Kifer, Philip Leclerc, Rolando A. Rodríguez, Tamara Adams, David Darais, Sourya Dey, Simson L. Garfinkel, Scott Moore, Ramy N. Tadros

Working Paper Number:

CES-25-57

For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
View Full Paper PDF
Working Paper

LODES Design and Methodology Report: Methodology Version 7

August 2025

Authors: Matthew R. Graham, Mark J. Kutzbach, Andrew Foote

Working Paper Number:

CES-25-52

The purpose of this report is to document the important features of Version 7 of the LEHD Origin-Destination Employment Statistics (LODES) processing system. This includes data sources, data processing methodology, confidentiality protection methodology, some quality measures, and a high-level description of the published data. The intended audience for this document includes LODES data users, Local Employment Dynamics (LED) Partnership members, U.S. Census Bureau management, program quality auditors, and current and future research and development staff members.
View Full Paper PDF

1 2 3 4 5 6 7 8 9 10 Next Total Results: 95

Papers Containing Keywords(s): 'census bureau'

See Working Papers by Tag(s), Keywords(s), Author(s), or Search Text

Click here to search again

Frequently Occurring Concepts within this Search

Viewing papers 1 through 10 of 95

January 2026

Working Paper Number:

CES-26-06

January 2026

Working Paper Number:

CES-26-01

December 2025

Working Paper Number:

CES-25-74

December 2025

Working Paper Number:

CES-25-73

September 2025

Working Paper Number:

CES-25-69

September 2025

Working Paper Number:

CES-25-65

September 2025

Working Paper Number:

CES-25-63

September 2025

Working Paper Number:

CES-25-61

August 2025

Working Paper Number:

CES-25-57

August 2025

Working Paper Number:

CES-25-52