CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Cornell University'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

National Science Foundation - 53

Longitudinal Employer Household Dynamics - 42

Current Population Survey - 35

Alfred P Sloan Foundation - 34

Bureau of Labor Statistics - 32

Center for Economic Studies - 24

Standard Industrial Classification - 23

American Community Survey - 22

Internal Revenue Service - 22

Social Security Administration - 19

Employer Identification Numbers - 19

Survey of Income and Program Participation - 19

Cornell Institute for Social and Economic Research - 19

National Institute on Aging - 18

Research Data Center - 18

LEHD Program - 18

Longitudinal Business Database - 17

Unemployment Insurance - 17

Quarterly Workforce Indicators - 17

Social Security Number - 16

North American Industry Classification System - 16

Business Register - 16

Census Bureau Disclosure Review Board - 15

Service Annual Survey - 15

Protected Identification Key - 13

Quarterly Census of Employment and Wages - 12

Ordinary Least Squares - 12

Social Security - 12

Economic Census - 12

Decennial Census - 10

Sloan Foundation - 10

Disclosure Review Board - 10

National Bureau of Economic Research - 10

Special Sworn Status - 10

Metropolitan Statistical Area - 9

2010 Census - 9

Federal Statistical Research Data Center - 9

Department of Labor - 9

Census Bureau Business Register - 9

Individual Characteristics File - 9

Chicago Census Research Data Center - 9

Public Use Micro Sample - 8

Annual Survey of Manufactures - 8

Standard Statistical Establishment List - 8

Employment History File - 8

University of Maryland - 7

Bureau of Economic Analysis - 7

Employer Characteristics File - 7

Master Address File - 7

Business Register Bridge - 7

National Longitudinal Survey of Youth - 6

National Center for Health Statistics - 6

Office of Personnel Management - 6

AKM - 6

Federal Reserve Bank - 6

American Housing Survey - 6

American Economic Review - 6

Local Employment Dynamics - 6

International Trade Research Report - 6

CDF - 6

Cumulative Density Function - 6

PSID - 6

Department of Economics - 5

1940 Census - 5

United States Census Bureau - 5

University of California Los Angeles - 5

American Economic Association - 5

Statistics Canada - 5

University of Chicago - 5

University of Michigan - 5

Business Master File - 5

Composite Person Record - 5

Business Employment Dynamics - 5

Successor Predecessor File - 5

County Business Patterns - 5

Detailed Earnings Records - 5

Business Dynamics Statistics - 5

Financial, Insurance and Real Estate Industries - 5

Some Other Race - 4

Department of Health and Human Services - 4

W-2 - 4

Office of Management and Budget - 4

National Academy of Sciences - 4

Person Validation System - 4

MIT Press - 4

Journal of Labor Economics - 4

Securities and Exchange Commission - 4

Census Bureau Longitudinal Business Database - 4

Census of Manufactures - 4

Labor Productivity - 4

Journal of Economic Literature - 4

Core Based Statistical Area - 4

North American Industry Classi - 4

Federal Tax Information - 4

National Institutes of Health - 4

Duke University - 4

Organization for Economic Cooperation and Development - 4

Permanent Plant Number - 4

Agency for Healthcare Research and Quality - 4

MAFID - 3

Census Edited File - 3

Department of Education - 3

Ohio State University - 3

Census Numident - 3

New York University - 3

Census of Manufacturing Firms - 3

Postal Service - 3

Longitudinal Research Database - 3

Health and Retirement Study - 3

Review of Economics and Statistics - 3

Bureau of Labor - 3

Probability Density Function - 3

Georgetown University - 3

National Health Interview Survey - 3

Total Factor Productivity - 3

IZA - 3

Establishment Micro Properties - 3

Russell Sage Foundation - 3

National Research Council - 3

Computer Assisted Personal Interview - 3

Urban Institute - 3

survey - 32

statistical - 31

labor - 26

employed - 26

employ - 25

workforce - 25

census bureau - 24

employee - 23

respondent - 22

earnings - 21

data - 20

population - 19

payroll - 18

agency - 17

research census - 17

employer household - 16

census data - 15

estimating - 15

longitudinal - 15

report - 14

statistician - 14

data census - 14

economist - 13

employee data - 13

econometric - 12

worker - 12

analysis - 11

census research - 11

longitudinal employer - 11

aging - 11

economic census - 10

percentile - 10

researcher - 10

microdata - 10

datasets - 10

salary - 10

estimation - 9

quarterly - 9

hispanic - 8

use census - 8

disclosure - 8

research - 8

census employment - 8

earner - 8

minority - 7

hiring - 7

immigrant - 7

estimator - 7

employment statistics - 7

privacy - 7

employment dynamics - 7

censuses surveys - 7

database - 7

yearly - 7

labor statistics - 7

trend - 7

ssa - 6

information - 6

imputation - 6

workplace - 6

prevalence - 6

endogeneity - 6

average - 6

income data - 6

ethnicity - 5

ethnic - 5

heterogeneity - 5

econometrician - 5

immigration - 5

migrant - 5

residential - 5

linked census - 5

confidentiality - 5

record - 5

statistical disclosure - 5

earn - 5

income year - 5

study - 5

tenure - 5

employing - 5

unemployed - 5

layoff - 5

statistical agencies - 5

income individuals - 5

discrimination - 4

disadvantaged - 4

census responses - 4

census disclosure - 4

labor markets - 4

mexican - 4

economically - 4

medicaid - 4

state - 4

tax - 4

public - 4

employment data - 4

department - 4

occupation - 4

expenditure - 4

aggregate - 4

census survey - 4

assessing - 4

migration - 4

entrepreneurship - 4

decline - 4

native - 4

migrate - 4

assimilation - 4

educated - 4

workforce indicators - 4

job - 4

household income - 4

poverty - 4

employment estimates - 4

census business - 4

bias - 3

assessed - 3

sector - 3

spillover - 3

welfare - 3

immigrated - 3

citizen - 3

2010 census - 3

associate - 3

insurance - 3

corporation - 3

social - 3

work census - 3

clerical - 3

census file - 3

worker demographics - 3

entrepreneur - 3

employment growth - 3

bankruptcy - 3

inference - 3

finance - 3

endogenous - 3

recession - 3

employment earnings - 3

migrating - 3

refugee - 3

segregation - 3

measures employment - 3

regressing - 3

federal - 3

disability - 3

income distributions - 3

household surveys - 3

Viewing papers 1 through 10 of 80


  • Working Paper

    Revisiting the Unintended Consequences of Ban the Box

    August 2025

    Working Paper Number:

    CES-25-58

    Ban-the-Box (BTB) policies intend to help formerly incarcerated individuals find employment by delaying when employers can ask about criminal records. We revisit the finding in Doleac and Hansen (2020) that BTB causes statistical discrimination against minority men. We correct miscoded BTB laws and show that estimates from the Current Population Survey (CPS) remain quantitatively similar, while those from the American Community Survey (ACS) now fail to reject the null hypothesis of no effect of BTB on employment. In contrast to the published estimates, these ACS results are statistically significantly different from the CPS results, indicating a lack of robustness across datasets. We do not find evidence that these differences are due to sample composition or survey weights. There is limited evidence that these divergent results are explained by the different frequencies of these surveys. Differences in sample sizes may also lead to different estimates; the ACS has a much larger sample and more statistical power to detect effects near the corrected CPS estimates.
    View Full Paper PDF
  • Working Paper

    A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census

    August 2025

    Working Paper Number:

    CES-25-57

    For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
    View Full Paper PDF
  • Working Paper

    The Effect of the Minimum Wage on Childcare Establishments

    August 2025

    Working Paper Number:

    CES-25-53

    Childcare is essential for working families, yet it remains increasingly unaffordable and inaccessible for parents and offers poverty-level wages to many employees. While research suggests minimum wage policies may improve the welfare of low-wage workers, there is also evidence they may increase firm exits, especially among smaller, low-profit firms, which could reduce access and harm consumer well-being. This study is the first to examine these trade-offs in the childcare industry, a labor-intensive, highly regulated sector where capital-labor substitution is limited, and to provide evidence on how minimum wage policies affect a dual-sector labor market in the U.S., where self-employed and waged providers serve overlapping markets. Using variation from state-level minimum wage increases between 1995 and 2019 and unique microdata, I implement a cross-state county border discontinuity design to estimate impacts on the stocks, flows, and composition of childcare establishments. I find that while county-level aggregate establishment stocks and employment remained stable, establishment-level turnover increased, and employment decreased. I reconcile these findings by showing that minimum wage increases prompted reallocation, with larger establishments in the waged-sector more likely to enter and less likely to exit, making this one of the first studies to link null aggregate effects to shifts in establishment composition. Finally, I show that minimum wage increases may negatively affect the self-employed sector, resulting in fewer owners with advanced degrees and more with only high school education. These findings suggest that minimum wage policies reshape who provides care in ways that could affect both quality and access.
    View Full Paper PDF
  • Working Paper

    Divorce, Family Arrangements, and Children's Adult Outcomes

    May 2025

    Working Paper Number:

    CES-25-28

    Nearly a third of American children experience parental divorce before adulthood. To understand its consequences, we use linked tax and Census records for over 5 million children to examine how divorce affects family arrangements and children's long-term outcomes. Following divorce, parents move apart, household income falls, parents work longer hours, families move more frequently, and households relocate to poorer neighborhoods with less economic opportunity. This bundle of changes in family circumstances suggests multiple channels through which divorce may affect children's development and outcomes. In the years following divorce, we observe sharp increases in teen births and child mortality. To examine long-run effects on children, we compare siblings with different lengths of exposure to the same divorce. We find that parental divorce reduces children's adult earnings and college residence while increasing incarceration, mortality, and teen births. Changes in household income, neighborhood quality, and parent proximity account for 25 to 60 percent of these divorce effects.
    View Full Paper PDF
  • Working Paper

    Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics

    September 2024

    Working Paper Number:

    CES-24-48

    We use place of birth information from the Social Security Administration linked to earnings data from the Longitudinal Employer-Household Dynamics Program and detailed race and ethnicity data from the 2010 Census to study how long-term earnings differentials vary by place of birth for different self-identified race and ethnicity categories. We focus on foreign-born persons from countries that are heavily Hispanic and from countries in the Middle East and North Africa (MENA). We find substantial heterogeneity of long-term earnings differentials within country of birth, some of which will be difficult to detect when the reporting format changes from the current two-question version to the new single-question version because they depend on self-identifications that place the individual in two distinct categories within the single-question format, specifically, Hispanic and White or Black, and MENA and White or Black. We also study the USA-born children of these same immigrants. Long-term earnings differences for the 2nd generation also vary as a function of self-identified ethnicity and race in ways that changing to the single-question format could affect.
    View Full Paper PDF
  • Working Paper

    A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census: Full Technical Report

    December 2023

    Working Paper Number:

    CES-23-63R

    For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act. You are reading the full technical report. For the summary paper see https://doi.org/10.1162/99608f92.4a1ebf70.
    View Full Paper PDF
  • Working Paper

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    December 2023

    Working Paper Number:

    CES-23-63

    Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. The tabular publications in Summary File 1 thus have prohibited disclosure risk similar to the unreleased confidential microdata. Reidentification studies confirm that an attacker can, within blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with nonmodal characteristics) with 95% accuracy, the same precision as the confidential data achieve and far greater than statistical baselines. The flaw in the 2010 Census framework was the assumption that aggregation prevented accurate microdata reconstruction, justifying weaker disclosure limitation methods than were applied to 2010 Census public microdata. The framework used for 2020 Census publications defends against attacks that are based on reconstruction, as we also demonstrate here. Finally, we show that alternatives to the 2020 Census Disclosure Avoidance System with similar accuracy (enhanced swapping) also fail to protect confidentiality, and those that partially defend against reconstruction attacks (incomplete suppression implementations) destroy the primary statutory use case: data for redistricting all legislatures in the country in compliance with the 1965 Voting Rights Act.
    View Full Paper PDF
  • Working Paper

    Mixed-Effects Methods For Search and Matching Research

    September 2023

    Working Paper Number:

    CES-23-43

    We study mixed-effects methods for estimating equations containing person and firm effects. In economics such models are usually estimated using fixed-effects methods. Recent enhancements to those fixed-effects methods include corrections to the bias in estimating the covariance matrix of the person and firm effects, which we also consider.
    View Full Paper PDF
  • Working Paper

    Covering Undocumented Immigrants: The Effects of a Large-Scale Prenatal Care Intervention

    August 2022

    Working Paper Number:

    CES-22-28

    Undocumented immigrants are ineligible for public insurance coverage for prenatal care in most states, despite their children representing a large fraction of births and having U.S. citizenship. In this paper, we examine a policy that expanded Medicaid pregnancy coverage to undocumented immigrants. Using a novel dataset that links California birth records to Census surveys, we identify siblings born to immigrant mothers before and after the policy. Implementing a mothers' fixed effects design, we find that the policy increased coverage for and use of prenatal care among pregnant immigrant women, and increased average gestation length and birth weight among their children.
    View Full Paper PDF
  • Working Paper

    Shareholder Power and the Decline of Labor

    May 2022

    Working Paper Number:

    CES-22-17

    Shareholder power in the US grew over recent decades due to a steep rise in concentrated institutional ownership. Using establishment-level data from the US Census Bureau's Longitudinal Business Database for 1982-2015, this paper examines the impact of increases in concentrated institutional ownership on employment, wages, shareholder returns, and labor productivity. Consistent with theory of the firm based on conflicts of interests between shareholders and stakeholders, we find that establishments of firms that experience an increase in ownership by larger and more concentrated institutional shareholders have lower employment and wages. This result holds in both panel regressions with establishment fixed effects and a difference-in-differences design that exploits large increases in concentrated institutional ownership, and is robust to controls for industry and local shocks. The result is more pronounced in industries where labor is relatively less unionized, in more monopsonistic local labor markets, and for dedicated and activist institutional shareholders. The labor losses are accompanied by higher shareholder returns but no improvements in labor productivity, suggesting that shareholder power mainly reallocates rents away from workers. Our results imply that the rise in concentrated institutional ownership could explain about a quarter of the secular decline in the aggregate labor share.
    View Full Paper PDF