CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Protected Identification Key'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

American Community Survey - 114

Internal Revenue Service - 103

Social Security Number - 92

Census Bureau Disclosure Review Board - 92

Social Security Administration - 83

Current Population Survey - 78

Person Validation System - 71

Social Security - 64

Longitudinal Employer Household Dynamics - 54

Decennial Census - 53

Disclosure Review Board - 53

Person Identification Validation System - 46

2010 Census - 45

Employer Identification Numbers - 43

W-2 - 40

North American Industry Classification System - 35

Survey of Income and Program Participation - 34

Bureau of Labor Statistics - 33

Census Numident - 33

Personally Identifiable Information - 33

Ordinary Least Squares - 32

Master Address File - 31

Center for Economic Studies - 28

Business Register - 28

Department of Housing and Urban Development - 27

National Science Foundation - 26

Federal Statistical Research Data Center - 25

Housing and Urban Development - 23

Adjusted Gross Income - 22

Office of Management and Budget - 22

Longitudinal Business Database - 22

SSA Numident - 21

Quarterly Census of Employment and Wages - 21

Center for Administrative Records Research and Applications - 21

Individual Taxpayer Identification Numbers - 20

Unemployment Insurance - 19

Supplemental Nutrition Assistance Program - 19

Temporary Assistance for Needy Families - 19

Service Annual Survey - 18

Earned Income Tax Credit - 18

Indian Health Service - 18

Social and Economic Supplement - 16

Quarterly Workforce Indicators - 16

Census Household Composition Key - 16

Census Bureau Business Register - 15

Administrative Records - 15

Some Other Race - 15

Employment History File - 14

Medicaid Services - 14

1940 Census - 14

PIKed - 13

National Center for Health Statistics - 13

PSID - 13

Cornell University - 13

Census Bureau Person Identification Validation System - 13

Employer Characteristics File - 13

Data Management System - 13

Computer Assisted Personal Interview - 13

Metropolitan Statistical Area - 13

MAFID - 12

Individual Characteristics File - 12

ASEC - 12

National Bureau of Economic Research - 12

University of Chicago - 12

Indian Housing Information Center - 12

Census 2000 - 12

MAF-ARF - 11

Federal Reserve Bank - 11

Detailed Earnings Records - 11

Standard Industrial Classification - 11

Composite Person Record - 11

Alfred P Sloan Foundation - 11

COVID-19 - 10

CPS ASEC - 10

Department of Education - 10

Office of Personnel Management - 10

NUMIDENT - 10

Centers for Medicare - 10

Census Bureau Master Address File - 10

American Housing Survey - 10

Research Data Center - 10

National Opinion Research Center - 10

DOB - 10

General Accounting Office - 9

Department of Health and Human Services - 9

Center for Administrative Records Research - 9

Disability Insurance - 9

Core Based Statistical Area - 9

Census Edited File - 9

Chicago Census Research Data Center - 9

Postal Service - 9

Harvard University - 8

Local Employment Dynamics - 8

Department of Homeland Security - 8

Business Dynamics Statistics - 8

Department of Labor - 8

National Longitudinal Survey of Youth - 8

Master Beneficiary Record - 8

Social Science Research Institute - 8

Current Population Survey Annual Social and Economic Supplement - 7

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 7

CATI - 7

CDF - 7

Cumulative Density Function - 7

American Economic Association - 7

Cornell Institute for Social and Economic Research - 7

LEHD Program - 7

National Academy of Sciences - 6

MTO - 6

Opportunity Atlas - 6

Stanford University - 6

Citizenship and Immigration Services - 6

Integrated Longitudinal Business Database - 6

Legal Form of Organization - 6

Integrated Public Use Microdata Series - 6

University of Maryland - 6

Federal Insurance Contribution Act - 6

Federal Reserve System - 6

National Institutes of Health - 6

University of Michigan - 6

Department of Agriculture - 6

Economic Census - 6

Journal of Economic Literature - 6

National Institute on Aging - 6

Department of Commerce - 6

Standard Statistical Establishment List - 6

Patent and Trademark Office - 6

Department of Defense - 6

HHS - 6

Federal Tax Information - 6

Successor Predecessor File - 6

Michigan Institute for Teaching and Research in Economics - 5

United States Census Bureau - 5

County Business Patterns - 5

Environmental Protection Agency - 5

Pew Research Center - 5

Russell Sage Foundation - 5

Bureau of Economic Analysis - 5

New York University - 5

Centers for Disease Control and Prevention - 5

University of Minnesota - 5

American Economic Review - 5

Business Master File - 5

Business Employment Dynamics - 5

Business Register Bridge - 5

Establishment Micro Properties - 4

Annual Survey of Entrepreneurs - 4

National Employer Survey - 4

Nonemployer Statistics - 4

Yale University - 4

Department of Economics - 4

Health and Retirement Study - 4

Ohio State University - 4

Survey of Consumer Finances - 4

Educational Services - 4

Health Care and Social Assistance - 4

Federal Poverty Level - 4

Retail Trade - 4

Technical Services - 4

Accommodation and Food Services - 4

Agriculture, Forestry - 4

Economic Research Service - 4

Survey of Business Owners - 4

National Ambient Air Quality Standards - 4

Review of Economics and Statistics - 4

Sloan Foundation - 4

Society of Labor Economists - 4

Organization for Economic Cooperation and Development - 4

Supreme Court - 4

Department of Justice - 4

Journal of Labor Economics - 4

Robert Wood Johnson Foundation - 4

North American Industry Classi - 4

Probability Density Function - 4

MIT Press - 4

Securities and Exchange Commission - 3

Occupational Employment Statistics - 3

Consumer Expenditure Survey - 3

Arts, Entertainment - 3

Federal Register - 3

NBER Summer Institute - 3

UC Berkeley - 3

COVID - 3

Herfindahl Hirschman Index - 3

New York Times - 3

National Income and Product Accounts - 3

General Education Development - 3

Master Earnings File - 3

Customs and Border Protection - 3

Quarterly Journal of Economics - 3

2SLS - 3

International Trade Research Report - 3

Census Bureau Longitudinal Business Database - 3

Employer-Household Dynamics - 3

Urban Institute - 3

Journal of Human Resources - 3

Minnesota Population Center - 3

American Immigration Council - 3

population - 53

survey - 52

respondent - 43

ethnicity - 41

employed - 41

hispanic - 37

census data - 36

census bureau - 34

workforce - 31

socioeconomic - 30

poverty - 30

employ - 30

disadvantaged - 28

disparity - 28

minority - 27

recession - 27

immigrant - 27

earnings - 26

irs - 26

ethnic - 26

labor - 26

race - 25

racial - 24

payroll - 24

employee - 23

record - 23

family - 22

data - 22

enrollment - 22

estimating - 20

data census - 20

1040 - 20

intergenerational - 20

resident - 20

welfare - 18

tax - 18

citizen - 18

earner - 17

segregation - 17

economist - 17

residence - 17

housing - 16

residential - 16

agency - 16

taxpayer - 16

matching - 15

medicaid - 15

immigration - 15

census responses - 15

datasets - 14

heterogeneity - 14

census employment - 13

worker - 13

unemployed - 13

black - 13

statistical - 12

use census - 12

ssa - 12

filing - 12

records census - 12

job - 12

white - 12

neighborhood - 11

federal - 11

census survey - 11

state - 11

salary - 11

survey income - 11

parent - 11

assessed - 11

migrant - 11

imputation - 11

discrimination - 11

census use - 11

sampling - 10

percentile - 10

eligibility - 10

enrolled - 10

2010 census - 10

child - 10

migration - 10

native - 10

employer household - 10

occupation - 10

eligible - 9

department - 9

graduate - 9

entrepreneur - 9

generation - 9

hiring - 9

dependent - 9

income data - 9

income households - 9

mexican - 9

race census - 9

census records - 9

econometric - 9

bias - 8

linkage - 8

impact - 8

database - 8

identifier - 8

education - 8

schooling - 8

latino - 8

parental - 8

longitudinal - 8

employment data - 8

employment statistics - 8

longitudinal employer - 8

microdata - 8

census household - 8

employing - 8

employee data - 8

ancestry - 8

census research - 8

census file - 8

household surveys - 7

mobility - 7

poorer - 7

enterprise - 7

coverage - 7

residing - 7

proprietorship - 7

entrepreneurship - 7

retirement - 7

educated - 7

mortality - 7

wealth - 7

work census - 7

workplace - 7

revenue - 7

expenditure - 7

income survey - 7

citizenship - 7

associate - 7

clerical - 7

matched - 7

relocation - 6

proprietor - 6

postsecondary - 6

adulthood - 6

mortgage - 6

population survey - 6

student - 6

employment estimates - 6

income children - 6

survey households - 6

migrate - 6

migrating - 6

report - 6

linked census - 6

environmental - 6

census linked - 6

pollution - 6

pollutant - 6

surveys censuses - 6

research census - 6

tenure - 6

medicare - 6

segregated - 6

poor - 6

birth - 6

metropolitan - 6

outcome - 5

researcher - 5

prevalence - 5

census disclosure - 5

disclosure - 5

incorporated - 5

degree - 5

nonemployer businesses - 5

home - 5

renter - 5

asian - 5

indian - 5

propensity - 5

recessionary - 5

loan - 5

earn - 5

worker demographics - 5

school - 5

funding - 5

household income - 5

immigrated - 5

provided census - 5

incentive - 5

saving - 5

venture - 5

emission - 5

income white - 5

adoption - 5

interracial - 5

discrepancy - 5

reside - 5

grandparent - 5

endogenous - 5

maternal - 5

census 2020 - 5

unobserved - 5

pollution exposure - 5

labor statistics - 5

censuses surveys - 5

corporation - 4

insurance - 4

macroeconomic - 4

sector - 4

country - 4

survey data - 4

finance - 4

debt - 4

creditor - 4

effects employment - 4

rent - 4

trend - 4

employment dynamics - 4

economically - 4

quarterly - 4

family income - 4

exemption - 4

financial - 4

endogeneity - 4

assessing - 4

innovation - 4

patent - 4

patenting - 4

bankruptcy - 4

exposure - 4

fertility - 4

invention - 4

estimates employment - 4

assimilation - 4

rural - 4

wage data - 4

enrollee - 4

analysis - 3

study - 3

statistician - 3

paper census - 3

individuals census - 3

disability - 3

information census - 3

company - 3

college - 3

university - 3

career - 3

entrepreneurial - 3

cohort - 3

homeowner - 3

estimation - 3

estimator - 3

sample - 3

borrower - 3

lending - 3

lender - 3

credit - 3

employment earnings - 3

crime - 3

establishment - 3

employment trends - 3

subsidy - 3

income individuals - 3

taxable - 3

pandemic - 3

parents income - 3

aging - 3

gdp - 3

woman - 3

mother - 3

recession exposure - 3

pregnancy - 3

concentration - 3

employment count - 3

workforce indicators - 3

inventory - 3

employment measures - 3

industrial - 3

unemployment rates - 3

demography - 3

model - 3

Viewing papers 1 through 10 of 172


  • Working Paper

    Non-Random Assignment of Individual Identifiers and Selection into Linked Data: Implications for Research

    January 2026

    Working Paper Number:

    CES-26-06

    The U.S. Census Bureau's Person Identification Validation System facilitates anonymous linkages between survey and administrative records by assigning Protected Identification Keys (PIKs) to person records. While PIK assignment is generally accurate, some person records are not successfully assigned a PIK, which can lead to sample selection bias in analyses of linked data. Using the American Community Survey (ACS) and the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) between 2005 and 2022, we corroborate and extend existing findings on the drivers of PIK assignment, showing that the rate of PIK assignment varies widely across socio-demographic subgroups. Using earnings as a test case, we then show that limiting a survey sample of wage earners to person records with PIKs or successful linkages to W-2 wage records tends to overestimate self-reported wage earnings, on average, indicative of linkage-induced selection bias. In a validation exercise, we demonstrate that reweighting methods, such as inverse probability weighting or entropy balancing, can mitigate this bias.
    View Full Paper PDF
  • Working Paper

    Creating High-Opportunity Neighborhoods: Evidence from the HOPE VI Program

    January 2026

    Working Paper Number:

    CES-26-02

    We study whether low-economic-mobility neighborhoods can be transformed into high-mobility areas by analyzing the HOPE VI program, which invested $17 billion to revitalize 262 distressed public housing developments. We estimate the program's impacts using a matched difference-in-differences design, comparing outcomes in revitalized developments to observably similar control developments using anonymized tax records. HOPE VI reduced neighborhood poverty rates by attracting higher-income families to revitalized neighborhoods, but had no causal impact on the earnings of adults living in public housing units. Children raised in revitalized public housing units earn more, are more likely to attend college, and are less likely to be incarcerated. Using a movers exposure design and sibling comparisons, we show that these improvements were driven by changes in neighborhoods' causal effects on children's outcomes. The improvements in neighborhood causal effects were driven in large part by changes in social interaction: HOPE VI increased interaction between public housing residents and peers in surrounding neighborhoods and increased earnings more for subgroups with higher-income peers. Many low-income families in the U.S. currently live in neighborhoods that are as socially isolated as the HOPE VI developments were prior to revitalization. We conclude that it is feasible to create high-opportunity neighborhoods and that connecting socially isolated areas to surrounding communities is a cost-effective approach to doing so.
    View Full Paper PDF
  • Working Paper

    Integrating Multiple U.S. Census Bureau Data Assets to Create Standardized Profiles of Program Participants

    January 2026

    Working Paper Number:

    CES-26-01

    The Foundations for Evidence-Based Policymaking Act of 2018 (Evidence Act) directed federal agencies to systematically use data when making policy decisions. In response, the U.S. Census Bureau established the Evidence Group within its Center for Economic Studies (CES). With an interdisciplinary team of economists, sociologists, and statisticians, the Evidence Group can support the broader federal government in their efforts to use existing data to improve program operations without increasing respondent burden. For federal agencies administering social safety net and business assistance programs in particular, the team provides a no-cost evidence-building service that links program records to Census Bureau data assets and creates a series of standardized tables describing participants, their economic outcomes prior to program entry, and the communities where they live. These tables provide partner agencies with the detailed information they need to better understand their participants and potentially make their programs more accountable and effective in reaching their target populations. In this working paper, we describe the standardized tables themselves as well as the data assets available at the Census Bureau to create these tables, the data files produced by the table production process, and the methodology used to merge and harmonize data on participants and subsequently calculate unbiased and accurate estimates. We conclude with a brief discussion of steps taken to ensure confidentiality and data security. This documentation is intended to facilitate proper use and understanding of the standardized tables by partner agencies as well as researchers who are interested in leveraging these tools to explore characteristics of their samples of interest.
    View Full Paper PDF
  • Working Paper

    School-Based Disability Identification Varies by Student Family Income

    December 2025

    Working Paper Number:

    CES-25-74

    Currently, 18 percent of K-12 students in the United States receive additional supports through the identification of a disability. Socioeconomic status is viewed as central to understanding who gets identified as having a disability, yet limited large-scale evidence examines how disability identification varies for students from different income backgrounds. Using unique data linking information on Oregon students and their family income, we document pronounced income-based differences in how students are categorized for two school-based disability supports: special education services and Section 504 plans. We find that a quarter of students in the lowest income percentile receive supports through special education, compared with less than seven percent of students in the top income percentile. This pattern may partially reflect differences in underlying disability-related needs caused by poverty. However, we find the opposite pattern for 504 plans, where students in the top income percentiles are two times more likely to receive 504 plan supports. We further document substantial variation in these income-based differences by disability category, by race/ethnicity, and by grade level. Together, these patterns suggest that disability-related needs alone cannot account for the income-based differences that we observe and highlight the complex ways that income shapes the school and family processes that lead to variability in disability classification and services.
    View Full Paper PDF
  • Working Paper

    Matching Compustat Data to the Longitudinal Business Database, 1976-2020

    September 2025

    Working Paper Number:

    CES-25-65

    This paper details the methodology for creating an updated Compustat-Longitudinal Business Database (LBD) bridge, facilitating linkage between company identifiers in Compustat and firm identifiers in the LBD. In addition to data from Compustat, we incorporate historical data on public companies from various public and private sources, including information on executive names. Our methodology involves a series of stages using fuzzy name and address matching, including EIN, telephone number, and industry code matching. Qualified researchers with approved proposals can access this bridge though the Federal Statistical Research Data Centers. The Compustat-SSL bridge serves as a crucial resource for longitudinal studies on U.S. businesses, corporate governance, and executive compensation.
    View Full Paper PDF
  • Working Paper

    Estimating the Graduate Coverage of Post-Secondary Employment Outcomes

    September 2025

    Authors: Cody Orr

    Working Paper Number:

    CES-25-61

    This paper proposes a new methodology for estimating the coverage rate of the Post-Secondary Employment Outcomes data product (PSEO), both as a share of new graduates and as a share of total working-age degree holders in the United States. This paper also assesses how representative PSEO is of the broader population of college graduates across an array of institutional and individual characteristics.
    View Full Paper PDF
  • Working Paper

    Business Owners and the Self-Employed: 33 Million (and Counting!)

    September 2025

    Working Paper Number:

    CES-25-60

    Entrepreneurs are known to be key drivers of economic growth, and the rise of online platforms and the broader 'gig economy' has led self-employment to surge in recent decades. Yet the young and small businesses associated with this activity are often absent from economic data. In this paper, we explore a novel longitudinal dataset that covers the owners of tens of millions of the smallest businesses: those without employees. We produce three new sets of statistics on the rapidly growing set of nonemployer businesses. First, we measure transitions between self-employment and wage and salary jobs. Second, we describe nonemployer business entry and exit, as well as transitions between legal form (e.g., sole proprietorship to S corporation). Finally, we link owners to their nonemployer businesses and examine the dynamics of business ownership.
    View Full Paper PDF
  • Working Paper

    Education and Mortality: Evidence for the Silent Generation from Linked Census and Administrative Data

    August 2025

    Working Paper Number:

    CES-25-56

    We quantify the effect of education on mortality using a linkage of the full count 1940, 2000, and 2010 US census files and the Numident death records file. Our sample is composed of children aged 0-18 in 1940, observed living with at least one parent, for whom we can construct a rich set of parental and neighborhood characteristics. We estimate effects of educational attainment in 1940 on survival to 2000, as well as the effects of completed education, observed in 2000, on 10-year survival to 2010. The educational gradients in longevity that we estimate are robust to the inclusion of detailed individual, parental, household, neighborhood and county covariates. Given our full population census sample, we also explore rich patterns of heterogeneity and examine the effect of mediators of the education-mortality relationship. The mediators we consider in this study explain more than half of the relationship between education and mortality. We further show that the mechanisms underlying the education-mortality gradient might be different at different margins of educational attainment.
    View Full Paper PDF
  • Working Paper

    Housing Capital and Intergenerational Mobility in the United States

    August 2025

    Working Paper Number:

    CES-25-55

    Housing represents the most important capital asset for most U.S. families. Despite substantial analysis of the intergenerational mobility of income, large gaps in our knowledge of the distribution of housing assets and their transmission over time remain, as housing is generally not reflected by income flows. Using novel linked data that combines survey responses with administrative tax data and information on ownership and valuation from property tax records for over 3.4 million families, we provide new evidence on the intergenerational transmission of housing capital. We find that housing capital is more persistent across generations than labor income. We document important disparities between average housing outcomes for White and Black children. These difference persist even conditional on parent rank in the distribution of housing assets, with the gap growing throughout the parental housing capital distribution. A decomposition shows that average differences in children's labor market outcomes associated with parental assets explain about half of the observed intergenerational persistence (a 'labor income channel'), and that there is also a substantial 'direct channel' ' conditional on children having the same earnings, children of parents with more housing assets have more assets themselves on average. The direct channel is also important for explaining the intergenerational gap in outcomes of Black and White children. Finally, we present quasi-experimental evidence that local housing supply constraints help explain spatial differences in intergenerational persistence across US counties. Our results establish the importance of housing markets, both independently from and jointly with labor markets, in shaping the intergenerational persistence of economic resources.
    View Full Paper PDF
  • Working Paper

    Differences in Disability Insurance Allowance Rates

    August 2025

    Working Paper Number:

    CES-25-54

    Allowance rates for disability insurance applications vary by race and ethnicity, but it is unclear to what extent these differences are artifacts of other differing socio-economic and health characteristics, or selection issues in SSA's race and ethnicity data. This paper uses the 2015 American Community Survey linked to 2015-2019 SSA administrative data to investigate DI application allowance rates among non-Hispanic White, non-Hispanic Black, non-Hispanic Asian, non-Hispanic American Indian/Alaska Native, and Hispanic applicants aged 25-65. The analysis uses regression, propensity score matching, and inverse probability weighting to estimate differences in allowance rates among applicants who are similar on observable characteristics. Relative to raw comparisons, differences by race and ethnicity in multivariate analyses are substantially smaller in magnitude and are generally not statistically significant.
    View Full Paper PDF