CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Person Validation System'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Protected Identification Key - 71

Internal Revenue Service - 51

Social Security Number - 50

American Community Survey - 48

Social Security Administration - 45

Census Bureau Disclosure Review Board - 43

Person Identification Validation System - 42

Current Population Survey - 39

Social Security - 34

Personally Identifiable Information - 24

2010 Census - 23

Disclosure Review Board - 21

Decennial Census - 20

W-2 - 19

Center for Administrative Records Research and Applications - 19

Longitudinal Employer Household Dynamics - 18

Master Address File - 18

Department of Housing and Urban Development - 18

Census Numident - 18

Housing and Urban Development - 16

Supplemental Nutrition Assistance Program - 15

Individual Taxpayer Identification Numbers - 14

Employer Identification Numbers - 14

Survey of Income and Program Participation - 14

Office of Management and Budget - 14

Some Other Race - 13

Federal Statistical Research Data Center - 12

Social and Economic Supplement - 11

SSA Numident - 11

Census Household Composition Key - 11

Medicaid Services - 11

Census Bureau Person Identification Validation System - 11

National Bureau of Economic Research - 11

Computer Assisted Personal Interview - 11

Bureau of Labor Statistics - 10

North American Industry Classification System - 10

1940 Census - 10

Administrative Records - 10

Temporary Assistance for Needy Families - 10

Indian Health Service - 10

Ordinary Least Squares - 10

MAFID - 9

MAF-ARF - 9

Service Annual Survey - 9

Business Register - 9

Centers for Medicare - 9

Earned Income Tax Credit - 9

Indian Housing Information Center - 9

Census Edited File - 9

National Science Foundation - 9

National Opinion Research Center - 9

Center for Economic Studies - 8

Adjusted Gross Income - 8

Master Beneficiary Record - 8

Disability Insurance - 8

Social Science Research Institute - 8

Detailed Earnings Records - 8

PIKed - 7

Longitudinal Business Database - 7

ASEC - 7

Center for Administrative Records Research - 7

Census Bureau Master Address File - 7

Data Management System - 7

DOB - 7

Department of Homeland Security - 6

National Center for Health Statistics - 6

National Institute on Aging - 6

University of Chicago - 6

Census 2000 - 6

National Academy of Sciences - 5

COVID-19 - 5

CPS ASEC - 5

Citizenship and Immigration Services - 5

Federal Poverty Level - 5

Current Population Survey Annual Social and Economic Supplement - 5

Census Bureau Business Register - 5

American Housing Survey - 5

New York University - 5

Journal of Economic Literature - 5

PSID - 5

Chicago Census Research Data Center - 5

Postal Service - 5

Department of Health and Human Services - 5

Unemployment Insurance - 4

Employment History File - 4

Quarterly Census of Employment and Wages - 4

County Business Patterns - 4

Metropolitan Statistical Area - 4

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 4

CATI - 4

Pew Research Center - 4

Alfred P Sloan Foundation - 4

Centers for Disease Control and Prevention - 4

Master Earnings File - 4

Cornell University - 4

Research Data Center - 4

Department of Labor - 4

Cornell Institute for Social and Economic Research - 4

Michigan Institute for Teaching and Research in Economics - 3

Department of Education - 3

Individual Characteristics File - 3

United States Census Bureau - 3

Employer Characteristics File - 3

Federal Reserve Bank - 3

NUMIDENT - 3

Customs and Border Protection - 3

Patent and Trademark Office - 3

Ohio State University - 3

Harvard University - 3

Organization for Economic Cooperation and Development - 3

Department of Justice - 3

Office of Personnel Management - 3

HHS - 3

Stanford University - 3

Quarterly Workforce Indicators - 3

Department of Commerce - 3

population - 30

survey - 29

respondent - 28

census bureau - 23

census data - 21

ethnicity - 18

irs - 16

data - 16

disadvantaged - 16

hispanic - 16

record - 15

minority - 15

socioeconomic - 14

data census - 13

1040 - 13

immigrant - 13

race - 12

enrollment - 12

poverty - 12

tax - 12

ethnic - 12

earnings - 11

matching - 11

family - 11

medicaid - 11

datasets - 11

taxpayer - 11

recession - 11

resident - 11

citizen - 11

records census - 11

census responses - 11

assessed - 10

disparity - 10

racial - 10

employed - 10

agency - 9

federal - 9

intergenerational - 9

ssa - 9

labor - 9

workforce - 9

use census - 9

sampling - 8

census survey - 8

database - 8

employ - 8

imputation - 8

estimating - 7

earner - 7

statistical - 7

identifier - 7

coverage - 7

parent - 7

eligibility - 7

income data - 7

immigration - 7

unemployed - 7

residence - 7

survey income - 7

percentile - 7

welfare - 7

census records - 7

race census - 7

linkage - 6

filing - 6

dependent - 6

income households - 6

microdata - 6

native - 6

black - 6

bias - 5

household surveys - 5

economist - 5

department - 5

residing - 5

housing - 5

survey households - 5

eligible - 5

enrolled - 5

migration - 5

surveys censuses - 5

medicare - 5

2010 census - 5

citizenship - 5

census use - 5

payroll - 5

heterogeneity - 5

white - 5

associate - 5

latino - 5

census file - 5

employee - 4

parental - 4

population survey - 4

child - 4

provided census - 4

residential - 4

census linked - 4

income survey - 4

exemption - 4

census household - 4

migrant - 4

retirement - 4

poor - 4

birth - 4

assessing - 4

mexican - 4

segregation - 4

salary - 4

occupation - 4

ancestry - 4

census research - 4

matched - 4

researcher - 3

statistician - 3

census employment - 3

individuals census - 3

disclosure - 3

graduate - 3

career - 3

postsecondary - 3

renter - 3

prevalence - 3

subsidy - 3

income individuals - 3

household income - 3

taxable - 3

linked census - 3

impact - 3

environmental - 3

pandemic - 3

propensity - 3

expenditure - 3

adoption - 3

mobility - 3

insurance - 3

wealth - 3

reside - 3

recessionary - 3

schooling - 3

recession exposure - 3

maternal - 3

estimator - 3

state - 3

census 2020 - 3

patent - 3

patenting - 3

mortality - 3

worker demographics - 3

employee data - 3

econometric - 3

hiring - 3

industrial - 3

enrollee - 3

Viewing papers 1 through 10 of 77


  • Working Paper

    Non-Random Assignment of Individual Identifiers and Selection into Linked Data: Implications for Research

    January 2026

    Working Paper Number:

    CES-26-06

    The U.S. Census Bureau's Person Identification Validation System facilitates anonymous linkages between survey and administrative records by assigning Protected Identification Keys (PIKs) to person records. While PIK assignment is generally accurate, some person records are not successfully assigned a PIK, which can lead to sample selection bias in analyses of linked data. Using the American Community Survey (ACS) and the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) between 2005 and 2022, we corroborate and extend existing findings on the drivers of PIK assignment, showing that the rate of PIK assignment varies widely across socio-demographic subgroups. Using earnings as a test case, we then show that limiting a survey sample of wage earners to person records with PIKs or successful linkages to W-2 wage records tends to overestimate self-reported wage earnings, on average, indicative of linkage-induced selection bias. In a validation exercise, we demonstrate that reweighting methods, such as inverse probability weighting or entropy balancing, can mitigate this bias.
    View Full Paper PDF
  • Working Paper

    Integrating Multiple U.S. Census Bureau Data Assets to Create Standardized Profiles of Program Participants

    January 2026

    Working Paper Number:

    CES-26-01

    The Foundations for Evidence-Based Policymaking Act of 2018 (Evidence Act) directed federal agencies to systematically use data when making policy decisions. In response, the U.S. Census Bureau established the Evidence Group within its Center for Economic Studies (CES). With an interdisciplinary team of economists, sociologists, and statisticians, the Evidence Group can support the broader federal government in their efforts to use existing data to improve program operations without increasing respondent burden. For federal agencies administering social safety net and business assistance programs in particular, the team provides a no-cost evidence-building service that links program records to Census Bureau data assets and creates a series of standardized tables describing participants, their economic outcomes prior to program entry, and the communities where they live. These tables provide partner agencies with the detailed information they need to better understand their participants and potentially make their programs more accountable and effective in reaching their target populations. In this working paper, we describe the standardized tables themselves as well as the data assets available at the Census Bureau to create these tables, the data files produced by the table production process, and the methodology used to merge and harmonize data on participants and subsequently calculate unbiased and accurate estimates. We conclude with a brief discussion of steps taken to ensure confidentiality and data security. This documentation is intended to facilitate proper use and understanding of the standardized tables by partner agencies as well as researchers who are interested in leveraging these tools to explore characteristics of their samples of interest.
    View Full Paper PDF
  • Working Paper

    Matching Compustat Data to the Longitudinal Business Database, 1976-2020

    September 2025

    Working Paper Number:

    CES-25-65

    This paper details the methodology for creating an updated Compustat-Longitudinal Business Database (LBD) bridge, facilitating linkage between company identifiers in Compustat and firm identifiers in the LBD. In addition to data from Compustat, we incorporate historical data on public companies from various public and private sources, including information on executive names. Our methodology involves a series of stages using fuzzy name and address matching, including EIN, telephone number, and industry code matching. Qualified researchers with approved proposals can access this bridge though the Federal Statistical Research Data Centers. The Compustat-SSL bridge serves as a crucial resource for longitudinal studies on U.S. businesses, corporate governance, and executive compensation.
    View Full Paper PDF
  • Working Paper

    Estimating the Graduate Coverage of Post-Secondary Employment Outcomes

    September 2025

    Authors: Cody Orr

    Working Paper Number:

    CES-25-61

    This paper proposes a new methodology for estimating the coverage rate of the Post-Secondary Employment Outcomes data product (PSEO), both as a share of new graduates and as a share of total working-age degree holders in the United States. This paper also assesses how representative PSEO is of the broader population of college graduates across an array of institutional and individual characteristics.
    View Full Paper PDF
  • Working Paper

    Consequences of Eviction for Parenting and Non-parenting College Students

    June 2025

    Working Paper Number:

    CES-25-35

    Amidst rising and increasingly unaffordable rents, 7.6 million people are threatened with eviction each year across the United States'and eviction rates are twice as high for renters with children. One important and neglected population who may experience unique levels of housing insecurity is college students, especially given that one in five college students are parents. In this study, we link 11.9 million student records to eviction filings from housing courts, demographic characteristics reported in decennial census and survey data, incomes reported on tax returns by students and their parents, and dates of birth and death from the Social Security Administration. Parenting students are more likely than non-parenting students to identify as female (62.81% vs. 55.94%) and Black (19.66% vs. 14.30%), be over 30 years old (42.73% vs. 20.25%), and have parents with lower household incomes ($100,000 vs. $140,000). Parenting students threatened with eviction (i.e., had an eviction filed against them) are much more likely than non-threatened parenting students to identify as female (81.18% vs. 62.81%) and Black (56.84% vs. 19.66%). In models adjusted for individual and institutional characteristics, we find that being threatened with an eviction was significantly associated with reduced likelihood of degree completion, reduced post-enrollment income, reduced likelihood of being married post-enrollment, and increased post-enrollment mortality. Among parenting students, 38.38% (95% confidence interval (CI): 32.50-44.26%) of non-threatened students completed a bachelor's degree compared to just 15.36% (CI: 11.61-19.11%) of students threatened with eviction. Our findings highlight the long-term economic and health impacts of housing insecurity during college, especially for parenting students. Housing stability for parenting students may have substantial multigenerational benefits for economic mobility and population health.
    View Full Paper PDF
  • Working Paper

    The Design of Sampling Strata for the National Household Food Acquisition and Purchase Survey

    February 2025

    Working Paper Number:

    CES-25-13

    The National Household Food Acquisition and Purchase Survey (FoodAPS), sponsored by the United States Department of Agriculture's (USDA) Economic Research Service (ERS) and Food and Nutrition Service (FNS), examines the food purchasing behavior of various subgroups of the U.S. population. These subgroups include participants in the Supplemental Nutrition Assistance Program (SNAP) and the Special Supplemental Nutrition Program for Women, Infants, and Children (WIC), as well as households who are eligible for but don't participate in these programs. Participants in these social protection programs constitute small proportions of the U.S. population; obtaining an adequate number of such participants in a survey would be challenging absent stratified sampling to target SNAP and WIC participating households. This document describes how the U.S. Census Bureau (which is planning to conduct future versions of the FoodAPS survey on behalf of USDA) created sampling strata to flag the FoodAPS targeted subpopulations using machine learning applications in linked survey and administrative data. We describe the data, modeling techniques, and how well the sampling flags target low-income households and households receiving WIC and SNAP benefits. We additionally situate these efforts in the nascent literature on the use of big data and machine learning for the improvement of survey efficiency.
    View Full Paper PDF
  • Working Paper

    Potential Bias When Using Administrative Data to Measure the Family Income of School-Aged Children

    January 2025

    Working Paper Number:

    CES-25-03

    Researchers and practitioners increasingly rely on administrative data sources to measure family income. However, administrative data sources are often incomplete in their coverage of the population, giving rise to potential bias in family income measures, particularly if coverage deficiencies are not well understood. We focus on the school-aged child population, due to its particular import to research and policy, and because of the unique challenges of linking children to family income information. We find that two of the most significant administrative sources of family income information that permit linking of children and parents'IRS Form 1040 and SNAP participation records'usefully complement each other, potentially reducing coverage bias when used together. In a case study considering how best to measure economic disadvantage rates in the public school student population, we demonstrate the sensitivity of family income statistics to assumptions about individuals who do not appear in administrative data sources.
    View Full Paper PDF
  • Working Paper

    Places versus People: The Ins and Outs of Labor Market Adjustment to Globalization

    December 2024

    Working Paper Number:

    CES-24-78

    We analyze the distinct adjustment paths of U.S. labor markets (places) and U.S. workers (people) to increased Chinese import competition during the 2000s. Using comprehensive register data for 2000'2019, we document that employment levels more than fully rebound in trade-exposed places after 2010, while employment-to-population ratios remain depressed and manufacturing employment further atrophies. The adjustment of places to trade shocks is generational: affected areas recover primarily by adding workers to non-manufacturing who were below working age when the shock occurred. Entrants are disproportionately native-born Hispanics, foreign-born immigrants, women, and the college-educated, who find employment in relatively low-wage service sectors like medical services, education, retail, and hospitality. Using the panel structure of the employer-employee data, we decompose changes in the employment composition of places into trade-induced shifts in the gross flows of people across sectors, locations, and non-employment status. Contrary to standard models, trade shocks reduce geographic mobility, with both in- and out-migration remaining depressed through 2019. The employment recovery instead stems almost entirely from young adults and foreign-born immigrants taking their first U.S. jobs in affected areas, with minimal contributions from cross-sector transitions of former manufacturing workers. Although worker inflows into non-manufacturing more than fully offset manufacturing employment losses in trade-exposed locations after 2010, incumbent workers neither fully recover earnings losses nor predominately exit the labor market, but rather age in place as communities undergo rapid demographic and industrial transitions.
    View Full Paper PDF
  • Working Paper

    CTC and ACTC Participation Results and IRS-Census Match Methodology, Tax Year 2020

    December 2024

    Working Paper Number:

    CES-24-76

    The Child Tax Credit (CTC) and Additional Child Tax Credit (ACTC) offer assistance to help ease the financial burden of families with children. This paper provides taxpayer and dollar participation estimates for the CTC and ACTC covering tax year 2020. The estimates derive from an approach that relies on linking the 2021 Current Population Survey Annual Social and Economic Supplement (CPS ASEC) to IRS administrative data. This approach, called the Exact Match, uses survey data to identify CTC/ACTC eligible taxpayers and IRS administrative data to indicate which eligible taxpayers claimed and received the credit. Overall in tax year 2020, eligible taxpayers participated in the CTC and ACTC program at a rate of 93 percent while dollar participation was 91 percent.
    View Full Paper PDF
  • Working Paper

    EITC Participation Results and IRS-Census Match Methodology, Tax Year 2021

    December 2024

    Working Paper Number:

    CES-24-75

    The Earned Income Tax Credit (EITC), enacted in 1975, offers a refundable tax credit to low income working families. This paper provides taxpayer and dollar participation estimates for the EITC covering tax year 2021. The estimates derive from an approach that relies on linking the 2022 Current Population Survey Annual Social and Economic Supplement (CPS ASEC) to IRS administrative data. This approach, called the Exact Match, uses survey data to identify EITC eligible taxpayers and IRS administrative data to indicate which eligible taxpayers claimed and received the credit. Overall in tax year 2021 eligible taxpayers participated in the EITC program at a rate of 78 percent while dollar participation was 81 percent.
    View Full Paper PDF