CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Person Validation System'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Protected Identification Key - 69

Internal Revenue Service - 50

Social Security Number - 48

American Community Survey - 46

Social Security Administration - 43

Census Bureau Disclosure Review Board - 42

Person Identification Validation System - 40

Current Population Survey - 37

Social Security - 33

Personally Identifiable Information - 24

2010 Census - 23

Disclosure Review Board - 20

Decennial Census - 19

Center for Administrative Records Research and Applications - 19

Department of Housing and Urban Development - 18

Census Numident - 18

W-2 - 18

Longitudinal Employer Household Dynamics - 17

Master Address File - 17

Housing and Urban Development - 16

Supplemental Nutrition Assistance Program - 15

Office of Management and Budget - 14

Employer Identification Numbers - 13

Survey of Income and Program Participation - 13

Individual Taxpayer Identification Numbers - 13

Some Other Race - 13

Federal Statistical Research Data Center - 12

Census Household Composition Key - 11

Medicaid Services - 11

Census Bureau Person Identification Validation System - 11

National Bureau of Economic Research - 11

Computer Assisted Personal Interview - 11

North American Industry Classification System - 10

1940 Census - 10

Administrative Records - 10

Temporary Assistance for Needy Families - 10

SSA Numident - 10

Indian Health Service - 10

Ordinary Least Squares - 10

Service Annual Survey - 9

Business Register - 9

Social and Economic Supplement - 9

Centers for Medicare - 9

Earned Income Tax Credit - 9

Indian Housing Information Center - 9

Census Edited File - 9

National Science Foundation - 9

National Opinion Research Center - 9

MAFID - 8

Bureau of Labor Statistics - 8

Adjusted Gross Income - 8

Master Beneficiary Record - 8

Disability Insurance - 8

Social Science Research Institute - 8

MAF-ARF - 8

Detailed Earnings Records - 8

Longitudinal Business Database - 7

Center for Economic Studies - 7

ASEC - 7

Center for Administrative Records Research - 7

Census Bureau Master Address File - 7

Data Management System - 7

Department of Homeland Security - 6

National Center for Health Statistics - 6

PIKed - 6

National Institute on Aging - 6

University of Chicago - 6

Census 2000 - 6

Citizenship and Immigration Services - 5

Federal Poverty Level - 5

Current Population Survey Annual Social and Economic Supplement - 5

Census Bureau Business Register - 5

American Housing Survey - 5

New York University - 5

Journal of Economic Literature - 5

PSID - 5

Chicago Census Research Data Center - 5

Postal Service - 5

Department of Health and Human Services - 5

National Academy of Sciences - 4

County Business Patterns - 4

CPS ASEC - 4

Metropolitan Statistical Area - 4

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 4

CATI - 4

Pew Research Center - 4

COVID-19 - 4

Alfred P Sloan Foundation - 4

Centers for Disease Control and Prevention - 4

Master Earnings File - 4

Cornell University - 4

Research Data Center - 4

Department of Labor - 4

Cornell Institute for Social and Economic Research - 4

Michigan Institute for Teaching and Research in Economics - 3

Department of Education - 3

Individual Characteristics File - 3

United States Census Bureau - 3

Employment History File - 3

Employer Characteristics File - 3

Federal Reserve Bank - 3

Quarterly Census of Employment and Wages - 3

NUMIDENT - 3

Customs and Border Protection - 3

Patent and Trademark Office - 3

Ohio State University - 3

Harvard University - 3

Organization for Economic Cooperation and Development - 3

Unemployment Insurance - 3

Department of Justice - 3

Office of Personnel Management - 3

HHS - 3

Stanford University - 3

Quarterly Workforce Indicators - 3

Department of Commerce - 3

population - 29

survey - 27

respondent - 26

census bureau - 21

census data - 20

ethnicity - 18

data - 16

disadvantaged - 16

hispanic - 16

record - 15

irs - 15

minority - 15

1040 - 13

immigrant - 13

socioeconomic - 13

enrollment - 12

data census - 12

poverty - 12

tax - 12

ethnic - 12

family - 11

medicaid - 11

datasets - 11

taxpayer - 11

recession - 11

resident - 11

citizen - 11

records census - 11

race - 11

census responses - 11

earnings - 10

assessed - 10

disparity - 10

matching - 10

racial - 10

employed - 10

intergenerational - 9

ssa - 9

labor - 9

workforce - 9

use census - 9

database - 8

federal - 8

agency - 8

employ - 8

imputation - 8

identifier - 7

coverage - 7

parent - 7

sampling - 7

census survey - 7

eligibility - 7

income data - 7

immigration - 7

unemployed - 7

residence - 7

survey income - 7

percentile - 7

welfare - 7

census records - 7

race census - 7

filing - 6

dependent - 6

income households - 6

microdata - 6

estimating - 6

statistical - 6

native - 6

black - 6

earner - 6

department - 5

residing - 5

housing - 5

survey households - 5

eligible - 5

enrolled - 5

migration - 5

surveys censuses - 5

medicare - 5

linkage - 5

2010 census - 5

citizenship - 5

census use - 5

payroll - 5

heterogeneity - 5

white - 5

associate - 5

latino - 5

census file - 5

employee - 4

parental - 4

household surveys - 4

population survey - 4

child - 4

provided census - 4

residential - 4

census linked - 4

bias - 4

income survey - 4

exemption - 4

census household - 4

migrant - 4

retirement - 4

poor - 4

birth - 4

assessing - 4

economist - 4

mexican - 4

segregation - 4

salary - 4

occupation - 4

ancestry - 4

census research - 4

matched - 4

disclosure - 3

graduate - 3

career - 3

renter - 3

prevalence - 3

subsidy - 3

income individuals - 3

household income - 3

linked census - 3

impact - 3

environmental - 3

pandemic - 3

propensity - 3

expenditure - 3

adoption - 3

mobility - 3

insurance - 3

wealth - 3

reside - 3

recessionary - 3

schooling - 3

recession exposure - 3

maternal - 3

estimator - 3

state - 3

census 2020 - 3

patent - 3

patenting - 3

mortality - 3

worker demographics - 3

employee data - 3

econometric - 3

hiring - 3

industrial - 3

enrollee - 3

Viewing papers 51 through 60 of 75


  • Working Paper

    Fathers, Children, and the Intergenerational Transmission of Employers

    March 2018

    Working Paper Number:

    CES-18-12

    We document the tendency of fathers in the U.S. to share employers with their sons and daughters. We show that the rate of job sharing is much higher than can be explained by the fact that fathers and sons tend to live near each other. Younger children are much more likely to share their father's employer, as are children of high-earning fathers. We find that sons' earnings at shared jobs tend to be higher than at unshared jobs but see no statistically signi?cant di'erence for daughters. Much of the earnings differential is associated with jobs at shared employers being in higher-paying industries. When we control for employer characteristics, we see a much smaller son earnings premium for working together with his father. We also investigate the impact of sharing an employer on intergenerational mobility and demonstrate that for sons, sharing an employer at some point before age 30 is associated with a higher rank in the earnings distribution as an adult but that this association is independent of the father's rank in the earnings distribution.
    View Full Paper PDF
  • Working Paper

    The Distributional Effects of Minimum Wages: Evidence from Linked Survey and Administrative Data

    March 2018

    Working Paper Number:

    carra-2018-02

    States and localities are increasingly experimenting with higher minimum wages in response to rising income inequality and stagnant economic mobility, but commonly used public datasets offer limited opportunities to evaluate the extent to which such changes affect earnings growth. We use administrative earnings data from the Social Security Administration linked to the Current Population Survey to overcome important limitations of public data and estimate effects of the minimum wage on growth incidence curves and income mobility profiles, providing insight into how cross-sectional effects of the minimum wage on earnings persist over time. Under both approaches, we find that raising the minimum wage increases earnings growth at the bottom of the distribution, and those effects persist and indeed grow in magnitude over several years. This finding is robust to a variety of specifications, including alternatives commonly used in the literature on employment effects of the minimum wage. Instrumental variables and subsample analyses indicate that geographic mobility likely contributes to the effects we identify. Extrapolating from our estimates suggests that a minimum wage increase comparable in magnitude to the increase experienced in Seattle between 2013 and 2016 would have blunted some, but not nearly all, of the worst income losses suffered at the bottom of the income distribution during the Great Recession.
    View Full Paper PDF
  • Working Paper

    Investigating the Use of Administrative Records in the Consumer Expenditure Survey

    March 2018

    Working Paper Number:

    carra-2018-01

    In this paper, we investigate the potential of applying administrative records income data to the Consumer Expenditure (CE) survey to inform measurement error properties of CE estimates, supplement respondent-collected data, and estimate the representativeness of the CE survey by income level. We match individual responses to Consumer Expenditure Quarterly Interview Survey data collected from July 2013 through December 2014 to IRS administrative data in order to analyze CE questions on wages, social security payroll deductions, self-employment income receipt and retirement income. We find that while wage amounts are largely in alignment between the CE and administrative records in the middle of the wage distribution, there is evidence that wages are over-reported to the CE at the bottom of the wage distribution and under-reported at the top of the wage distribution. We find mixed evidence for alignment between the CE and administrative records on questions covering payroll deductions and self-employment income receipt, but find substantial divergence between CE responses and administrative records when examining retirement income. In addition to the analysis using person-based linkages, we also match responding and non-responding CE sample units to the universe of IRS 1040 tax returns by address to examine non-response bias. We find that non-responding households are substantially richer than responding households, and that very high income households are less likely to respond to the CE.
    View Full Paper PDF
  • Working Paper

    Capturing More Than Poverty: School Free and Reduced-Price Lunch Data and Household Income

    December 2017

    Working Paper Number:

    carra-2017-09

    Educational researchers often use National School Lunch Program (NSLP) data as a proxy for student poverty. Under NSLP policy, students whose household income is less than 130 percent of the poverty line qualify for free lunch and students whose household income is between 130 percent and 185 percent of the poverty line qualify for reduced-price lunch. Linking school administrative records for all 8th graders in a California public school district to household-level IRS income tax data, we examine how well NSLP data capture student disadvantage. We find both that there is substantial disadvantage in household income not captured by NSLP category data, and that NSLP categories capture disadvantage on test scores above and beyond household income.
    View Full Paper PDF
  • Working Paper

    Labor Market Effects of the Affordable Care Act: Evidence from a Tax Notch

    July 2017

    Working Paper Number:

    carra-2017-07

    States that declined to raise their Medicaid income eligibility cutoffs to 138 percent of the federal poverty level (FPL) under the Affordable Care Act (ACA) created a "coverage gap'' between their existing, often much lower Medicaid eligibility cutoffs and the FPL, the lowest level of income at which the ACA provides refundable, advanceable "premium tax credits'' to subsidize the purchase of private insurance. Lacking access to any form of subsidized health insurance, residents of those states with income in that range face a strong incentive, in the form of a large, discrete increase in post-tax income (i.e. an upward notch) at the FPL, to increase their earnings and obtain the premium tax credit. We investigate the extent to which they respond to that incentive. Using the universe of tax returns, we document excess mass, or bunching, in the income distribution surrounding this notch. Consistent with Saez (2010), we find that bunching occurs only among filers with self-employment income. Specifically, filers without children and married filers with three or fewer children exhibit significant bunching. Analysis of tax data linked to labor supply measures from the American Community Survey, however, suggests that this bunching likely reflects a change in reported income rather than a change in true labor supply. We find no evidence that wage and salary workers adjust their labor supply in response to increased availability of directly purchased health insurance.
    View Full Paper PDF
  • Working Paper

    Air Quality, Human Capital Formation and the Long-term Effects of Environmental Inequality at Birth

    May 2017

    Authors: John Voorheis

    Working Paper Number:

    carra-2017-05

    A growing body of literature suggests that pollution exposure early in life can have substantial long term effects on an individual's economic well-being as an adult, however the mechanisms for these effects remain unclear. I contribute to this literature by examining the effect of pollution exposure on several intermediate determinants of adult wages using a unique linked dataset for a large sample of individuals from two cohorts: an older cohort born around the 1970, and a younger cohort born around 1990. This dataset links responses to the American Community Survey to SSA administrative data, the universe of IRS Form 1040 tax returns, pollution concentration data from EPA air quality monitors and satellite remote sensing observations. In both OLS and IV specifications, I find that pollution exposure at birth has a large and economically significant effect on college attendance among 19-22 year olds. Using conventional estimates of the college wage premium, these effects imply that a 10 'g/m3 decrease in particulate matter exposure at birth is associated with a $190 per year increase in annual wages. This effect is smaller than the wage effects in the previous literature, which suggests that human capital acquisition associated with cognitive skills cannot fully explain the long term wage effects of pollution exposure. Indeed, I find evidence for an additional channel working through non-cognitive skill -pollution exposure at birth increases high school non-completion and incarceration among 16-24 year olds, and that these effects are concentrated within disadvantaged communities, with larger effects for non-whites and children of poor parents. I also find that pollution exposure during adolescence has statistically significant effects on high school non-completion and incarceration, but no effect on college attendance. These results suggest that the long term effects of pollution exposure on economic well-being may run through multiple channels, of which both non-cognitive skills and cognitive skills may play a role.
    View Full Paper PDF
  • Working Paper

    Longitudinal Environmental Inequality and Environmental Gentrification: Who Gains From Cleaner Air?

    May 2017

    Authors: John Voorheis

    Working Paper Number:

    carra-2017-04

    A vast empirical literature has convincingly shown that there is pervasive cross-sectional inequality in exposure to environmental hazards. However, less is known about how these inequalities have been evolving over time. I fill this gap by creating a new dataset, which combines satellite data on ground-level concentrations of fine particulate matter with linked administrative and survey data. This linked dataset allows me to measure individual pollution exposure for over 100 million individuals in each year between 2000 and 2014, a period of time has seen substantial improvements in average air quality. This rich dataset can then be used to analyze longitudinal dimensions of environmental inequality by examining the distribution of changes in individual pollution exposure that underlie these aggregate improvements. I confirm previous findings that cross-sectional environmental inequality has been on the decline, but I argue that this may miss longitudinal patterns in exposure that are consistent with environmental gentrification. I find that advantaged individuals at the beginning of the sample experience larger pollution exposure reductions than do initially disadvantaged individuals.
    View Full Paper PDF
  • Working Paper

    When Race and Hispanic Origin Reporting are Discrepant Across Administrative Records and Third Party Sources: Exploring Methods to Assign Responses

    December 2015

    Working Paper Number:

    carra-2015-08

    The U.S. Census Bureau is researching uses of administrative records and third party data in survey and decennial census operations. One potential use of administrative records is to utilize these data when race and Hispanic origin responses are missing. When federal and third party administrative records are compiled, race and Hispanic origin responses are not always the same for an individual across sources. We explore different methods to assign one race and one Hispanic response when these responses are discrepant. We also describe the characteristics of individuals with matching, non-matching, and missing race and Hispanic origin data by demographic, household, and contextual variables. We find that minorities, especially Hispanics, are more likely to have non-matching Hispanic origin and race responses in administrative records and third party data compared to the 2010 Census. Minority groups and individuals ages 0-17 are more likely to have missing race or Hispanic origin data in administrative records and third party data. Larger households tend to have more missing race data in administrative records and third party data than smaller households.
    View Full Paper PDF
  • Working Paper

    Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net

    October 2015

    Working Paper Number:

    CES-15-35

    We examine the consequences of underreporting of transfer programs in household survey data for several prototypical analyses of low-income populations. We focus on the Current Population Survey (CPS), the source of official poverty and inequality statistics, but provide evidence that our qualitative conclusions are likely to apply to other surveys. We link administrative data for food stamps, TANF, General Assistance, and subsidized housing from New York State to the CPS at the individual level. Program receipt in the CPS is missed for over one-third of housing assistance recipients, 40 percent of food stamp recipients and 60 percent of TANF and General Assistance recipients. Dollars of benefits are also undercounted for reporting recipients, particularly for TANF, General Assistance and housing assistance. We find that the survey data sharply understate the income of poor households, as conjectured in past work by one of the authors. Underreporting in the survey data also greatly understates the effects of anti-poverty programs and changes our understanding of program targeting, often making it seem that welfare programs are less targeted to both the very poorest and middle income households than they are. Using the combined data rather than survey data alone, the poverty reducing effect of all programs together is nearly doubled while the effect of housing assistance is tripled. We also re-examine the coverage of the safety net, specifically the share of people without work or program receipt. Using the administrative measures of program receipt rather than the survey ones often reduces the share of single mothers falling through the safety net by one-half or more.
    View Full Paper PDF
  • Working Paper

    Assessing Coverage and Quality of the 2007 Prototype Census Kidlink Database

    September 2015

    Working Paper Number:

    carra-2015-07

    The Census Bureau is conducting research to expand the use of administrative records data in censuses and surveys to decrease respondent burden and reduce costs while improving data quality. Much of this research (e.g., Rastogi and O''Hara (2012), Luque and Bhaskar (2014)) hinges on the ability to integrate multiple data sources by linking individuals across files. One of the Census Bureau's record linkage methodologies for data integration is the Person Identification Validation System or PVS. PVS assigns anonymous and unique IDs (Protected Identification Keys or PIKs) that serve as linkage keys across files. Prior research showed that integrating 'known associates' information into PVS's reference files could potentially enhance PVS's PIK assignment rates. The term 'known associates' refers to people that are likely to be associated with each other because of a known common link (such as family relationships or people sharing a common address), and thus, to be observed together in different files. One of the results from this prior research was the creation of the 2007 Census Kidlink file, a child-level file linking a child's Social Security Number (SSN) record to the SSN of those identified as the child's parents. In this paper, we examine to what extent the 2007 Census Kidlink methodology was able to link parents SSNs to children SSN records, and also evaluate the quality of those links. We find that in approximately 80 percent of cases, at least one parent was linked to the child's record. Younger children and noncitizens have a higher percentage of cases where neither parent could be linked to the child. Using 2007 tax data as a benchmark, our quality evaluation results indicate that in at least 90 percent of the cases, the parent-child link agreed with those found in the tax data. Based on our findings, we propose improvements to the 2007 Kidlink methodology to increase child-parent links, and discuss how the creation of the file could be operationalized moving forward.
    View Full Paper PDF