CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Person Validation System'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Protected Identification Key - 69

Internal Revenue Service - 50

Social Security Number - 48

American Community Survey - 46

Social Security Administration - 43

Census Bureau Disclosure Review Board - 42

Person Identification Validation System - 40

Current Population Survey - 37

Social Security - 33

Personally Identifiable Information - 24

2010 Census - 23

Disclosure Review Board - 20

Decennial Census - 19

Center for Administrative Records Research and Applications - 19

Department of Housing and Urban Development - 18

Census Numident - 18

W-2 - 18

Longitudinal Employer Household Dynamics - 17

Master Address File - 17

Housing and Urban Development - 16

Supplemental Nutrition Assistance Program - 15

Office of Management and Budget - 14

Employer Identification Numbers - 13

Survey of Income and Program Participation - 13

Individual Taxpayer Identification Numbers - 13

Some Other Race - 13

Federal Statistical Research Data Center - 12

Census Household Composition Key - 11

Medicaid Services - 11

Census Bureau Person Identification Validation System - 11

National Bureau of Economic Research - 11

Computer Assisted Personal Interview - 11

North American Industry Classification System - 10

1940 Census - 10

Administrative Records - 10

Temporary Assistance for Needy Families - 10

SSA Numident - 10

Indian Health Service - 10

Ordinary Least Squares - 10

Service Annual Survey - 9

Business Register - 9

Social and Economic Supplement - 9

Centers for Medicare - 9

Earned Income Tax Credit - 9

Indian Housing Information Center - 9

Census Edited File - 9

National Science Foundation - 9

National Opinion Research Center - 9

MAFID - 8

Bureau of Labor Statistics - 8

Adjusted Gross Income - 8

Master Beneficiary Record - 8

Disability Insurance - 8

Social Science Research Institute - 8

MAF-ARF - 8

Detailed Earnings Records - 8

Longitudinal Business Database - 7

Center for Economic Studies - 7

ASEC - 7

Center for Administrative Records Research - 7

Census Bureau Master Address File - 7

Data Management System - 7

Department of Homeland Security - 6

National Center for Health Statistics - 6

PIKed - 6

National Institute on Aging - 6

University of Chicago - 6

Census 2000 - 6

Citizenship and Immigration Services - 5

Federal Poverty Level - 5

Current Population Survey Annual Social and Economic Supplement - 5

Census Bureau Business Register - 5

American Housing Survey - 5

New York University - 5

Journal of Economic Literature - 5

PSID - 5

Chicago Census Research Data Center - 5

Postal Service - 5

Department of Health and Human Services - 5

National Academy of Sciences - 4

County Business Patterns - 4

CPS ASEC - 4

Metropolitan Statistical Area - 4

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 4

CATI - 4

Pew Research Center - 4

COVID-19 - 4

Alfred P Sloan Foundation - 4

Centers for Disease Control and Prevention - 4

Master Earnings File - 4

Cornell University - 4

Research Data Center - 4

Department of Labor - 4

Cornell Institute for Social and Economic Research - 4

Michigan Institute for Teaching and Research in Economics - 3

Department of Education - 3

Individual Characteristics File - 3

United States Census Bureau - 3

Employment History File - 3

Employer Characteristics File - 3

Federal Reserve Bank - 3

Quarterly Census of Employment and Wages - 3

NUMIDENT - 3

Customs and Border Protection - 3

Patent and Trademark Office - 3

Ohio State University - 3

Harvard University - 3

Organization for Economic Cooperation and Development - 3

Unemployment Insurance - 3

Department of Justice - 3

Office of Personnel Management - 3

HHS - 3

Stanford University - 3

Quarterly Workforce Indicators - 3

Department of Commerce - 3

population - 29

survey - 27

respondent - 26

census bureau - 21

census data - 20

ethnicity - 18

data - 16

disadvantaged - 16

hispanic - 16

record - 15

irs - 15

minority - 15

1040 - 13

immigrant - 13

socioeconomic - 13

enrollment - 12

data census - 12

poverty - 12

tax - 12

ethnic - 12

family - 11

medicaid - 11

datasets - 11

taxpayer - 11

recession - 11

resident - 11

citizen - 11

records census - 11

race - 11

census responses - 11

earnings - 10

assessed - 10

disparity - 10

matching - 10

racial - 10

employed - 10

intergenerational - 9

ssa - 9

labor - 9

workforce - 9

use census - 9

database - 8

federal - 8

agency - 8

employ - 8

imputation - 8

identifier - 7

coverage - 7

parent - 7

sampling - 7

census survey - 7

eligibility - 7

income data - 7

immigration - 7

unemployed - 7

residence - 7

survey income - 7

percentile - 7

welfare - 7

census records - 7

race census - 7

filing - 6

dependent - 6

income households - 6

microdata - 6

estimating - 6

statistical - 6

native - 6

black - 6

earner - 6

department - 5

residing - 5

housing - 5

survey households - 5

eligible - 5

enrolled - 5

migration - 5

surveys censuses - 5

medicare - 5

linkage - 5

2010 census - 5

citizenship - 5

census use - 5

payroll - 5

heterogeneity - 5

white - 5

associate - 5

latino - 5

census file - 5

employee - 4

parental - 4

household surveys - 4

population survey - 4

child - 4

provided census - 4

residential - 4

census linked - 4

bias - 4

income survey - 4

exemption - 4

census household - 4

migrant - 4

retirement - 4

poor - 4

birth - 4

assessing - 4

economist - 4

mexican - 4

segregation - 4

salary - 4

occupation - 4

ancestry - 4

census research - 4

matched - 4

disclosure - 3

graduate - 3

career - 3

renter - 3

prevalence - 3

subsidy - 3

income individuals - 3

household income - 3

linked census - 3

impact - 3

environmental - 3

pandemic - 3

propensity - 3

expenditure - 3

adoption - 3

mobility - 3

insurance - 3

wealth - 3

reside - 3

recessionary - 3

schooling - 3

recession exposure - 3

maternal - 3

estimator - 3

state - 3

census 2020 - 3

patent - 3

patenting - 3

mortality - 3

worker demographics - 3

employee data - 3

econometric - 3

hiring - 3

industrial - 3

enrollee - 3

Viewing papers 71 through 75 of 75


  • Working Paper

    2010 American Community Survey Match Study

    July 2014

    Working Paper Number:

    carra-2014-03

    Using administrative records data from federal government agencies and commercial sources, the 2010 ACS Match Study measures administrative records coverage of 2010 ACS addresses, persons, and persons at addresses at different levels of geography as well as by demographic characteristics and response mode. The 2010 ACS Match Study represents a continuation of the research undertaken in the 2010 Census Match Study, the first national-level evaluation of administrative records data coverage. Preliminary results indicate that administrative records provide substantial coverage for addresses and persons in the 2010 ACS (92.7 and 92.1 percent respectively), and less extensive though substantial coverage, for person-address pairs (74.3 percent). In addition, some variation in address, person and/or person-address coverage is found across demographic and response mode groups. This research informs future uses of administrative records in survey and decennial census operations to address the increasing costs of data collection and declining response rates.
    View Full Paper PDF
  • Working Paper

    Estimating Record Linkage False Match Rate for the Person Identification Validation System

    July 2014

    Working Paper Number:

    carra-2014-02

    The Census Bureau Person Identification Validation System (PVS) assigns unique person identifiers to federal, commercial, census, and survey data to facilitate linkages across files. PVS uses probabilistic matching to assign a unique Census Bureau identifier for each person. This paper presents a method to measure the false match rate in PVS following the approach of Belin and Rubin (1995). The Belin and Rubin methodology requires truth data to estimate a mixture model. The parameters from the mixture model are used to obtain point estimates of the false match rate for each of the PVS search modules. The truth data requirement is satisfied by the unique access the Census Bureau has to high quality name, date of birth, address and Social Security (SSN) data. Truth data are quickly created for the Belin and Rubin model and do not involve a clerical review process. These truth data are used to create estimates for the Belin and Rubin parameters, making the approach more feasible. Both observed and modeled false match rates are computed for all search modules in federal administrative records data and commercial data.
    View Full Paper PDF
  • Working Paper

    The Person Identification Validation System (PVS): Applying the Center for Administrative Records Research and Applications' (CARRA) Record Linkage Software

    July 2014

    Working Paper Number:

    carra-2014-01

    The Census Bureau's Person Identification Validation System (PVS) assigns unique person identifiers to federal, commercial, census, and survey data to facilitate linkages across and within files. PVS uses probabilistic matching to assign a unique Census Bureau identifier for each person. The PVS matches incoming files to reference files created with data from the Social Security Administration (SSA) Numerical Identification file, and SSA data with addresses obtained from federal files. This paper describes the PVS methodology from editing input data to creating the final file.
    View Full Paper PDF
  • Working Paper

    The Nature of the Bias When Studying Only Linkable Person Records: Evidence from the American Community Survey

    April 2014

    Working Paper Number:

    carra-2014-08

    Record linkage across survey and administrative records sources can greatly enrich data and improve their quality. The linkage can reduce respondent burden and nonresponse follow-up costs. This is particularly important in an era of declining survey response rates and tight budgets. Record linkage also creates statistical bias, however. The U.S. Census Bureau links person records through its Person Identification Validation System (PVS), assigning each record a Protected Identification Key (PIK). It is not possible to reliably assign a PIK to every record, either due to insufficient identifying information or because the information does not uniquely match any of the administrative records used in the person validation process. Non-random ability to assign a PIK can potentially inject bias into statistics using linked data. This paper studies the nature of this bias using the 2009 and 2010 American Community Survey (ACS). The ACS is well-suited for this analysis, as it contains a rich set of person characteristics that can describe the bias. We estimate probit models for whether a record is assigned a PIK. The results suggest that young children, minorities, residents of group quarters, immigrants, recent movers, low-income individuals, and non-employed individuals are less likely to receive a PIK using 2009 ACS. Changes to the PVS process in 2010 significantly addressed the young children deficit, attenuated the other biases, and increased the validated records share from 88.1 to 92.6 percent (person-weighted).
    View Full Paper PDF
  • Working Paper

    Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Bureau Survey and SSA Administrative Data

    July 2011

    Working Paper Number:

    CES-11-20

    We quantify sources of variation in annual job earnings data collected by the Survey of Income and Program Participation (SIPP) to determine how much of the variation is the result of measurement error. Jobs reported in the SIPP are linked to jobs reported in an administrative database, the Detailed Earnings Records (DER) drawn from the Social Security Administration's Master Earnings File, a universe file of all earnings reported on W-2 tax forms. As a result of the match, each job potentially has two earnings observations per year: survey and administrative. Unlike previous validation studies, both of these earnings measures are viewed as noisy measures of some underlying true amount of annual earnings. While the existence of survey error resulting from respondent mistakes or misinterpretation is widely accepted, the idea that administrative data are also error-prone is new. Possible sources of employer reporting error, employee under-reporting of compensation such as tips, and general differences between how earnings may be reported on tax forms and in surveys, necessitates the discarding of the assumption that administrative data are a true measure of the quantity that the survey was designed to collect. In addition, errors in matching SIPP and DER jobs, a necessary task in any use of administrative data, also contribute to measurement error in both earnings variables. We begin by comparing SIPP and DER earnings for different demographic and education groups of SIPP respondents. We also calculate different measures of changes in earnings for individuals switching jobs. We estimate a standard earnings equation model using SIPP and DER earnings and compare the resulting coefficients. Finally exploiting the presence of individuals with multiple jobs and shared employers over time, we estimate an econometric model that includes random person and firm effects, a common error component shared by SIPP and DER earnings, and two independent error components that represent the variation unique to each earnings measure. We compare the variance components from this model and consider how the DER and SIPP differ across unobservable components.
    View Full Paper PDF