CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Person Validation System'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Protected Identification Key - 69

Internal Revenue Service - 50

Social Security Number - 48

American Community Survey - 46

Social Security Administration - 43

Census Bureau Disclosure Review Board - 42

Person Identification Validation System - 40

Current Population Survey - 37

Social Security - 33

Personally Identifiable Information - 24

2010 Census - 23

Disclosure Review Board - 20

Decennial Census - 19

Center for Administrative Records Research and Applications - 19

Department of Housing and Urban Development - 18

Census Numident - 18

W-2 - 18

Longitudinal Employer Household Dynamics - 17

Master Address File - 17

Housing and Urban Development - 16

Supplemental Nutrition Assistance Program - 15

Office of Management and Budget - 14

Employer Identification Numbers - 13

Survey of Income and Program Participation - 13

Individual Taxpayer Identification Numbers - 13

Some Other Race - 13

Federal Statistical Research Data Center - 12

Census Household Composition Key - 11

Medicaid Services - 11

Census Bureau Person Identification Validation System - 11

National Bureau of Economic Research - 11

Computer Assisted Personal Interview - 11

North American Industry Classification System - 10

1940 Census - 10

Administrative Records - 10

Temporary Assistance for Needy Families - 10

SSA Numident - 10

Indian Health Service - 10

Ordinary Least Squares - 10

Service Annual Survey - 9

Business Register - 9

Social and Economic Supplement - 9

Centers for Medicare - 9

Earned Income Tax Credit - 9

Indian Housing Information Center - 9

Census Edited File - 9

National Science Foundation - 9

National Opinion Research Center - 9

MAFID - 8

Bureau of Labor Statistics - 8

Adjusted Gross Income - 8

Master Beneficiary Record - 8

Disability Insurance - 8

Social Science Research Institute - 8

MAF-ARF - 8

Detailed Earnings Records - 8

Longitudinal Business Database - 7

Center for Economic Studies - 7

ASEC - 7

Center for Administrative Records Research - 7

Census Bureau Master Address File - 7

Data Management System - 7

Department of Homeland Security - 6

National Center for Health Statistics - 6

PIKed - 6

National Institute on Aging - 6

University of Chicago - 6

Census 2000 - 6

Citizenship and Immigration Services - 5

Federal Poverty Level - 5

Current Population Survey Annual Social and Economic Supplement - 5

Census Bureau Business Register - 5

American Housing Survey - 5

New York University - 5

Journal of Economic Literature - 5

PSID - 5

Chicago Census Research Data Center - 5

Postal Service - 5

Department of Health and Human Services - 5

National Academy of Sciences - 4

County Business Patterns - 4

CPS ASEC - 4

Metropolitan Statistical Area - 4

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 4

CATI - 4

Pew Research Center - 4

COVID-19 - 4

Alfred P Sloan Foundation - 4

Centers for Disease Control and Prevention - 4

Master Earnings File - 4

Cornell University - 4

Research Data Center - 4

Department of Labor - 4

Cornell Institute for Social and Economic Research - 4

Michigan Institute for Teaching and Research in Economics - 3

Department of Education - 3

Individual Characteristics File - 3

United States Census Bureau - 3

Employment History File - 3

Employer Characteristics File - 3

Federal Reserve Bank - 3

Quarterly Census of Employment and Wages - 3

NUMIDENT - 3

Customs and Border Protection - 3

Patent and Trademark Office - 3

Ohio State University - 3

Harvard University - 3

Organization for Economic Cooperation and Development - 3

Unemployment Insurance - 3

Department of Justice - 3

Office of Personnel Management - 3

HHS - 3

Stanford University - 3

Quarterly Workforce Indicators - 3

Department of Commerce - 3

population - 29

survey - 27

respondent - 26

census bureau - 21

census data - 20

ethnicity - 18

data - 16

disadvantaged - 16

hispanic - 16

record - 15

irs - 15

minority - 15

1040 - 13

immigrant - 13

socioeconomic - 13

enrollment - 12

data census - 12

poverty - 12

tax - 12

ethnic - 12

family - 11

medicaid - 11

datasets - 11

taxpayer - 11

recession - 11

resident - 11

citizen - 11

records census - 11

race - 11

census responses - 11

earnings - 10

assessed - 10

disparity - 10

matching - 10

racial - 10

employed - 10

intergenerational - 9

ssa - 9

labor - 9

workforce - 9

use census - 9

database - 8

federal - 8

agency - 8

employ - 8

imputation - 8

identifier - 7

coverage - 7

parent - 7

sampling - 7

census survey - 7

eligibility - 7

income data - 7

immigration - 7

unemployed - 7

residence - 7

survey income - 7

percentile - 7

welfare - 7

census records - 7

race census - 7

filing - 6

dependent - 6

income households - 6

microdata - 6

estimating - 6

statistical - 6

native - 6

black - 6

earner - 6

department - 5

residing - 5

housing - 5

survey households - 5

eligible - 5

enrolled - 5

migration - 5

surveys censuses - 5

medicare - 5

linkage - 5

2010 census - 5

citizenship - 5

census use - 5

payroll - 5

heterogeneity - 5

white - 5

associate - 5

latino - 5

census file - 5

employee - 4

parental - 4

household surveys - 4

population survey - 4

child - 4

provided census - 4

residential - 4

census linked - 4

bias - 4

income survey - 4

exemption - 4

census household - 4

migrant - 4

retirement - 4

poor - 4

birth - 4

assessing - 4

economist - 4

mexican - 4

segregation - 4

salary - 4

occupation - 4

ancestry - 4

census research - 4

matched - 4

disclosure - 3

graduate - 3

career - 3

renter - 3

prevalence - 3

subsidy - 3

income individuals - 3

household income - 3

linked census - 3

impact - 3

environmental - 3

pandemic - 3

propensity - 3

expenditure - 3

adoption - 3

mobility - 3

insurance - 3

wealth - 3

reside - 3

recessionary - 3

schooling - 3

recession exposure - 3

maternal - 3

estimator - 3

state - 3

census 2020 - 3

patent - 3

patenting - 3

mortality - 3

worker demographics - 3

employee data - 3

econometric - 3

hiring - 3

industrial - 3

enrollee - 3

Viewing papers 61 through 70 of 75


  • Working Paper

    An outside view: What do observers say about others' races and Hispanic origins?

    August 2015

    Working Paper Number:

    carra-2015-05

    Outsiders' views of a person's race or Hispanic origin can impact how she sees herself, how she reports her race and Hispanic origins, and her social and economic experiences. The way outsiders describe non-strangers in terms of their race and Hispanic origin may reveal popular assumptions about which race/Hispanic categories are salient for Americans, which kinds of people are seen as multiracial, and the types of cues people use when identifying another person's race. We study patterns of observer identification using a unique, large, linked data source with two measures of a person's race and Hispanic origin. One measure (from Census 2000 or the 2010 Census) was provided by a household respondent and the other (from the other census year) was provided by a census proxy reporter (e.g., a neighbor) who responded on behalf of a non-responsive household. We ask: Does an outsider's report of a person's race and Hispanic origin match a household report? We find that in about 90% of our 3.7 million (nonrepresentative) cases, proxy reports of a person's race and Hispanic origin match responses given by the household in a different census year. Match rates are high for the largest groups: non-Hispanic whites, blacks, and Asians and for Hispanics, though proxies are not very able to replicate the race responses of Hispanics. Matches are much less common for people in smaller groups (American Indian/Alaska Native, Pacific Islander, Some Other Race, and multiracial). We also ask: What predicts a matched response and what predicts a particular unmatched response? We find evidence of the persistence of hypodescent for blacks and hyperdescent for American Indians. Biracial Asian-whites and Pacific Islander-whites are more often seen by others as non-Hispanic white than as people of color. Proxy reporters tend to identify children as multiple race and elders as single race, whether they are or not. The race/Hispanic composition of the tract is more powerfully predictive of a particular unmatched response than are tract-level measures of socioeconomic status; unmatched responses are often consistent with the race/Hispanic characteristics of the neighborhood.
    View Full Paper PDF
  • Working Paper

    Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms

    July 2015

    Working Paper Number:

    CES-15-19

    This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
    View Full Paper PDF
  • Working Paper

    Assimilation and Coverage of the Foreign-Born Population in Administrative Records

    April 2015

    Working Paper Number:

    carra-2015-02

    The U.S. Census Bureau is researching ways to incorporate administrative data in decennial census and survey operations. Critical to this work is an understanding of the coverage of the population by administrative records. Using federal and third party administrative data linked to the American Community Survey (ACS), we evaluate the extent to which administrative records provide data on foreign-born individuals in the ACS and employ multinomial logistic regression techniques to evaluate characteristics of those who are in administrative records relative to those who are not. We find that overall, administrative records provide high coverage of foreign-born individuals in our sample for whom a match can be determined. The odds of being in administrative records are found to be tied to the processes of immigrant assimilation - naturalization, higher English proficiency, educational attainment, and full-time employment are associated with greater odds of being in administrative records. These findings suggest that as immigrants adapt and integrate into U.S. society, they are more likely to be involved in government and commercial processes and programs for which we are including data. We further explore administrative records coverage for the two largest race/ethnic groups in our sample - Hispanic and non-Hispanic single-race Asian foreign born, finding again that characteristics related to assimilation are associated with administrative records coverage for both groups. However, we observe that neighborhood context impacts Hispanics and Asians differently.
    View Full Paper PDF
  • Working Paper

    Evaluating Race and Hispanic Origin Responses of Medicaid Participants Using Census Data

    April 2015

    Working Paper Number:

    carra-2015-01

    Health and health care disparities associated with race or Hispanic origin are complex and continue to challenge researchers and policy makers. With the intention of improving the measurement and monitoring of these disparities, provisions of the Patient Protection and Affordable Care Act (ACA) of 2010 require states to collect, report and analyze data on demographic characteristics of applicants and participants in Medicaid and other federally supported programs. By linking Medicaid records to 2010 Census, American Community Survey, and Census 2000, this new large-scale study examines and documents the extent to which pre-ACA Medicaid administrative records match self-reported race and Hispanic origin in Census data. Linked records allow comparisons between individuals with matching and non-matching race and Hispanic origin data across several demographic, socioeconomic and neighborhood characteristics, such as age, gender, language proficiency, education and Census tract variables. Identification of the groups most likely to have non-matching and missing race and Hispanic origin data in Medicaid relative to Census data can inform strategies to improve the quality of demographic data collected from Medicaid populations.
    View Full Paper PDF
  • Working Paper

    The EITC over the business cycle: Who benefits?

    December 2014

    Authors: Maggie R. Jones

    Working Paper Number:

    carra-2014-15

    In this paper, I examine the impact of the Great Recession on Earned Income Tax Credit (EITC) eligibility. Because the EITC is structurally tied to earnings, the direction of this impact is not immediately obvious. Families who experience complete job loss for an entire tax year lose eligibility, while those experiencing underemployment (part-year employment, a reduction in hours, or spousal unemployment in married households) may become eligible. Determining the direction and magnitude of the impact is important for a number of reasons. The EITC has become the largest cash-transfer program in the U.S., and many low-earning families rely on it as a means of support in tough times. The program has largely been viewed as a replacement for welfare, enticing former welfare recipients into the labor force. However, the effectiveness of the EITC during a period of very high unemployment has not been assessed. To answer these questions, I first use the Current Population Survey (CPS) matched to Internal Revenue Service data from tax years 2005 to 2010 to assess patterns of employment and eligibility over the Great Recession for different labor-force groups. Results indicate that overall, EITC eligibility increased over the recession, but only among groups that were cushioned from total household earnings loss by marriage. I also use the 2006 CPS matched to tax data from 2005 through 2011 to examine changes in eligibility experienced by individuals over time. In assessing three competing causes of eligibility loss, I find that less-educated, unmarried women experienced a greater hazard of eligibility loss due a yearlong lack of earnings compared with other labor-market groups. I discuss the implications of these findings on the view of the EITC as a safety-net program.
    View Full Paper PDF
  • Working Paper

    Coverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data

    November 2014

    Working Paper Number:

    carra-2014-14

    The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.
    View Full Paper PDF
  • Working Paper

    Do Doubled-up Families Minimize Household-level Tax Burden?

    September 2014

    Working Paper Number:

    carra-2014-13

    This paper examines a method of tax avoidance not previously studied: the sorting of dependent children among related filers who have 'doubled up' in a household for economic reasons. Using the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) linked with 1040 data from the Internal Revenue Service (IRS), we examine households with children and at least two adult tax filers to determine whether the household minimizes income tax burden, and thus maximizes refunds, by optimally claiming dependents. We examine specifically the relationship between the Earned Income Tax Credit (EITC) and the sorting of dependent children among filers in households. We find the following: The propensity to sort increases as the number of filers who are potentially eligible for the EITC increases; sorting probability increases as the optimal household EITC amount increases; and among households with at least one EITC-eligible filer, the propensity to sort increases as the difference between modeled household EITC amount and the optimal amount increases. We also exploit the 2009 change in EITC benefit for families with three or more children, finding that the propensity to sort to exactly three children increased among EITC-eligible filers after the rule change. The results of this analysis improve our understanding of filing behavior, particularly how households form filing units and pool resources, and have implications for poverty measurement in complex households This presentation was given at the CARRA Seminar, July 16, 2014
    View Full Paper PDF
  • Working Paper

    Creating Linked Historical Data: An Assessment of the Census Bureau's Ability to Assign Protected Identification Keys to the 1960 Census

    September 2014

    Working Paper Number:

    carra-2014-12

    In order to study social phenomena over the course of the 20th century, the Census Bureau is investigating the feasibility of digitizing historical census records and linking them to contemporary data. However, historical censuses have limited personally identifiable information available to match on. In this paper, I discuss the problems associated with matching older censuses to contemporary data files, and I describe the matching process used to match a small sample of the 1960 census to the Social Security Administration Numeric Identification System.
    View Full Paper PDF
  • Working Paper

    Person Matching in Historical Files using the Census Bureau's Person Validation System

    September 2014

    Working Paper Number:

    carra-2014-11

    The recent release of the 1940 Census manuscripts enables the creation of longitudinal data spanning the whole of the twentieth century. Linked historical and contemporary data would allow unprecedented analyses of the causes and consequences of health, demographic, and economic change. The Census Bureau is uniquely equipped to provide high quality linkages of person records across datasets. This paper summarizes the linkage techniques employed by the Census Bureau and discusses utilization of these techniques to append protected identification keys to the 1940 Census.
    View Full Paper PDF
  • Working Paper

    America's Churning Races: Race and Ethnic Response Changes between Census 2000 and the 2010 Census

    August 2014

    Working Paper Number:

    carra-2014-09

    Race and ethnicity responses can change over time and across contexts - a component of population change not usually taken into account. To what extent do race and/or Hispanic origin responses change? Is change more common to/from some race/ethnic groups than others? Does the propensity to change responses vary by characteristics of the individual? To what extent do these changes affect researchers? We use internal Census Bureau data from the 2000 and 2010 censuses in which individuals' responses have been linked across years. Approximately 9.8 million people (about 6 percent) in our large, non-representative linked data have a different race and/or Hispanic origin response in 2010 than they did in 2000. Several groups experienced considerable fluidity in racial identification: American Indians and Alaska Natives, Native Hawaiians and Other Pacific Islanders, and multiple-race response groups, as well as Hispanics when reporting a race. In contrast, race and ethnic responses for single-race non-Hispanic whites, blacks, and Asians were relatively consistent over the decade, as were ethnicity responses by Hispanics. People who change their race and/or Hispanic origin response(s) are doing so in a wide variety of ways, as anticipated by previous research. For example, people's responses change from multiple races to a single race, from a single race to multiple races, from one single race to another, and some people add or drop a Hispanic response. The inflow of people to each race/Hispanic group is in many cases similar in size to the outflow from the same group, such that cross-sectional data would show a small net change. We find response changes across ages, sexes, regions, and response modes, with variation across groups. Researchers should consider the implications of changing race and Hispanic origin responses when conducting analyses and interpreting results.
    View Full Paper PDF