CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'census file'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Social Security Administration - 9

Social Security Number - 8

Protected Identification Key - 8

Internal Revenue Service - 6

Service Annual Survey - 6

Current Population Survey - 5

Research Data Center - 5

SSA Numident - 5

Person Validation System - 5

Standard Statistical Establishment List - 4

Standard Industrial Classification - 4

Bureau of Labor Statistics - 4

Center for Economic Studies - 4

Employer Identification Numbers - 4

Business Master File - 4

American Community Survey - 4

Master Address File - 4

2010 Census - 4

Person Identification Validation System - 4

Center for Administrative Records Research and Applications - 4

Personally Identifiable Information - 4

Decennial Census - 3

Cornell University - 3

North American Industry Classification System - 3

Alfred P Sloan Foundation - 3

Longitudinal Employer Household Dynamics - 3

Business Register - 3

Employment History File - 3

Employer Characteristics File - 3

Individual Characteristics File - 3

American Housing Survey - 3

Quarterly Workforce Indicators - 3

Core Based Statistical Area - 3

Quarterly Census of Employment and Wages - 3

Business Employment Dynamics - 3

Business Register Bridge - 3

Disclosure Review Board - 3

National Opinion Research Center - 3

Census Numident - 3

Metropolitan Statistical Area - 3

Longitudinal Business Database - 2

Social Security - 2

Composite Person Record - 2

Local Employment Dynamics - 2

North American Industry Classi - 2

Department of Housing and Urban Development - 2

Individual Taxpayer Identification Numbers - 2

MAFID - 2

Administrative Records - 2

Housing and Urban Development - 2

Indian Health Service - 2

Indian Housing Information Center - 2

1940 Census - 2

Minnesota Population Center - 2

Census Bureau Person Identification Validation System - 2

Annual Survey of Manufactures - 2

Permanent Plant Number - 2

Establishment Micro Properties - 2

Survey of Income and Program Participation - 2

Unemployment Insurance - 2

CDF - 2

Viewing papers 1 through 9 of 9


  • Working Paper

    LEHD Infrastructure S2014 files in the FSRDC

    September 2018

    Authors: Lars Vilhuber

    Working Paper Number:

    CES-18-27R

    The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
    View Full Paper PDF
  • Working Paper

    Assessing Coverage and Quality of the 2007 Prototype Census Kidlink Database

    September 2015

    Working Paper Number:

    carra-2015-07

    The Census Bureau is conducting research to expand the use of administrative records data in censuses and surveys to decrease respondent burden and reduce costs while improving data quality. Much of this research (e.g., Rastogi and O''Hara (2012), Luque and Bhaskar (2014)) hinges on the ability to integrate multiple data sources by linking individuals across files. One of the Census Bureau's record linkage methodologies for data integration is the Person Identification Validation System or PVS. PVS assigns anonymous and unique IDs (Protected Identification Keys or PIKs) that serve as linkage keys across files. Prior research showed that integrating 'known associates' information into PVS's reference files could potentially enhance PVS's PIK assignment rates. The term 'known associates' refers to people that are likely to be associated with each other because of a known common link (such as family relationships or people sharing a common address), and thus, to be observed together in different files. One of the results from this prior research was the creation of the 2007 Census Kidlink file, a child-level file linking a child's Social Security Number (SSN) record to the SSN of those identified as the child's parents. In this paper, we examine to what extent the 2007 Census Kidlink methodology was able to link parents SSNs to children SSN records, and also evaluate the quality of those links. We find that in approximately 80 percent of cases, at least one parent was linked to the child's record. Younger children and noncitizens have a higher percentage of cases where neither parent could be linked to the child. Using 2007 tax data as a benchmark, our quality evaluation results indicate that in at least 90 percent of the cases, the parent-child link agreed with those found in the tax data. Based on our findings, we propose improvements to the 2007 Kidlink methodology to increase child-parent links, and discuss how the creation of the file could be operationalized moving forward.
    View Full Paper PDF
  • Working Paper

    Coverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data

    November 2014

    Working Paper Number:

    carra-2014-14

    The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.
    View Full Paper PDF
  • Working Paper

    Creating Linked Historical Data: An Assessment of the Census Bureau's Ability to Assign Protected Identification Keys to the 1960 Census

    September 2014

    Working Paper Number:

    carra-2014-12

    In order to study social phenomena over the course of the 20th century, the Census Bureau is investigating the feasibility of digitizing historical census records and linking them to contemporary data. However, historical censuses have limited personally identifiable information available to match on. In this paper, I discuss the problems associated with matching older censuses to contemporary data files, and I describe the matching process used to match a small sample of the 1960 census to the Social Security Administration Numeric Identification System.
    View Full Paper PDF
  • Working Paper

    Person Matching in Historical Files using the Census Bureau's Person Validation System

    September 2014

    Working Paper Number:

    carra-2014-11

    The recent release of the 1940 Census manuscripts enables the creation of longitudinal data spanning the whole of the twentieth century. Linked historical and contemporary data would allow unprecedented analyses of the causes and consequences of health, demographic, and economic change. The Census Bureau is uniquely equipped to provide high quality linkages of person records across datasets. This paper summarizes the linkage techniques employed by the Census Bureau and discusses utilization of these techniques to append protected identification keys to the 1940 Census.
    View Full Paper PDF
  • Working Paper

    The Person Identification Validation System (PVS): Applying the Center for Administrative Records Research and Applications' (CARRA) Record Linkage Software

    July 2014

    Working Paper Number:

    carra-2014-01

    The Census Bureau's Person Identification Validation System (PVS) assigns unique person identifiers to federal, commercial, census, and survey data to facilitate linkages across and within files. PVS uses probabilistic matching to assign a unique Census Bureau identifier for each person. The PVS matches incoming files to reference files created with data from the Social Security Administration (SSA) Numerical Identification file, and SSA data with addresses obtained from federal files. This paper describes the PVS methodology from editing input data to creating the final file.
    View Full Paper PDF
  • Working Paper

    LEHD Data Documentation LEHD-OVERVIEW-S2008-rev1

    December 2011

    Working Paper Number:

    CES-11-43

    View Full Paper PDF
  • Working Paper

    LEHD Infrastructure Files in the Census RDC: Overview of S2004 Snapshot

    April 2011

    Working Paper Number:

    CES-11-13

    The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, has built a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2004 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's Research Data Center network.
    View Full Paper PDF
  • Working Paper

    NEW DATA FOR DYNAMIC ANALYSIS: THE LONGITUDINAL ESTABLISHMENT AND ENTERPRISE MICRODATA (LEEM) FILE

    December 1999

    Authors: Alicia Robb

    Working Paper Number:

    CES-99-18

    Until now, research on U.S. business activities over time has been hindered by the lack of accurate and comprehensive longitudinal data. The new Longitudinal Establishment and Enterprise Microdata (LEEM) are tremendously rich data that open up numerous possibilities for dynamic analyses of businesses in the U.S. economy. It is the first nationwide high-quality longitudinal database that covers the majority of employer businesses from all sectors of the economy. Due to the confidential nature of these data, the file is located at the Center for Economic Studies in the U.S. Bureau of the Census. To access the data, researchers must submit an acceptable proposal to CES and become sworn Census researchers. This paper describes the LEEM file, the variables contained on the file, and current uses of the data.
    View Full Paper PDF