CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'record'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Internal Revenue Service - 24

Social Security Administration - 22

Protected Identification Key - 22

American Community Survey - 21

Center for Economic Studies - 18

Service Annual Survey - 17

Social Security Number - 15

Person Validation System - 14

National Science Foundation - 13

2010 Census - 13

Census Bureau Disclosure Review Board - 12

North American Industry Classification System - 12

Personally Identifiable Information - 11

Research Data Center - 11

Person Identification Validation System - 10

Social Security - 10

Longitudinal Business Database - 10

Indian Health Service - 9

Master Address File - 9

Standard Industrial Classification - 9

Administrative Records - 9

Center for Administrative Records Research and Applications - 9

Longitudinal Employer Household Dynamics - 8

Current Population Survey - 8

County Business Patterns - 8

Employer Identification Numbers - 8

Business Register - 8

Decennial Census - 7

Department of Housing and Urban Development - 7

Indian Housing Information Center - 7

Housing and Urban Development - 7

Some Other Race - 7

Bureau of Labor Statistics - 7

Economic Census - 7

Federal Statistical Research Data Center - 7

Disclosure Review Board - 6

Quarterly Workforce Indicators - 6

Individual Taxpayer Identification Numbers - 6

Survey of Income and Program Participation - 6

Business Dynamics Statistics - 6

SSA Numident - 6

National Opinion Research Center - 6

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 5

Computer Assisted Personal Interview - 5

CATI - 5

Quarterly Census of Employment and Wages - 5

Census Bureau Person Identification Validation System - 5

Census Numident - 5

Census Bureau Business Register - 5

Annual Survey of Manufactures - 5

MAFID - 5

Cornell University - 5

Medicaid Services - 5

Postal Service - 4

Census Bureau Master Address File - 4

Standard Statistical Establishment List - 4

Company Organization Survey - 4

Centers for Medicare - 4

Chicago Census Research Data Center - 4

Unemployment Insurance - 4

Center for Administrative Records Research - 4

Census of Manufactures - 4

PIKed - 4

Sloan Foundation - 3

Supplemental Nutrition Assistance Program - 3

Census Edited File - 3

Census Household Composition Key - 3

University of Chicago - 3

National Center for Health Statistics - 3

Office of Management and Budget - 3

1940 Census - 3

Department of Economics - 3

Alfred P Sloan Foundation - 3

University of Michigan - 3

COVID-19 - 3

Metropolitan Statistical Area - 3

Longitudinal Research Database - 3

Minnesota Population Center - 3

Local Employment Dynamics - 3

Duke University - 3

data - 36

survey - 26

datasets - 24

respondent - 21

microdata - 20

census bureau - 20

census data - 17

matching - 17

data census - 16

agency - 15

database - 15

report - 13

population - 12

statistical - 12

imputation - 11

records census - 10

irs - 10

census records - 10

linkage - 10

matched - 10

identifier - 9

disclosure - 8

federal - 8

use census - 8

census use - 8

census research - 8

ethnicity - 7

hispanic - 7

estimating - 7

coverage - 7

ssa - 7

department - 7

quarterly - 7

information - 7

confidentiality - 6

privacy - 6

publicly - 6

employed - 6

census survey - 6

citizen - 6

business data - 6

aggregate - 6

sector - 6

firms census - 6

census file - 6

1040 - 5

enrollment - 5

employee - 5

filing - 5

census employment - 5

census linked - 5

residence - 5

payroll - 5

enterprise - 5

longitudinal - 5

analysis - 5

associate - 5

survey data - 5

public - 4

minority - 4

ethnic - 4

job - 4

incorporated - 4

employment data - 4

workforce - 4

race - 4

sampling - 4

discrepancy - 4

race census - 4

linked census - 4

resident - 4

census responses - 4

census 2020 - 4

sample - 4

reporting - 4

employ - 4

employment statistics - 4

researcher - 4

research - 4

2010 census - 4

statistical disclosure - 4

model - 4

industrial - 4

statistical agencies - 4

surveys censuses - 3

tenure - 3

assessed - 3

native - 3

state - 3

migration - 3

migrant - 3

medicare - 3

medicaid - 3

recession - 3

establishments data - 3

manufacturing - 3

statistician - 3

financial - 3

demography - 3

inference - 3

ancestry - 3

econometric - 3

earnings - 3

Viewing papers 11 through 20 of 51


  • Working Paper

    Redesigning the Longitudinal Business Database

    May 2021

    Working Paper Number:

    CES-21-08

    In this paper we describe the U.S. Census Bureau's redesign and production implementation of the Longitudinal Business Database (LBD) first introduced by Jarmin and Miranda (2002). The LBD is used to create the Business Dynamics Statistics (BDS), tabulations describing the entry, exit, expansion, and contraction of businesses. The new LBD and BDS also incorporate information formerly provided by the Statistics of U.S. Businesses program, which produced similar year-to-year measures of employment and establishment flows. We describe in detail how the LBD is created from curation of the input administrative data, longitudinal matching, retiming of economic census-year births and deaths, creation of vintage consistent industry codes and noise factors, and the creation and cleaning of each year of LBD data. This documentation is intended to facilitate the proper use and understanding of the data by both researchers with approved projects accessing the LBD microdata and those using the BDS tabulations.
    View Full Paper PDF
  • Working Paper

    The Impact of 2010 Decennial Census Hiring on the Unemployment Rate

    June 2020

    Working Paper Number:

    CES-20-19

    The decennial census is the largest peacetime operation of the U.S. federal government. The Census Bureau hires hundreds of thousands of temporary workers to conduct the decennial census. The magnitude of this temporary workforce influences the national employment situation when enumeration efforts ramp up and when they recede. The impact of decennial census hiring on the headline number of payroll jobs added each month is well established, but previous work has not established how decennial census hiring affects the headline unemployment rate. We link the 2010 Decennial Applicant Personnel and Payroll System data to the 2010 American Community Survey to answer this question. We find that the large hiring surge in May 2010 came mostly from people already employed (40 percent) or from people who were unemployed (33 percent). We estimate that the workers hired for Census 2010 lowered the May 2010 unemployment rate by one-tenth of a percentage point relative to the counterfactual. This one-tenth of a percentage point is within the standard error for the official unemployment rate, and BLS press releases would denote a change in the unemployment rate of 0.1% or less as 'unchanged.' We also estimate that relative to the counterfactual, the more gradual changes in decennial census employment influenced the unemployment rate by less than one-tenth of a percentage point in every other month during 2010.
    View Full Paper PDF
  • Working Paper

    Matching State Business Registration Records to Census Business Data

    January 2020

    Working Paper Number:

    CES-20-03

    We describe our methodology and results from matching state Business Registration Records (BRR) to Census business data. We use data from Massachusetts and California to develop methods and preliminary results that could be used to guide matching data for additional states. We obtain matches to Census business records for 45% of the Massachusetts BRR records and 40% of the California BRR records. We find higher match rates for incorporated businesses and businesses with higher startup-quality scores as assigned in Guzman and Stern (2018). Clerical reviews show that using relatively strict matching on address is important for match accuracy, while results are less sensitive to name matching strictness. Among matched BRR records, the modal timing of the first match to the BR is in the year in which the BRR record was filed. We use two sets of software to identify matches: SAS DQ Match and a machine-learning algorithm described in Cuffe and Goldschlag (2018). We find preliminary evidence that while the ML-based method yields more match results, SAS DQ tends to result in higher accuracy rates. To conclude, we provide suggestions on how to proceed with matching other states' data in light of our findings using these two states.
    View Full Paper PDF
  • Working Paper

    The Management and Organizational Practices Survey (MOPS): Collection and Processing

    December 2018

    Working Paper Number:

    CES-18-51

    The U.S. Census Bureau partnered with a team of external researchers to conduct the first-ever large-scale survey of management practices in the United States, the Management and Organizational Practices Survey (MOPS), for reference year 2010. With the help of the research team, the Census Bureau expanded and improved the survey for a second wave for reference year 2015. The MOPS is a supplement to the Annual Survey of Manufacturing (ASM), and so the collection and processing strategy for the MOPS built on the methodology for the ASM, while differing on key dimensions to address the unique nature of management relative to other business data. This paper provides detail on the mail strategy pursued for the MOPS, the collection methods for paper and electronic responses, the processing and estimation procedures, and the official Census Bureau data releases. This detail is useful for all those who have interest in using the MOPS for research purposes, those wishing to understand the MOPS data more deeply, and those with an interest in survey methodology.
    View Full Paper PDF
  • Working Paper

    Disclosure Avoidance Techniques Used for the 1970 through 2010 Decennial Censuses of Population and Housing

    November 2018

    Authors: Laura McKenna

    Working Paper Number:

    CES-18-47

    The U.S. Census Bureau conducts the decennial censuses under Title 13 of the U. S. Code with the Section 9 mandate to not 'use the information furnished under the provisions of this title for any purpose other than the statistical purposes for which it is supplied; or make any publication whereby the data furnished by any particular establishment or individual under this title can be identified; or permit anyone other than the sworn officers and employees of the Department or bureau or agency thereof to examine the individual reports (13 U.S.C. ' 9 (2007)).' The Census Bureau applies disclosure avoidance techniques to its publicly released statistical products in order to protect the confidentiality of its respondents and their data.
    View Full Paper PDF
  • Working Paper

    Squeezing More Out of Your Data: Business Record Linkage with Python

    November 2018

    Working Paper Number:

    CES-18-46

    Integrating data from different sources has become a fundamental component of modern data analytics. Record linkage methods represent an important class of tools for accomplishing such integration. In the absence of common disambiguated identifiers, researchers often must resort to ''fuzzy" matching, which allows imprecision in the characteristics used to identify common entities across dfferent datasets. While the record linkage literature has identified numerous individually useful fuzzy matching techniques, there is little consensus on a way to integrate those techniques within a single framework. To this end, we introduce the Multiple Algorithm Matching for Better Analytics (MAMBA), an easy-to-use, flexible, scalable, and transparent software platform for business record linkage applications using Census microdata. MAMBA leverages multiple string comparators to assess the similarity of records using a machine learning algorithm to disambiguate matches. This software represents a transparent tool for researchers seeking to link external business data to the Census Business Register files.
    View Full Paper PDF
  • Working Paper

    Foreign-Born and Native-Born Migration in the U.S.: Evidence from IRS Administrative and Census Survey Records

    July 2018

    Working Paper Number:

    carra-2018-07

    This paper details efforts to link administrative records from the Internal Revenue Service (IRS) to American Community Survey (ACS) and 2010 Census microdata for the study of migration among foreign-born and native-born populations in the United States. Specifically, we (1) document our linkage strategy and methodology for inferring migration in IRS records; (2) model selection into and survival across IRS records to determine suitability for research applications; and (3) gauge the efficacy of the IRS records by demonstrating how they can be used to validate and potentially improve migration responses for native-born and foreign-born respondents in ACS microdata. Our results show little evidence of selection or survival bias in the IRS records, suggesting broad generalizability to the nation as a whole. Moreover, we find that the combined IRS 1040, 1099, and W2 records may provide important information on populations, such as the foreign-born, that may be difficult to reach with traditional Census Bureau surveys. Finally, while preliminary, the results of our comparison of IRS and ACS migration responses shows that IRS records may be useful in improving ACS migration measurement for respondents whose migration response is proxy, allocated, or imputed. Taking these results together, we discuss the potential application of our longitudinal IRS dataset to innovations in migration research on both the native-born and foreign-born populations of the United States.
    View Full Paper PDF
  • Working Paper

    The Opportunities and Challenges of Linked IRS Administrative and Census Survey Records in the Study of Migration

    July 2018

    Working Paper Number:

    carra-2018-06

    This paper details efforts to link administrative records from the Internal Revenue Service (IRS) to American Community Survey (ACS) and 2010 Census microdata for the study of migration in the United States. Specifically, we (1) document our linkage strategy and methodology for inferring migration in IRS records; (2) model selection into and survival across IRS records to determine suitability for research applications; and (3) gauge the efficacy of the IRS records by demonstrating how they can be used to validate and potentially improve migration responses in ACS microdata. Our results show little evidence of selection or survival bias in the IRS records, suggesting broad generalizability to the nation as a whole. Moreover, we find that the combined IRS 1040, 1099, and W2 records may provide important information on populations that are hard to reach with traditional Census surveys. Finally, while preliminary, the results of our comparison of IRS and ACS migration responses shows that IRS records may be useful in improving ACS migration measurement for respondents whose migration response is proxy, allocated, or imputed. Taking these results together, we discuss the potential applications of our longitudinal IRS dataset to innovations in migration research in the United States.
    View Full Paper PDF
  • Working Paper

    The Use of Administrative Records and the American Community Survey to Study the Characteristics of Undercounted Young Children in the 2010 Census

    May 2018

    Working Paper Number:

    carra-2018-05

    Children under age five are historically one of the most difficult segments of the population to enumerate in the U.S. decennial census. The persistent undercount of young children is highest among Hispanics and racial minorities. In this study, we link 2010 Census data to administrative records from government and third party data sources, such as Medicaid enrollment data and tenant rental assistance program records from the Department of Housing and Urban Development, to identify differences between children reported and not reported in the 2010 Census. In addition, we link children in administrative records to the American Community Survey to identify various characteristics of households with children under age five who may have been missed in the last census. This research contributes to what is known about the demographic, socioeconomic, and household characteristics of young children undercounted by the census. Our research also informs the potential benefits of using administrative records and surveys to supplement the U.S. Census Bureau child population enumeration efforts in future decennial censuses.
    View Full Paper PDF
  • Working Paper

    Disclosure Limitation and Confidentiality Protection in Linked Data

    January 2018

    Working Paper Number:

    CES-18-07

    Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.
    View Full Paper PDF