CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'data census'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Internal Revenue Service - 30

American Community Survey - 27

Social Security Administration - 25

Current Population Survey - 24

Center for Economic Studies - 22

National Science Foundation - 20

Protected Identification Key - 19

Social Security Number - 18

Bureau of Labor Statistics - 18

Service Annual Survey - 18

Longitudinal Employer Household Dynamics - 18

Research Data Center - 17

Master Address File - 16

Census Bureau Disclosure Review Board - 16

Decennial Census - 16

Business Register - 16

North American Industry Classification System - 16

Disclosure Review Board - 15

Employer Identification Numbers - 15

Survey of Income and Program Participation - 14

Standard Industrial Classification - 14

Cornell University - 14

Social Security - 13

Census Bureau Business Register - 13

Longitudinal Business Database - 13

2010 Census - 12

Person Validation System - 12

Federal Statistical Research Data Center - 12

Annual Survey of Manufactures - 12

Standard Statistical Establishment List - 11

Economic Census - 11

Housing and Urban Development - 10

Quarterly Workforce Indicators - 10

Alfred P Sloan Foundation - 9

Department of Housing and Urban Development - 8

Metropolitan Statistical Area - 8

Quarterly Census of Employment and Wages - 8

American Housing Survey - 8

Supplemental Nutrition Assistance Program - 7

Census Numident - 7

Person Identification Validation System - 7

Computer Assisted Personal Interview - 7

Business Dynamics Statistics - 7

National Opinion Research Center - 7

Individual Taxpayer Identification Numbers - 6

Administrative Records - 6

Longitudinal Research Database - 6

Indian Health Service - 6

Unemployment Insurance - 6

Local Employment Dynamics - 6

Census 2000 - 6

Center for Administrative Records Research and Applications - 6

Medicaid Services - 5

MAFID - 5

Census Bureau Person Identification Validation System - 5

SSA Numident - 5

Geographic Information Systems - 5

Business Employment Dynamics - 5

Federal Reserve Bank - 5

Federal Tax Information - 5

Census of Manufactures - 5

American Statistical Association - 5

Bureau of Economic Analysis - 5

Permanent Plant Number - 5

Agency for Healthcare Research and Quality - 5

Federal Reserve System - 4

National Institute on Aging - 4

County Business Patterns - 4

Company Organization Survey - 4

Cornell Institute for Social and Economic Research - 4

PIKed - 4

Indian Housing Information Center - 4

Personally Identifiable Information - 4

National Bureau of Economic Research - 4

University of Chicago - 4

Postal Service - 4

Probability Density Function - 4

American Economic Association - 4

Business Master File - 4

Employment History File - 4

Employer Characteristics File - 4

Individual Characteristics File - 4

Core Based Statistical Area - 4

Business Register Bridge - 4

Successor Predecessor File - 4

Chicago Census Research Data Center - 4

Census Bureau Longitudinal Business Database - 4

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 4

CATI - 4

Some Other Race - 4

Establishment Micro Properties - 4

Department of Agriculture - 3

Centers for Medicare - 3

1940 Census - 3

Census Bureau Master Address File - 3

W-2 - 3

Temporary Assistance for Needy Families - 3

Accommodation and Food Services - 3

Social Science Research Institute - 3

MAF-ARF - 3

Ordinary Least Squares - 3

Health and Retirement Study - 3

Characteristics of Business Owners - 3

Retail Trade - 3

Small Business Administration - 3

Department of Homeland Security - 3

Special Sworn Status - 3

Sloan Foundation - 3

Wholesale Trade - 3

University of Maryland - 3

Bureau of Labor - 3

Journal of Labor Economics - 3

Composite Person Record - 3

North American Industry Classi - 3

Duke University - 3

Total Factor Productivity - 3

Office of Management and Budget - 3

CDF - 3

Cumulative Density Function - 3

Medical Expenditure Panel Survey - 3

Financial, Insurance and Real Estate Industries - 3

Department of Labor - 3

census bureau - 34

survey - 33

census data - 32

data - 31

respondent - 30

population - 25

agency - 20

microdata - 19

report - 18

statistical - 18

datasets - 16

record - 16

use census - 15

census survey - 14

estimating - 13

census research - 13

research census - 13

employed - 11

resident - 10

economic census - 10

employee - 9

workforce - 9

database - 9

statistician - 9

aggregate - 9

coverage - 8

census employment - 8

censuses surveys - 8

payroll - 8

researcher - 8

disclosure - 8

assessed - 7

information census - 7

recession - 7

quarterly - 7

longitudinal - 7

sector - 7

employ - 7

study - 7

labor - 7

linked census - 6

census years - 6

residential - 6

provided census - 6

estimation - 6

confidentiality - 6

2010 census - 6

econometric - 6

work census - 6

yearly - 6

ethnicity - 6

census file - 6

race census - 6

matching - 6

sampling - 5

household surveys - 5

disparity - 5

minority - 5

citizen - 5

survey data - 5

assessing - 5

privacy - 5

neighborhood - 5

census records - 5

imputation - 5

census business - 5

metropolitan - 5

employment data - 5

business data - 5

records census - 5

employment statistics - 5

research - 5

hispanic - 5

race - 5

expenditure - 5

ssa - 4

prevalence - 4

population survey - 4

estimator - 4

housing - 4

linkage - 4

enterprise - 4

census use - 4

macroeconomic - 4

earnings - 4

geography - 4

geographic - 4

surveys censuses - 4

reporting - 4

information - 4

publicly - 4

department - 4

worker - 4

employer household - 4

employee data - 4

ethnic - 4

census responses - 4

analysis - 4

aggregation - 4

identifier - 4

federal - 4

aging - 4

economist - 4

trend - 4

sample - 3

survey households - 3

medicaid - 3

impact - 3

amenity - 3

census linked - 3

survey income - 3

incorporated - 3

businesses census - 3

salary - 3

workforce indicators - 3

geographically - 3

establishment - 3

public - 3

workplace - 3

employment dynamics - 3

clerical - 3

worker demographics - 3

longitudinal employer - 3

white - 3

racial - 3

irs - 3

bias - 3

enrollment - 3

job - 3

average - 3

labor statistics - 3

Viewing papers 21 through 30 of 56


  • Working Paper

    Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes

    August 2016

    Working Paper Number:

    carra-2016-06

    While commercial data sources offer promise to statistical agencies for use in production of official statistics, challenges can arise as the data are not collected for statistical purposes. This paper evaluates the use of 2008-2010 property tax data from CoreLogic, Inc. (CoreLogic), aggregated from county and township governments from around the country, to improve 2010 American Community Survey (ACS) estimates of property tax amounts for single-family homes. Particularly, the research evaluates the potential to use CoreLogic to reduce respondent burden, to study survey response error and to improve adjustments for survey nonresponse. The research found that the coverage of the CoreLogic data varies between counties as does the correspondence between ACS and CoreLogic property taxes. This geographic variation implies that different approaches toward using CoreLogic are needed in different areas of the country. Further, large differences between CoreLogic and ACS property taxes in certain counties seem to be due to conceptual differences between what is collected in the two data sources. The research examines three counties, Clark County, NV, Philadelphia County, PA and St. Louis County, MO, and compares how estimates would change with different approaches using the CoreLogic data. Mean county property tax estimates are highly sensitive to whether ACS or CoreLogic data are used to construct estimates. Using CoreLogic data in imputation modeling for nonresponse adjustment of ACS estimates modestly improves the predictive power of imputation models, although estimates of county property taxes and property taxes by mortgage status are not very sensitive to the imputation method.
    View Full Paper PDF
  • Working Paper

    Using Partially Synthetic Microdata to Protect Sensitive Cells in Business Statistics

    February 2016

    Working Paper Number:

    CES-16-10

    We describe and analyze a method that blends records from both observed and synthetic microdata into public-use tabulations on establishment statistics. The resulting tables use synthetic data only in potentially sensitive cells. We describe different algorithms, and present preliminary results when applied to the Census Bureau's Business Dynamics Statistics and Synthetic Longitudinal Business Database, highlighting accuracy and protection afforded by the method when compared to existing public-use tabulations (with suppressions).
    View Full Paper PDF
  • Working Paper

    Measuring Cross-Country Differences in Misallocation

    January 2016

    Working Paper Number:

    CES-16-50R

    We describe differences between the commonly used version of the U.S. Census of Manufactures available at the RDCs and what establishments themselves report. The originally reported data has substantially more dispersion in measured establishment productivity. Measured allocative efficiency is substantially higher in the cleaned data than the raw data: 4x higher in 2002, 20x in 2007, and 80x in 2012. Many of the important editing strategies at the Census, including industry analysts' manual edits and edits using tax records, are infeasible in non-U.S. datasets. We describe a new Bayesian approach for editing and imputation that can be used across contexts.
    View Full Paper PDF
  • Working Paper

    When Race and Hispanic Origin Reporting are Discrepant Across Administrative Records and Third Party Sources: Exploring Methods to Assign Responses

    December 2015

    Working Paper Number:

    carra-2015-08

    The U.S. Census Bureau is researching uses of administrative records and third party data in survey and decennial census operations. One potential use of administrative records is to utilize these data when race and Hispanic origin responses are missing. When federal and third party administrative records are compiled, race and Hispanic origin responses are not always the same for an individual across sources. We explore different methods to assign one race and one Hispanic response when these responses are discrepant. We also describe the characteristics of individuals with matching, non-matching, and missing race and Hispanic origin data by demographic, household, and contextual variables. We find that minorities, especially Hispanics, are more likely to have non-matching Hispanic origin and race responses in administrative records and third party data compared to the 2010 Census. Minority groups and individuals ages 0-17 are more likely to have missing race or Hispanic origin data in administrative records and third party data. Larger households tend to have more missing race data in administrative records and third party data than smaller households.
    View Full Paper PDF
  • Working Paper

    Assessing Coverage and Quality of the 2007 Prototype Census Kidlink Database

    September 2015

    Working Paper Number:

    carra-2015-07

    The Census Bureau is conducting research to expand the use of administrative records data in censuses and surveys to decrease respondent burden and reduce costs while improving data quality. Much of this research (e.g., Rastogi and O''Hara (2012), Luque and Bhaskar (2014)) hinges on the ability to integrate multiple data sources by linking individuals across files. One of the Census Bureau's record linkage methodologies for data integration is the Person Identification Validation System or PVS. PVS assigns anonymous and unique IDs (Protected Identification Keys or PIKs) that serve as linkage keys across files. Prior research showed that integrating 'known associates' information into PVS's reference files could potentially enhance PVS's PIK assignment rates. The term 'known associates' refers to people that are likely to be associated with each other because of a known common link (such as family relationships or people sharing a common address), and thus, to be observed together in different files. One of the results from this prior research was the creation of the 2007 Census Kidlink file, a child-level file linking a child's Social Security Number (SSN) record to the SSN of those identified as the child's parents. In this paper, we examine to what extent the 2007 Census Kidlink methodology was able to link parents SSNs to children SSN records, and also evaluate the quality of those links. We find that in approximately 80 percent of cases, at least one parent was linked to the child's record. Younger children and noncitizens have a higher percentage of cases where neither parent could be linked to the child. Using 2007 tax data as a benchmark, our quality evaluation results indicate that in at least 90 percent of the cases, the parent-child link agreed with those found in the tax data. Based on our findings, we propose improvements to the 2007 Kidlink methodology to increase child-parent links, and discuss how the creation of the file could be operationalized moving forward.
    View Full Paper PDF
  • Working Paper

    Estimation and Inference in Regression Discontinuity Designs with Clustered Sampling

    August 2015

    Working Paper Number:

    carra-2015-06

    Regression Discontinuity (RD) designs have become popular in empirical studies due to their attractive properties for estimating causal effects under transparent assumptions. Nonetheless, most popular procedures assume i.i.d. data, which is not reasonable in many common applications. To relax this assumption, we derive the properties of traditional non-parametric estimators in a setting that incorporates potential clustering at the level of the running variable, and propose an accompanying optimal-MSE bandwidth selection rule. Simulation results demonstrate that falsely assuming data are i.i.d. when selecting the bandwidth may lead to the choice of bandwidths that are too small relative to the optimal-MSE bandwidth. Last, we apply our procedure using person-level microdata that exhibits clustering at the census tract level to analyze the impact of the Low-Income Housing Tax Credit program on neighborhood characteristics and low-income housing supply.
    View Full Paper PDF
  • Working Paper

    Exploring Administrative Records Use for Race and Hispanic Origin Item Non-Response

    December 2014

    Working Paper Number:

    carra-2014-16

    Race and Hispanic origin data are required to produce official statistics in the United States. Data collected through the American Community Survey and decennial census address missing data through traditional imputation methods, often relying on information from neighbors. These methods work well if neighbors share similar characteristics, however, the shape and patterns of neighborhoods in the United States are changing. Administrative records may provide more accurate data compared to traditional imputation methods for missing race and Hispanic origin responses. This paper first describes the characteristics of persons with missing demographic data, then assesses the coverage of administrative records data for respondents who do not answer race and Hispanic origin questions in Census data. The paper also discusses the distributional impact of using administrative records race and Hispanic origin data to complete missing responses in a decennial census or survey context.
    View Full Paper PDF
  • Working Paper

    Coverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data

    November 2014

    Working Paper Number:

    carra-2014-14

    The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.
    View Full Paper PDF
  • Working Paper

    JOB-TO-JOB (J2J) Flows: New Labor Market Statistics From Linked Employer-Employee Data

    September 2014

    Working Paper Number:

    CES-14-34

    Flows of workers across jobs are a principal mechanism by which labor markets allocate workers to optimize productivity. While these job flows are both large and economically important, they represent a significant gap in available economic statistics. A soon to be released data product from the U.S. Census Bureau will fill this gap. The Job-to-Job (J2J) flow statistics provide estimates of worker flows across jobs, across different geographic labor markets, by worker and firm characteristics, including direct job-to-job flows as well as job changes with intervening nonemployment. In this paper, we describe the creation of the public-use data product on job-to-job flows. The data underlying the statistics are the matched employer-employee data from the U.S. Census Bureau's Longitudinal Employer-Household Dynamics program. We describe definitional issues and the identification strategy for tracing worker movements between employers in administrative data. We then compare our data with related series and discuss similarities and differences. Lastly, we describe disclosure avoidance techniques for the public use file, and our methodology for estimating national statistics when there is partially missing geography.
    View Full Paper PDF
  • Working Paper

    2010 American Community Survey Match Study

    July 2014

    Working Paper Number:

    carra-2014-03

    Using administrative records data from federal government agencies and commercial sources, the 2010 ACS Match Study measures administrative records coverage of 2010 ACS addresses, persons, and persons at addresses at different levels of geography as well as by demographic characteristics and response mode. The 2010 ACS Match Study represents a continuation of the research undertaken in the 2010 Census Match Study, the first national-level evaluation of administrative records data coverage. Preliminary results indicate that administrative records provide substantial coverage for addresses and persons in the 2010 ACS (92.7 and 92.1 percent respectively), and less extensive though substantial coverage, for person-address pairs (74.3 percent). In addition, some variation in address, person and/or person-address coverage is found across demographic and response mode groups. This research informs future uses of administrative records in survey and decennial census operations to address the increasing costs of data collection and declining response rates.
    View Full Paper PDF