CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'census bureau'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

American Community Survey - 42

Current Population Survey - 40

Internal Revenue Service - 37

Protected Identification Key - 32

Census Bureau Disclosure Review Board - 30

Longitudinal Employer Household Dynamics - 29

Social Security Administration - 27

National Science Foundation - 26

Center for Economic Studies - 25

Bureau of Labor Statistics - 25

Cornell University - 24

Social Security Number - 23

Employer Identification Numbers - 21

Person Validation System - 21

North American Industry Classification System - 19

Decennial Census - 19

Business Register - 18

2010 Census - 18

Service Annual Survey - 16

Economic Census - 16

Disclosure Review Board - 15

Social Security - 15

Research Data Center - 15

Quarterly Census of Employment and Wages - 14

Master Address File - 14

Survey of Income and Program Participation - 14

Alfred P Sloan Foundation - 14

Standard Industrial Classification - 14

Longitudinal Business Database - 13

Federal Statistical Research Data Center - 13

Census Bureau Business Register - 13

Personally Identifiable Information - 13

Person Identification Validation System - 12

Quarterly Workforce Indicators - 12

Office of Management and Budget - 11

Annual Survey of Manufactures - 10

Standard Statistical Establishment List - 10

Unemployment Insurance - 10

1940 Census - 9

Local Employment Dynamics - 9

Center for Administrative Records Research and Applications - 9

Administrative Records - 9

Department of Housing and Urban Development - 9

Computer Assisted Personal Interview - 9

MAFID - 8

LEHD Program - 8

Ordinary Least Squares - 8

National Institute on Aging - 8

Cornell Institute for Social and Economic Research - 8

National Opinion Research Center - 7

Department of Labor - 7

National Center for Health Statistics - 7

Business Dynamics Statistics - 7

Individual Taxpayer Identification Numbers - 7

SSA Numident - 7

Metropolitan Statistical Area - 7

Bureau of Economic Analysis - 7

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 6

Social and Economic Supplement - 6

Some Other Race - 6

National Longitudinal Survey of Youth - 6

Postal Service - 6

Supplemental Nutrition Assistance Program - 6

Housing and Urban Development - 6

DOB - 6

Indian Health Service - 6

Adjusted Gross Income - 5

Company Organization Survey - 5

Individual Characteristics File - 5

Census Edited File - 5

County Business Patterns - 5

Composite Person Record - 5

Financial, Insurance and Real Estate Industries - 5

Federal Tax Information - 5

National Bureau of Economic Research - 5

W-2 - 5

Center for Administrative Records Research - 5

Census Numident - 5

Census Bureau Person Identification Validation System - 5

CATI - 5

Public Use Micro Sample - 5

Department of Health and Human Services - 5

Special Sworn Status - 5

Business Master File - 5

Census 2000 - 5

Medicaid Services - 5

Department of Education - 4

National Academy of Sciences - 4

Health and Retirement Study - 4

Census of Manufactures - 4

Annual Business Survey - 4

Department of Homeland Security - 4

United States Census Bureau - 4

Employment History File - 4

Employer Characteristics File - 4

CDF - 4

Office of Personnel Management - 4

Cumulative Density Function - 4

Earned Income Tax Credit - 4

Data Management System - 4

Census Bureau Master Address File - 4

Temporary Assistance for Needy Families - 4

Census Household Composition Key - 4

Statistics Canada - 4

Bureau of Labor - 4

American Housing Survey - 4

Core Based Statistical Area - 4

Business Register Bridge - 4

North American Industry Classi - 4

Chicago Census Research Data Center - 4

PIKed - 4

Agency for Healthcare Research and Quality - 4

Stanford University - 3

Securities and Exchange Commission - 3

Occupational Employment Statistics - 3

MAF-ARF - 3

Accommodation and Food Services - 3

COVID - 3

Sloan Foundation - 3

Social Science Research Institute - 3

Indian Housing Information Center - 3

American Economic Association - 3

Federal Reserve System - 3

Retail Trade - 3

Survey of Business Owners - 3

University of Maryland - 3

American Economic Review - 3

Journal of Labor Economics - 3

Business Employment Dynamics - 3

Probability Density Function - 3

Successor Predecessor File - 3

Centers for Medicare - 3

Federal Reserve Bank - 3

General Accounting Office - 3

PSID - 3

Establishment Micro Properties - 3

Urban Institute - 3

Permanent Plant Number - 3

Wholesale Trade - 3

AKM - 3

survey - 60

population - 50

respondent - 44

census data - 38

data census - 36

data - 36

statistical - 28

agency - 24

report - 22

workforce - 22

record - 21

employed - 21

research census - 21

use census - 20

estimating - 20

labor - 19

economic census - 19

2010 census - 18

census research - 18

census survey - 17

hispanic - 16

employ - 16

microdata - 16

ethnicity - 15

employee - 15

census employment - 14

resident - 14

datasets - 14

minority - 13

longitudinal - 13

payroll - 12

coverage - 12

prevalence - 11

household surveys - 11

employer household - 11

aging - 11

ethnic - 10

percentile - 10

paper census - 10

enrollment - 10

trend - 10

disclosure - 10

earnings - 10

census use - 10

information census - 9

department - 9

expenditure - 9

work census - 9

censuses surveys - 9

provided census - 9

population survey - 9

records census - 9

recession - 9

worker - 9

analysis - 8

poverty - 8

assessed - 8

estimation - 8

immigrant - 8

residential - 8

census records - 8

family - 8

federal - 8

census years - 8

citizen - 8

surveys censuses - 8

matching - 8

race - 8

race census - 8

longitudinal employer - 8

study - 7

disparity - 7

1040 - 7

census disclosure - 7

irs - 7

average - 7

sampling - 7

identifier - 7

census responses - 7

employment statistics - 7

employee data - 7

census 2020 - 7

salary - 7

sector - 7

residence - 7

linked census - 7

linkage - 7

imputation - 7

metropolitan - 7

employment dynamics - 7

state - 6

database - 6

occupation - 6

labor statistics - 6

employment data - 6

aggregate - 6

information - 6

disadvantaged - 6

privacy - 6

public - 6

census linked - 6

medicaid - 6

racial - 6

census business - 6

econometric - 6

research - 6

hiring - 6

workplace - 6

census file - 6

worker demographics - 6

job - 6

socioeconomic - 5

assessing - 5

survey data - 5

earner - 5

survey income - 5

taxpayer - 5

child - 5

unemployed - 5

confidentiality - 5

publicly - 5

neighborhood - 5

quarterly - 5

economist - 5

yearly - 5

researcher - 5

clerical - 5

ancestry - 5

individuals census - 5

statistician - 5

eligibility - 4

enrolled - 4

enrollee - 4

income data - 4

incorporated - 4

ssa - 4

migration - 4

tax - 4

census household - 4

health - 4

policymakers - 4

insurance - 4

gdp - 4

healthcare - 4

immigration - 4

market - 4

establishment - 4

associate - 4

tenure - 4

matched - 4

eligible - 3

sample - 3

company - 3

revenue - 3

residing - 3

decade - 3

estimator - 3

migrant - 3

income individuals - 3

parent - 3

dependent - 3

household income - 3

income households - 3

environmental - 3

geographic - 3

impact - 3

survey households - 3

pandemic - 3

parental - 3

adoption - 3

bias - 3

black - 3

econometrician - 3

finance - 3

firms census - 3

statistical disclosure - 3

layoff - 3

welfare - 3

discrimination - 3

businesses census - 3

empirical - 3

Viewing papers 21 through 30 of 93


  • Working Paper

    The Changing Nature of Pollution, Income, and Environmental Inequality in the United States

    January 2024

    Working Paper Number:

    CES-24-04

    This paper uses administrative tax records linked to Census demographic data and high-resolution measures of fine small particulate (PM2.5) exposure to study the evolution of the Black-White pollution exposure gap over the past 40 years. In doing so, we focus on the various ways in which income may have contributed to these changes using a statistical decomposition. We decompose the overall change in the Black-White PM2.5 exposure gap into (1) components that stem from rank-preserving compression in the overall pollution distribution and (2) changes that stem from a reordering of Black and White households within the pollution distribution. We find a significant narrowing of the Black-White PM2.5 exposure gap over this time period that is overwhelmingly driven by rank-preserving changes rather than positional changes. However, the relative positions of Black and White households at the upper end of the pollution distribution have meaningfully shifted in the most recent years.
    View Full Paper PDF
  • Working Paper

    Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey

    January 2024

    Working Paper Number:

    CES-24-02

    Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations. After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics. This paper is for research purposes only. No changes to production are being implemented at this time.
    View Full Paper PDF
  • Working Paper

    A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census: Full Technical Report

    December 2023

    Working Paper Number:

    CES-23-63R

    For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act. You are reading the full technical report. For the summary paper see https://doi.org/10.1162/99608f92.4a1ebf70.
    View Full Paper PDF
  • Working Paper

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    December 2023

    Working Paper Number:

    CES-23-63

    Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. The tabular publications in Summary File 1 thus have prohibited disclosure risk similar to the unreleased confidential microdata. Reidentification studies confirm that an attacker can, within blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with nonmodal characteristics) with 95% accuracy, the same precision as the confidential data achieve and far greater than statistical baselines. The flaw in the 2010 Census framework was the assumption that aggregation prevented accurate microdata reconstruction, justifying weaker disclosure limitation methods than were applied to 2010 Census public microdata. The framework used for 2020 Census publications defends against attacks that are based on reconstruction, as we also demonstrate here. Finally, we show that alternatives to the 2020 Census Disclosure Avoidance System with similar accuracy (enhanced swapping) also fail to protect confidentiality, and those that partially defend against reconstruction attacks (incomplete suppression implementations) destroy the primary statutory use case: data for redistricting all legislatures in the country in compliance with the 1965 Voting Rights Act.
    View Full Paper PDF
  • Working Paper

    Coverage of Children in the American Community Survey Based on California Birth Records

    September 2023

    Authors: Gloria G. Aldana

    Working Paper Number:

    CES-23-46

    The U.S. Census Bureau's American Community Survey (ACS) collects information on individuals and households. The ACS provides survey-based estimates of children drawn from a sample of the U.S. population. However, survey responses may not match administrative records, such as birth records. Birth records should provide a complete account of all births, along with child-parent relationships and demographic characteristics. California is a state that has both a large population of children and a high undercount for young children. This paper uses California as a case study to examine differences between reported versus unreported children in the ACS based on state birth records. Child reporting rates were lower for more recent data years, younger children, for Black and Hispanic mothers, and for more complex households. Child reporting rates were higher for more educated mothers and for households above the poverty line. Using mother's race and Hispanic ethnicity from the birth records combined with poverty indices from the ACS, this analysis also finds that child reporting does not uniformly vary with poverty status across all race and ethnicity groups. This research builds support for the utility of state birth records in analyzing the undercount of children.
    View Full Paper PDF
  • Working Paper

    Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels

    November 2022

    Working Paper Number:

    CES-22-48

    In this study, we implement small-area estimation to assess the prevalence of child health outcomes at the county, state, and regional levels, using national survey data.
    View Full Paper PDF
  • Working Paper

    The Impact of Household Surveys on 2020 Census Self-Response

    July 2022

    Working Paper Number:

    CES-22-24

    Households who were sampled in 2019 for the American Community Survey (ACS) had lower self-response rates to the 2020 Census. The magnitude varied from -1.5 percentage point for household sampled in January 2019 to -15.1 percent point for households sampled in December 2019. Similar effects are found for the Current Population Survey (CPS) as well.
    View Full Paper PDF
  • Working Paper

    Finding Needles in Haystacks: Multiple-Imputation Record Linkage Using Machine Learning

    November 2021

    Working Paper Number:

    CES-21-35

    This paper considers the problem of record linkage between a household-level survey and an establishment-level frame in the absence of unique identifiers. Linkage between frames in this setting is challenging because the distribution of employment across establishments is highly skewed. To address these difficulties, this paper develops a probabilistic record linkage methodology that combines machine learning (ML) with multiple imputation (MI). This ML-MI methodology is applied to link survey respondents in the Health and Retirement Study to their workplaces in the Census Business Register. The linked data reveal new evidence that non-sampling errors in household survey data are correlated with respondents' workplace characteristics.
    View Full Paper PDF
  • Working Paper

    Small Business Pulse Survey Estimates by Owner Characteristics and Rural/Urban Designation

    September 2021

    Working Paper Number:

    CES-21-24

    In response to requests from policymakers for additional context for Small Business Pulse Survey (SBPS) measures of the impact of COVID-19 on small businesses, we researched developing estimates by owner characteristics and rural/urban locations. Leveraging geographic coding on the Business Register, we create estimates of the effect of the pandemic on small businesses by urban and rural designations. A more challenging exercise entails linking micro-level data from the SBPS with ownership data from the Annual Business Survey (ABS) to create estimates of the effect of the pandemic on small businesses by owner race, sex, ethnicity, and veteran status. Given important differences in survey design and concerns about nonresponse bias, we face significant challenges in producing estimates for owner demographics. We discuss our attempts to meet these challenges and provide discussion about caution that must be used in interpreting the results. The estimates produced for this paper are available for download. Reflecting the Census Bureau's commitment to scientific inquiry and transparency, the micro data from the SBPS will be available to qualified researchers on approved projects in the Federal Statistical Research Data Center network.
    View Full Paper PDF
  • Working Paper

    Measuring the Impact of COVID-19 on Businesses and People: Lessons from the Census Bureau's Experience

    January 2021

    Working Paper Number:

    CES-21-02

    We provide an overview of Census Bureau activities to enhance the consistency, timeliness, and relevance of our data products in response to the COVID-19 pandemic. We highlight new data products designed to provide timely and granular information on the pandemic's impact: the Small Business Pulse Survey, weekly Business Formation Statistics, the Household Pulse Survey, and Community Resilience Estimates. We describe pandemic-related content introduced to existing surveys such as the Annual Business Survey and the Current Population Survey. We discuss adaptations to ensure the continuity and consistency of existing data products such as principal economic indicators and the American Community Survey.
    View Full Paper PDF