CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'censusbureau'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

No authors occur more than twice in this search.

Viewing papers 1 through 10 of 11


  • Working Paper

    A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census: Full Technical Report

    December 2023

    Working Paper Number:

    CES-23-63R

    For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act. You are reading the full technical report. For the summary paper see https://doi.org/10.1162/99608f92.4a1ebf70.
    View Full Paper PDF
  • Working Paper

    LEHD Snapshot Documentation, Release S2021_R2022Q4

    November 2022

    Working Paper Number:

    CES-22-51

    The Longitudinal Employer-Household Dynamics (LEHD) data at the U.S. Census Bureau is a quarterly database of linked employer-employee data covering over 95% of employment in the United States. These data are used to produce a number of public-use tabulations and tools, including the Quarterly Workforce Indicators (QWI), LEHD Origin-Destination Employment Statistics (LODES), Job-to-Job Flows (J2J), and Post-Secondary Employment Outcomes (PSEO) data products. Researchers on approved projects may also access the underlying LEHD microdata directly, in the form of the LEHD Snapshot restricted-use data product. This document provides a detailed overview of the LEHD Snapshot as of release S2021_R2022Q4, including user guidance, variable codebooks, and an overview of the approvals needed to obtain access. Updates to the documentation for this and future snapshot releases will be made available in HTML format on the LEHD website.
    View Full Paper PDF
  • Working Paper

    Job-to-Job Flows and Earnings Growth*

    January 2017

    Working Paper Number:

    CES-17-08

    The U.S. workforce has had little change in real wages, income, or earnings since the year 2000. However, even when there is little change in the average rate at which workers are compensated, individual workers experienced a distribution of wage and earnings changes. In this paper, we demonstrate how earnings evolve in the U.S. economy in the years 2001-2014 on a forthcoming dataset on earnings for stayers and transitioners from the U.S. Census Bureau's Job-to-Job Flows data product to account for the role of on-the-job earnings growth, job-to-job flows, and nonemployment in the growth of U.S. earnings.
    View Full Paper PDF
  • Working Paper

    Migration Decisions in Arctic Alaska: Empirical Evidence of the Stepping Stones Hypothesis

    December 2010

    Working Paper Number:

    CES-10-41

    This paper explores hypotheses of hierarchical migration using data from the Alaskan Arctic. We focus on migration of I'upiat people, who are indigenous to the region, and explore the role of income, harvests of subsistence resources, and other place characteristics in migration decisions. To test related hypotheses we use confidential micro-data from the US Census Bureau's 2000 Decennial Census of Population and Income. Using predicted earnings and subsistence along with place invariant characteristics we generate migration probabilities using a mixed multinomial and conditional logit model. Our results support stepwise migration patterns, both up and down an urban and rural hierarchy. At the same time, we also identify differences between men and women, and we find mixed effects of place amenities and predicted earnings.
    View Full Paper PDF
  • Working Paper

    Professional Employer Organizations: What Are They, Who Uses Them and Why Should We Care?

    September 2010

    Working Paper Number:

    CES-10-22

    More and more U.S. workers are counted as employees of firms that they do not actually work for. Among such workers are those who staffed by temporary help service (THS) agencies and leased employees who are on the payroll of professional employment organizations (PEOs) but work for PEOs' client firms. While several papers study firms' use of THS services, few examine firms' use of PEO services. In this article, we summarize PEOs' business practices and examine how the intensity of their use varies across industries, geographic areas, and establishment characteristics using both public and confidential data.
    View Full Paper PDF
  • Working Paper

    Resolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau

    September 2009

    Working Paper Number:

    CES-09-33

    This paper provides an historical context for access to U.S. Federal statistical data with a primary focus on the U.S. Census Bureau. We review the various modes used by the Census Bureau to make data available to users, and highlight the costs and benefits associated with each. We highlight some of the specific improvements underway or under consideration at the Census Bureau to better serve its data users, as well as discuss the broad strategies employed by statistical agencies to respond to the challenges of data access.
    View Full Paper PDF
  • Working Paper

    Measuring Inequality Using Censored Data: A Multiple Imputation Approach

    April 2009

    Working Paper Number:

    CES-09-05

    To measure income inequality with right censored (topcoded) data, we propose multiple imputation for censored observations using draws from Generalized Beta of the Second Kind distributions to provide partially synthetic datasets analyzed using complete data methods. Estimation and inference uses Reiter's (Survey Methodology 2003) formulae. Using Current Population Survey (CPS) internal data, we find few statistically significant differences in income inequality for pairs of years between 1995 and 2004. We also show that using CPS public use data with cell mean imputations may lead to incorrect inferences about inequality differences. Multiply-imputed public use data provide an intermediate solution.
    View Full Paper PDF
  • Working Paper

    Spatial Influences on the Employment of U.S. Hispanics: Spatial Mismatch, Discrimination, or Immigrant Networks?

    January 2009

    Working Paper Number:

    CES-09-03

    Employment rates of Hispanic males in the United States are considerably lower than employment rates of whites. In the data used in this paper, the Hispanic male employment rate is 61 percent, compared with 83 percent for white men.1 The question of the employment disadvantage of Hispanic men likely has many parallels to the question of the employment disadvantage of black men, where factors including spatial mismatch, discrimination, and labor market networks have all received attention as contributing factors. However, the Hispanic disadvantage has been much less studied, and the goal of this paper is to bridge that gap. To that end, we present evidence that tries to assess which of the three factors listed above appears to contribute to the lower employment rate of Hispanic males. We focus in particular on immigrant Hispanics and Hispanics who do not speak English well.
    View Full Paper PDF
  • Working Paper

    Neighbors and Co-Workers: The Importance of Residential Labor Market Networks

    January 2009

    Working Paper Number:

    CES-09-01

    We specify and implement a test for the importance of network effects in determining the establishments at which people work, using recently-constructed matched employer-employee data at the establishment level. We explicitly measure the importance of network effects for groups broken out by race, ethnicity, and various measures of skill, for networks generated by residential proximity. The evidence indicates that labor market networks play an important role in hiring, more so for minorities and the less-skilled, especially among Hispanics, and that labor market networks appear to be race-based.
    View Full Paper PDF
  • Working Paper

    Lessons for Targeted Program Evaluation: A Personal and Professional History of the Survey of Program Dynamics

    August 2007

    Authors: Daniel Weinberg

    Working Paper Number:

    CES-07-24

    The Survey of Program Dynamics (SPD) was created by the 1996 welfare reform legislation to facilitate its evaluation. This paper describes the evolution of that survey, discusses its implementation, and draws lessons for future evaluation. Large-scale surveys can be an important part of a portfolio of evaluation methods, but sufficient time must be given to data collection agencies if a high-quality longitudinal survey is expected. Such a survey must have both internal (agency) and external (policy analyst) buy-in. Investments in data analysis by agency staff, downplayed in favor of larger sample sizes given a fixed budget, could have contributed to more external acceptance. More attention up-front to reducing the potentially deleterious effects of attrition in longitudinal surveys, such as through the use of monetary incentives, might have been worthwhile. Given the problems encountered by the Census Bureau in producing the SPD, I argue that ongoing multi-purpose longitudinal surveys like the Survey of Income and Program Participation are potentially more valuable than episodic special-purpose surveys.
    View Full Paper PDF