CREAT: Census Research Exploration and Analysis Tool

Papers written by Author(s): 'Matthew R. Graham'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Viewing papers 1 through 6 of 6


  • Working Paper

    LEHD Snapshot Documentation, Release S2021_R2022Q4

    November 2022

    Working Paper Number:

    CES-22-51

    The Longitudinal Employer-Household Dynamics (LEHD) data at the U.S. Census Bureau is a quarterly database of linked employer-employee data covering over 95% of employment in the United States. These data are used to produce a number of public-use tabulations and tools, including the Quarterly Workforce Indicators (QWI), LEHD Origin-Destination Employment Statistics (LODES), Job-to-Job Flows (J2J), and Post-Secondary Employment Outcomes (PSEO) data products. Researchers on approved projects may also access the underlying LEHD microdata directly, in the form of the LEHD Snapshot restricted-use data product. This document provides a detailed overview of the LEHD Snapshot as of release S2021_R2022Q4, including user guidance, variable codebooks, and an overview of the approvals needed to obtain access. Updates to the documentation for this and future snapshot releases will be made available in HTML format on the LEHD website.
    View Full Paper PDF
  • Working Paper

    Developing a Residence Candidate File for Use With Employer-Employee Matched Data

    January 2017

    Working Paper Number:

    CES-17-40

    This paper describes the Longitudinal Employer-Household Dynamics (LEHD) program's ongoing efforts to use administrative records in a predictive model that describes residence locations for workers. This project was motivated by the discontinuation of a residence file produced elsewhere at the U.S. Census Bureau. The goal of the Residence Candidate File (RCF) process is to provide the LEHD Infrastructure Files with residence information that maintains currency with the changing state of administrative sources and represents uncertainty in location as a probability distribution. The discontinued file provided only a single residence per person/year, even when contributing administrative data may have contained multiple residences. This paper describes the motivation for the project, our methodology, the administrative data sources, the model estimation and validation results, and the file specifications. We find that the best prediction of the person-place model provides similar, but superior, accuracy compared with previous methods and performs well for workers in the LEHD jobs frame. We outline possibilities for further improvement in sources and modeling as well as recommendations on how to use the preference weights in downstream processing.
    View Full Paper PDF
  • Working Paper

    Design Comparison of LODES and ACS Commuting Data Products

    October 2014

    Working Paper Number:

    CES-14-38

    The Census Bureau produces two complementary data products, the American Community Survey (ACS) commuting and workplace data and the Longitudinal Employer-Household Dynamics (LEHD) Origin-Destination Employment Statistics (LODES), which can be used to answer questions about spatial, economic, and demographic questions relating to workplaces and home-to-work flows. The products are complementary in the sense that they measure similar activities but each has important unique characteristics that provide information that the other measure cannot. As a result of questions from data users, the Census Bureau has created this document to highlight the major design differences between these two data products. This report guides users on the relative advantages of each data product for various analyses and helps explain differences that may arise when using the products.2,3 As an overview, these two data products are sourced from different inputs, cover different populations and time periods, are subject to different sets of edits and imputations, are released under different confidentiality protection mechanisms, and are tabulated at different geographic and characteristic levels. As a general rule, the two data products should not be expected to match exactly for arbitrary queries and may differ substantially for some queries. Within this document, we compare the two data products by the design elements that were deemed most likely to contribute to differences in tabulated data. These elements are: Collection, Coverage, Geographic and Longitudinal Scope, Job Definition and Reference Period, Job and Worker Characteristics, Location Definitions (Workplace and Residence), Completeness of Geographic Information and Edits/Imputations, Geographic Tabulation Levels, Control Totals, Confidentiality Protection and Suppression, and Related Public-Use Data Products. An in-depth data analysis'in aggregate or with the microdata'between the two data products will be the subject of a future technical report. The Census Bureau has begun a pilot project to integrate ACS microdata with LEHD administrative data to develop an enhanced frame of employment status, place of work, and commuting. The Census Bureau will publish quality metrics for person match rates, residence and workplace match rates, and commute distance comparisons.
    View Full Paper PDF
  • Working Paper

    Social, Economic, Spatial, and Commuting Patterns of Self-Employed Jobholders

    April 2007

    Working Paper Number:

    tp-2007-03

    A significant number of employees within the United States identify themselves as selfemployed, and they are distinct from the larger group identified as private jobholders. While socioeconomic and spatial information on these individuals is readily available in standard datasets, such as the 2000 Decennial Census Long Form, it is possible to gain further information on their wage earnings by using data from administrative wage records. This study takes advantage of firm-based data from Unemployment Insurance administrative wage records linked with the Census Bureau's household-based data in order to examine self-employed jobholders - both as a whole and as subgroups defined according to their earned wage status - by their demographic characteristics as well as their economic, commuting, and spatial location outcomes. Additionally, this report evaluates whether self-employed jobholders and the defined subgroups should be included explicitly in future labor-workforce analyses and transportation modeling. The analyses in this report use the sample of self-employed workers who lived in Los Angeles County, California.
    View Full Paper PDF
  • Working Paper

    Social, Economic, Spatial, and Commuting Patterns of Informal Jobholders

    April 2007

    Working Paper Number:

    tp-2007-02

    A significant number of employees within the United States can be considered "informal" or "off-the-books" workers. These workers, who by definition do not appear in administrative wage records, are distinct from the larger group of private jobholders who do appear in administrative records. However, while socioeconomic and spatial information on these individuals is readily available in standard datasets, such as the 2000 Decennial Census Long Form, it is not possible to identify the informal workers by only using such data because of the lack of accurate, formal wage records. This study takes advantage of firm-based data that originates in Unemployment Insurance administrative wage records linked with the Census Bureau's household-based data in order to examine informal jobholders by their demographic characteristics as well as their economic, commuting, and spatial location outcomes. In addition this report evaluates whether informal jobholders should be included explicitly in future labor-workforce analyses and transportation modeling. The analyses in this report use the sample of workers who lived in Los Angeles County, California.
    View Full Paper PDF
  • Working Paper

    Social, Economic, Spatial, and Commuting Patterns of Dual Jobholders

    April 2007

    Working Paper Number:

    tp-2007-01

    Individuals who hold multiple jobs have complex working lives and complex commuting patterns. Economic and spatial information on these individuals is not readily available in standard datasets, such as the 2000 Decennial Census Long Form, because the survey questions were not designed to collect details on multiple jobs. This study takes advantage of firm-based data from the Unemployment Insurance administrative wage records, linked with the Census Bureau's household-based data, to examine multiple jobholders - and specifically a sentinel group of dual jobholders. The study uses a sample from Los Angeles County, California and examines the dual jobholders by their demographic characteristics as well as their economic, commuting, and spatial location outcomes. In addition this report evaluates whether multiple jobholders should be included explicitly in future labor-workforce analyses and transportation modeling.
    View Full Paper PDF