CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Geographic Information Systems'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

No authors occur more than twice in this search.

Viewing papers 1 through 10 of 22


  • Working Paper

    Household Wealth and Entrepreneurial Career Choices: Evidence from Climate Disasters

    July 2024

    Authors: Xiao Cen

    Working Paper Number:

    CES-24-39

    This study investigates how household wealth affects the human capital of startups, based on U.S. Census individual-level employment data, deed records, and geographic information system (GIS) data. Using floods as a wealth shock, a regression discontinuity analysis shows inundated residents are 7% less likely to work in startups relative to their neighbors outside the flood boundary, within a 0.1-mile-wide band. The effect is more pronounced for homeowners, consistent with the wealth effect. The career distortion leads to a significant long-run income loss, highlighting the importance of self-insurance for human capital allocation.
    View Full Paper PDF
  • Working Paper

    The U.S. Manufacturing Sector's Response to Higher Electricity Prices: Evidence from State-Level Renewable Portfolio Standards

    October 2022

    Working Paper Number:

    CES-22-47

    While several papers examine the effects of renewable portfolio standards (RPS) on electricity prices, they mainly rely on state-level data and there has been little research on how RPS policies affect manufacturing activity via their effect on electricity prices. Using plant-level data for the entire U.S. manufacturing sector and all electric utilities from 1992 ' 2015, we jointly estimate the effect of RPS adoption and stringency on plant-level electricity prices and production decisions. To ensure that our results are not sensitive to possible pre-existing differences across manufacturing plants in RPS and non-RPS states, we implement coarsened exact covariate matching. Our results suggest that electricity prices for plants in RPS states averaged about 2% higher than in non-RPS states, notably lower than prior estimates based on state-level data. In response to these higher electricity prices, we estimate that plant electricity usage declined by 1.2% for all plants and 1.8% for energy-intensive plants, broadly consistent with published estimates of the elasticity of electricity demand for industrial users. We find smaller declines in output, employment, and hours worked (relative to the decline in electricity use). Finally, several key RPS policy design features that vary substantially from state-to-state produce heterogeneous effects on plant-level electricity prices.
    View Full Paper PDF
  • Working Paper

    Improving Estimates of Neighborhood Change with Constant Tract Boundaries

    May 2022

    Working Paper Number:

    CES-22-16

    Social scientists routinely rely on methods of interpolation to adjust available data to their research needs. This study calls attention to the potential for substantial error in efforts to harmonize data to constant boundaries using standard approaches to areal and population interpolation. We compare estimates from a standard source (the Longitudinal Tract Data Base) to true values calculated by re-aggregating original 2000 census microdata to 2010 tract areas. We then demonstrate an alternative approach that allows the re-aggregated values to be publicly disclosed, using 'differential privacy' (DP) methods to inject random noise to protect confidentiality of the raw data. The DP estimates are considerably more accurate than the interpolated estimates. We also examine conditions under which interpolation is more susceptible to error. This study reveals cause for greater caution in the use of interpolated estimates from any source. Until and unless DP estimates can be publicly disclosed for a wide range of variables and years, research on neighborhood change should routinely examine data for signs of estimation error that may be substantial in a large share of tracts that experienced complex boundary changes.
    View Full Paper PDF
  • Working Paper

    Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance

    February 2020

    Authors: James Gaboardi

    Working Paper Number:

    CES-20-05

    This paper furthers a research agenda for modeling populations along spatial networks and expands upon an empirical analysis to a full U.S. county (Gaboardi, 2019, Ch. 1,2). Specific foci are the necessity of, and methods for, validating and benchmarking spatial data when conducting social science research with aggregated and ambiguous population representations. In order to promote the validation of publicly-available data, access to highly-restricted census microdata was requested, and granted, in order to determine the levels of accuracy and error associated with a network-based population modeling framework. Primary findings reinforce the utility of a novel network allocation method'populated polygons to networks (pp2n) in terms of accuracy, computational complexity, and real runtime (Gaboardi, 2019, Ch. 2). Also, a pseudo-benchmark dataset's performance against the true census microdata shows promise in modeling populations along networks.
    View Full Paper PDF
  • Working Paper

    Reservation Nonemployer and Employer Establishments: Data from U.S. Census Longitudinal Business Databases

    December 2018

    Working Paper Number:

    CES-18-50

    The presence of businesses on American Indian reservations has been difficult to analyze due to limited data. Akee, Mykerezi, and Todd (AMT; 2017) geocoded confidential data from the U.S. Census Longitudinal Business Database to identify whether employer establishments were located on or off American Indian reservations and then compared federally recognized reservations and nearby county areas with respect to their per capita number of employers and jobs. We use their methods and the U.S. Census Integrated Longitudinal Business Database to develop parallel results for nonemployer establishments and for the combination of employer and nonemployer establishments. Similar to AMT's findings, we find that reservations and nearby county areas have a similar sectoral distribution of nonemployer and nonemployer-plus-employer establishments, but reservations have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000. By contrast to AMT, the average size of reservation nonemployer establishments, as measured by revenue (instead of the jobs measure AMT used for employers), is smaller than the size of nonemployers in nearby county areas, and this is true in most industries as well. The most significant exception is in the retail sector. Geographic and demographic factors, such as population density and per capita income, statistically account for only a small portion of these differences. However, when we assume that nonemployer establishments create the equivalent of one job and use combined employer-plus-nonemployer jobs to measure establishment size, the employer job numbers dominate and we parallel AMT's finding that, due to large job counts in the Arts/Entertainment/Recreation and Public Administration sectors, reservations on average have slightly more jobs per resident than nearby county areas.
    View Full Paper PDF
  • Working Paper

    Who are the people in my neighborhood? The 'contextual fallacy' of measuring individual context with census geographies

    February 2018

    Working Paper Number:

    CES-18-11

    Scholars deploy census-based measures of neighborhood context throughout the social sciences and epidemiology. Decades of research confirm that variation in how individuals are aggregated into geographic units to create variables that control for social, economic or political contexts can dramatically alter analyses. While most researchers are aware of the problem, they have lacked the tools to determine its magnitude in the literature and in their own projects. By using confidential access to the complete 2010 U.S. Decennial Census, we are able to construct'for all persons in the US'individual-specific contexts, which we group according to the Census-assigned block, block group, and tract. We compare these individual-specific measures to the published statistics at each scale, and we then determine the magnitude of variation in context for an individual with respect to the published measures using a simple statistic, the standard deviation of individual context (SDIC). For three key measures (percent Black, percent Hispanic, and Entropy'a measure of ethno-racial diversity), we find that block-level Census statistics frequently do not capture the actual context of individuals within them. More problematic, we uncover systematic spatial patterns in the contextual variables at all three scales. Finally, we show that within-unit variation is greater in some parts of the country than in others. We publish county-level estimates of the SDIC statistics that enable scholars to assess whether mis-specification in context variables is likely to alter analytic findings when measured at any of the three common Census units.
    View Full Paper PDF
  • Working Paper

    Reservation Employer Establishments: Data from the U.S. Census Longitudinal Business Database

    January 2017

    Working Paper Number:

    CES-17-57

    The presence of employers and jobs on American Indian reservations has been difficult to analyze due to limited data. We are the first to geocode confidential data on employer establishments from the U.S. Census Longitudinal Business Database to identify location on or off American Indian reservations. We identify the per capita establishment count and jobs in reservation-based employer establishments for most federally recognized reservations. Comparisons to nearby non-reservation areas in the lower 48 states across 18 industries reveal that reservations have a similar sectoral distribution of employer establishments but have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000 (as it is on the vast majority of reservations and for the majority of the reservation population). By contrast, the total number of jobs provided by reservation establishments is, on average, at par with or somewhat higher than in nearby county areas but is concentrated among casino-related and government employers. An implication is that average job numbers per establishment are higher in these sectors on reservations, including those with populations below 15,000, while the remaining industries are typically sparser within reservations (in firm count and jobs per capita). Geographic and demographic factors, such as population density and per capita income, statistically account for some but not all of these differences.
    View Full Paper PDF
  • Working Paper

    The Impact of College Education on Old-Age Mortality: A Study of Marginal Treatment Effects

    January 2017

    Authors: Evan Taylor

    Working Paper Number:

    CES-17-30

    Using a newly constructed dataset that links 2000 U.S. Census long-form records to Social Security Administration data files, I evaluate the effect of college education on mortality. In an OLS regression, women and men who have at least some college education have 20% lower mortality rates than those with a high school degree or less. I proceed with an empirical design intended to illuminate the extent to which this relationship is causal, estimating marginal treatment effects (MTEs) using the proximity of the nearest college to individuals' birthplace as an instrument. Results indicate positive selection into college education (in terms of longevity) for both women and men. Selection drives almost all of the mortality gap for women. For men, longevity gains from college attendance are concentrated among individuals with unobserved variables that make them unlikely attend college. This suggests that men who would benefit most from receiving college education in terms of mortality reductions are those who are not attending.
    View Full Paper PDF
  • Working Paper

    Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes

    August 2016

    Working Paper Number:

    carra-2016-06

    While commercial data sources offer promise to statistical agencies for use in production of official statistics, challenges can arise as the data are not collected for statistical purposes. This paper evaluates the use of 2008-2010 property tax data from CoreLogic, Inc. (CoreLogic), aggregated from county and township governments from around the country, to improve 2010 American Community Survey (ACS) estimates of property tax amounts for single-family homes. Particularly, the research evaluates the potential to use CoreLogic to reduce respondent burden, to study survey response error and to improve adjustments for survey nonresponse. The research found that the coverage of the CoreLogic data varies between counties as does the correspondence between ACS and CoreLogic property taxes. This geographic variation implies that different approaches toward using CoreLogic are needed in different areas of the country. Further, large differences between CoreLogic and ACS property taxes in certain counties seem to be due to conceptual differences between what is collected in the two data sources. The research examines three counties, Clark County, NV, Philadelphia County, PA and St. Louis County, MO, and compares how estimates would change with different approaches using the CoreLogic data. Mean county property tax estimates are highly sensitive to whether ACS or CoreLogic data are used to construct estimates. Using CoreLogic data in imputation modeling for nonresponse adjustment of ACS estimates modestly improves the predictive power of imputation models, although estimates of county property taxes and property taxes by mortgage status are not very sensitive to the imputation method.
    View Full Paper PDF
  • Working Paper

    Storms and Jobs: The Effect of Hurricanes on Individuals' Employment and Earnings over the Long Term*

    January 2015

    Working Paper Number:

    CES-15-21R

    Hurricanes Katrina and Rita devastated the U.S. Gulf Coast in 2005, destroying homes and businesses and causing mass evacuations. The economic effects of disasters are often studied at a regional level, but little is known about the responsiveness of individuals' employment and earnings to the damages, disruption, and rebuilding'particularly in the longer run. Our analysis is based on data that tracks workers over nine years, including seven years after the storms. We estimate models that compare the evolution of earnings for workers who resided in a storm-affected area with those who resided in a suitable control counties. We find that, on average, the storms reduced the earnings of affected individuals during the first year after the storm. These losses reflect various aspects of the short-run disruption caused by the hurricanes, including job separations, migration to other areas, and business contractions. Starting in the third year after the storms, however, we find that the earnings of affected individuals outpaced the earnings of individuals in the control sample. We provide evidence that the long-term earnings gains were the result of wage growth in the affected areas relative to the control areas, due to reduced labor supply and increased labor demand, especially in sectors related to rebuilding. Despite the short-term earnings losses, we find a net increase in average quarterly earnings among affected individuals over the entire post-storm period. However, those who worked in sectors closely tied to tourism or the size of the local population experienced net earnings losses.
    View Full Paper PDF