- 
        
        Locating Hispanic Americans, 1900-2020
        
 July 2025
             
                Working Paper Number:CES-25-50
            
            This study examines Hispanic Americans' residential settlement patterns nationwide in the last 120 years. Drawing on newly available neighborhood data for the whole country as early as 1900, it documents the direction and timing of changes in two aspects of their location. First, it charts Hispanics' transition from a predominantly rural population to majority metropolitan by 1930 and also their growing presence in all regions of the U.S. while still maintaining a predominance in the West and Texas. Second, it provides the first evidence of the long-term trajectory of their segregation from whites in the metropolitan areas where they were settling. As shown by studies of more recent decades, Hispanics were never as segregated as African Americans. Nonetheless, similar to African Americans, their segregation from whites increased to high levels through the middle of the century, followed by slow decline. For both groups metropolitan segregation was driven mainly by segregation among central city neighborhoods prior to the 1940s. But new forms of segregation ' a growing city/suburb divide and increasing segregation among suburban places ' have become the largest contributors to segregation today.
         View Full
            Paper PDF
 
- 
        
        Household Wealth and Entrepreneurial Career Choices: Evidence from Climate Disasters
        
 July 2024
             
                Working Paper Number:CES-24-39
            
            This study investigates how household wealth affects the human capital of startups, based on U.S. Census individual-level employment data, deed records, and geographic information system (GIS) data. Using floods as a wealth shock, a regression discontinuity analysis shows inundated residents are 7% less likely to work in startups relative to their neighbors outside the flood boundary, within a 0.1-mile-wide band. The effect is more pronounced for homeowners, consistent with the wealth effect. The career distortion leads to a significant long-run income loss, highlighting the importance of self-insurance for human capital allocation.
         View Full
            Paper PDF
 
- 
        
        The U.S. Manufacturing Sector's Response to Higher Electricity Prices:  Evidence from State-Level Renewable Portfolio Standards
        
 October 2022
             
                Working Paper Number:CES-22-47
            
            While several papers examine the effects of renewable portfolio standards (RPS) on electricity prices, they mainly rely on state-level data and there has been little research on how RPS policies affect manufacturing activity via their effect on electricity prices. Using plant-level data for the entire U.S. manufacturing sector and all electric utilities from 1992 ' 2015, we jointly estimate the effect of RPS adoption and stringency on plant-level electricity prices and production decisions. To ensure that our results are not sensitive to possible pre-existing differences across manufacturing plants in RPS and non-RPS states, we implement coarsened exact covariate matching. Our results suggest that electricity prices for plants in RPS states averaged about 2% higher than in non-RPS states, notably lower than prior estimates based on state-level data. In response to these higher electricity prices, we estimate that plant electricity usage declined by 1.2% for all plants and 1.8% for energy-intensive plants, broadly consistent with published estimates of the elasticity of electricity demand for industrial users. We find smaller declines in output, employment, and hours worked (relative to the decline in electricity use). Finally, several key RPS policy design features that vary substantially from state-to-state produce heterogeneous effects on plant-level electricity prices.
         View Full
            Paper PDF
 
- 
        
        Improving Estimates of Neighborhood Change with Constant Tract Boundaries
        
 May 2022
             
                Working Paper Number:CES-22-16
            
            Social scientists routinely rely on methods of interpolation to adjust available data to their  research needs. This study calls attention to the potential for substantial error in efforts to harmonize data to constant boundaries using standard approaches to areal and population interpolation. We compare estimates from a standard source (the Longitudinal Tract Data Base) to true values calculated by re-aggregating original 2000 census microdata to 2010 tract areas.  We then demonstrate an alternative approach that allows the re-aggregated values to be publicly  disclosed, using 'differential privacy' (DP) methods to inject random noise to protect confidentiality of the raw data. The DP estimates are considerably more accurate than the  interpolated estimates. We also examine conditions under which interpolation is more  susceptible to error. This study reveals cause for greater caution in the use of interpolated  estimates from any source. Until and unless DP estimates can be publicly disclosed for a wide  range of variables and years, research on neighborhood change should routinely examine data for  signs of estimation error that may be substantial in a large share of tracts that experienced  complex boundary changes.
         View Full
            Paper PDF
 
- 
        
        Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance
        
 February 2020
             
                Working Paper Number:CES-20-05
            
            This paper furthers a research agenda for modeling populations along spatial networks and expands upon an empirical analysis to a full U.S. county (Gaboardi, 2019, Ch. 1,2). Specific foci are the necessity of, and methods for, validating and benchmarking spatial data when conducting social science research with aggregated and ambiguous population representations. In order to promote the validation of publicly-available data, access to highly-restricted census microdata was requested, and granted, in order to determine the levels of accuracy and error associated with a network-based population modeling framework. Primary findings reinforce the utility of a novel network allocation method'populated polygons to networks (pp2n) in terms of accuracy, computational complexity, and real runtime (Gaboardi, 2019, Ch. 2). Also, a pseudo-benchmark dataset's performance against the true census microdata shows promise in modeling populations along networks.
         View Full
            Paper PDF
 
- 
        
        Reservation Nonemployer and Employer Establishments: Data from U.S. Census Longitudinal Business Databases
        
 December 2018
             
                Working Paper Number:CES-18-50
            
            The presence of businesses on American Indian reservations has been difficult to analyze due to limited data. Akee, Mykerezi, and Todd (AMT; 2017) geocoded confidential data from the U.S. Census Longitudinal Business Database to identify whether employer establishments were located on or off American Indian reservations and then compared federally recognized reservations and nearby county areas with respect to their per capita number of employers and jobs. We use their methods and the U.S. Census Integrated Longitudinal Business Database to develop parallel results for nonemployer establishments and for the combination of employer and nonemployer establishments. Similar to AMT's findings, we find that reservations and nearby county areas have a similar sectoral distribution of nonemployer and nonemployer-plus-employer establishments, but reservations have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000. By contrast to AMT, the average size of reservation nonemployer establishments, as measured by revenue (instead of the jobs measure AMT used for employers), is smaller than the size of nonemployers in nearby county areas, and this is true in most industries as well. The most significant exception is in the retail sector. Geographic and demographic factors, such as population density and per capita income, statistically account for only a small portion of these differences. However, when we assume that nonemployer establishments create the equivalent of one job and use combined employer-plus-nonemployer jobs to measure establishment size, the employer job numbers dominate and we parallel AMT's finding that, due to large job counts in the Arts/Entertainment/Recreation and Public Administration sectors, reservations on average have slightly more jobs per resident than nearby county areas.
         View Full
            Paper PDF
 
- 
        
        Who are the people in my neighborhood? The 'contextual fallacy' of measuring individual context with census geographies
        
 February 2018
             
                Working Paper Number:CES-18-11
            
            Scholars deploy census-based measures of neighborhood context throughout the social sciences and epidemiology. Decades of research confirm that variation in how individuals are aggregated into geographic units to create variables that control for social, economic or political contexts can dramatically alter analyses. While most researchers are aware of the problem, they have lacked the tools to determine its magnitude in the literature and in their own projects. By using confidential access to the complete 2010 U.S. Decennial Census, we are able to construct'for all persons in the US'individual-specific contexts, which we group according to the Census-assigned block, block group, and tract. We compare these individual-specific measures to the published statistics at each scale, and we then determine the magnitude of variation in context for an individual with respect to the published measures using a simple statistic, the standard deviation of individual context (SDIC). For three key measures (percent Black, percent Hispanic, and Entropy'a measure of ethno-racial diversity), we find that block-level Census statistics frequently do not capture the actual context of individuals within them. More problematic, we uncover systematic spatial patterns in the contextual variables at all three scales. Finally, we show that within-unit variation is greater in some parts of the country than in others. We publish county-level estimates of the SDIC statistics that enable scholars to assess whether mis-specification in context variables is likely to alter analytic findings when measured at any of the three common Census units.
         View Full
            Paper PDF
 
- 
        
        Reservation Employer Establishments: Data from the U.S. Census Longitudinal Business Database
        
 January 2017
             
                Working Paper Number:CES-17-57
            
            The presence of employers and jobs on American Indian reservations has been difficult to analyze due to limited data. We are the first to geocode confidential data on employer establishments from the U.S. Census Longitudinal Business Database to identify location on or off American Indian reservations. We identify the per capita establishment count and jobs in reservation-based employer establishments for most federally recognized reservations. Comparisons to nearby non-reservation areas in the lower 48 states across 18 industries reveal that reservations have a similar sectoral distribution of employer establishments but have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000 (as it is on the vast majority of reservations and for the majority of the reservation population). By contrast, the total number of jobs provided by reservation establishments is, on average, at par with or somewhat higher than in nearby county areas but is concentrated among casino-related and government employers. An implication is that average job numbers per establishment are higher in these sectors on reservations, including those with populations below 15,000, while the remaining industries are typically sparser within reservations (in firm count and jobs per capita). Geographic and demographic factors, such as population density and per capita income, statistically account for some but not all of these differences.
         View Full
            Paper PDF
 
- 
        
        The Impact of College Education on Old-Age Mortality: A Study of Marginal Treatment Effects
        
 January 2017
             
                Working Paper Number:CES-17-30
            
            Using a newly constructed dataset that links 2000 U.S. Census long-form records to Social Security Administration data files, I evaluate the effect of college education on mortality. In an OLS regression, women and men who have at least some college education have 20% lower mortality rates than those with a high school degree or less. I proceed with an empirical design intended to illuminate the extent to which this relationship is causal, estimating marginal treatment effects (MTEs) using the proximity of the nearest college to individuals' birthplace as an instrument. Results indicate positive selection into college education (in terms of longevity) for both women and men. Selection drives almost all of the mortality gap for women. For men, longevity gains from college attendance are concentrated among individuals with unobserved variables that make them unlikely attend college. This suggests that men who would benefit most from receiving college education in terms of mortality reductions are those who are not attending.
         View Full
            Paper PDF
 
- 
        
        Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes
        
 August 2016
             
                Working Paper Number:carra-2016-06
            
            While commercial data sources offer promise to statistical agencies for use in production of official statistics, challenges can arise as the data are not collected for statistical purposes. This paper evaluates the use of 2008-2010 property tax data from CoreLogic, Inc. (CoreLogic), aggregated from county and township governments from around the country, to improve 2010 American Community Survey (ACS) estimates of property tax amounts for single-family homes. Particularly, the research evaluates the potential to use CoreLogic to reduce respondent burden, to study survey response error and to improve adjustments for survey nonresponse. The research found that the coverage of the CoreLogic data varies between counties as does the correspondence between ACS and CoreLogic property taxes. This geographic variation implies that different approaches toward using CoreLogic are needed in different areas of the country. Further, large differences between CoreLogic and ACS property taxes in certain counties seem to be due to conceptual differences between what is collected in the two data sources. The research examines three counties, Clark County, NV, Philadelphia County, PA and St. Louis County, MO, and compares how estimates would change with different approaches using the CoreLogic data. Mean county property tax estimates are highly sensitive to whether ACS or CoreLogic data are used to construct estimates. Using CoreLogic data in imputation modeling for nonresponse adjustment of ACS estimates modestly improves the predictive power of imputation models, although estimates of county property taxes and property taxes by mortgage status are not very sensitive to the imputation method.
         View Full
            Paper PDF