-
Expanding the Frontier of Economic Statistics Using Big Data: A Case Study of Regional Employment
July 2024
Working Paper Number:
CES-24-37
Big data offers potentially enormous benefits for improving economic measurement, but it also presents challenges (e.g., lack of representativeness and instability), implying that their value is not always clear. We propose a framework for quantifying the usefulness of these data sources for specific applications, relative to existing official sources. We specifically weigh the potential benefits of additional granularity and timeliness, while examining the accuracy associated with any new or improved estimates, relative to comparable accuracy produced in existing official statistics. We apply the methodology to employment estimates using data from a payroll processor, considering both the improvement of existing state-level estimates, but also the production of new, more timely, county-level estimates. We find that incorporating payroll data can improve existing state-level estimates by 11% based on out-of-sample mean absolute error, although the improvement is considerably higher for smaller state-industry cells. We also produce new county-level estimates that could provide more timely granular estimates than previously available. We develop a novel test to determine if these new county-level estimates have errors consistent with official series. Given the level of granularity, we cannot reject the hypothesis that the new county estimates have an accuracy in line with official measures, implying an expansion of the existing frontier. We demonstrate the practical importance of these experimental estimates by investigating a hypothetical application during the COVID-19 pandemic, a period in which more timely and granular information could have assisted in implementing effective policies. Relative to existing estimates, we find that the alternative payroll data series could help identify areas of the country where employment was lagging. Moreover, we also demonstrate the value of a more timely series.
View Full
Paper PDF
-
Contrasting the Local and National Demographic Incidence of Local Labor Demand Shocks
July 2024
Working Paper Number:
CES-24-36
This paper examines how spatial frictions that differ among heterogeneous workers and establishments shape the geographic and demographic incidence of alternative local labor demand shocks, with implications for the appropriate level of government at which to fund local economic initiatives. LEHD data featuring millions of job transitions facilitate estimation of a rich two-sided labor market assignment model. The model generates simulated forecasts of many alternative local demand shocks featuring different establishment compositions and local areas. Workers within 10 miles receive only 11.2% (6.6%) of nationwide welfare (employment) short-run gains, with at least 35.9% (62.0%) accruing to out-of-state workers, despite much larger per-worker impacts for the closest workers. Local incidence by demographic category is very sensitive to shock composition, but different shocks produce similar demographic incidence farther from the shock. Furthermore, the remaining heterogeneity in incidence at the state or national level can reverse patterns of heterogeneous demographic impacts at the local level. Overall, the results suggest that reduced-form approaches using distant locations as controls can produce accurate estimates of local shock impacts on local workers, but that the distribution of local impacts badly approximates shocks' statewide or national incidence.
View Full
Paper PDF
-
Producing U.S. Population Statistics Using Multiple Administrative Sources
November 2023
Working Paper Number:
CES-23-58
We identify several challenges encountered when constructing U.S. administrative record-based (AR-based) population estimates for 2020. Though the AR estimates are higher than the 2020 Census at the national level, they are over 15 percent lower in 5 percent of counties, suggesting that locational accuracy can be improved. Other challenges include how to achieve comprehensive coverage, maintain consistent coverage across time, filter out nonresidents and people not alive on the reference date, uncover missing links across person and address records, and predict demographic characteristics when multiple ones are reported or when they are missing. We discuss several ways of addressing these issues, e.g., building in redundancy with more sources, linking children to their parents' addresses, and conducting additional record linkage for people without Social Security Numbers and for addresses not initially linked to the Census Bureau's Master Address File. We discuss modeling to predict lower levels of geography for people lacking those geocodes, the probability that a person is a U.S. resident on the reference date, the probability that an address is the person's residence on the reference date, and the probability a person is in each demographic characteristic category. Regression results illustrate how many of these challenges and solutions affect the AR county population estimates.
View Full
Paper PDF
-
Output Market Power and Spatial Misallocation
November 2023
Working Paper Number:
CES-23-57
Most product industries are local. In the U.S., firms selling goods and services to local consumers account for half of total sales and generate more than sixty percent of the nation's jobs. Competition in these industries occurs in local product markets: cities. I propose a theory of such competition in which firms have output market power. Spatial differences in local competition arise endogenously due to the spatial sorting of heterogeneous firms. The ability to charge higher markups induces more productive firms to overvalue locating in larger cities, leading to a misallocation of firms across space. The optimal policy incen tivizes productive firms to relocate to smaller cities, providing a rationale for commonly used place-based policies. I use U.S. Census establishment-level data to estimate markups and to structurally estimate the model. I document a significant heterogeneity in markups for local industries across U.S. cities. Cities in the top decile of the city-size distribution have a fifty percent lower markup than cities in the bottom decile. I use the estimated model to quantify the general equilibrium effects of place-based policies. Policies that remove markups and relocate firms to smaller cities yield sizable aggregate welfare gains.
View Full
Paper PDF
-
Coverage of Children in the American Community Survey Based on California Birth Records
September 2023
Working Paper Number:
CES-23-46
The U.S. Census Bureau's American Community Survey (ACS) collects information on individuals and households. The ACS provides survey-based estimates of children drawn from a sample of the U.S. population. However, survey responses may not match administrative records, such as birth records. Birth records should provide a complete account of all births, along with child-parent relationships and demographic characteristics. California is a state that has both a large population of children and a high undercount for young children. This paper uses California as a case study to examine differences between reported versus unreported children in the ACS based on state birth records. Child reporting rates were lower for more recent data years, younger children, for Black and Hispanic mothers, and for more complex households. Child reporting rates were higher for more educated mothers and for households above the poverty line. Using mother's race and Hispanic ethnicity from the birth records combined with poverty indices from the ACS, this analysis also finds that child reporting does not uniformly vary with poverty status across all race and ethnicity groups. This research builds support for the utility of state birth records in analyzing the undercount of children.
View Full
Paper PDF
-
Noncitizen Coverage and Its Effects on U.S. Population Statistics
August 2023
Working Paper Number:
CES-23-42
We produce population estimates with the same reference date, April 1, 2020, as the 2020 Census of Population and Housing by combining 31 types of administrative record (AR) and third-party sources, including several new to the Census Bureau with a focus on noncitizens. Our AR census national population estimate is higher than other Census Bureau official estimates: 1.8% greater than the 2020 Demographic Analysis high estimate, 3.0% more than the 2020 Census count, and 3.6% higher than the vintage-2020 Population Estimates Program estimate. Our analysis suggests that inclusion of more noncitizens, especially those with unknown legal status, explains the higher AR census estimate. About 19.8% of AR census noncitizens have addresses that cannot be linked to an address in the 2020 Census collection universe, compared to 5.7% of citizens, raising the possibility that the 2020 Census did not collect data for a significant fraction of noncitizens residing in the United States under the residency criteria used for the census. We show differences in estimates by age, sex, Hispanic origin, geography, and socioeconomic characteristics symptomatic of the differences in noncitizen coverage.
View Full
Paper PDF
-
Re-examining Regional Income Convergence: A Distributional Approach
February 2023
Working Paper Number:
CES-23-05
We re-examine recent trends in regional income convergence, considering the full distribution of income rather than focusing on the mean. Measuring similarity by comparing each percentile of state
distributions to the corresponding percentile of the national distribution, we find that state incomes have become less similar (i.e. they have diverged) within the top 20 percent of the income distribution since 1969. The top percentile alone accounts for more than half of aggregate divergence across states over this period by our measure, and the top five percentiles combine to account for 93 percent. Divergence in top incomes across states appears to be driven largely by changes in top incomes among White people, while top incomes among Black people have experienced relatively little divergence.
View Full
Paper PDF
-
Comparing the 2019 American Housing Survey to Contemporary Sources of Property Tax Records: Implications for Survey Efficiency and Quality
June 2022
Working Paper Number:
CES-22-22
Given rising nonresponse rates and concerns about respondent burden, government statistical agencies have been exploring ways to supplement household survey data collection with administrative records and other sources of third-party data. This paper evaluates the potential of property tax assessment records to improve housing surveys by comparing these records to responses from the 2019 American Housing Survey. Leveraging the U.S. Census Bureau's linkage infrastructure, we compute the fraction of AHS housing units that could be matched to a unique property parcel (coverage rate), as well as the extent to which survey and property tax data contain the same information (agreement rate). We analyze heterogeneity in coverage and agreement across states, housing characteristics, and 11 AHS items of interest to housing researchers. Our results suggest that partial replacement of AHS data with property data, targeted toward certain survey items or single-family detached homes, could reduce respondent burden without altering data quality. Further research into partial-replacement designs is needed and should proceed on an item-by-item basis. Our work can guide this research as well as those who wish to conduct independent research with property tax records that is representative of the U.S. housing stock.
View Full
Paper PDF
-
Immigration and the Demand for Urban Housing
August 2021
Working Paper Number:
CES-21-23
The immigrant population has grown dramatically in the US in the last fifty years. This study estimates housing demand among immigrants and discusses how immigration may be altering the structure of US urban areas. Immigrants are found to consume less housing per capita than native born US residents.
View Full
Paper PDF
-
The Shifting of the Property Tax on Urban Renters: Evidence from New York State's Homestead Tax Option
December 2020
Working Paper Number:
CES-20-43
In 1981, New York State enabled their cities to adopt the Homestead Tax Option (HTO), which created a multi-tiered property tax system for rental properties in New York City, Buffalo, and Rochester. The HTO enabled these municipalities to impose a higher property tax rate on rental units in buildings with four or more units, compared to rental units in buildings with three or fewer units. Using restricted-use American Housing Survey data and historical property tax rates from each of these cities, we exploit within-unit across-time variation in property tax rates and rents to estimate the degree to which property taxes are shifted onto renters in the form of higher rents. We find that property owners shift approximately 14 percent of an increase in taxes onto renters. This study is the first to use within-unit across time variation in property taxes and rents to identify this shifting effect. Our estimated effect is measurably smaller than most previous studies, which often found shifting effects of over 60 percent.
View Full
Paper PDF