    Expanding the Frontier of Economic Statistics Using Big Data: A Case Study of Regional Employment

    July 2024

    Big data offers potentially enormous benefits for improving economic measurement, but it also presents challenges (e.g., lack of representativeness and instability), implying that their value is not always clear. We propose a framework for quantifying the usefulness of these data sources for specific applications, relative to existing official sources. We specifically weigh the potential benefits of additional granularity and timeliness, while examining the accuracy associated with any new or improved estimates, relative to comparable accuracy produced in existing official statistics. We apply the methodology to employment estimates using data from a payroll processor, considering both the improvement of existing state-level estimates, but also the production of new, more timely, county-level estimates. We find that incorporating payroll data can improve existing state-level estimates by 11% based on out-of-sample mean absolute error, although the improvement is considerably higher for smaller state-industry cells. We also produce new county-level estimates that could provide more timely granular estimates than previously available. We develop a novel test to determine if these new county-level estimates have errors consistent with official series. Given the level of granularity, we cannot reject the hypothesis that the new county estimates have an accuracy in line with official measures, implying an expansion of the existing frontier. We demonstrate the practical importance of these experimental estimates by investigating a hypothetical application during the COVID-19 pandemic, a period in which more timely and granular information could have assisted in implementing effective policies. Relative to existing estimates, we find that the alternative payroll data series could help identify areas of the country where employment was lagging. Moreover, we also demonstrate the value of a more timely series.
    Agglomeration Spillovers and Persistence: New Evidence from Large Plant Openings

    June 2022

    We use confidential Census microdata to compare outcomes for plants in counties that 'win' a new plant to plants in similar counties that did not to receive the new plant, providing empirical evidence on the economic theories used to justify local industrial policies. We find little evidence that the average highly incentivized large plant generates significant productivity spillovers. Our semiparametric estimates of the overall local agglomeration function indicate that residual TFP is linear for the range of 'agglomeration' densities most frequently observed, suggesting local economic shocks do not push local economies to a new higher equilibrium. Examining changes twenty years after the new plant entrant, we find some evidence of persistent, positive increases in winning county-manufacturing shares that are not driven by establishment births.
    Estimating market power Evidence from the US Brewing Industry

    January 2017

    While inferring markups from demand data is common practice, estimation relies on difficult-to-test assumptions, including a specific model of how firms compete. Alternatively, markups can be inferred from production data, again relying on a set of difficult-to-test assumptions, but a wholly different set, including the assumption that firms minimize costs using a variable input. Relying on data from the US brewing industry, we directly compare markup estimates from the two approaches. After implementing each approach for a broad set of assumptions and specifications, we find that both approaches provide similar and plausible markup estimates in most cases. The results illustrate how using the two strategies together can allow researchers to evaluate structural models and identify problematic assumptions.
    Dutch Disease or Agglomeration? The Local Economic Effects of Natural Resource Booms in Modern America

    November 2015

    Do natural resources benefit producer economies, or is there a "Natural Resource Curse," perhaps as Dutch Disease crowds out manufacturing? We combine new data on oil and gas abundance with Census of Manufactures microdata to estimate how oil and gas booms have affected local economies in the United States. Migration does not fully offset labor demand growth, so local wages rise. Notwithstanding, manufacturing is actually pro-cyclical with resource booms, driven by growth in upstream and locally traded sectors. The results highlight the importance of highly local demand for many manufacturers and underscore how natural resource linkages can drive manufacturing growth.
    Linking Investment Spikes and Productivity Growth: U.S. Food Manufacturing Industry

    October 2008

    We investigate the relationship between productivity growth and investment spikes using Census Bureau's plant-level data set for the U.S. food manufacturing industry. We find that productivity growth increases after investment spikes suggesting an efficiency gain or plants' learning effect. However, efficiency and the learning period associated with investment spikes differ among plants' productivity quartile ranks implying the differences in the plants' investment types such as expansionary, replacement or retooling. We find evidence of both convex and non-convex types of adjustment costs where lumpy plant-level investments suggest the possibility of non-convex adjustment costs and hazard estimation results suggest the possibility of convex adjustment costs. The downward sloping hazard can be due to the unobserved heterogeneity across plants such as plants' idiosyncratic obsolescence caused by different R&D capabilities and implies the existence of convex adjustment costs. Food plants frequently invest during their first few years of operation and high productivity plants postpone investing due to high fixed costs.
    Pollution Abatement Expenditures and Plant-Level Productivity: A Production Function Approach

    August 2003

    In this paper, we investigate the impact of environmental regulation on productivity using a Cobb-Douglas production function framework. Estimating the effects of regulation on productivity can be done with a top-down approach using data for broad sectors of the economy, or a more disaggregated bottom-up approach. Our study follows a bottom-up approach using data from the U.S. paper, steel, and oil industries. We measure environmental regulation using plant-level information on pollution abatement expenditures, which allows us to distinguish between productive and abatement expenditures on each input. We use annual Census Bureau information (1979-1990) on output, labor, capital, and material inputs, and pollution abatement operating costs and capital expenditures for 68 pulp and paper mills, 55 oil refineries, and 27 steel mills. We find that pollution abatement inputs generally contribute little or nothing to output, especially when compared to their 'productive' equivalents. Adding an aggregate pollution abatement cost measure to a Cobb-Douglas production function, we find that a $1 increase in pollution abatement costs leads to an estimated productivity decline of $3.11, $1.80, and $5.98 in the paper, oil, and steel industries respectively. These findings imply substantial differences across industries in their sensitivity to pollution abatement costs, arguing for a bottom-up approach that can capture these differences. Further differentiating plants by their production technology, we find substantial differences in the impact of pollution abatement costs even within industries, with higher marginal costs at plants with more polluting technologies. Finally, in all three industries, plants concentrating on change-in-production-process abatement techniques have higher productivity than plants doing predominantly end-of-line abatement, but also seem to be more affected by pollution abatement operating costs. Overall, our results point to the importance using detailed, disaggregated analyses, even below the industry level, when trying to model the costs of forcing plants to reduce their emissions.
    ARE FIXED EFFECTS FIXED? Persistence in Plant Level Productivity

    May 1996

    Authors: Douglas W Dwyer

    Estimates of production functions suffer from an omitted variable problem; plant quality is an omitted variable that is likely to be correlated with variable inputs. One approach is to capture differences in plant qualities through plant specific intercepts, i.e., to estimate a fixed effects model. For this technique to work, it is necessary that differences in plant quality are more or less fixed; if the "fixed effects" erode over time, such a procedure becomes problematic, especially when working with long panels. In this paper, a standard fixed effects model, extended to allow for serial correlation in the error term, is applied to a 16-year panel of textile plants. This parametric approach strongly accepts the hypothesis of fixed effects. They account for about one-third of the variation in productivity. A simple non-parametric approach, however, concludes that differences in plant qualities erode over time, that is plant qualities f-mix. Monte Carlo results demonstrate that this discrepancy comes from the parametric approach imposing an overly restrictive functional form on the data; if there were fixed effects of the magnitude measured, one would reject the hypothesis of f-mixing. For textiles, at least, the functional form of a fixed effects model appears to generate misleading conclusions. A more flexible functional form is estimated. The "fixed" effects actually have a half life of approximately 10 to 20 years, and they account for about one-half the variation in productivity.
    The Long-Run Demand for Labor: Estimates From Census Establishment Data

    September 1993

    This paper estimates long-run demand functions for production workers, production worker hours, and nonproduction workers using micro data from U.S. establishment surveys. The paper focuses on estimation of the wage and output elasticities of labor demand using data on over 41,000 U.S. manufacturing plants in 1975 and more than 30,000 plants in 1981. Particular attention is focused on the problems of unobserved producer heterogeneity and measurement errors in output that can affect labor demand estimates based on establishment survey data. The empirical results reveal that OLS estimates of both the own-price elasticity and the output elasticity of labor demand are biased downward as a result of unobserved heterogeneity. Differencing the data as a solution to this problem greatly exaggerates measurement error in the output coefficients. The use of capital stocks as instrumental variables to correct for measurement error in output significantly alters output elasticities in the expected direction but has no systematic effect on own-price elasticities. All of these patterns are found in estimates that pool establishment data across industries and in industry-specific regressions for the vast majority of industries. Estimates of the output elasticity of labor demand indicate that there are slight increasing returns for production workers and production hours, with a pooled data estimate of .92. The estimate for nonproduction workers in .98. The variation in the output elasticities across industries is fairly small. Estimates of the own-price elasticity vary more substantially with the year, type of differencing used, and industry. They average -.50 for production hours, -.41 for production workers, and -.44 for nonproduction workers. The price elasticities vary widely across manufacturing industries: the interquartile range for the industry estimates is approximately .40.
    Estimating Capital Efficiency Schedules Within Production Functions

    May 1992

    Authors: Mark E Doms

    The appropriate method for aggregating capital goods across vintages to produce a single capital stock measure has long been a contentious issue, and the literature covering this topic is quite extensive. This paper presents a methodology that estimates efficiency schedules within a production function, allowing the data to reveal how the efficiency of capital goods evolve as they age. Specifically we insert a parameterized investment stream into the position of a capital variable in a production function, and then estimate the parameters of the production function simultaneously with the parameters of the investment stream. Plant level panel data for a select group of steel plants employing a common technology are used to estimate the model. Our primary finding is that when using a simple Cobb Douglas production function, the estimated efficiency schedules appear to follow a geometric pattern, which is consistent with the estimates of economic depreciation of Hulten and Wykoff (1981). Results from more flexible functional forms produced much less precise and unreliable estimates.
    The Dynamics Of Productivity In The Telecommunications Equipment Industry

    February 1992

    Technological change and deregulation have caused a major restructuring of the telecommunications equipment industry over the last two decades. We estimate the parameters of a production function for the equipment industry and then use those estimates to analyze the evolution of plant-level productivity over this period. The restructuring involved significant entry and exit and large changes in the sizes of incumbents. Since firms choices on whether to liquidate and the on the quantities of inputs demanded should they continue depend on their productivity, we develop an estimation algorithm that takes into account the relationship between productivity on the one hand, and both input demand and survival on the other. The algorithm is guided by a dynamic equilibrium model that generates the exit and input demand equations needed to correct for the simultaneity and selection problems. A fully parametric estimation algorithm based on these decision rules would be both computationally burdensome and require a host of auxiliary assumptions. So we develop a semiparametric technique which is both consistent with a quite general version of the theoretical framework and easy to use. The algorithm produces markedly different estimates of both production function parameters and of productivity movements than traditional estimation procedures. We find an increase in the rate of industry productivity growth after deregulation. This in spite of the fact that there was no increase in the average of the plants' rates of productivity growth, and there was actually a fall in our index of the efficiency of the allocation of variable factors conditional on the existing distribution of fixed factors. Deregulation was, however, followed by a reallocation of capital towards more productive establishments (by a down sizing, often shutdown, of unproductive plants and by a disproportionate growth of productive establishments) which more than offset the other factors' negative impacts on aggregate productivity.
