This paper presents an analysis of the dynamics of total factor productivity measures for large plants in SICs 35, 36, and 38. Several TFP measures, derived from production functions and Solow type residuals, are computed and their behavior over time is compared, using various non-parametric tools. Aggregate TFP, which has grown substantially over the time period, is compared with average plant level TFP, which has declined or remained flat. Using transition matrices, the persistence of plant productivity is examined, and it is shown how the transition probabilities vary by industry, plant age, and other characteristics.
-
A FIRST STEP TOWARDS A GERMAN SYNLBD: CONSTRUCTING A GERMAN LONGITUDINAL BUSINESS DATABASE
February 2014
Working Paper Number:
CES-14-13
One major criticism against the use of synthetic data has been that the efforts necessary to generate useful synthetic data are so in- tense that many statistical agencies cannot afford them. We argue many lessons in this evolving field have been learned in the early years of synthetic data generation, and can be used in the development of new synthetic data products, considerably reducing the required in- vestments. The final goal of the project described in this paper will be to evaluate whether synthetic data algorithms developed in the U.S. to generate a synthetic version of the Longitudinal Business Database (LBD) can easily be transferred to generate a similar data product for other countries. We construct a German data product with infor- mation comparable to the LBD - the German Longitudinal Business Database (GLBD) - that is generated from different administrative sources at the Institute for Employment Research, Germany. In a fu- ture step, the algorithms developed for the synthesis of the LBD will be applied to the GLBD. Extensive evaluations will illustrate whether the algorithms provide useful synthetic data without further adjustment. The ultimate goal of the project is to provide access to multiple synthetic datasets similar to the SynLBD at Cornell to enable comparative studies between countries. The Synthetic GLBD is a first step towards that goal.
View Full
Paper PDF
-
Compositional Nature of Firm Growth and Aggregate Fluctuations
March 2020
Working Paper Number:
CES-20-09
This paper studies firm dynamics over the business cycle. I present evidence from the United Kingdom that more rapidly growing firms are born in expansions than in recessions. Using administrative records from Census data, I find that this observation also holds for the last four recessions in the United States. I also present suggestive evidence that financial frictions play an important role in determining the types of firms that are born at different stages of the business cycle. I then develop a general equilibrium model in which firms choose their managers' span of control at birth. Firms that choose larger spans of control grow faster and eventually get to be larger, and in this sense have a larger target size. Financial frictions in the form of collateral constraints slow the rate at which firms reach their target size. It takes firms longer to get up to scale when collateral constraints tighten; therefore, businesses with the largest target size are affected disproportionately more. Thus, fewer entrepreneurs find it profitable to choose larger projects when financial conditions deteriorate. Using Bayesian methods, I estimate the model using micro and aggregate data from the United Kingdom. I find that financial shocks account for over 80% of fluctuations in the formation of businesses with a large target size, and TFP and labor wedge shocks account for the remaining 20%. An independently estimated version of the model with no choice over the span of control needs larger aggregate shocks in order to account for the same data series, suggesting that the intensive margin of business formation is important at business cycle frequencies. The model with the choice over the span of control generates an empirically relevant and non-targeted collapse in the right tail of the cumulative growth distribution among firms started in recessions, while the model without such a choice does not. The paper also discusses implications for micro-targeted government stimulus policies.
View Full
Paper PDF
-
Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance
February 2020
Working Paper Number:
CES-20-05
This paper furthers a research agenda for modeling populations along spatial networks and expands upon an empirical analysis to a full U.S. county (Gaboardi, 2019, Ch. 1,2). Specific foci are the necessity of, and methods for, validating and benchmarking spatial data when conducting social science research with aggregated and ambiguous population representations. In order to promote the validation of publicly-available data, access to highly-restricted census microdata was requested, and granted, in order to determine the levels of accuracy and error associated with a network-based population modeling framework. Primary findings reinforce the utility of a novel network allocation method'populated polygons to networks (pp2n) in terms of accuracy, computational complexity, and real runtime (Gaboardi, 2019, Ch. 2). Also, a pseudo-benchmark dataset's performance against the true census microdata shows promise in modeling populations along networks.
View Full
Paper PDF
-
An Economic Analysis of Privacy Protection and Statistical Accuracy as Social Choices
August 2018
Working Paper Number:
CES-18-35
Statistical agencies face a dual mandate to publish accurate statistics while protecting respondent privacy. Increasing privacy protection requires decreased accuracy. Recognizing this as a resource allocation problem, we propose an economic solution: operate where the marginal cost of increasing privacy equals the marginal benefit. Our model of production, from computer science, assumes data are published using an efficient differentially private algorithm. Optimal choice weighs the demand for accurate statistics against the demand for privacy. Examples from U.S. statistical programs show how our framework can guide decision-making. Further progress requires a better understanding of willingness-to-pay for privacy and statistical accuracy.
View Full
Paper PDF
-
The Emergence of Wage Discrimination in U.S. Manufacturing
June 2011
Working Paper Number:
CES-11-18
This paper examines the hypothesis that wage discrimination emerged at the beginning of the twentieth century. I test for wage discrimination by estimating the female-male productivity ratio from samples of manufacturing firms in the northeast, and then comparing the estimated productivity ratio to the wage ratio. I find that women did not face wage discrimination in manufacturing during the nineteenth century. In 1900 there was wage discrimination against women in white-collar jobs, but not in blue-collar jobs. Wage discrimination persisted, and in 2002 the female-male wage ratio was less than the productivity ratio.
View Full
Paper PDF
-
Why Some Firms Export
June 2001
Working Paper Number:
CES-01-05
This paper presents a dynamic model of the export decision by a profit-maximizing firm. Using a panelofU.S.manufacturing plants, we test for the role of plant characteristics, spillovers from neighboring exporters, entry costs and government export promotion expenditures. Entry and exit in the export market by U.S. plants is substantial, past exporters are apt to reenter, and plants are likely to export in consecutive years. However, we find that entry costs are significant and spillovers from the export activity of other plants negligible. State export promotion expenditures have no significant effect on the probability of exporting. Plant characteristics, especially those indicative of past success, strongly increase the probability of exporting as do favorable exchange rate shocks.
View Full
Paper PDF
-
An In-Depth Examination of Requirements for Disclosure Risk Assessment
October 2023
Authors:
Ron Jarmin,
John M. Abowd,
Ian M. Schmutte,
Jerome P. Reiter,
Nathan Goldschlag,
Victoria A. Velkoff,
Michael B. Hawes,
Robert Ashmead,
Ryan Cumings-Menon,
Sallie Ann Keller,
Daniel Kifer,
Philip Leclerc,
Rolando A. RodrÃguez,
Pavel Zhuravlev
Working Paper Number:
CES-23-49
The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be based on pre-specified, objective criteria. Such criteria should be used to compare methodologies to identify those with the most desirable properties. We illustrate this approach, using simple desiderata, to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. Thus, more research is needed, but in the near-term, the counterfactual approach appears best-suited for privacy-utility analysis.
View Full
Paper PDF
-
LOOKING BACK ON THREE YEARS OF USING THE SYNTHETIC LBD BETA
February 2014
Working Paper Number:
CES-14-11
Distributions of business data are typically much more skewed than those for household or individual data and public knowledge of the underlying units is greater. As a results, national statistical offices (NSOs) rarely release establishment or firm-level business microdata due to the risk to respondent confidentiality. One potential approach for overcoming these risks is to release synthetic data where the establishment data are simulated from statistical models designed to mimic the distributions of the real underlying microdata. The US Census Bureau's Center for Economic Studies in collaboration with Duke University, the National Institute of Statistical Sciences, and Cornell University made available a synthetic public use file for the Longitudinal Business Database (LBD) comprising more than 20 million records for all business establishment with paid employees dating back to 1976. The resulting product, dubbed the SynLBD, was released in 2010 and is the first-ever comprehensive business microdata set publicly released in the United States including data on establishments employment and payroll, birth and death years, and industrial classification. This pa- per documents the scope of projects that have requested and used the SynLBD.
View Full
Paper PDF
-
Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Survey and SSA Administrative Data
September 2002
Working Paper Number:
tp-2002-24
The third chapter investigates measurement error in SIPP annual job
earnings data linked to SSA administrative earnings data. The multiple
earnings measures provided by the survey and administrative data enable
the identification of components of true variation and variation due to
measurement error. We find that 18% of the variation in SIPP annual job
earnings can be attributed to measurement error. We also find that in
both the SIPP and the DER, measurement error is persistent over time.
A lower level of auto-correlation in the SIPP measurement error than in
the economic error component leads to a lower reliability ratio of .62 for
first-differenced earnings.
View Full
Paper PDF
-
Production Function and Wage Equation Estimation with Heterogenous Labor: Evidence from a New Matched Employer-Employee Dataset
April 2004
Working Paper Number:
CES-04-05
In this paper, we first describe the 1990 DEED, the most recently constructed matched employeremployee data set for the United States that contains detailed demographic information on workers (most notably, information on education). We then use the data from manufacturing establishments in the 1990 DEED to update and expand on previous findings, using a more limited data set, regarding the measurement of the labor input and theories of wage determination (Hellerstein, et al., 1999). We find that the productivity of women is less than that of men, but not by enough to fully explain the gap in wages, a result that is consistent with wage discrimination against women. In contrast, we find no evidence of wage discrimination against blacks. We estimate that both the wage and productivity profiles are rising but concave to the origin (consistent with profiles quadratic in age), but the estimated relative wage profile is steeper than the relative productivity profile, consistent with models of deferred wages. We find a productivity premium for marriage equal to that of the wage premium, and a productivity premium for education that somewhat exceeds the wage premium. Exploring the sensitivity of these results, we also find that different specifications of production functions do not have any qualitative effects on the these results. Finally, the results indicate that the returns to productive inputs (capital, materials, labor quality) as well as the residual variance are virtually unaffected by the choice of the construction of the labor quality input.
View Full
Paper PDF