-
The 2010 Census Confidentiality Protections Failed, Here's How and Why
December 2023
Authors:
Lars Vilhuber,
John M. Abowd,
Ethan Lewis,
Nathan Goldschlag,
Robert Ashmead,
Daniel Kifer,
Philip Leclerc,
Rolando A. Rodr�guez,
Tamara Adams,
David Darais,
Sourya Dey,
Simson L. Garfinkel,
Scott Moore,
Ramy N. Tadros
Working Paper Number:
CES-23-63
Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. The tabular publications in Summary File 1 thus have prohibited disclosure risk similar to the unreleased confidential microdata. Reidentification studies confirm that an attacker can, within blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with nonmodal characteristics) with 95% accuracy, the same precision as the confidential data achieve and far greater than statistical baselines. The flaw in the 2010 Census framework was the assumption that aggregation prevented accurate microdata reconstruction, justifying weaker disclosure limitation methods than were applied to 2010 Census public microdata. The framework used for 2020 Census publications defends against attacks that are based on reconstruction, as we also demonstrate here. Finally, we show that alternatives to the 2020 Census Disclosure Avoidance System with similar accuracy (enhanced swapping) also fail to protect confidentiality, and those that partially defend against reconstruction attacks (incomplete suppression implementations) destroy the primary statutory use case: data for redistricting all legislatures in the country in compliance with the 1965 Voting Rights Act.
View Full
Paper PDF
-
An In-Depth Examination of Requirements for Disclosure Risk Assessment
October 2023
Authors:
Ron Jarmin,
John M. Abowd,
Ian M. Schmutte,
Jerome P. Reiter,
Nathan Goldschlag,
Michael B. Hawes,
Robert Ashmead,
Ryan Cumings-Menon,
Sallie Ann Keller,
Daniel Kifer,
Philip Leclerc,
Rolando A. Rodr�guez,
Victoria A. Velkoff,
Pavel Zhuravlev
Working Paper Number:
CES-23-49
The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be based on pre-specified, objective criteria. Such criteria should be used to compare methodologies to identify those with the most desirable properties. We illustrate this approach, using simple desiderata, to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. Thus, more research is needed, but in the near-term, the counterfactual approach appears best-suited for privacy-utility analysis.
View Full
Paper PDF
-
Advanced Technologies Adoption and Use by U.S. Firms: Evidence from the Annual Business Survey
December 2020
Working Paper Number:
CES-20-40
We introduce a new survey module intended to complement and expand research on the causes and consequences of advanced technology adoption. The 2018 Annual Business Survey (ABS), conducted by the Census Bureau in partnership with the National Center for Science and Engineering Statistics (NCSES), provides comprehensive and timely information on the diffusion among U.S. firms of advanced technologies including artificial intelligence (AI), cloud computing, robotics, and the digitization of business information. The 2018 ABS is a large, nationally representative sample of over 850,000 firms covering all private, nonfarm sectors of the economy. We describe the motivation for and development of the technology module in the ABS, as well as provide a first look at technology adoption and use patterns across firms and sectors. We find that digitization is quite widespread, as is some use of cloud computing. In contrast, advanced technology adoption is rare and generally skewed towards larger and older firms. Adoption patterns are consistent with a hierarchy of increasing technological sophistication, in which most firms that adopt AI or other advanced business technologies also use the other, more widely diffused technologies. Finally, while few firms are at the technology frontier, they tend to be large so technology exposure of the average worker is significantly higher. This new data will be available to qualified researchers on approved projects in the Federal Statistical Research Data Center network.
View Full
Paper PDF
-
Identifying U.S. Merchandise Traders: Integrating Customs Transactions with Business Administrative Data
September 2020
Working Paper Number:
CES-20-28
This paper describes the construction of the Longitudinal Firm Trade Transactions Database (LFTTD) enabling the identification of merchandise traders - exporters and importers - in the U.S. Census Bureau's Business Register (BR). The LFTTD links merchandise export and import transactions from customs declaration forms to the BR beginning in 1992 through the present. We employ a combination of deterministic and probabilistic matching algorithms to assign a unique firm identifier in the BR to a merchandise export or import transaction record. On average, we match 89 percent of export and import values to a firm identifier. In 1992, we match 79 (88) percent of export (import) value; in 2017, we match 92 (96) percent of export (import) value. Trade transactions in year t are matched to years between 1976 and t+1 of the BR. On average, 94 percent of the trade value matches to a firm in year t of the BR. The LFTTD provides the most comprehensive identification of and the foundation for the analysis of goods trading firms in the U.S. economy.
View Full
Paper PDF
-
Disclosure Limitation and Confidentiality Protection in Linked Data
January 2018
Working Paper Number:
CES-18-07
Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.
View Full
Paper PDF
-
Are firm-level idiosyncratic shocks important for U.S. aggregate volatility?
January 2017
Working Paper Number:
CES-17-23
This paper quantitatively assesses whether firm-specific shocks can drive the U.S. business cycle. Firm-specific shocks to the largest firms can directly contribute to aggregate fluctuations whenever the firm size distribution is fat-tailed giving rise to the granular hypothesis. I use a novel, comprehensive data set compiled from administrative sources that contains the universe of firms and trade transactions, and find that the granular hypothesis accounts at most for 16 percent of the variation in aggregate sales growth. This is about half of that found by previous studies that imposed Gibrat's law where all firms are equally volatile regardless of their size. Using the full distribution of growth rates among U.S. firms, I find robust evidence of a negative relationship between firm-level volatility and size, i.e. the size-variance relationship. The largest firms (whose shocks drive granularity) are the least volatile under the size-variance relationship, thus their influence on aggregates is mitigated. I show that by taking this relationship into account the effect of firm-specific shocks on observed macroeconomic volatility is substantially reduced. I then investigate several plausible mechanisms that could explain the negative sizevariance relationship. After empirically ruling out some of them, I suggest a 'market power' channel in which large firms face smaller price elasticities and therefore respond less to a givensized productivity shock than small firms do. I provide direct evidence for this mechanism by estimating demand elasticities among U.S. manufactures. Lastly, I construct an analytically tractable framework that is consistent with several empirical regularities related to firm size.
View Full
Paper PDF
-
Accounting for the New Gains from Trade Liberalization
March 2016
Working Paper Number:
CES-16-14
We measure the "new" gains from trade reaped by Canada as a result of the Canada-US Free Trade Agreement (CUSFTA). We think of the "new" gains from trade of a country as all welfare effects pertaining to changes in the set of firms serving that country as emphasized in the so-called "new" trade literature. To this end, we first develop an exact decomposition of the gains from trade which separates "traditional" and "new" gains. We then apply this decomposition using Canadian and US micro data and find that the "new" welfare effects of CUSFTA on Canada were negative.
View Full
Paper PDF
-
Are firm-level idiosyncratic shocks important for U.S. aggregate volatility?
January 2016
Working Paper Number:
CES-16-47
This paper assesses the quantitative impact of firm-level idiosyncratic shocks on aggregate volatility in the U.S. economy and provides a microfoundation for the negative relationship between firm-level volatility and size. I argue that the role of firm-specific shocks through the granular channel plays a fairly limited role in the U.S. economy. Using a novel, comprehensive data set compiled from several sources of the U.S. Census Bureau, I find that the granular com-ponent accounts at most for 15.5% of the variation in aggregate sales growth which is about half found by previous studies. To bridge the gap between previous findings and mine, I show that my quantitative results require deviations from Gibrat's law in which firm-level volatility and size are negatively related. I find that firm-level volatility declines at a substantially higher rate in size than previously found. Hence, the largest firms in the economy cannot be driving a sub-stantial fraction of macroeconomic volatility. I show that the explanatory power of granularity gets cut by at least half whenever the size-variance relationship, as estimated in the micro-level data, is taken into account. To uncover the economic mechanism behind this phenomenon, I construct an analytically tractable framework featuring random growth and a Kimball aggrega-tor. Under this setup, larger firms respond less to productivity shocks as the elasticity of demand is decreasing in size. Additionally, the model predicts a positive (negative) relationship between firm-level mark-ups (growth) and size. I confirm the predictions of the model by estimating size-varying price elasticities on unique product-level data from the Census of Manufactures (CM) and structurally estimating mark-ups using plant-level information from the Annual Survey of Manufactures (ASM).
View Full
Paper PDF
-
The Management and Organizational Practices Survey (MOPS): An Overview*
January 2016
Working Paper Number:
CES-16-28
Understanding productivity and business dynamics requires measuring production outputs and inputs. Through its surveys and use of administrative data, the Census Bureau collects information on production outputs and inputs including labor, capital, energy, and materials. With the introduction of the Management and Organizational Practices Survey (MOPS), the Census Bureau added information on another component of production: management. It has long been hypothesized that management is an important component of firm success, but until recently the study of management was confined to hypotheses, anecdotes, and case studies. Building upon the work of Bloom and Van Reenen (2007), the first-ever large scale survey of management practices in the United States, the MOPS, was conducted by the Census Bureau for 2010. A second, enhanced version of the MOPS is being conducted for 2015. The enhancement includes two new topics related to management: data and decision making (DDD) and uncertainty. As information technology has expanded plants are increasingly able to utilize data in their decision making. Structured management practices have been found to be complementary to DDD in earlier studies. Uncertainty has policy implications because uncertainty is found to be associated with reduced investment and employment. Uncertainty also plays a role in the targeting component of management.
View Full
Paper PDF
-
Plant Exit and U.S. Imports from Low-Wage Countries
January 2016
Working Paper Number:
CES-16-02
Over the past twenty years, imports to the U.S. from low-wage countries have increased dramatically. In this paper we examine how low-wage country import competition in the U.S. influences the probability of manufacturing establishment closure. Confidential data from the U.S. Bureau of the Census are used to track all manufacturing establishments between 1992 and 2007. These data are linked to measures of import competition built from individual trade transactions. Controlling for a variety of plant and firm covariates, we show that low-wage import competition has played a significant role in manufacturing plant exit. Analysis employs fixed effects panel models running across three periods: the first plant-level panels examining trade and exit for the U.S. economy. Our results appear robust to concerns regarding endogeneity.
View Full
Paper PDF