CREAT - Census Bureau

Resolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau

September 2009

Written by: Ron Jarmin, Lucia Foster, Lynn Riggs

Working Paper Number:

CES-09-33

Abstract

This paper provides an historical context for access to U.S. Federal statistical data with a primary focus on the U.S. Census Bureau. We review the various modes used by the Census Bureau to make data available to users, and highlight the costs and benefits associated with each. We highlight some of the specific improvements underway or under consideration at the Census Bureau to better serve its data users, as well as discuss the broad strategies employed by statistical agencies to respond to the challenges of data access.

Document Tags and Keywords

Keywords:

economist, data, statistical, data census, report, census data, survey, agency, respondent, statistician, state, trend, economic census, policymakers, federal, population, citizen, census bureau, use census, census survey, census records, individuals census

Tags:

Internal Revenue Service, Social Security Administration, National Science Foundation, Center for Economic Studies, Securities and Exchange Commission, Bureau of Economic Analysis, Census Bureau Longitudinal Business Database, Foreign Direct Investment, Organization for Economic Cooperation and Development, Statistics Canada, Longitudinal Business Database, Bureau of Labor, Decennial Census, National Research Council, Cornell University, Unemployment Insurance, Research Data Center, American Community Survey, Longitudinal Employer Household Dynamics, Agency for Healthcare Research and Quality, Census Bureau Business Register, Business Register, National Opinion Research Center, National Center for Health Statistics, Public Use Micro Sample, Quarterly Workforce Indicators, Special Sworn Status, European Union, Local Employment Dynamics, Census Bureau Disclosure Review Board, Business Dynamics Statistics, Census Bureau Business Dynamics Statistics, Stanford University, Research and Development

Similar Working Papers

The 10 most similar working papers to the working paper 'Resolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau' are listed below in order of similarity.

Working Paper
🔥

Access Methods for United States Microdata

August 2007

Authors: John M. Abowd, Daniel Weinberg, Sandra Rowland, Philip Steel, Laura Zayatz

Working Paper Number:

CES-07-25

Beyond the traditional methods of tabulations and public-use microdata samples, statistical agencies have developed four key alternatives for providing non-government researchers with access to confidential microdata to improve statistical modeling. The first, licensing, allows qualified researchers access to confidential microdata at their own facilities, provided certain security requirements are met. The second, statistical data enclaves, offer qualified researchers restricted access to confidential economic and demographic data at specific agency-controlled locations. Third, statistical agencies can offer remote access, through a computer interface, to the confidential data under automated or manual controls. Fourth, synthetic data developed from the original data but retaining the correlations in the original data have the potential for allowing a wide range of analyses.
View Full Paper PDF
Working Paper
🔥

Disclosure Limitation and Confidentiality Protection in Linked Data

January 2018

Authors: Lars Vilhuber, John M. Abowd, Ian M. Schmutte

Working Paper Number:

CES-18-07

Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.
View Full Paper PDF
Working Paper
🔥

New Approaches to Confidentiality Protection Synthetic Data, Remote Access and Research Data Centers

June 2004

Authors: Julia I. Lane, John M. Abowd

Working Paper Number:

tp-2004-03

View Full Paper PDF
Working Paper
🔥

Synthetic Data and Confidentiality Protection

September 2003

Authors: Julia I. Lane, John M. Abowd

Working Paper Number:

tp-2003-10

View Full Paper PDF
Working Paper
🔥

EXPANDING THE ROLE OF SYNTHETIC DATA AT THE U.S. CENSUS BUREAU

February 2014

Authors: Ron Jarmin, Javier Miranda, Thomas A. Louis

Working Paper Number:

CES-14-10

National Statistical offices (NSOs) create official statistics from data collected from survey respondents, government administrative records and other sources. The raw source data is usually considered to be confidential. In the case of the U.S. Census Bureau, confidentiality of survey and administrative records microdata is mandated by statute, and this mandate to protect confidentiality is often at odds with the needs of users to extract as much information from the data as possible. Traditional disclosure protection techniques result in official data products that do not fully utilize the information content of the underlying microdata. Typically, these products take the form of simple aggregate tabulations. In a few cases anonymized public- use micro samples are made available, but these face a growing risk of re-identification by the increasing amounts of information about individuals and firms available in the public domain. One approach for overcoming these risks is to release products based on synthetic data where values are simulated from statistical models designed to mimic the (joint) distributions of the underlying microdata. We discuss re- cent Census Bureau work to develop and deploy such products. We discuss the benefits and challenges involved with extending the scope of synthetic data products in official statistics.
View Full Paper PDF
Working Paper
🔥

LOOKING BACK ON THREE YEARS OF USING THE SYNTHETIC LBD BETA

February 2014

Authors: Lars Vilhuber, Miranda, Javier

Working Paper Number:

CES-14-11

Distributions of business data are typically much more skewed than those for household or individual data and public knowledge of the underlying units is greater. As a results, national statistical offices (NSOs) rarely release establishment or firm-level business microdata due to the risk to respondent confidentiality. One potential approach for overcoming these risks is to release synthetic data where the establishment data are simulated from statistical models designed to mimic the distributions of the real underlying microdata. The US Census Bureau's Center for Economic Studies in collaboration with Duke University, the National Institute of Statistical Sciences, and Cornell University made available a synthetic public use file for the Longitudinal Business Database (LBD) comprising more than 20 million records for all business establishment with paid employees dating back to 1976. The resulting product, dubbed the SynLBD, was released in 2010 and is the first-ever comprehensive business microdata set publicly released in the United States including data on establishments employment and payroll, birth and death years, and industrial classification. This pa- per documents the scope of projects that have requested and used the SynLBD.
View Full Paper PDF
Working Paper

An In-Depth Examination of Requirements for Disclosure Risk Assessment

October 2023

Authors: Ron Jarmin, John M. Abowd, Ian M. Schmutte, Jerome P. Reiter, Nathan Goldschlag, Victoria A. Velkoff, Michael B. Hawes, Robert Ashmead, Ryan Cumings-Menon, Sallie Ann Keller, Daniel Kifer, Philip Leclerc, Rolando A. Rodríguez, Pavel Zhuravlev

Working Paper Number:

CES-23-49

The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be based on pre-specified, objective criteria. Such criteria should be used to compare methodologies to identify those with the most desirable properties. We illustrate this approach, using simple desiderata, to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. Thus, more research is needed, but in the near-term, the counterfactual approach appears best-suited for privacy-utility analysis.
View Full Paper PDF
Working Paper

Integrated Longitudinal Employee-Employer Data for the United States

May 2004

Authors: John Haltiwanger, Julia I. Lane, John M. Abowd

Working Paper Number:

tp-2004-02

View Full Paper PDF
Working Paper

Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?

January 2017

Authors: Lars Vilhuber, John M. Abowd, Daniel Weinberg, Jerome P. Reiter, Matthew D. Shapiro, Robert F. Belli, Noel Cressie, David C. Folch, Scott H. Holan, Margaret C. Levenstein, Kristen M. Olson, Jolene Smyth, Leen-Kiat Soh, Bruce D. Spencer, Seth E. Spielman, Christopher K. Wikle

Working Paper Number:

CES-17-59R

The National Science Foundation-Census Bureau Research Network (NCRN) was established in 2011 to create interdisciplinary research nodes on methodological questions of interest and significance to the broader research community and to the Federal Statistical System (FSS), particularly the Census Bureau. The activities to date have covered both fundamental and applied statistical research and have focused at least in part on the training of current and future generations of researchers in skills of relevance to surveys and alternative measurement of economic units, households, and persons. This paper discusses some of the key research findings of the eight nodes, organized into six topics: (1) Improving census and survey data collection methods; (2) Using alternative sources of data; (3) Protecting privacy and confidentiality by improving disclosure avoidance; (4) Using spatial and spatio-temporal statistical modeling to improve estimates; (5) Assessing data cost and quality tradeoffs; and (6) Combining information from multiple sources. It also reports on collaborations across nodes and with federal agencies, new software developed, and educational activities and outcomes. The paper concludes with an evaluation of the ability of the FSS to apply the NCRN's research outcomes and suggests some next steps, as well as the implications of this research-network model for future federal government renewal initiatives.
View Full Paper PDF
Working Paper

The Annual Survey of Entrepreneurs: An Introduction

November 2015

Authors: Lucia Foster, Patrice Norman

Working Paper Number:

CES-15-40R

The Census Bureau continually seeks to improve its measures of the U.S. economy as part of its mission. In some cases this means expanding or updating the content of its existing surveys, expanding the use of administrative data, and/or exploring the use of privately collected data. When these options cannot provide the needed data, the Census Bureau may consider fielding a new survey to fill the gap. This paper describes one such new survey, the Annual Survey of Entrepreneurs (ASE). Innovations in content, format, and process are designed to provide high-quality, timely, frequent information on the activities of one of the important drivers of economic growth: entrepreneurship. The ASE is collected through a partnership of the Census Bureau with the Kauffman Foundation and the Minority Business Development Agency. The first wave of the ASE collection started in fall of 2015 (for reference period 2014) and results will be released in summer 2016. Qualified researchers on approved projects will be able to access micro data from the ASE through the Federal Statistical Research Data Center (FSRDC) network.
View Full Paper PDF

Resolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau

September 2009

Working Paper Number:

CES-09-33

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Resolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau' are listed below in order of similarity.

August 2007

Working Paper Number:

CES-07-25

January 2018

Working Paper Number:

CES-18-07

June 2004

Working Paper Number:

tp-2004-03

September 2003

Working Paper Number:

tp-2003-10

February 2014

Working Paper Number:

CES-14-10

February 2014

Working Paper Number:

CES-14-11

October 2023

Working Paper Number:

CES-23-49

May 2004

Working Paper Number:

tp-2004-02

January 2017

Working Paper Number:

CES-17-59R

November 2015

Working Paper Number:

CES-15-40R