Papers Containing Tag(s): 'Census Bureau Disclosure Review Board'
The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
See Working Papers by Tag(s), Keywords(s), Author(s), or Search Text
Click here to search again
Frequently Occurring Concepts within this Search
John Voorheis - 21
Lucia Foster - 20
John Haltiwanger - 16
John M. Abowd - 14
Nathan Goldschlag - 12
Viewing papers 281 through 290 of 292
-
Working PaperDisclosure Limitation and Confidentiality Protection in Linked Data
January 2018
Working Paper Number:
CES-18-07
Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.View Full Paper PDF
-
Working PaperUpstream, Downstream: Diffusion and Impacts of the Universal Product Code
January 2017
Working Paper Number:
CES-17-66R
We study the adoption, diffusion, and impacts of the Universal Product Code (UPC) between 1975 and 1992, during the initial years of the barcode system. We find evidence of network effects in the diffusion process. Matched-sample difference-in-difference estimates show that firm size and trademark registrations increase following UPC adoption by manufacturers. Industry-level import penetration also increases with domestic UPC adoption. Our findings suggest that barcodes, scanning, and related technologies helped stimulate variety-enhancing product innovation and encourage the growth of international retail supply chains.View Full Paper PDF
-
Working PaperMeasuring Cross-Country Differences in Misallocation
January 2016
Working Paper Number:
CES-16-50R
We describe differences between the commonly used version of the U.S. Census of Manufactures available at the RDCs and what establishments themselves report. The originally reported data has substantially more dispersion in measured establishment productivity. Measured allocative efficiency is substantially higher in the cleaned data than the raw data: 4x higher in 2002, 20x in 2007, and 80x in 2012. Many of the important editing strategies at the Census, including industry analysts' manual edits and edits using tax records, are infeasible in non-U.S. datasets. We describe a new Bayesian approach for editing and imputation that can be used across contexts.View Full Paper PDF
-
Working PaperIMPROVING THE SYNTHETIC LONGITUDINAL BUSINESS DATABASE
February 2014
Working Paper Number:
CES-14-12
In most countries, national statistical agencies do not release establishment-level business microdata, because doing so represents too large a risk to establishments' confidentiality. Agencies potentially can manage these risks by releasing synthetic microdata, i.e., individual establishment records simulated from statistical models de- signed to mimic the joint distribution of the underlying observed data. Previously, we used this approach to generate a public-use version'now available for public use'of the U. S. Census Bureau's Longitudinal Business Database (LBD), a longitudinal cen- sus of establishments dating back to 1976. While the synthetic LBD has proven to be a useful product, we now seek to improve and expand it by using new synthesis models and adding features. This article describes our efforts to create the second generation of the SynLBD, including synthesis procedures that we believe could be replicated in other contexts.View Full Paper PDF
-
Working PaperLOOKING BACK ON THREE YEARS OF USING THE SYNTHETIC LBD BETA
February 2014
Working Paper Number:
CES-14-11
Distributions of business data are typically much more skewed than those for household or individual data and public knowledge of the underlying units is greater. As a results, national statistical offices (NSOs) rarely release establishment or firm-level business microdata due to the risk to respondent confidentiality. One potential approach for overcoming these risks is to release synthetic data where the establishment data are simulated from statistical models designed to mimic the distributions of the real underlying microdata. The US Census Bureau's Center for Economic Studies in collaboration with Duke University, the National Institute of Statistical Sciences, and Cornell University made available a synthetic public use file for the Longitudinal Business Database (LBD) comprising more than 20 million records for all business establishment with paid employees dating back to 1976. The resulting product, dubbed the SynLBD, was released in 2010 and is the first-ever comprehensive business microdata set publicly released in the United States including data on establishments employment and payroll, birth and death years, and industrial classification. This pa- per documents the scope of projects that have requested and used the SynLBD.View Full Paper PDF
-
Working PaperWHY IMMIGRANTS LEAVE NEW DESTINATIONS AND WHERE DO THEY GO?
June 2013
Working Paper Number:
CES-13-32
Immigrants have a markedly higher likelihood of migrating internally if they live in new estinations. This paper looks at why that pattern occurs and at how immigrants' out-migration to new versus traditional destinations responds to their labor market economic and industrial structure, nativity origins and concentration, geographic region, and 1995 labor market type. Confidential data from the 2000 and 1990 decennial censuses are used for the analysis. Metropolitan and non-metropolitan areas are categorized into 741 local labor markets and classified as new or traditional based on their nativity concentrations of immigrants from the largest Asian, Caribbean and Latin American origins. The analysis showed that immigrants were less likely to migrate to new destinations if they lived in areas of higher nativity concentration, foreign-born population growth, and wages but more likely to make that move if they were professionals, agricultural or blue collar workers, highly educated, fluent in English, and lived in other new destinations. While most immigrants are more likely to migrate to new rather than traditional destinations that outcome differs sharply for immigrants from different origins and for some immigrants, particularly those from the Caribbean, the dispersal process to new destinations has barely started.View Full Paper PDF
-
Working PaperDynamically Consistent Noise Infusion and Partially Synthetic Data as Confidentiality Protection Measures for Related Time Series
July 2012
Working Paper Number:
CES-12-13
The Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job flows, tabulated by worker characteristics in various combinations. The data are released for several levels of NAICS industries and geography, the lowest aggregation of the latter being counties. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions { measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which confidentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying confidential data.View Full Paper PDF
-
Working PaperResolving the Tension Between Access and Confidentiality: Past Experience and Future Plans at the U.S. Census Bureau
September 2009
Working Paper Number:
CES-09-33
This paper provides an historical context for access to U.S. Federal statistical data with a primary focus on the U.S. Census Bureau. We review the various modes used by the Census Bureau to make data available to users, and highlight the costs and benefits associated with each. We highlight some of the specific improvements underway or under consideration at the Census Bureau to better serve its data users, as well as discuss the broad strategies employed by statistical agencies to respond to the challenges of data access.View Full Paper PDF
-
Working PaperDiscretionary Disclosure in Financial Reporting: An Examination Comparing Internal Firm Data to Externally Reported Segment Data
September 2009
Working Paper Number:
CES-09-28
We use confidential, U.S. Census Bureau, plant-level data to investigate aggregation in external reporting. We compare firms' plant-level data to their published segment reports, conducting our tests by grouping a firm's plants that share the same four-digit SIC code into a 'pseudo-segment.' We then determine whether that pseudo-segment is disclosed as an external segment, or whether it is subsumed into a different business unit for external reporting purposes. We find pseudo-segments are more likely to be aggregated within a line-of-business segment when the agency and proprietary costs of separately reporting the pseudo-segment are higher and when firm and pseudo-segment characteristics allow for more discretion in the application of segment reporting rules. For firms reporting multiple external segments, aggregation of pseudo-segments is driven by both agency and proprietary costs. However, for firms reporting a single external segment, we find no evidence of an agency cost motive for aggregation.View Full Paper PDF
-
Working PaperConsistent Cell Means for Topcoded Incomes in the Public Use March CPS (1976-2007)
March 2008
Working Paper Number:
CES-08-06
Using the internal March CPS, we create and in this paper distribute to the larger research community a cell mean series that provides the mean of all income values above the topcode for any income source of any individual in the public use March CPS that has been topcoded since 1976. We also describe our construction of this series. When we use this series together with the public use March CPS, we closely match the yearly mean income levels and income inequalities of the U.S. population found using the internal March CPS data.View Full Paper PDF