-
The Annual Survey of Entrepreneurs: An Introduction
November 2015
Working Paper Number:
CES-15-40R
The Census Bureau continually seeks to improve its measures of the U.S. economy as part of its mission. In some cases this means expanding or updating the content of its existing surveys, expanding the use of administrative data, and/or exploring the use of privately collected data. When these options cannot provide the needed data, the Census Bureau may consider fielding a new survey to fill the gap. This paper describes one such new survey, the Annual Survey of Entrepreneurs (ASE). Innovations in content, format, and process are designed to provide high-quality, timely, frequent information on the activities of one of the important drivers of economic growth: entrepreneurship. The ASE is collected through a partnership of the Census Bureau with the Kauffman Foundation and the Minority Business Development Agency. The first wave of the ASE collection started in fall of 2015 (for reference period 2014) and results will be released in summer 2016. Qualified researchers on approved projects will be able to access micro data from the ASE through the Federal Statistical Research Data Center (FSRDC) network.
View Full
Paper PDF
-
The Promise and Potential of Linked Employer-Employee Data for Entrepreneurship Research
September 2015
Working Paper Number:
CES-15-29
In this paper, we highlight the potential for linked employer-employee data to be used in entrepreneurship research, describing new data on business start-ups, their founders and early employees, and providing examples of how they can be used in entrepreneurship research. Linked employer-employee data provides a unique perspective on new business creation by combining information on the business, workforce, and individual. By combining data on both workers and firms, linked data can investigate many questions that owner-level or firm-level data cannot easily answer alone - such as composition of the workforce at start-ups and their role in explaining business dynamics, the flow of workers across new and established firms, and the employment paths of the business owners themselves.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Identifying Foreign Suppliers in U.S. Merchandise Import Transactions
April 2015
Working Paper Number:
CES-15-11
The availability of international trade transactions data capturing individual relationships between buyers and suppliers permits the answering of numerous new questions governing the economic activity of traders. In this paper, we explore the reliability of two-sided firm trade transactions data sourced from the United States by comparing the number of foreign suppliers from U.S. merchandise import transaction data to origin-country data. We find that the statistic derived from the origin-country data, on average, tends to be 20 percent lower than using the raw U.S. data. Guided by this finding, we propose and implement a set of methods that are capable of aligning the counts more closely from these two different data sources. Overall, our analysis presents broad support for the use of U.S. merchandise import transactions data to study buyer-supplier relationships in international trade.
View Full
Paper PDF
-
Customer-Employee Substitution: Evidence from Gasoline Stations*
January 2015
Working Paper Number:
CES-15-45R
We document the adoption of self-service pumps in U.S. gasoline stations from 1977 to 1992. Using establishment-level data from the Census of Retail Trade over this period, we show that self-service stations employ approximately one quarter fewer attendants per pump, all else equal. The work done by these attendants has shifted to customers, biasing upwards conventional measures of productivity growth.
View Full
Paper PDF
-
A FIRST STEP TOWARDS A GERMAN SYNLBD: CONSTRUCTING A GERMAN LONGITUDINAL BUSINESS DATABASE
February 2014
Working Paper Number:
CES-14-13
One major criticism against the use of synthetic data has been that the efforts necessary to generate useful synthetic data are so in- tense that many statistical agencies cannot afford them. We argue many lessons in this evolving field have been learned in the early years of synthetic data generation, and can be used in the development of new synthetic data products, considerably reducing the required in- vestments. The final goal of the project described in this paper will be to evaluate whether synthetic data algorithms developed in the U.S. to generate a synthetic version of the Longitudinal Business Database (LBD) can easily be transferred to generate a similar data product for other countries. We construct a German data product with infor- mation comparable to the LBD - the German Longitudinal Business Database (GLBD) - that is generated from different administrative sources at the Institute for Employment Research, Germany. In a fu- ture step, the algorithms developed for the synthesis of the LBD will be applied to the GLBD. Extensive evaluations will illustrate whether the algorithms provide useful synthetic data without further adjustment. The ultimate goal of the project is to provide access to multiple synthetic datasets similar to the SynLBD at Cornell to enable comparative studies between countries. The Synthetic GLBD is a first step towards that goal.
View Full
Paper PDF
-
LOOKING BACK ON THREE YEARS OF USING THE SYNTHETIC LBD BETA
February 2014
Working Paper Number:
CES-14-11
Distributions of business data are typically much more skewed than those for household or individual data and public knowledge of the underlying units is greater. As a results, national statistical offices (NSOs) rarely release establishment or firm-level business microdata due to the risk to respondent confidentiality. One potential approach for overcoming these risks is to release synthetic data where the establishment data are simulated from statistical models designed to mimic the distributions of the real underlying microdata. The US Census Bureau's Center for Economic Studies in collaboration with Duke University, the National Institute of Statistical Sciences, and Cornell University made available a synthetic public use file for the Longitudinal Business Database (LBD) comprising more than 20 million records for all business establishment with paid employees dating back to 1976. The resulting product, dubbed the SynLBD, was released in 2010 and is the first-ever comprehensive business microdata set publicly released in the United States including data on establishments employment and payroll, birth and death years, and industrial classification. This pa- per documents the scope of projects that have requested and used the SynLBD.
View Full
Paper PDF
-
The Dynamics of House Price Capitalization and Locational Sorting: Evidence from Air Quality Changes
September 2012
Working Paper Number:
CES-12-22
Despite extensive use of housing data to reveal valuation of non-market goods, the process of house price capitalization remains vague. Using the restricted access American Housing Survey, a high-frequency panel of prices, turnover, and occupant characteristics, this paper examines the time path of capitalization and preference-based sorting in response to air quality changes caused by differential regulatory pressure from the 1990 Clean Air Act Amendments. The results demonstrate that owner-occupied units capitalize changes immediately, whereas rent capitalization lags. The delayed but sharp rent capitalization temporally coincides with evidence of sorting, suggesting a strong link between location choices and price dynamics.
View Full
Paper PDF
-
Dynamically Consistent Noise Infusion and Partially Synthetic Data as Confidentiality Protection Measures for Related Time Series
July 2012
Working Paper Number:
CES-12-13
The Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job flows, tabulated by worker characteristics in various combinations. The data are released for several levels of NAICS industries and geography, the lowest aggregation of the latter being counties. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions { measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which confidentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying confidential data.
View Full
Paper PDF
-
University Innovation, Local Economic Growth, and Entrepreneurship
June 2012
Working Paper Number:
CES-12-10
Universities, often situated at the center of innovative clusters, are believed to be important drivers of local economic growth. This paper identifies the extent to which U.S. universities stimulate nearby economic activity using the interaction of a national shock to the spread of innovation from universities - the Bayh-Dole Act of 1980 - with pre-determined variation both within a university in academic strengths and across universities in federal research funding. Using longitudinal establishment-level data from the Census, I find that longrun employment and payroll per worker around universities rise particularly rapidly after Bayh-Dole in industries more closely related to local university innovative strengths. The impact of
university innovation increases with geographic proximity to the university. Counties surrounding universities that received more pre-Bayh-Dole federal funding - particularly from the Department of Defense and the National Institutes of Health - experienced faster employment growth after the law. Entering establishments - in particular multi-unit firm expansions - over the period from 1977 to 1997 were especially important in generating long-run employment growth, while incumbents experienced modest declines, consistent with creative destruction. Suggestive of their complementarities with universities, large establishments contributed more substantially to the total 20-year growth effect than did small establishments.
View Full
Paper PDF