This paper details the construction of a firm-year panel dataset combining the NBER Patent Dataset with the Industry R&D Survey conducted by the Census Bureau and National Science Foundation. The developed platform offers an unprecedented view of the R&D-to-patenting innovation process and a close analysis of the strengths and limitations of the Industry R&D Survey. The files are linked through a name-matching algorithm customized for uniting the firm names to which patents are assigned with the firm names in Census Bureau's SSEL business registry. Through the Census Bureau's file structure, this R&D platform can be linked to the operating performances of each firm's establishments, further facilitating innovation-to-productivity studies.
-
NBER Patent Data-BR Bridge: User Guide and Technical Documentation
October 2010
Working Paper Number:
CES-10-36
This note provides details about the construction of the NBER Patent Data-BR concordance, and is intended for researchers planning to use this concordance. In addition to describing the matching process used to construct the concordance, this note provides a discussion of the benefits and limitations of this concordance.
View Full
Paper PDF
-
Improving Patent Assignee-Firm Bridge with Web Search Results
August 2022
Working Paper Number:
CES-22-31
This paper constructs a patent assignee-firm longitudinal bridge between U.S. patent assignees and firms using firm-level administrative data from the U.S. Census Bureau. We match granted patents applied between 1976 and 2016 to the U.S. firms recorded in the Longitudinal Business Database (LBD) in the Census Bureau. Building on existing algorithms in the literature, we first use the assignee name, address (state and city), and year information to link the two datasets. We then introduce a novel search-aided algorithm that significantly improves the matching results by 7% and 2.9% at the patent and the assignee level, respectively. Overall, we are able to match 88.2% and 80.1% of all U.S. patents and assignees respectively. We contribute to the existing literature by 1) improving the match rates and quality with the web search-aided algorithm, and 2) providing the longest and longitudinally consistent crosswalk between patent assignees and LBD firms.
View Full
Paper PDF
-
Documenting the Business Register and Related Economic Business Data
March 2016
Working Paper Number:
CES-16-17
The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Characteristics of the Top R&D Performing Firms in the U.S.: Evidence from the Survey of Industrial R&D
September 2010
Working Paper Number:
CES-10-33
Innovation drives economic growth and productivity growth, and as such, indicators of innovative activity such as research and development (R&D) expenditures are of paramount importance. We combine Census confidential microdata from two sources in order to examine the characteristics of the top R&D performing firms in the U.S. economy. We use the Survey of Industrial Research and Development (SIRD) to identify the top 200 R&D performing firms in 2003 and, to the extent possible, to trace the evolution of these firms from 1957 to 2007. The Longitudinal Business Database (LBD) further extends our knowledge about these firms and enables us to make comparisons to the U.S. economy. By linking the SIRD and the LBD we are able to create a detailed portrait of the evolution of the top R&D performing firms in the U.S.
View Full
Paper PDF
-
Newly Recovered Microdata on U.S. Manufacturing Plants from the 1950s and 1960s: Some Early Glimpses
September 2011
Working Paper Number:
CES-11-29
Longitudinally-linked microdata on U.S. manufacturing plants are currently available to researchers for 1963, 1967, and 1972-2009. In this paper, we provide a first look at recently recovered manufacturing microdata files from the 1950s and 1960s. We describe their origins and background, discuss their contents, and begin to explore their sample coverage. We also begin to examine whether the available establishment identifier(s) allow record linking. Our preliminary analyses suggest that longitudinally-linked Annual Survey of Manufactures microdata from the mid-1950s through the present ' containing 16 years of additional data ' appears possible though challenging. While a great deal of work remains, we see tremendous value in extending the manufacturing microdata series back into time. With these data, new lines of research become possible and many others can be revisited.
View Full
Paper PDF
-
Growth Through Heterogeneous Innovations
June 2012
Working Paper Number:
CES-12-08
We study how exploration versus exploitation innovations impact economic growth through a tractable endogenous growth framework that contains multiple innovation sizes, multiproduct firms, and entry/exit. Firms invest in exploration R&D to acquire new product lines and exploitation R&D to improve their existing product lines. We model and show empirically that exploration R&D does not scale as strongly with firm size as exploitation R&D. The resulting framework conforms to many regularities regarding innovation and growth differences across the firm size distribution. We also incorporate patent citations into our theoretical framework. The framework generates a simple test using patent citations that indicates that entrants and small firms have relatively higher growth spillover effects.
View Full
Paper PDF
-
An 'Algorithmic Links with Probabilities' Crosswalk for USPC and CPC Patent Classifications with an Application Towards Industrial Technology Composition
March 2016
Working Paper Number:
CES-16-15
Patents are a useful proxy for innovation, technological change, and diffusion. However, fully exploiting patent data for economic analyses requires patents be tied to measures of economic activity, which has proven to be difficult. Recently, Lybbert and Zolas (2014) have constructed an International Patent Classification (IPC) to industry classification crosswalk using an 'Algorithmic Links with Probabilities' approach. In this paper, we utilize a similar approach and apply it to new patent classification schemes, the U.S. Patent Classification (USPC) system and Cooperative Patent Classification (CPC) system. The resulting USPC-Industry and CPC-Industry concordances link both U.S. and global patents to multiple vintages of the North American Industrial Classification System (NAICS), International Standard Industrial Classification (ISIC), Harmonized System (HS) and Standard International Trade Classification (SITC). We then use the crosswalk to highlight changes to industrial technology composition over time. We find suggestive evidence of strong persistence in the association between technologies and industries over time.
View Full
Paper PDF
-
A Guide To R&D Data At The Center For Economic Studies U.S. Bureau Of THe Census
August 1994
Working Paper Number:
CES-94-09
The National Science Foundation R&D Survey is an annual survey of firms' research and development expenditures. The survey covers 3000 firms reporting positive R&D. This paper provides a description of the R&D data available at the Center for Economic Studies (CES). The most basic data series available contains the original survey R&D data. It covers the years 1972-92. The remaining two series, although derived from the original files, specialize in particular items. The Mandatory Series contains required survey items for the years 1973-88. Items reported at firms' discretion are in the Voluntary Series, which covers the years 1974-89. Both of the derived series incorporate flags that track quality of the data. Both also include corrections to the data based on original hard copy survey evidence stored at CES. In addition to describing each dataset, we offer suggestions to researchers wishing to use the R&D data in exploring various economic issues. We report selected response rates, discuss the survey design, and provide hints on how to use the data.
View Full
Paper PDF
-
Identifying U.S. Merchandise Traders: Integrating Customs Transactions with Business Administrative Data
September 2020
Working Paper Number:
CES-20-28
This paper describes the construction of the Longitudinal Firm Trade Transactions Database (LFTTD) enabling the identification of merchandise traders - exporters and importers - in the U.S. Census Bureau's Business Register (BR). The LFTTD links merchandise export and import transactions from customs declaration forms to the BR beginning in 1992 through the present. We employ a combination of deterministic and probabilistic matching algorithms to assign a unique firm identifier in the BR to a merchandise export or import transaction record. On average, we match 89 percent of export and import values to a firm identifier. In 1992, we match 79 (88) percent of export (import) value; in 2017, we match 92 (96) percent of export (import) value. Trade transactions in year t are matched to years between 1976 and t+1 of the BR. On average, 94 percent of the trade value matches to a firm in year t of the BR. The LFTTD provides the most comprehensive identification of and the foundation for the analysis of goods trading firms in the U.S. economy.
View Full
Paper PDF