International technological diffusion is a key determinant of cross-country differences in economic performance. While patents can be a useful proxy for innovation and technological change and diffusion, fully exploiting patent data for such economic analyses requires patents to be tied to measures of economic activity. In this paper, we describe and explore a new algorithmic approach to constructing concordances between the International Patent Classification (IPC) system that organizes patents by technical features and industry classification systems that organize economic data, such as the Standard International Trade Classification (SITC), the International Standard Industrial Classification (ISIC) and the Harmonized System (HS). This 'Algorithmic Links with Probabilities' (ALP) approach incorporates text analysis software and keyword extraction programs and applies them to a comprehensive patent dataset. We compare the results of several ALP concordances to existing technology concordances. Based on these comparisons, we select a preferred ALP approach and discuss advantages of this approach relative to conventional approaches. We conclude with a discussion on some of the possible applications of the concordance and provide a sample analysis that uses our preferred ALP concordance to analyze international patent flows based on trade patterns.
-
AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA
September 2013
Working Paper Number:
CES-13-49
Trademarks (TMs) shape the competitive landscape of markets for goods and services in all countries through branding and conveying information and quality inherent in products. Yet, researchers are largely unable to conduct rigorous empirical analysis of TMs in the modern economy because TM data and economic activity data are organized differently and cannot be analyzed jointly at the industry or sectoral level. We propose an 'Algorithmic Links with Probabilities' (ALP) approach to match TM data to economic data and enable these data to speak to each other. Specifically, we construct a NICE Class Level concordance that maps TM data into trade and industry categories forward and backward. This concordance allows researchers to analyze differences in TM usage across both economic and TM sectors. In this paper, we apply this ALP concordance for TMs to characterize patterns in TM applications across countries, industries, income levels and more. We also use the concordance to investigate some of the key determinants of international technology transfer by comparing bilateral TM applications and bilateral patent applications. We conclude with a discussion of possible extensions of this work, including deeper indicator-level concordances and further analyses that are possible once TM data are linked with economic activity data.
View Full
Paper PDF
-
An 'Algorithmic Links with Probabilities' Crosswalk for USPC and CPC Patent Classifications with an Application Towards Industrial Technology Composition
March 2016
Working Paper Number:
CES-16-15
Patents are a useful proxy for innovation, technological change, and diffusion. However, fully exploiting patent data for economic analyses requires patents be tied to measures of economic activity, which has proven to be difficult. Recently, Lybbert and Zolas (2014) have constructed an International Patent Classification (IPC) to industry classification crosswalk using an 'Algorithmic Links with Probabilities' approach. In this paper, we utilize a similar approach and apply it to new patent classification schemes, the U.S. Patent Classification (USPC) system and Cooperative Patent Classification (CPC) system. The resulting USPC-Industry and CPC-Industry concordances link both U.S. and global patents to multiple vintages of the North American Industrial Classification System (NAICS), International Standard Industrial Classification (ISIC), Harmonized System (HS) and Standard International Trade Classification (SITC). We then use the crosswalk to highlight changes to industrial technology composition over time. We find suggestive evidence of strong persistence in the association between technologies and industries over time.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Clusters, Convergence, and Economic Performance
October 2010
Working Paper Number:
CES-10-34
This paper evaluates the role of regional cluster composition in the economic performance of industries, clusters and regions. On the one hand, diminishing returns to specialization in a location can result in a convergence effect: the growth rate of an industry within a region may be declining in the level of activity of that industry. At the same time, positive spillovers across complementary economic activities provide an impetus for agglomeration: the growth rate of an industry within a region may be increasing in the size and strength (i.e., relative presence) of related economic sectors. Building on Porter (1998, 2003), we develop a systematic empirical framework to identify the role of regional clusters ' groups of closely related and complementary industries operating within a particular region in regional economic performance. We exploit newly available data from the US Cluster Mapping Project to disentangle the impact of convergence at the region-industry level from agglomeration within clusters. We find that, after controlling for the impact of convergence at the narrowest unit of analysis, there is significant evidence for cluster-driven agglomeration. Industries participating in a strong cluster register higher employment growth as well as higher growth of wages, number of establishments, and patenting. Industry and cluster level growth also increases with the strength of related clusters in the region and with the strength of similar clusters in adjacent regions. Importantly, we find evidence that new industries emerge where there is a strong cluster environment. Our analysis also suggests that the presence of strong clusters in a region enhances growth opportunities in other industries and clusters. Overall, these findings highlight the important role of cluster-based agglomeration in regional economic performance.
View Full
Paper PDF
-
U.S. Trade in Toxics: The Case of Chlorodifluoromethane (HCFC-22)
September 2009
Working Paper Number:
CES-09-29
This paper explores whether environmental regulation affects where pollution-intensive goods are produced. Here we examine chlorodifluoromethane (HCFC-22), a chemical designated as toxic in 1994 by the U.S. Environmental Protection Agency's Toxics Release Inventory (TRI). Trends show a decline in the number of domestic producers of this chemical, a decline in the number of manufacturing facilities using it, and an increase in the number (and share) of facilities claiming to import it. Transaction-level trade data show an increase in the import of HCFC-22 imports since its TRI listing ' an increase that is faster than that of all non-TRI listed chemicals. This is suggestive of a pollution haven effect. Meanwhile, we find that the vast majority of U.S. imports of HCFC-22 come from OECD countries. However, an increase in the share of imports from non-OECD countries since the chemical's listing suggests a shift of production to countries with more lax environmental standards. While the findings here are suggestive of regulatory effects, more rigorous analyses are needed to rule out other possible explanations.
View Full
Paper PDF
-
Identifying U.S. Merchandise Traders: Integrating Customs Transactions with Business Administrative Data
September 2020
Working Paper Number:
CES-20-28
This paper describes the construction of the Longitudinal Firm Trade Transactions Database (LFTTD) enabling the identification of merchandise traders - exporters and importers - in the U.S. Census Bureau's Business Register (BR). The LFTTD links merchandise export and import transactions from customs declaration forms to the BR beginning in 1992 through the present. We employ a combination of deterministic and probabilistic matching algorithms to assign a unique firm identifier in the BR to a merchandise export or import transaction record. On average, we match 89 percent of export and import values to a firm identifier. In 1992, we match 79 (88) percent of export (import) value; in 2017, we match 92 (96) percent of export (import) value. Trade transactions in year t are matched to years between 1976 and t+1 of the BR. On average, 94 percent of the trade value matches to a firm in year t of the BR. The LFTTD provides the most comprehensive identification of and the foundation for the analysis of goods trading firms in the U.S. economy.
View Full
Paper PDF
-
Business Dynamics Statistics of High Tech Industries
January 2016
Working Paper Number:
CES-16-55
Modern market economies are characterized by the reallocation of resources from less productive, less valuable activities to more productive, more valuable ones. Businesses in the High Technology sector play a particularly important role in this reallocation by introducing new products and services that impact the entire economy. Tracking the performance of this sector is therefore of primary importance, especially in light of recent evidence that suggests a slowdown in business dynamism in High Tech industries. The Census Bureau produces the Business Dynamics Statistics (BDS), a suite of data products that track job creation, job destruction, startups, and exits by firm and establishment characteristics including sector, firm age, and firm size. In this paper we describe the methodologies used to produce a new extension to the BDS focused on businesses in High Technology industries.
View Full
Paper PDF
-
Innovation and Appropriability: Revisiting the Role of Intellectual Property
March 2022
Working Paper Number:
CES-22-09
It is more than 25 years since the authors of the Yale and Carnegie surveys studied how firms seek to protect the rents from innovation. In this paper, we revisit that question using a nationally representative sample of firms over the period 2008-2015, with the goal of updating and extending a set of stylized facts that has been influential for our understanding of the economics of innovation. There are five main findings. First, while patenting firms are relatively uncommon in the economy, they account for an overwhelming share of R&D spending. Second, utility patents are considered less important than other forms of IP protection, like trade secrets, trademarks, and copyrights. Third, industry differences explain a great deal of the level of firms' engagement with IP, with high-tech firms on average being more active on all forms of IP. Fourth, we do not find any significant difference in the use of IP strategies across firms at different points of their life cycle. Lastly, unlike age, firms of different size appear to manage IP significantly differently. On average, larger firms tend to engage much more extensively in the protection of IP, and this pattern cannot be easily explained by differences in the type of R&D or innovation produced by a firm. We also discuss the implications of these findings for innovation research and policy.
View Full
Paper PDF
-
Improving Patent Assignee-Firm Bridge with Web Search Results
August 2022
Working Paper Number:
CES-22-31
This paper constructs a patent assignee-firm longitudinal bridge between U.S. patent assignees and firms using firm-level administrative data from the U.S. Census Bureau. We match granted patents applied between 1976 and 2016 to the U.S. firms recorded in the Longitudinal Business Database (LBD) in the Census Bureau. Building on existing algorithms in the literature, we first use the assignee name, address (state and city), and year information to link the two datasets. We then introduce a novel search-aided algorithm that significantly improves the matching results by 7% and 2.9% at the patent and the assignee level, respectively. Overall, we are able to match 88.2% and 80.1% of all U.S. patents and assignees respectively. We contribute to the existing literature by 1) improving the match rates and quality with the web search-aided algorithm, and 2) providing the longest and longitudinally consistent crosswalk between patent assignees and LBD firms.
View Full
Paper PDF
-
NBER Patent Data-BR Bridge: User Guide and Technical Documentation
October 2010
Working Paper Number:
CES-10-36
This note provides details about the construction of the NBER Patent Data-BR concordance, and is intended for researchers planning to use this concordance. In addition to describing the matching process used to construct the concordance, this note provides a discussion of the benefits and limitations of this concordance.
View Full
Paper PDF