Patents are a useful proxy for innovation, technological change, and diffusion. However, fully exploiting patent data for economic analyses requires patents be tied to measures of economic activity, which has proven to be difficult. Recently, Lybbert and Zolas (2014) have constructed an International Patent Classification (IPC) to industry classification crosswalk using an 'Algorithmic Links with Probabilities' approach. In this paper, we utilize a similar approach and apply it to new patent classification schemes, the U.S. Patent Classification (USPC) system and Cooperative Patent Classification (CPC) system. The resulting USPC-Industry and CPC-Industry concordances link both U.S. and global patents to multiple vintages of the North American Industrial Classification System (NAICS), International Standard Industrial Classification (ISIC), Harmonized System (HS) and Standard International Trade Classification (SITC). We then use the crosswalk to highlight changes to industrial technology composition over time. We find suggestive evidence of strong persistence in the association between technologies and industries over time.
-
Getting Patents and Economic Data to Speak to Each Other: An 'Algorithmic Links with Probabilities' Approach for Joint Analyses of Patenting and Economic Activity
September 2012
Working Paper Number:
CES-12-16
International technological diffusion is a key determinant of cross-country differences in economic performance. While patents can be a useful proxy for innovation and technological change and diffusion, fully exploiting patent data for such economic analyses requires patents to be tied to measures of economic activity. In this paper, we describe and explore a new algorithmic approach to constructing concordances between the International Patent Classification (IPC) system that organizes patents by technical features and industry classification systems that organize economic data, such as the Standard International Trade Classification (SITC), the International Standard Industrial Classification (ISIC) and the Harmonized System (HS). This 'Algorithmic Links with Probabilities' (ALP) approach incorporates text analysis software and keyword extraction programs and applies them to a comprehensive patent dataset. We compare the results of several ALP concordances to existing technology concordances. Based on these comparisons, we select a preferred ALP approach and discuss advantages of this approach relative to conventional approaches. We conclude with a discussion on some of the possible applications of the concordance and provide a sample analysis that uses our preferred ALP concordance to analyze international patent flows based on trade patterns.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Improving Patent Assignee-Firm Bridge with Web Search Results
August 2022
Working Paper Number:
CES-22-31
This paper constructs a patent assignee-firm longitudinal bridge between U.S. patent assignees and firms using firm-level administrative data from the U.S. Census Bureau. We match granted patents applied between 1976 and 2016 to the U.S. firms recorded in the Longitudinal Business Database (LBD) in the Census Bureau. Building on existing algorithms in the literature, we first use the assignee name, address (state and city), and year information to link the two datasets. We then introduce a novel search-aided algorithm that significantly improves the matching results by 7% and 2.9% at the patent and the assignee level, respectively. Overall, we are able to match 88.2% and 80.1% of all U.S. patents and assignees respectively. We contribute to the existing literature by 1) improving the match rates and quality with the web search-aided algorithm, and 2) providing the longest and longitudinally consistent crosswalk between patent assignees and LBD firms.
View Full
Paper PDF
-
AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA
September 2013
Working Paper Number:
CES-13-49
Trademarks (TMs) shape the competitive landscape of markets for goods and services in all countries through branding and conveying information and quality inherent in products. Yet, researchers are largely unable to conduct rigorous empirical analysis of TMs in the modern economy because TM data and economic activity data are organized differently and cannot be analyzed jointly at the industry or sectoral level. We propose an 'Algorithmic Links with Probabilities' (ALP) approach to match TM data to economic data and enable these data to speak to each other. Specifically, we construct a NICE Class Level concordance that maps TM data into trade and industry categories forward and backward. This concordance allows researchers to analyze differences in TM usage across both economic and TM sectors. In this paper, we apply this ALP concordance for TMs to characterize patterns in TM applications across countries, industries, income levels and more. We also use the concordance to investigate some of the key determinants of international technology transfer by comparing bilateral TM applications and bilateral patent applications. We conclude with a discussion of possible extensions of this work, including deeper indicator-level concordances and further analyses that are possible once TM data are linked with economic activity data.
View Full
Paper PDF
-
Business Dynamics Statistics of High Tech Industries
January 2016
Working Paper Number:
CES-16-55
Modern market economies are characterized by the reallocation of resources from less productive, less valuable activities to more productive, more valuable ones. Businesses in the High Technology sector play a particularly important role in this reallocation by introducing new products and services that impact the entire economy. Tracking the performance of this sector is therefore of primary importance, especially in light of recent evidence that suggests a slowdown in business dynamism in High Tech industries. The Census Bureau produces the Business Dynamics Statistics (BDS), a suite of data products that track job creation, job destruction, startups, and exits by firm and establishment characteristics including sector, firm age, and firm size. In this paper we describe the methodologies used to produce a new extension to the BDS focused on businesses in High Technology industries.
View Full
Paper PDF
-
Business Dynamic Statistics of Innovative Firms
January 2017
Working Paper Number:
CES-17-72
A key driver of economic growth is the reallocation of resources from low to high productivity activities. Innovation plays an important role in this regard by introducing new products, services, and business methods that ultimately lead to increased productivity and rising living standards. Traditional measures of innovation, particularly those based on aggregate inputs, are increasingly unable to capture the breadth and depth of innovation in modern economies. In this paper, we describe an effort at the
US Census Bureau, the Business Dynamics Statistics of Innovative Firms (BDS-IF) project, which aims to address these challenges by extending the Business Dynamics Statistics data to include new measures of innovative activity. The BDS-IF project will produce measures of firm, establishment, and employment flows by firm age, firm size, and industry for the subset of firms engaged in activities related to innovation. These activities include patenting and trademarking, the employment of STEM workers, and R&D expenditures. The exibility of the underlying data infrastructure allows this measurement agenda to be extended to include copyright activity, management practices, and high growth firms.
View Full
Paper PDF
-
A Portrait of Firms that Invest in R&D
January 2016
Working Paper Number:
CES-16-41
We focus on the evolution and behavior of firms that invest in research and development (R&D). We build upon the cross-sectional analysis in Foster and Grim (2010) that identified the characteristics of top R&D spending firms and follow up by charting the behavior of these firms over time. Our focus is dynamic in nature as we merge micro-level cross-sectional data from the Survey of Industrial Research and Development (SIRD) and the Business Research & Development and Innovation Survey (BRDIS) with the Longitudinal Business Database (LBD). The result is a panel firm-level data set from 1992 to 2011 that tracks firms' performances as they enter and exit the R&D surveys. Using R&D expenditures to proxy R&D performance, we find the top R&D performing firms in the U.S. across all years to be large, old, multinational enterprises. However, we also find that the composition of R&D performing firms is gradually shifting more towards smaller domestic firms with expenditures being less sensitive to scale effects. We find a high degree of persistence for these firms over time. We chart the history of R&D performing firms and compare them to all firms in the economy and find substantial differences in terms of age, size, firm structure and international activity; these differences persist when looking at future firm outcomes.
View Full
Paper PDF
-
Measuring The Trade Balance In Advanced Technology Products
January 1989
Working Paper Number:
CES-89-01
Because of the dramatic decline in the United States Trade Balance since the early 1970's, many economists and policy makers have become increasingly concerned about the ability of U.S. manufacturers to compete with foreign producers. Initially concern was limited to a few basic industries such as shoes, clothing, and steel; but more recently foreign producers have been effectively competing with U.S. manufacturers in automobiles, electronics, and other consumer products. It now seems that foreign producers are even challenging the dominance of America in high technology industries. The most recent publication from the International Trade Administration shows that the U.S. Trade Balance in high technology industries fell from a $24 billion surplus in 1982, to a $2.6 billion deficit in 1986, before rebounding to a $591 million surplus in 1987. As part of the efforts of the U.S. Census Bureau to provide policy makers and other interested parties with the most complete and accurate information possible, we recently completed a review of the methodology and data used to construct trade statistics in the area of high technology trade. Our findings suggest that the statistics presented by the International Trade Administration, although technically correct, do not provide an accurate picture of international trade in high or advanced technology products because of the level of aggregation used in their construction. The ITA statistics are based on the Department of Commerce's DOC3 definition of high technology industries. The DOC3 definition requires that each product classified in a high tech industry be designated high tech. As a result, many products which would not individually be considered high tech are included in the statistics. After developing a disaggregate, product- based measure of international trade in Advanced Technology Products (ATP), we find that although the trade balance in these products did decline over the 1982-1987 period, the decline is much smaller (about $5 billion) than reported by ITA (approximately $24 billion). This paper discusses the methodology used to define the ATP measure, contrasts it to the DOC3 measure, and provides a comparison of the resulting statistics. After discussing alternative approaches to identifying advanced technology products, Section 2 describes the advanced technologies in the classification. (Appendix A, provides definitions and examples of the products which embody these technologies. In addition, Appendix B, available on request, provides a comprehensive list of Advanced Technology Products by technology grouping.) Having described the ATPs, Section 3 examines annual trade statistics for ATP products, in 1982, 1986, and 1987, and compares these statistics with equivalent ones based on the DOC3 measure. The differences between the two measures over the 1982- 87 period stem from changes in the balance of trade of items included in the DOC3 measure but excluded by the Census ATP measure; i.e. the differences are due to changes in the trade balance of "low tech" products which are produced in "high tech" industries. This finding corroborates a principal argument for construction of the ATP measure, that the weakness of the DOC3 measure of high technology trade is the level of aggregation used in its construction. It also suggests that at the level of individual products the high technology sectors of the economy continue to enjoy a strong comparative advantage and are surprisingly healthy. Nonetheless, some areas of weakness are identified, such as low tech products in high tech industries. (Appendix C, supplements this material by providing a detailed listing of traded products included and excluded from the Advanced Technology definition for each DOC3 high tech commodity grouping. These Tables enable the reader to directly assess the Census classification.)
View Full
Paper PDF
-
Characteristics of the Top R&D Performing Firms in the U.S.: Evidence from the Survey of Industrial R&D
September 2010
Working Paper Number:
CES-10-33
Innovation drives economic growth and productivity growth, and as such, indicators of innovative activity such as research and development (R&D) expenditures are of paramount importance. We combine Census confidential microdata from two sources in order to examine the characteristics of the top R&D performing firms in the U.S. economy. We use the Survey of Industrial Research and Development (SIRD) to identify the top 200 R&D performing firms in 2003 and, to the extent possible, to trace the evolution of these firms from 1957 to 2007. The Longitudinal Business Database (LBD) further extends our knowledge about these firms and enables us to make comparisons to the U.S. economy. By linking the SIRD and the LBD we are able to create a detailed portrait of the evolution of the top R&D performing firms in the U.S.
View Full
Paper PDF
-
Occupational Classifications: A Machine Learning Approach
August 2018
Working Paper Number:
CES-18-37
Characterizing the work that people do on their jobs is a longstanding and core issue in labor economics. Traditionally, classification has been done manually. If it were possible to combine new computational tools and administrative wage records to generate an automated crosswalk between job titles and occupations, millions of dollars could be saved in labor costs, data processing could be sped up, data could become more consistent, and it might be possible to generate, without a lag, current information about the changing occupational composition of the labor market. This paper examines the potential to assign occupations to job titles contained in administrative data using automated, machine-learning approaches. We use a new extraordinarily rich and detailed set of data on transactional HR records of large firms (universities) in a relatively narrowly defined industry (public institutions of higher education) to identify the potential for machine-learning approaches to classify occupations.
View Full
Paper PDF