CREAT - Census Bureau

AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA

September 2013

Written by: Travis J. Lybbert, Nikolas Zolas, Prantik Bhattacharyya

Working Paper Number:

CES-13-49

Abstract

Trademarks (TMs) shape the competitive landscape of markets for goods and services in all countries through branding and conveying information and quality inherent in products. Yet, researchers are largely unable to conduct rigorous empirical analysis of TMs in the modern economy because TM data and economic activity data are organized differently and cannot be analyzed jointly at the industry or sectoral level. We propose an 'Algorithmic Links with Probabilities' (ALP) approach to match TM data to economic data and enable these data to speak to each other. Specifically, we construct a NICE Class Level concordance that maps TM data into trade and industry categories forward and backward. This concordance allows researchers to analyze differences in TM usage across both economic and TM sectors. In this paper, we apply this ALP concordance for TMs to characterize patterns in TM applications across countries, industries, income levels and more. We also use the concordance to investigate some of the key determinants of international technology transfer by comparing bilateral TM applications and bilateral patent applications. We conclude with a discussion of possible extensions of this work, including deeper indicator-level concordances and further analyses that are possible once TM data are linked with economic activity data.

Document Tags and Keywords

Keywords:

economist, manufacturing, industrial, technology, export, good, sector, classification, specialization, patent, sectoral, patenting, trademark, gdp, globalization

Tags:

Annual Survey of Manufactures, Foreign Direct Investment, Organization for Economic Cooperation and Development, Schools Under Registration Review, Insurance Information Institute, World Bank, Patent and Trademark Office, Master Address File, Business Register Bridge, International Standard Industrial Classification, Value Added

Similar Working Papers

The 10 most similar working papers to the working paper 'AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA' are listed below in order of similarity.

Working Paper
🔥

Getting Patents and Economic Data to Speak to Each Other: An 'Algorithmic Links with Probabilities' Approach for Joint Analyses of Patenting and Economic Activity

September 2012

Authors: Travis J. Lybbert, Nikolas Zolas

Working Paper Number:

CES-12-16

International technological diffusion is a key determinant of cross-country differences in economic performance. While patents can be a useful proxy for innovation and technological change and diffusion, fully exploiting patent data for such economic analyses requires patents to be tied to measures of economic activity. In this paper, we describe and explore a new algorithmic approach to constructing concordances between the International Patent Classification (IPC) system that organizes patents by technical features and industry classification systems that organize economic data, such as the Standard International Trade Classification (SITC), the International Standard Industrial Classification (ISIC) and the Harmonized System (HS). This 'Algorithmic Links with Probabilities' (ALP) approach incorporates text analysis software and keyword extraction programs and applies them to a comprehensive patent dataset. We compare the results of several ALP concordances to existing technology concordances. Based on these comparisons, we select a preferred ALP approach and discuss advantages of this approach relative to conventional approaches. We conclude with a discussion on some of the possible applications of the concordance and provide a sample analysis that uses our preferred ALP concordance to analyze international patent flows based on trade patterns.
View Full Paper PDF
Working Paper

An 'Algorithmic Links with Probabilities' Crosswalk for USPC and CPC Patent Classifications with an Application Towards Industrial Technology Composition

March 2016

Authors: Travis J. Lybbert, Nikolas Zolas, Nathan Goldschlag

Working Paper Number:

CES-16-15

Patents are a useful proxy for innovation, technological change, and diffusion. However, fully exploiting patent data for economic analyses requires patents be tied to measures of economic activity, which has proven to be difficult. Recently, Lybbert and Zolas (2014) have constructed an International Patent Classification (IPC) to industry classification crosswalk using an 'Algorithmic Links with Probabilities' approach. In this paper, we utilize a similar approach and apply it to new patent classification schemes, the U.S. Patent Classification (USPC) system and Cooperative Patent Classification (CPC) system. The resulting USPC-Industry and CPC-Industry concordances link both U.S. and global patents to multiple vintages of the North American Industrial Classification System (NAICS), International Standard Industrial Classification (ISIC), Harmonized System (HS) and Standard International Trade Classification (SITC). We then use the crosswalk to highlight changes to industrial technology composition over time. We find suggestive evidence of strong persistence in the association between technologies and industries over time.
View Full Paper PDF
Working Paper

Are Customs Records Consistent Across Countries? Evidence from the U.S. and Colombia

March 2020

Authors: C.J. Krizan, James Tybout, Zi Wang, Yingyan Zhao

Working Paper Number:

CES-20-11

In many countries, official customs records include identifying information on the exporting and importing firms involved in each shipment. This information allows researchers to study international business networks, offshoring patterns, and the micro-foundations of aggregate trade flows. It also provides the government with a basis for tariff assessments at the border. However, there are no mechanisms in place to ensure that the shipment-level information recorded by the exporting country is consistent with the shipment-level information recorded by the importing country. And to the extent that there are discrepancies, it is not clear how prevalent they are or what form they take. In this paper we explore these issues, both to enhance our understanding of the limitations of customs records, and to inform future discussions of possible revisions in the way they are collected. Specifically, we match U.S.-bound export shipments that appear in Colombian Customs records (DIAN) with their counterparts in the US Customs records (LFTTD): U.S. import shipments from Colombia. Several patterns emerge. First, differences in the coverage of the two countries customs records lead to significant discrepancies in the official bilateral trade flow statistics of these two countries: the DIAN database records 8 percent fewer transactions than the LFTTD database over the sample period, and the average export shipment size in the DIAN is roughly 4 percent smaller than the corresponding import shipment size in the LFTTD. These discrepancies are not due to difference in minimum shipment sizes and they are not particular to a few sectors, though they are more common among small shipments and they evolve over time. Second, if we rely exclusively on firms' names and addresses, ignoring other shipment characteristics (value, product code, etc.), we are able to match 85 percent of the value of U.S. imports from Colombia in our LFTTD sample with particular Colombian suppliers in the DIAN. Further, fully 97 percent of the value of Colombian exports to the U.S. can be mapped onto particular importers in the U.S. LFTTD. Third, however, match rates at the shipment level within buyer-seller pairs are low. That is, while buyers and sellers can be paired up fairly accurately, only 25-30 percent of the individual transactions in the customs records of the two countries can be matched using fuzzy algorithms at reasonable tolerance levels. Fourth, the manufacturer ID (MANUF_ID) that appears in the LFTTD implies there are roughly twice as many Colombian exporters as actually appear in the DIAN. And similar comments apply to an analogous MANUF_ID variable constructed from importer name and address information in the DIAN. Hence studies that treat each MANUF_ID value as a distinct firm are almost surely overstating the number of foreign firms that engage in trade with the U.S. by a substantial amount. Finally, we conclude that if countries were to require that exporters report standardized shipment identifiers'either invoice numbers or bill of lading/air waybill numbers'it would be far easier to track individual transactions and to identify international discrepancies in reporting.
View Full Paper PDF
Working Paper

Firms in International Trade

April 2007

Authors: J. Bradford Jensen, Andrew Bernard, Peter Schott, Stephen Redding

Working Paper Number:

CES-07-14

Standard models of international trade devote little attention to firms. Yet of the 5.5 million firms operating in the United States in 2000, just 4 percent engaged in exporting, and the top 10 percent of these exporting firms accounted for 96 percent of U.S. exports. Since the mid 1990s, a large number of empirical studies have provided a wealth of information about the important role that firms play in mediating countries' imports and exports. This research, based on micro datasets that track countries' production and trade at the firm level, demonstrates that trading firms differ substantially from firms that solely serve the domestic market. Across a wide range of countries and industries, exporters have been shown to be larger, more productive, more skill- and capital-intensive, and to pay higher wages than non-trading firms.2 Furthermore, these differences exist even before exporting begins. The ex ante 'superiority' of exporters suggests self-selection: exporters are more productive, not as a result of exporting, but because only the most productive firms are able to overcome the costs of entering export markets. It is precisely this sort of microeconomic heterogeneity that grants firms the ability to influence macroeconomic outcomes. When trade policy barriers fall or transportation costs decline, high-productivity exporting firms survive and grow while lower-productivity non-exporting firms are more likely to fail. This reallocation of economic activity across firms raises aggregate productivity and provides a new source of welfare gains from trade. Confronting the challenges posed by the analysis of micro data has shifted the focus of the international trade field from countries and industries towards firms and products. We highlight these challenges with a detailed analysis of how trading firms differ from non-trading firms in the United States. We show how these differences serve as the foundation of a series of recent heterogeneous-firm models that offer new insights into the causes and consequences of international trade. We then introduce a new set of stylized facts that emerge from analysis of recently available U.S. customs data. These transaction-level trade data track all of the products imported and exported by the U.S. firms to all of its trading partners from 1992 to 2000. They show that the extensive margins of trade ' that is, the number of products firms trade as well as the number of countries they trade with ' are central to understanding the well-known role of distance in dampening aggregate trade flows. We conclude with suggestions for further theoretical and empirical research.
View Full Paper PDF
Working Paper

An Anatomy of U.S. Firms Seeking Trademark Registration

April 2018

Authors: Emin Dinlersoz, Nikolas Zolas, Nathan Goldschlag, Amanda Myers

Working Paper Number:

CES-18-22

This paper reports on the construction of a new dataset that combines data on trademark applications and registrations from the U.S. Patent and Trademark Office with data on firms from the U.S. Census Bureau. The resulting dataset allows tracking of various activity related to trademark use and protection over the life-cycle of firms, such as the first application for a trademark registration, the first use of a trademark, and the renewal, assignment, and cancellation of trademark registrations. Facts about firm-level trademark activity are documented, including the incidence and timing of trademark registration filings over the firm life-cycle and the connection between firm characteristics and trademark applications. We also explore the relation of trademark application filing to firm employment and revenue growth, and to firm innovative activity as measured by R&D and patents.
View Full Paper PDF
Working Paper

Identifying U.S. Merchandise Traders: Integrating Customs Transactions with Business Administrative Data

September 2020

Authors: Fariha Kamal, Wei Ouyang

Working Paper Number:

CES-20-28

This paper describes the construction of the Longitudinal Firm Trade Transactions Database (LFTTD) enabling the identification of merchandise traders - exporters and importers - in the U.S. Census Bureau's Business Register (BR). The LFTTD links merchandise export and import transactions from customs declaration forms to the BR beginning in 1992 through the present. We employ a combination of deterministic and probabilistic matching algorithms to assign a unique firm identifier in the BR to a merchandise export or import transaction record. On average, we match 89 percent of export and import values to a firm identifier. In 1992, we match 79 (88) percent of export (import) value; in 2017, we match 92 (96) percent of export (import) value. Trade transactions in year t are matched to years between 1976 and t+1 of the BR. On average, 94 percent of the trade value matches to a firm in year t of the BR. The LFTTD provides the most comprehensive identification of and the foundation for the analysis of goods trading firms in the U.S. economy.
View Full Paper PDF
Working Paper

Innovation and Appropriability: Revisiting the Role of Intellectual Property

March 2022

Authors: Timothy Simcoe, Filippo Mezzanotti

Working Paper Number:

CES-22-09

It is more than 25 years since the authors of the Yale and Carnegie surveys studied how firms seek to protect the rents from innovation. In this paper, we revisit that question using a nationally representative sample of firms over the period 2008-2015, with the goal of updating and extending a set of stylized facts that has been influential for our understanding of the economics of innovation. There are five main findings. First, while patenting firms are relatively uncommon in the economy, they account for an overwhelming share of R&D spending. Second, utility patents are considered less important than other forms of IP protection, like trade secrets, trademarks, and copyrights. Third, industry differences explain a great deal of the level of firms' engagement with IP, with high-tech firms on average being more active on all forms of IP. Fourth, we do not find any significant difference in the use of IP strategies across firms at different points of their life cycle. Lastly, unlike age, firms of different size appear to manage IP significantly differently. On average, larger firms tend to engage much more extensively in the protection of IP, and this pattern cannot be easily explained by differences in the type of R&D or innovation produced by a firm. We also discuss the implications of these findings for innovation research and policy.
View Full Paper PDF
Working Paper

A Portrait of Firms that Invest in R&D

January 2016

Authors: Lucia Foster, Cheryl Grim, Nikolas Zolas

Working Paper Number:

CES-16-41

We focus on the evolution and behavior of firms that invest in research and development (R&D). We build upon the cross-sectional analysis in Foster and Grim (2010) that identified the characteristics of top R&D spending firms and follow up by charting the behavior of these firms over time. Our focus is dynamic in nature as we merge micro-level cross-sectional data from the Survey of Industrial Research and Development (SIRD) and the Business Research & Development and Innovation Survey (BRDIS) with the Longitudinal Business Database (LBD). The result is a panel firm-level data set from 1992 to 2011 that tracks firms' performances as they enter and exit the R&D surveys. Using R&D expenditures to proxy R&D performance, we find the top R&D performing firms in the U.S. across all years to be large, old, multinational enterprises. However, we also find that the composition of R&D performing firms is gradually shifting more towards smaller domestic firms with expenditures being less sensitive to scale effects. We find a high degree of persistence for these firms over time. We chart the history of R&D performing firms and compare them to all firms in the economy and find substantial differences in terms of age, size, firm structure and international activity; these differences persist when looking at future firm outcomes.
View Full Paper PDF
Working Paper

INTERNATIONAL PATENTING STRATEGIES WITH HETEROGENEOUS FIRMS

September 2014

Authors: Nikolas Zolas

Working Paper Number:

CES-14-28

This paper analyzes how firms decide where to patent in a heterogeneous firm model of trade with endogenous rival entry. In the model, innovating firms compete with rival firms on price, where rivals force the innovating firm to reduce markups and lower the innovating firm's probability of obtaining monopolistic profits. Patenting allows the innovating firm to reduce the number of rival rms by increasing their fixed overhead costs, thereby providing higher expected profits and increased markups from reduced competition. Countries with higher states of technology, more competition and better patent protection have a greater proportion of entrants who patent. Industries tend to follow a U-shaped pattern of patenting where industries with high heterogeneity in production and low substitution, along with industries with low heterogeneity in production and high substitution patent more frequently. Using a generalized framework of the model, I estimate market-based measures of country-level patent protection, which when compared with other IP indices, suggests that not enough international patenting is taking place. Finally, I test the predictions of the model using a newly available technology-to-industry concordance on bilateral patent flows and show that firms are increasingly sensitive to foreign IP protection. Countries that choose to maximize their IP protection can increase the number of foreign patents by almost 10%.
View Full Paper PDF
Working Paper

Measuring The Trade Balance In Advanced Technology Products

January 1989

Authors: Robert H Mcguckin, Thomas A Abbott Iii, Paul E Herrick, Leroy Norfolk

Working Paper Number:

CES-89-01

Because of the dramatic decline in the United States Trade Balance since the early 1970's, many economists and policy makers have become increasingly concerned about the ability of U.S. manufacturers to compete with foreign producers. Initially concern was limited to a few basic industries such as shoes, clothing, and steel; but more recently foreign producers have been effectively competing with U.S. manufacturers in automobiles, electronics, and other consumer products. It now seems that foreign producers are even challenging the dominance of America in high technology industries. The most recent publication from the International Trade Administration shows that the U.S. Trade Balance in high technology industries fell from a $24 billion surplus in 1982, to a $2.6 billion deficit in 1986, before rebounding to a $591 million surplus in 1987. As part of the efforts of the U.S. Census Bureau to provide policy makers and other interested parties with the most complete and accurate information possible, we recently completed a review of the methodology and data used to construct trade statistics in the area of high technology trade. Our findings suggest that the statistics presented by the International Trade Administration, although technically correct, do not provide an accurate picture of international trade in high or advanced technology products because of the level of aggregation used in their construction. The ITA statistics are based on the Department of Commerce's DOC3 definition of high technology industries. The DOC3 definition requires that each product classified in a high tech industry be designated high tech. As a result, many products which would not individually be considered high tech are included in the statistics. After developing a disaggregate, product- based measure of international trade in Advanced Technology Products (ATP), we find that although the trade balance in these products did decline over the 1982-1987 period, the decline is much smaller (about $5 billion) than reported by ITA (approximately $24 billion). This paper discusses the methodology used to define the ATP measure, contrasts it to the DOC3 measure, and provides a comparison of the resulting statistics. After discussing alternative approaches to identifying advanced technology products, Section 2 describes the advanced technologies in the classification. (Appendix A, provides definitions and examples of the products which embody these technologies. In addition, Appendix B, available on request, provides a comprehensive list of Advanced Technology Products by technology grouping.) Having described the ATPs, Section 3 examines annual trade statistics for ATP products, in 1982, 1986, and 1987, and compares these statistics with equivalent ones based on the DOC3 measure. The differences between the two measures over the 1982- 87 period stem from changes in the balance of trade of items included in the DOC3 measure but excluded by the Census ATP measure; i.e. the differences are due to changes in the trade balance of "low tech" products which are produced in "high tech" industries. This finding corroborates a principal argument for construction of the ATP measure, that the weakness of the DOC3 measure of high technology trade is the level of aggregation used in its construction. It also suggests that at the level of individual products the high technology sectors of the economy continue to enjoy a strong comparative advantage and are surprisingly healthy. Nonetheless, some areas of weakness are identified, such as low tech products in high tech industries. (Appendix C, supplements this material by providing a detailed listing of traded products included and excluded from the Advanced Technology definition for each DOC3 high tech commodity grouping. These Tables enable the reader to directly assess the Census classification.)
View Full Paper PDF

AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA

September 2013

Working Paper Number:

CES-13-49

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'AN 'ALGORITHMIC LINKS WITH PROBABILITIES' CONCORDANCE FOR TRADEMARKS: FOR DISAGGREGATED ANALYSIS OF TRADEMARK & ECONOMIC DATA' are listed below in order of similarity.

September 2012

Working Paper Number:

CES-12-16

March 2016

Working Paper Number:

CES-16-15

March 2020

Working Paper Number:

CES-20-11

April 2007

Working Paper Number:

CES-07-14

April 2018

Working Paper Number:

CES-18-22

September 2020

Working Paper Number:

CES-20-28

March 2022

Working Paper Number:

CES-22-09

January 2016

Working Paper Number:

CES-16-41

September 2014

Working Paper Number:

CES-14-28

January 1989

Working Paper Number:

CES-89-01