The Industry R&D Survey: Patent Database Link Project
November 2006
Working Paper Number:
CES-06-28
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
manufacturing,
industrial,
company,
invention,
manufacturer,
corporation,
corporate,
venture,
incorporated,
innovation,
inventory,
patent,
patenting,
trademark,
developed,
innovative,
innovation productivity,
corp,
firm patenting
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Metropolitan Statistical Area,
Standard Statistical Establishment List,
Standard Industrial Classification,
Service Annual Survey,
Longitudinal Research Database,
National Science Foundation,
Current Population Survey,
Longitudinal Business Database,
Chicago Census Research Data Center,
Patent and Trademark Office,
Business Register,
Limited Liability Company
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'The Industry R&D Survey: Patent Database Link Project' are listed below in order of similarity.
-
Working PaperNBER Patent Data-BR Bridge: User Guide and Technical Documentation
October 2010
Working Paper Number:
CES-10-36
This note provides details about the construction of the NBER Patent Data-BR concordance, and is intended for researchers planning to use this concordance. In addition to describing the matching process used to construct the concordance, this note provides a discussion of the benefits and limitations of this concordance.View Full Paper PDF
-
Working PaperCharacteristics of the Top R&D Performing Firms in the U.S.: Evidence from the Survey of Industrial R&D
September 2010
Working Paper Number:
CES-10-33
Innovation drives economic growth and productivity growth, and as such, indicators of innovative activity such as research and development (R&D) expenditures are of paramount importance. We combine Census confidential microdata from two sources in order to examine the characteristics of the top R&D performing firms in the U.S. economy. We use the Survey of Industrial Research and Development (SIRD) to identify the top 200 R&D performing firms in 2003 and, to the extent possible, to trace the evolution of these firms from 1957 to 2007. The Longitudinal Business Database (LBD) further extends our knowledge about these firms and enables us to make comparisons to the U.S. economy. By linking the SIRD and the LBD we are able to create a detailed portrait of the evolution of the top R&D performing firms in the U.S.View Full Paper PDF
-
Working PaperDocumenting the Business Register and Related Economic Business Data
March 2016
Working Paper Number:
CES-16-17
The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.View Full Paper PDF
-
Working PaperImproving Patent Assignee-Firm Bridge with Web Search Results
August 2022
Working Paper Number:
CES-22-31
This paper constructs a patent assignee-firm longitudinal bridge between U.S. patent assignees and firms using firm-level administrative data from the U.S. Census Bureau. We match granted patents applied between 1976 and 2016 to the U.S. firms recorded in the Longitudinal Business Database (LBD) in the Census Bureau. Building on existing algorithms in the literature, we first use the assignee name, address (state and city), and year information to link the two datasets. We then introduce a novel search-aided algorithm that significantly improves the matching results by 7% and 2.9% at the patent and the assignee level, respectively. Overall, we are able to match 88.2% and 80.1% of all U.S. patents and assignees respectively. We contribute to the existing literature by 1) improving the match rates and quality with the web search-aided algorithm, and 2) providing the longest and longitudinally consistent crosswalk between patent assignees and LBD firms.View Full Paper PDF
-
Working PaperNewly Recovered Microdata on U.S. Manufacturing Plants from the 1950s and 1960s: Some Early Glimpses
September 2011
Working Paper Number:
CES-11-29
Longitudinally-linked microdata on U.S. manufacturing plants are currently available to researchers for 1963, 1967, and 1972-2009. In this paper, we provide a first look at recently recovered manufacturing microdata files from the 1950s and 1960s. We describe their origins and background, discuss their contents, and begin to explore their sample coverage. We also begin to examine whether the available establishment identifier(s) allow record linking. Our preliminary analyses suggest that longitudinally-linked Annual Survey of Manufactures microdata from the mid-1950s through the present ' containing 16 years of additional data ' appears possible though challenging. While a great deal of work remains, we see tremendous value in extending the manufacturing microdata series back into time. With these data, new lines of research become possible and many others can be revisited.View Full Paper PDF
-
Working PaperUSING LINKED CENSUS R&D-LRD DATA TO ANALYZE THE EFFECT OF R&D INVESTMENT ON TOTAL FACTOR PRODUCTIVITY GROWTH
January 1989
Working Paper Number:
CES-89-02
Previous studies have demonstrated that productivity growth is positively correlated with the intensity of R&D investment. However, existing studies of this relationship at the micro (firm or line of business) level have been subject to some important limitations. The most serious of these has been an inability to adequately control for the diversified activities of corporations. This study makes use of linked Census R&D - LRD data, which provides comprehensive information on each firms' operations at the 4-digit SIC level. A marked improvement in explaining the association between R&D and TFP occurs when we make appropriate use of the data by firm by industry. Significant relationships between the intensities of investment in total, basic, and company-funded R&D, and TFP growth are confirmed.View Full Paper PDF
-
Working PaperAllocation of Company Research and Development Expenditures to Industries Using a Tobit Model
November 2015
Working Paper Number:
CES-15-42
This paper uses Census microdata and a regression-based approach to assign multi-division firms' pre-2008 Research and Development (R&D) expenditures to more than one industry. Since multi-division firms conduct R&D in more than one industry, assigning R&D to corresponding industries provides a more accurate representation of where R&D actually takes place and provides a consistent time-series with the National Science Foundation R&D by line of business information. Firm R&D is allocated to industries on the basis of observed industry payroll, as befits the historic importance of payroll in Census assignments of firms to industry. The results demonstrate that the method of assigning R&D to industries on the basis of payroll works well in earlier years, but becomes less effective over time as firms outsource their manufacturing function.View Full Paper PDF
-
Working PaperBusiness Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.View Full Paper PDF
-
Working PaperGrowth Through Heterogeneous Innovations
June 2012
Working Paper Number:
CES-12-08
We study how exploration versus exploitation innovations impact economic growth through a tractable endogenous growth framework that contains multiple innovation sizes, multiproduct firms, and entry/exit. Firms invest in exploration R&D to acquire new product lines and exploitation R&D to improve their existing product lines. We model and show empirically that exploration R&D does not scale as strongly with firm size as exploitation R&D. The resulting framework conforms to many regularities regarding innovation and growth differences across the firm size distribution. We also incorporate patent citations into our theoretical framework. The framework generates a simple test using patent citations that indicates that entrants and small firms have relatively higher growth spillover effects.View Full Paper PDF
-
Working PaperA Portrait of Firms that Invest in R&D
January 2016
Working Paper Number:
CES-16-41
We focus on the evolution and behavior of firms that invest in research and development (R&D). We build upon the cross-sectional analysis in Foster and Grim (2010) that identified the characteristics of top R&D spending firms and follow up by charting the behavior of these firms over time. Our focus is dynamic in nature as we merge micro-level cross-sectional data from the Survey of Industrial Research and Development (SIRD) and the Business Research & Development and Innovation Survey (BRDIS) with the Longitudinal Business Database (LBD). The result is a panel firm-level data set from 1992 to 2011 that tracks firms' performances as they enter and exit the R&D surveys. Using R&D expenditures to proxy R&D performance, we find the top R&D performing firms in the U.S. across all years to be large, old, multinational enterprises. However, we also find that the composition of R&D performing firms is gradually shifting more towards smaller domestic firms with expenditures being less sensitive to scale effects. We find a high degree of persistence for these firms over time. We chart the history of R&D performing firms and compare them to all firms in the economy and find substantial differences in terms of age, size, firm structure and international activity; these differences persist when looking at future firm outcomes.View Full Paper PDF