A Guide To R&D Data At The Center For Economic Studies U.S. Bureau Of THe Census
August 1994
Working Paper Number:
CES-94-09
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
investment,
data,
report,
quarterly,
sale,
company,
survey data,
survey,
study,
respondent,
earnings,
research,
firms census,
expenditure,
gdp,
firms export,
record,
firms exporting
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Standard Industrial Classification,
Service Annual Survey,
Longitudinal Research Database,
National Science Foundation,
Center for Economic Studies,
Auxiliary Establishment Survey
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'A Guide To R&D Data At The Center For Economic Studies U.S. Bureau Of THe Census' are listed below in order of similarity.
-
Working PaperCharacteristics of the Top R&D Performing Firms in the U.S.: Evidence from the Survey of Industrial R&D
September 2010
Working Paper Number:
CES-10-33
Innovation drives economic growth and productivity growth, and as such, indicators of innovative activity such as research and development (R&D) expenditures are of paramount importance. We combine Census confidential microdata from two sources in order to examine the characteristics of the top R&D performing firms in the U.S. economy. We use the Survey of Industrial Research and Development (SIRD) to identify the top 200 R&D performing firms in 2003 and, to the extent possible, to trace the evolution of these firms from 1957 to 2007. The Longitudinal Business Database (LBD) further extends our knowledge about these firms and enables us to make comparisons to the U.S. economy. By linking the SIRD and the LBD we are able to create a detailed portrait of the evolution of the top R&D performing firms in the U.S.View Full Paper PDF
-
Working PaperThe Role of Industry Classification in the Estimation of Research and Development Expenditures
November 2014
Working Paper Number:
CES-14-45
This paper uses data from the National Science Foundation's surveys on business research and development (R&D) expenditures that have been linked with data from the Census Bureau's Longitudinal Business Database to produce consistent NAICS-based R&D time-series data based on the main product produced by the firm for 1976 to 2008.The results show that R&D spending has shifted away from domestic manufacturing industries in recent years. This is due in part to a shift in U.S. payrolls away from manufacturing establishments for R&D-performing firms.These findings support the notion of an increasingly fragmented production system for R&D-intensive manufacturing firms, whereby U.S. firms control output and provide intellectual property inputs in the form of R&D, but production takes place outside of the firms' U.S. establishments.View Full Paper PDF
-
Working PaperThe Census of Construction Industries Database
August 1998
Working Paper Number:
CES-98-10
The Census of Construction Industries (CCI) is conducted every five years as part of the quinquennial Economic Census. The Census of Construction Industries covers all establishments with payroll that are engaged primarily in contract construction or construction on their own account for sale as defined in the Standard Industrial Classification Manual. As previously administered, the CCI is a partial census including all multi-establishments and all establishments with payroll above $480,000, one out of every five establishments with payroll between $480,000 and $120,000 and one out of eight remaining establishments. The resulting database contains for each year approximately 200,000 establishments in the building construction, heavy construction and special trade construction industrial classifications. This paper compares the content, survey procedures, and sample response of the 1982, 1987 and 1992 Censuses of Construction.View Full Paper PDF
-
Working PaperAllocation of Company Research and Development Expenditures to Industries Using a Tobit Model
November 2015
Working Paper Number:
CES-15-42
This paper uses Census microdata and a regression-based approach to assign multi-division firms' pre-2008 Research and Development (R&D) expenditures to more than one industry. Since multi-division firms conduct R&D in more than one industry, assigning R&D to corresponding industries provides a more accurate representation of where R&D actually takes place and provides a consistent time-series with the National Science Foundation R&D by line of business information. Firm R&D is allocated to industries on the basis of observed industry payroll, as befits the historic importance of payroll in Census assignments of firms to industry. The results demonstrate that the method of assigning R&D to industries on the basis of payroll works well in earlier years, but becomes less effective over time as firms outsource their manufacturing function.View Full Paper PDF
-
Working PaperNew Uses of Health and Pension Information
January 2002
Working Paper Number:
tp-2002-03
-
Working PaperR&D, Attrition and Multiple Imputation in BRDIS
January 2017
Working Paper Number:
CES-17-13
Multiple imputation in business establishment surveys like BRDIS, an annual business survey in which some companies are sampled every year or multiple years, may enhance the estimates of total R&D in addition to helping researchers estimate models with subpopulations of small sample size. Considering a panel of BRDIS companies throughout the years 2008 to 2013 linked to LBD data, this paper uses the conclusions obtained with missing data visualization and other explorations to come up with a strategy to conduct multiple imputation appropriate to address the item nonresponse in R&D expenditures. Because survey design characteristics are behind much of the item and unit nonresponse, multiple imputation of missing data in BRDIS changes the estimates of total R&D significantly and alters the conclusions reached by models of the determinants of R&D investment obtained with complete case analysis.View Full Paper PDF
-
Working PaperA Portrait of Firms that Invest in R&D
January 2016
Working Paper Number:
CES-16-41
We focus on the evolution and behavior of firms that invest in research and development (R&D). We build upon the cross-sectional analysis in Foster and Grim (2010) that identified the characteristics of top R&D spending firms and follow up by charting the behavior of these firms over time. Our focus is dynamic in nature as we merge micro-level cross-sectional data from the Survey of Industrial Research and Development (SIRD) and the Business Research & Development and Innovation Survey (BRDIS) with the Longitudinal Business Database (LBD). The result is a panel firm-level data set from 1992 to 2011 that tracks firms' performances as they enter and exit the R&D surveys. Using R&D expenditures to proxy R&D performance, we find the top R&D performing firms in the U.S. across all years to be large, old, multinational enterprises. However, we also find that the composition of R&D performing firms is gradually shifting more towards smaller domestic firms with expenditures being less sensitive to scale effects. We find a high degree of persistence for these firms over time. We chart the history of R&D performing firms and compare them to all firms in the economy and find substantial differences in terms of age, size, firm structure and international activity; these differences persist when looking at future firm outcomes.View Full Paper PDF
-
Working PaperEvaluation And Use Of The Pollution Abatement Costs And Expenditures Survey Micro Data
January 1996
Working Paper Number:
CES-96-01
The Pollution Abatement Costs and Expenditures Survey (PACE) is an annual survey of manufacturing establishment=s operating costs and capital investment expenditures for pollution abatement purposes. This paper provides a description and evaluation of the PACE micro data available at the Center for Economic Studies (CES). The paper provides an overview of the survey, how the sample is drawn, how the survey questionnaire has changed over time, an assessment of the data quality, and suggestions for the use of the data, as well as its limitations. Also included are suggestions for modifying the survey design and data processing procedures. The PACE data series, linked to the economic data in CES= Longitudinal Research Database (LRD), covers the years 1979-1993, excluding 1983 and 1987.View Full Paper PDF
-
Working PaperThe Industry R&D Survey: Patent Database Link Project
November 2006
Working Paper Number:
CES-06-28
This paper details the construction of a firm-year panel dataset combining the NBER Patent Dataset with the Industry R&D Survey conducted by the Census Bureau and National Science Foundation. The developed platform offers an unprecedented view of the R&D-to-patenting innovation process and a close analysis of the strengths and limitations of the Industry R&D Survey. The files are linked through a name-matching algorithm customized for uniting the firm names to which patents are assigned with the firm names in Census Bureau's SSEL business registry. Through the Census Bureau's file structure, this R&D platform can be linked to the operating performances of each firm's establishments, further facilitating innovation-to-productivity studies.View Full Paper PDF
-
Working PaperNewly Recovered Microdata on U.S. Manufacturing Plants from the 1950s and 1960s: Some Early Glimpses
September 2011
Working Paper Number:
CES-11-29
Longitudinally-linked microdata on U.S. manufacturing plants are currently available to researchers for 1963, 1967, and 1972-2009. In this paper, we provide a first look at recently recovered manufacturing microdata files from the 1950s and 1960s. We describe their origins and background, discuss their contents, and begin to explore their sample coverage. We also begin to examine whether the available establishment identifier(s) allow record linking. Our preliminary analyses suggest that longitudinally-linked Annual Survey of Manufactures microdata from the mid-1950s through the present ' containing 16 years of additional data ' appears possible though challenging. While a great deal of work remains, we see tremendous value in extending the manufacturing microdata series back into time. With these data, new lines of research become possible and many others can be revisited.View Full Paper PDF