The Longitudinal Business Database
July 2002
Working Paper Number:
CES-02-17
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
economist,
data,
statistical,
data census,
survey,
study,
agency,
respondent,
sector,
statistician,
longitudinal,
trend,
employment data,
economic census,
gdp,
population,
census years,
census bureau,
censuses surveys,
census use,
census survey,
datasets,
individuals census
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Annual Survey of Manufactures,
Standard Statistical Establishment List,
Internal Revenue Service,
Standard Industrial Classification,
Service Annual Survey,
Longitudinal Research Database,
Center for Economic Studies,
Office of Management and Budget,
Permanent Plant Number,
County Business Patterns,
Company Organization Survey,
Financial, Insurance and Real Estate Industries,
National Longitudinal Survey of Youth,
Longitudinal Business Database,
Employer Identification Numbers,
Survey of Income and Program Participation,
Postal Service,
Economic Census,
North American Industry Classification System,
PSID,
Business Register,
Public Administration
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'The Longitudinal Business Database' are listed below in order of similarity.
-
Working PaperRedesigning the Longitudinal Business Database🔥
May 2021
Working Paper Number:
CES-21-08
In this paper we describe the U.S. Census Bureau's redesign and production implementation of the Longitudinal Business Database (LBD) first introduced by Jarmin and Miranda (2002). The LBD is used to create the Business Dynamics Statistics (BDS), tabulations describing the entry, exit, expansion, and contraction of businesses. The new LBD and BDS also incorporate information formerly provided by the Statistics of U.S. Businesses program, which produced similar year-to-year measures of employment and establishment flows. We describe in detail how the LBD is created from curation of the input administrative data, longitudinal matching, retiming of economic census-year births and deaths, creation of vintage consistent industry codes and noise factors, and the creation and cleaning of each year of LBD data. This documentation is intended to facilitate the proper use and understanding of the data by both researchers with approved projects accessing the LBD microdata and those using the BDS tabulations.View Full Paper PDF
-
Working PaperNEW DATA FOR DYNAMIC ANALYSIS: THE LONGITUDINAL ESTABLISHMENT AND ENTERPRISE MICRODATA (LEEM) FILE🔥
December 1999
Working Paper Number:
CES-99-18
Until now, research on U.S. business activities over time has been hindered by the lack of accurate and comprehensive longitudinal data. The new Longitudinal Establishment and Enterprise Microdata (LEEM) are tremendously rich data that open up numerous possibilities for dynamic analyses of businesses in the U.S. economy. It is the first nationwide high-quality longitudinal database that covers the majority of employer businesses from all sectors of the economy. Due to the confidential nature of these data, the file is located at the Center for Economic Studies in the U.S. Bureau of the Census. To access the data, researchers must submit an acceptable proposal to CES and become sworn Census researchers. This paper describes the LEEM file, the variables contained on the file, and current uses of the data.View Full Paper PDF
-
Working PaperLongitudinal Establishment And Enterprise Microdata (LEEM) Documentation
May 1998
Working Paper Number:
CES-98-09
This paper introduces and documents the new Longitudinal Enterprise and Establishment Microdata (LEEM) database, which has been constructed by Census' Economic Planning and Coordination Division under contract to the Office of Advocacy of the U.S. Small Business Administration. The LEEM links three years (1990, 1994, and 1995) of basic data for each private sector establishment with payroll in any of those years, along with data on the firm to which the establishment belongs each year. The LEEM data will facilitate both broader and more detailed analysis of patterns of job creation and destruction in the U.S., as well as research on the structure and dynamics of U.S. businesses. This paper provides documentation of the construction of LEEM data, summary data on most variables in the database, comparisons of the annual data with that of the nearly identical County Business Patterns, and distributions of establishments and their employment by the size of their firms. This is followed by a simple analysis of changes over time in the attributes of surviving establishments, and a brief discussion of turnover (business births and deaths) in the population and gross changes in employment associated with both establishment turnover and with surviving establishments. It concludes with a summary of the strengths and weaknesses of the LEEM.View Full Paper PDF
-
Working PaperNewly Recovered Microdata on U.S. Manufacturing Plants from the 1950s and 1960s: Some Early Glimpses
September 2011
Working Paper Number:
CES-11-29
Longitudinally-linked microdata on U.S. manufacturing plants are currently available to researchers for 1963, 1967, and 1972-2009. In this paper, we provide a first look at recently recovered manufacturing microdata files from the 1950s and 1960s. We describe their origins and background, discuss their contents, and begin to explore their sample coverage. We also begin to examine whether the available establishment identifier(s) allow record linking. Our preliminary analyses suggest that longitudinally-linked Annual Survey of Manufactures microdata from the mid-1950s through the present ' containing 16 years of additional data ' appears possible though challenging. While a great deal of work remains, we see tremendous value in extending the manufacturing microdata series back into time. With these data, new lines of research become possible and many others can be revisited.View Full Paper PDF
-
Working PaperDocumenting the Business Register and Related Economic Business Data
March 2016
Working Paper Number:
CES-16-17
The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.View Full Paper PDF
-
Working PaperMeasuring the Dynamics of Young and Small Businesses: Integrating the Employer and Nonemployer Universes
February 2006
Working Paper Number:
CES-06-04
We develop a preliminary version of an Integrated Longitudinal Business Database (ILBD) that combines administrative records and survey-based data for virtually all employer and nonemployer business units in the United States. In the process, we confront conceptual and practical issues that arise in measuring the importance and dynamic behavior of younger and smaller businesses. We also document some basic facts about younger and smaller businesses. In doing so, we exploit the ability of the ILBD to follow business transitions between employer and nonemployer status, and vice-versa. This aspect of the ILBD opens a new frontier for the study of business formation and the precursors to job creation in the U.S. economy.View Full Paper PDF
-
Working PaperFIRM AGE AND SIZE IN THE LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS DATA
March 2014
Working Paper Number:
CES-14-16
The Census Bureau's Quarterly Workforce Dynamics (QWI) and OnTheMap now provide detailed workforce statistics by employer age and size. These data allow a first look at the demographics of workers at small and young businesses as well as detailed analysis of how hiring, turnover, job creation/destruction vary throughout a firm's lifespan. Both the QWI and OnTheMap are tabulated from the Longitudinal Employer-Household Dynamics (LEHD) linked employer-employee data. Firm age and size information was added to the LEHD data through integration of Business Dynamics Statistics (BDS) microdata into the LEHD jobs frame. This paper describes how these two new firm characteristics were added to the microdata and how they are tabulated in QWI and OnTheMapView Full Paper PDF
-
Working PaperLEHD Data Documentation LEHD-OVERVIEW-S2008-rev1
December 2011
Working Paper Number:
CES-11-43
View Full Paper PDF
-
Working PaperDescribing the Form 5500-Business Register Match
January 2003
Working Paper Number:
tp-2003-05
View Full Paper PDF
-
Working PaperThe Role of Retail Chains: National, Regional, and Industry Results
December 2005
Working Paper Number:
CES-05-30
We use the establishment level data in the Longitudinal Business Database to measure changes in market structure in the U.S. Retail Trade sector during the period, 1976 to 2000. We use firm ownership information to construct measures of firm entry and exit and also to categorize four types of retail firms: single location, and local, regional, and national chains. We use detailed location data to examine market structure in both national and county markets. We summarize the county level results into three groups: metropolitan, micropolitan, and rural. We find that retail activity is increasingly occurring at establishments owned by chain firms, especially large national chains. On average, we find that all types of retail firms are increasing in size during the period. We also find that larger markets experience more firm turnover. Finally, we see that entry and exit rates vary across two-digit retail industries.View Full Paper PDF