The Bureau of Labor Statistics and the Bureau of the Census each maintain a business register, a universe of all U.S. business establishments and their characteristics, created from independent sources. Both registers serve critical functions such as supplying aggregate data inputs for certain national statistics generated by the Bureau of Economic Analysis. This paper examines key micro-level differences across these two business registers.
-
Longitudinal Establishment And Enterprise Microdata (LEEM) Documentation
May 1998
Working Paper Number:
CES-98-09
This paper introduces and documents the new Longitudinal Enterprise and Establishment Microdata (LEEM) database, which has been constructed by Census' Economic Planning and Coordination Division under contract to the Office of Advocacy of the U.S. Small Business Administration. The LEEM links three years (1990, 1994, and 1995) of basic data for each private sector establishment with payroll in any of those years, along with data on the firm to which the establishment belongs each year. The LEEM data will facilitate both broader and more detailed analysis of patterns of job creation and destruction in the U.S., as well as research on the structure and dynamics of U.S. businesses. This paper provides documentation of the construction of LEEM data, summary data on most variables in the database, comparisons of the annual data with that of the nearly identical County Business Patterns, and distributions of establishments and their employment by the size of their firms. This is followed by a simple analysis of changes over time in the attributes of surviving establishments, and a brief discussion of turnover (business births and deaths) in the population and gross changes in employment associated with both establishment turnover and with surviving establishments. It concludes with a summary of the strengths and weaknesses of the LEEM.
View Full
Paper PDF
-
NEW DATA FOR DYNAMIC ANALYSIS: THE LONGITUDINAL ESTABLISHMENT AND ENTERPRISE MICRODATA (LEEM) FILE
December 1999
Working Paper Number:
CES-99-18
Until now, research on U.S. business activities over time has been hindered by the lack of accurate and comprehensive longitudinal data. The new Longitudinal Establishment and Enterprise Microdata (LEEM) are tremendously rich data that open up numerous possibilities for dynamic analyses of businesses in the U.S. economy. It is the first nationwide high-quality longitudinal database that covers the majority of employer businesses from all sectors of the economy. Due to the confidential nature of these data, the file is located at the Center for Economic Studies in the U.S. Bureau of the Census. To access the data, researchers must submit an acceptable proposal to CES and become sworn Census researchers. This paper describes the LEEM file, the variables contained on the file, and current uses of the data.
View Full
Paper PDF
-
Documenting the Business Register and Related Economic Business Data
March 2016
Working Paper Number:
CES-16-17
The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.
View Full
Paper PDF
-
Published Versus Sample Statistics From The ASM: Implications For The LRD
January 1991
Working Paper Number:
CES-91-01
In principle, the Longitudinal Research Database ( LRD ) which links the establishments in the Annual Survey of Manufactures (ASM) is ideal for examining the dynamics of firm and aggregate behavior. However, the published ASM aggregates are not simply the appropriately weighted sums of establishment data in the LRD . Instead, the published data equal the sum of LRD-based sample estimates and nonsample estimates. The latter reflect adjustments related to sampling error and the imputation of small-establishment data. Differences between the LRD and the ASM raise questions for users of both data sets. For ASM users, time-series variation in the difference indicates potential problems in consistently and reliably estimating the nonsample portion of the ASM. For LRD users, potential sample selection problems arise due to the systematic exclusion of data from small establishments. Microeconomic studies based on the LRD can yield misleading inferences to the extent that small establishments behave differently. Similarly, new economic aggregates constructed from the LRD can yield incorrect estimates of levels and growth rates. This paper documents cross-sectional and time-series differences between ASM and LRD estimates of levels and growth rates of total employment, and compares them with employment estimates provided by Bureau of Labor Statistics and County Business Patterns data. In addition, this paper explores potential adjustments to economic aggregates constructed from the LRD. In particular, the paper reports the results of adjusting LRD-based estimates of gross job creation and destruction to be consistent with net job changes implied by the published ASM figures.
View Full
Paper PDF
-
LEHD Data Documentation LEHD-OVERVIEW-S2008-rev1
December 2011
Working Paper Number:
CES-11-43
View Full
Paper PDF
-
Gross Job Flows for the U.S. Manufacturing Sector: Measurement from the Longitudinal Research Database
December 2006
Working Paper Number:
CES-06-30
Measures of job creation and destruction are now produced regularly by the U.S. statistical agencies. The Bureau of Labor Statistics releases via the Business Employment Dynamics (BED) on a quarterly basis measures of job creation and destruction for the U.S. nonfarm business sector and related disaggregation by industrial sector and size class. The U.S. Census Bureau has developed the Longitudinal Business Database (LBD) covering the nonfarm business sector that has been used to produce research analysis and special tabulations including tabulations of job creation and destruction. Both of these data programs build upon the measurement methods and data analysis of job creation and destruction measures from the Longitudinal Research Database (LRD) developed and published by Davis, Haltiwanger and Schuh (1996). In this paper, the LRD based estimates of job creation and destruction are updated and made available for consistent annual and quarterly series from 1972-1998. While the BED and LBD programs are more comprehensive in scope than the LRD, the extensive development of the LRD permits the construction of measures of job creation and destruction for a rich array of employer characteristics including industry, size, business age, ownership structure, location and wage structure. The updated series that are released with this working paper provide measures along each of these dimensions. The paper describes in detail the changes in the processing of the Annual Survey of Manufactures over the 1972-1998 period that are important to incorporate by users of the LRD at Census Research Data Centers as well as users of products from the LRD such as job creation and destruction.
View Full
Paper PDF
-
Matching Compustat Data to the Longitudinal Business Database, 1976-2020
September 2025
Working Paper Number:
CES-25-65
This paper details the methodology for creating an updated Compustat-Longitudinal Business Database (LBD) bridge, facilitating linkage between company identifiers in Compustat and firm identifiers in the LBD. In addition to data from Compustat, we incorporate historical data on public companies from various public and private sources, including information on executive names. Our methodology involves a series of stages using fuzzy name and address matching, including EIN, telephone number, and industry code matching. Qualified researchers with approved proposals can access this bridge though the Federal Statistical Research Data Centers. The Compustat-SSL bridge serves as a crucial resource for longitudinal studies on U.S. businesses, corporate governance, and executive compensation.
View Full
Paper PDF
-
Successor/Predecessor Firms
March 2002
Working Paper Number:
tp-2002-04
The goal of this research was to investigate the value added from using worker flows to identify the spurious births and deaths of businesses. We identify four types of "at risk" businesses from ES202 using the successor/predecessor flag and mimic the same categories using UI wage record data. We use two critical decision rules in the analysis: a successor firm has to have at least 80% of employment coming from the donor firm and (in two of the four categories) at least 5 employees have to come from the donor firm. We examine the sensitivity of the categories based on the percentage definition, and find that the results stay very similar, with the exception of the identification of the pure successor. We examine the sensitivity based on the count threshold, and find that there are enormous differences, particularly with identifying spinoff businesses.
View Full
Paper PDF
-
Using Census Business Data to Augment the MEPS-IC
December 2005
Working Paper Number:
CES-05-26
This paper has two aims: first to describe methods, issues, and outcomes involved in matching data from the Insurance Component of the Medical Expenditure Panel Survey (MEPSIC) to other business microdata collected by the U.S. Census Bureau, and second to present some simple results that illustrate the usefulness of such combined data. We present the results of linking the MEPS-IC with data from the 1997 Economic Censuses (EC), but also discuss other possible sources of business data. An issue in any linkage is whether the linked sample remains representative and large enough to be useful. The EC data are attractive because, given the survey's broad coverage and large sample, most of the MEPS-IC sample can be matched to it. We use the combined EC/MEPS-IC data to construct productivity measures that are useful auxiliary data in examining employers' health insurance offering decisions.
View Full
Paper PDF
-
Newly Recovered Microdata on U.S. Manufacturing Plants from the 1950s and 1960s: Some Early Glimpses
September 2011
Working Paper Number:
CES-11-29
Longitudinally-linked microdata on U.S. manufacturing plants are currently available to researchers for 1963, 1967, and 1972-2009. In this paper, we provide a first look at recently recovered manufacturing microdata files from the 1950s and 1960s. We describe their origins and background, discuss their contents, and begin to explore their sample coverage. We also begin to examine whether the available establishment identifier(s) allow record linking. Our preliminary analyses suggest that longitudinally-linked Annual Survey of Manufactures microdata from the mid-1950s through the present ' containing 16 years of additional data ' appears possible though challenging. While a great deal of work remains, we see tremendous value in extending the manufacturing microdata series back into time. With these data, new lines of research become possible and many others can be revisited.
View Full
Paper PDF