The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.
-
The Longitudinal Business Database
July 2002
Working Paper Number:
CES-02-17
As the largest federal statistical agency and primary collector of data on businesses, households and individuals, the Census Bureau each year conducts numerous surveys intended to provide statistics on a wide range of topics about the population and economy of the United States. The Census Bureau's decennial population and quinquennial economic censuses are unique, providing information on all U.S. households and business establishments, respectively.
View Full
Paper PDF
-
Longitudinal Establishment And Enterprise Microdata (LEEM) Documentation
May 1998
Working Paper Number:
CES-98-09
This paper introduces and documents the new Longitudinal Enterprise and Establishment Microdata (LEEM) database, which has been constructed by Census' Economic Planning and Coordination Division under contract to the Office of Advocacy of the U.S. Small Business Administration. The LEEM links three years (1990, 1994, and 1995) of basic data for each private sector establishment with payroll in any of those years, along with data on the firm to which the establishment belongs each year. The LEEM data will facilitate both broader and more detailed analysis of patterns of job creation and destruction in the U.S., as well as research on the structure and dynamics of U.S. businesses. This paper provides documentation of the construction of LEEM data, summary data on most variables in the database, comparisons of the annual data with that of the nearly identical County Business Patterns, and distributions of establishments and their employment by the size of their firms. This is followed by a simple analysis of changes over time in the attributes of surviving establishments, and a brief discussion of turnover (business births and deaths) in the population and gross changes in employment associated with both establishment turnover and with surviving establishments. It concludes with a summary of the strengths and weaknesses of the LEEM.
View Full
Paper PDF
-
NEW DATA FOR DYNAMIC ANALYSIS: THE LONGITUDINAL ESTABLISHMENT AND ENTERPRISE MICRODATA (LEEM) FILE
December 1999
Working Paper Number:
CES-99-18
Until now, research on U.S. business activities over time has been hindered by the lack of accurate and comprehensive longitudinal data. The new Longitudinal Establishment and Enterprise Microdata (LEEM) are tremendously rich data that open up numerous possibilities for dynamic analyses of businesses in the U.S. economy. It is the first nationwide high-quality longitudinal database that covers the majority of employer businesses from all sectors of the economy. Due to the confidential nature of these data, the file is located at the Center for Economic Studies in the U.S. Bureau of the Census. To access the data, researchers must submit an acceptable proposal to CES and become sworn Census researchers. This paper describes the LEEM file, the variables contained on the file, and current uses of the data.
View Full
Paper PDF
-
A Guide to the MEPS-IC Government List Sample Microdata
September 2011
Working Paper Number:
CES-11-27
The Medical Expenditure Panel Survey-Insurance Component (MEPS-IC) is conducted to provide nationally representative estimates on employer sponsored health insurance. MEPSIC data are collected from private sector employers, as well as state and local governments. While similar information is gathered from these two sectors, differences in the survey process exist. The goal of this paper is to provide details on the public sector including types of state and local government employers, sample design, general information on the data collected in the MEPS-IC, and additional sources of information.
View Full
Paper PDF
-
The Industry R&D Survey: Patent Database Link Project
November 2006
Working Paper Number:
CES-06-28
This paper details the construction of a firm-year panel dataset combining the NBER Patent Dataset with the Industry R&D Survey conducted by the Census Bureau and National Science Foundation. The developed platform offers an unprecedented view of the R&D-to-patenting innovation process and a close analysis of the strengths and limitations of the Industry R&D Survey. The files are linked through a name-matching algorithm customized for uniting the firm names to which patents are assigned with the firm names in Census Bureau's SSEL business registry. Through the Census Bureau's file structure, this R&D platform can be linked to the operating performances of each firm's establishments, further facilitating innovation-to-productivity studies.
View Full
Paper PDF
-
An Analysis of Key Differences in Micro Data: Results from the Business List Comparison Project
September 2008
Working Paper Number:
CES-08-28
The Bureau of Labor Statistics and the Bureau of the Census each maintain a business register, a universe of all U.S. business establishments and their characteristics, created from independent sources. Both registers serve critical functions such as supplying aggregate data inputs for certain national statistics generated by the Bureau of Economic Analysis. This paper examines key micro-level differences across these two business registers.
View Full
Paper PDF
-
Redesigning the Longitudinal Business Database
May 2021
Working Paper Number:
CES-21-08
In this paper we describe the U.S. Census Bureau's redesign and production implementation of the Longitudinal Business Database (LBD) first introduced by Jarmin and Miranda (2002). The LBD is used to create the Business Dynamics Statistics (BDS), tabulations describing the entry, exit, expansion, and contraction of businesses. The new LBD and BDS also incorporate information formerly provided by the Statistics of U.S. Businesses program, which produced similar year-to-year measures of employment and establishment flows. We describe in detail how the LBD is created from curation of the input administrative data, longitudinal matching, retiming of economic census-year births and deaths, creation of vintage consistent industry codes and noise factors, and the creation and cleaning of each year of LBD data. This documentation is intended to facilitate the proper use and understanding of the data by both researchers with approved projects accessing the LBD microdata and those using the BDS tabulations.
View Full
Paper PDF
-
Multinational Firms in the U.S. Economy: Insights from Newly Integrated Microdata
September 2022
Working Paper Number:
CES-22-39
This paper describes the construction of two confidential crosswalk files enabling a comprehensive identification of multinational rms in the U.S. economy. The effort combines firm-level surveys on direct investment conducted by the U.S. Bureau of Economic Analysis (BEA) and the U.S. Census Bureau's Business Register (BR) spanning the universe of employer businesses from 1997 to 2017. First, the parent crosswalk links BEA firm-level surveys on U.S. direct investment abroad and the BR. Second, the affiliate crosswalk links BEA firm-level surveys on foreign direct investment in the United States and the BR. Using these newly available links, we distinguish between U.S.- and foreign-owned multinational firms and describe their prevalence and economic activities in the national economy, by sector, and by geography.
View Full
Paper PDF
-
Just Passing Through: Characterizing U.S. Pass-Through Business Owners
January 2017
Working Paper Number:
CES-17-69
We investigate the use of administrative data on the owners of partnerships and S-corporations to develop new statistics that characterize business owners. Income from these types of entities is "passed through" to owners to be taxed on the owners' tax returns. The information returns associated with such pass-through entities (Form K1 records) make it possible to link individual owners to the businesses they own. These linkages can be leveraged to associate measures of the demographic and human capital characteristics of business owners with the characteristics of the businesses they own. This paper describes measurement issues associated with administrative records on these pass-through entities and their integration with other Census data products. In addition, we document a number of interesting trends in business ownership among pass-through entities. We show a substantial decline in both entry and exit with less churn among both owners and owned businesses. We also show that the owners of pass-through entities are older, more likely to be male, and more likely to be white compared to the working population.
View Full
Paper PDF
-
Methodology on Creating the U.S. Linked Retail Health Clinic (LiRHC) Database
March 2023
Working Paper Number:
CES-23-10
Retail health clinics (RHCs) are a relatively new type of health care setting and understanding the role they play as a source of ambulatory care in the United States is important. To better understand these settings, a joint project by the Census Bureau and National Center for Health Statistics used data science techniques to link together data on RHCs from Convenient Care Association, County Business Patterns Business Register, and National Plan and Provider Enumeration System to create the Linked RHC (LiRHC, pronounced 'lyric') database of locations throughout the United States during the years 2018 to 2020. The matching methodology used to perform this linkage is described, as well as the benchmarking, match statistics, and manual review and quality checks used to assess the resulting matched data. The large majority (81%) of matches received quality scores at or above 75/100, and most matches were linked in the first two (of eight) matching passes, indicating high confidence in the final linked dataset. The LiRHC database contained 2,000 RHCs and found that 97% of these clinics were in metropolitan statistical areas and 950 were in the South region of the United States. Through this collaborative effort, the Census Bureau and National Center for Health Statistics strive to understand how RHCs can potentially impact population health as well as the access and provision of health care services across the nation.
View Full
Paper PDF