Between the 2007 and 2012 Economic Censuses (EC), the count of franchise-affiliated establishments declined by 9.8%. One reason for this decline was a reduction in resources that the Census Bureau was able to dedicate to the manual evaluation of survey responses in the franchise section of the EC. Extensive manual evaluation in 2007 resulted in many establishments, whose survey forms indicated they were not franchise-affiliated, being recoded as franchise-affiliated. No such evaluation could be undertaken in 2012. In this paper, we examine the potential of using external data harvested from the web in combination with machine learning methods to automate the process of evaluating responses to the franchise section of the 2017 EC. Our method allows us to quickly and accurately identify and recode establishments have been mistakenly classified as not being franchise-affiliated, increasing the unweighted number of franchise-affiliated establishments in the 2017 EC by 22%-42%.
-
Methodology on Creating the U.S. Linked Retail Health Clinic (LiRHC) Database
March 2023
Working Paper Number:
CES-23-10
Retail health clinics (RHCs) are a relatively new type of health care setting and understanding the role they play as a source of ambulatory care in the United States is important. To better understand these settings, a joint project by the Census Bureau and National Center for Health Statistics used data science techniques to link together data on RHCs from Convenient Care Association, County Business Patterns Business Register, and National Plan and Provider Enumeration System to create the Linked RHC (LiRHC, pronounced 'lyric') database of locations throughout the United States during the years 2018 to 2020. The matching methodology used to perform this linkage is described, as well as the benchmarking, match statistics, and manual review and quality checks used to assess the resulting matched data. The large majority (81%) of matches received quality scores at or above 75/100, and most matches were linked in the first two (of eight) matching passes, indicating high confidence in the final linked dataset. The LiRHC database contained 2,000 RHCs and found that 97% of these clinics were in metropolitan statistical areas and 950 were in the South region of the United States. Through this collaborative effort, the Census Bureau and National Center for Health Statistics strive to understand how RHCs can potentially impact population health as well as the access and provision of health care services across the nation.
View Full
Paper PDF
-
Making a Motivated Manager: A Census Data Investigation into Efficiency Differences Between Franchisee and Franchisor-Owned Restaurants
January 2016
Working Paper Number:
CES-16-54
While there has been significant research on the reasons for franchising, little work has examined the effects of franchising on establishment performance. This paper attempts to fill that gap. We use restricted-access US Census Bureau microdata from the 2007 Census of Retail Trade to examine establishment-level productivity of franchisee- and franchisor-owned restaurants. We do this by employing a two-stage data envelopment analysis model where the first stage uses DEA to measure each establishment's efficiency. The DEA efficiency score is then used as the second-stage dependent variable. The results show a strong and robust effect attributed to franchisee ownership for full service restaurants, but a smaller and insignificant difference for limited service restaurants. We believe the differences in task programability between limited and full service restaurants results in a very different role for managers/franchisees and is the driving factor behind the different results.
View Full
Paper PDF
-
Multinational Firms in the U.S. Economy: Insights from Newly Integrated Microdata
September 2022
Working Paper Number:
CES-22-39
This paper describes the construction of two confidential crosswalk files enabling a comprehensive identification of multinational rms in the U.S. economy. The effort combines firm-level surveys on direct investment conducted by the U.S. Bureau of Economic Analysis (BEA) and the U.S. Census Bureau's Business Register (BR) spanning the universe of employer businesses from 1997 to 2017. First, the parent crosswalk links BEA firm-level surveys on U.S. direct investment abroad and the BR. Second, the affiliate crosswalk links BEA firm-level surveys on foreign direct investment in the United States and the BR. Using these newly available links, we distinguish between U.S.- and foreign-owned multinational firms and describe their prevalence and economic activities in the national economy, by sector, and by geography.
View Full
Paper PDF
-
Squeezing More Out of Your Data: Business Record Linkage with Python
November 2018
Working Paper Number:
CES-18-46
Integrating data from different sources has become a fundamental component of modern data analytics. Record linkage methods represent an important class of tools for accomplishing such integration. In the absence of common disambiguated identifiers, researchers often must resort to ''fuzzy" matching, which allows imprecision in the characteristics used to identify common entities across dfferent datasets. While the record linkage literature has identified numerous individually useful fuzzy matching techniques, there is little consensus on a way to integrate those techniques within a
single framework. To this end, we introduce the Multiple Algorithm Matching for Better Analytics (MAMBA), an easy-to-use, flexible, scalable, and transparent software platform for business record linkage applications using Census microdata. MAMBA leverages multiple string comparators to assess the similarity of records using a machine learning algorithm to disambiguate matches. This software represents a transparent tool for researchers seeking to link external business data to the Census Business Register files.
View Full
Paper PDF
-
Competition, Productivity, and Survival of Grocery Stores in the Great Depression
April 2018
Working Paper Number:
CES-18-24
We study the grocery industry in Washington, DC, during the Great Depression using data from the 1929 Census of Distribution, a 1929'1930 survey by the Federal Trade Commission, and a 1935 business directory. We first document the differences between chains and independents in the Washington, DC, grocery market circa 1929 to better understand chains' competitive advantages. Second, we study correlates of survival from 1929 to 1935, a period of major contraction and upheaval. We find that more productive stores survived at higher rates, as did stores with greater assortment and lower prices. Presaging the supermarket revolution, combination stores were much more likely to survive to 1935 than other grocery formats.
View Full
Paper PDF
-
Redesigning the Longitudinal Business Database
May 2021
Working Paper Number:
CES-21-08
In this paper we describe the U.S. Census Bureau's redesign and production implementation of the Longitudinal Business Database (LBD) first introduced by Jarmin and Miranda (2002). The LBD is used to create the Business Dynamics Statistics (BDS), tabulations describing the entry, exit, expansion, and contraction of businesses. The new LBD and BDS also incorporate information formerly provided by the Statistics of U.S. Businesses program, which produced similar year-to-year measures of employment and establishment flows. We describe in detail how the LBD is created from curation of the input administrative data, longitudinal matching, retiming of economic census-year births and deaths, creation of vintage consistent industry codes and noise factors, and the creation and cleaning of each year of LBD data. This documentation is intended to facilitate the proper use and understanding of the data by both researchers with approved projects accessing the LBD microdata and those using the BDS tabulations.
View Full
Paper PDF
-
Supersize It: The Growth of Retail Chains and the Rise of the "Big Box" Retail Format
August 2008
Working Paper Number:
CES-08-23R
This paper documents and explains the recent rise of "big-box" general merchandisers. Data from the Census of Retail Trade for 1977-2007 show that general-merchandise chains grew much faster than specialist retail chains, and that general merchandisers that added the most stores also made the biggest increases to their product offerings. We explain these facts with a stylized model in which a retailer's scale economies interact with consumer gains from one-stop shopping to generate a complementarity between a retailer's scale and scope.
View Full
Paper PDF
-
Modeling Single Establishment Firm Returns to the 2007 Economic Census
September 2011
Working Paper Number:
CES-11-28
The Economic Census is one of the most important activities that the U.S. Census Bureau performs. It is critical for updating firm ownership/structure and industry information for a large number of businesses in the Census Bureau's Business Register, impacting most other economic programs. Also, it feeds into Bureau of Economic Analysis products, such as benchmark inputoutput accounts and Gross Domestic Product. The overall check-in rate for the 2007 Economic Census was just over 86%. Establishments owned by multi-location companies returned over 90% of their forms, as compared to the roughly two million single-establishment firms sampled in the Census that returned just over 80%. We model the check-in rate for single-establishment firms by using a large number of variables that might be correlated with whether or not a firm returns a form in the Economic Census. These variables are broadly categorized as the characteristics of firms, measures of external factors, and features of the survey design. We use the model for two purposes. First, by including many of the factors that may be correlated with returns we aim to focus limited advertising and outreach resources to low-return segments of the population. Second, we use the model to investigate the efficacy of an unplanned intervention expected to increase return rates: using certified mailing for one of the form follow-ups.
View Full
Paper PDF
-
Raising the Barcode Scanner: Technology and Productivity in the Retail Sector
May 2011
Working Paper Number:
CES-11-16R
Barcodes and barcode scanners transformed the grocery industry in the 1970s. I use store-level data from the 1972, 1977, and 1982 Census of Retail Trade, matched to data on store scanner installations, to estimate scanners' effect on labor productivity. I find that early scanners increased a store's labor productivity, on average, by approximately 4.5 percent in the first few years. The effect was larger in stores carrying more packaged products, consistent with the presence of network externalities. Short-run gains were small relative to fixed costs, suggesting that the impediment to widespread adoption of the new technology was profitability, not coordination problems.
View Full
Paper PDF
-
Creditor Control Rights and Resource Allocation within Firms
November 2015
Working Paper Number:
CES-15-39
We examine the within-firm resource allocation effects of creditor interventions and their relationship to performance gains at firms violating financial covenants. By linking firm-level data to establishment-level data from the U.S. Census Bureau, we show that covenant violations are followed by large reductions in employment and more frequent establishment sales and closures. These operational cuts are concentrated in violating firms' noncore business lines and unproductive establishments. We conclude that refocusing activities and improving productive efficiency are important mechanisms through which creditors enhance violating firms' performance.
View Full
Paper PDF