Modeling Single Establishment Firm Returns to the 2007 Economic Census
September 2011
Working Paper Number:
CES-11-28
Abstract
Document Tags and Keywords
Keywords
Keywords are automatically generated using KeyBERT, a powerful and innovative
keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant
keywords.
By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the
text, highlighting the most significant topics and trends. This approach not only enhances searchability but
provides connections that go beyond potentially domain-specific author-defined keywords.
:
estimation,
market,
information census,
company,
survey,
aggregate,
agency,
respondent,
sector,
firms census,
establishment,
incorporated,
economic census,
businesses census,
gdp,
population,
census bureau,
census use,
race census
Tags
Tags are automatically generated using a pretrained language model from spaCy, which excels at
several tasks, including entity tagging.
The model is able to label words and phrases by part-of-speech,
including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are
identified to contain references to specific institutions, datasets, and other organizations.
:
Census of Manufactures,
Annual Survey of Manufactures,
Standard Statistical Establishment List,
Internal Revenue Service,
Bureau of Labor Statistics,
Center for Economic Studies,
Bureau of Economic Analysis,
Longitudinal Business Database,
Decennial Census,
Postal Service,
Economic Census,
North American Industry Classification System,
Census Bureau Business Register,
Business Register,
Survey of Business Owners
Similar Working Papers
Similarity between working papers are determined by an unsupervised neural
network model
know as Doc2Vec.
Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the
capture of semantic meaning in a way that relates to the context of words within the document. The model learns to
associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as
document classification, clustering, and similarity detection by preserving the order and structure of words. The
document vectors are compared using cosine similarity/distance to determine the most similar working papers.
Papers identified with 🔥 are in the top 20% of similarity.
The 10 most similar working papers to the working paper 'Modeling Single Establishment Firm Returns to the 2007 Economic Census' are listed below in order of similarity.
-
Working PaperCoverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data
November 2014
Working Paper Number:
carra-2014-14
The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.View Full Paper PDF
-
Working PaperThe Use of Administrative Records and the American Community Survey to Study the Characteristics of Undercounted Young Children in the 2010 Census
May 2018
Working Paper Number:
carra-2018-05
Children under age five are historically one of the most difficult segments of the population to enumerate in the U.S. decennial census. The persistent undercount of young children is highest among Hispanics and racial minorities. In this study, we link 2010 Census data to administrative records from government and third party data sources, such as Medicaid enrollment data and tenant rental assistance program records from the Department of Housing and Urban Development, to identify differences between children reported and not reported in the 2010 Census. In addition, we link children in administrative records to the American Community Survey to identify various characteristics of households with children under age five who may have been missed in the last census. This research contributes to what is known about the demographic, socioeconomic, and household characteristics of young children undercounted by the census. Our research also informs the potential benefits of using administrative records and surveys to supplement the U.S. Census Bureau child population enumeration efforts in future decennial censuses.View Full Paper PDF
-
Working PaperPast Experience and Future Success: New Evidence on Owner Characteristics and Firm Performance
September 2010
Working Paper Number:
CES-10-24
Because the ability of entrepreneurs to start their own businesses is key to the success of the U.S. economy and to the economic mobility of many disadvantaged demographic groups, understanding why entrepreneurship activity varies across groups and geography is an increasingly important issue. As a step in this direction we employ a novel set of metrics of business success to the growing literature and find great variation across groups and metrics. For example, we find that black-owned firms grow slower than white or Asian-owned firms. However, once we condition on firm survival, the differences disappear. Interestingly, we also find differences across groups in their start-up histories. For example, Asian-owned firms are less likely than white-owned firms to have started-out as nonemployers but firms owned by all other minority groups, as well as women-owned firms, are more likely to start-out without employees.View Full Paper PDF
-
Working PaperMeasuring the Effect of COVID-19 on U.S. Small Businesses: The Small Business Pulse Survey
May 2020
Working Paper Number:
CES-20-16
In response to the novel coronavirus (COVID-19) pandemic, the Census Bureau developed and fielded an entirely new survey intended to measure the effect on small businesses. The Small Business Pulse Survey (SBPS) will run weekly from April 26 to June 27, 2020. Results from the SBPS will be published weekly through a visualization tool with downloadable data. We describe the motivation for SBPS, summarize how the content for the survey was developed, and discuss some of the initial results from the survey. We also describe future plans for the SBPS collections and for our research using the SBPS data. Estimates from the first week of the SBPS indicate large to moderate negative effects of COVID-19 on small businesses, and yet the majority expect to return to usual level of operations within the next six months. Reflecting the Census Bureau's commitment to scientific inquiry and transparency, the micro data from the SBPS will be available to qualified researchers on approved projects in the Federal Statistical Research Data Center network.View Full Paper PDF
-
Working PaperA Comparison of Employee Benefits Data from the MEPS-IC and Form 5500
September 2008
Working Paper Number:
CES-08-32
This paper compares data on employers\u2019 health and pension offerings from the two sources: publicly available administrative data from Form 5500 filings and survey data from the Insurance Component of the Medical Expenditure Panel Survey (MEPS-IC). The basic findings are that the 5500 filings cover too few health plans to be very useful as a substitute or supplement to the MEPS-IC measure of whether or not employers offer health insurance. The pension information in the 5500 filings is potentially more useful as a supplement to the MEPSIC for research purposes where additional pension information would be useful in studying employers\u2019 decisions to offer health insurance.View Full Paper PDF
-
Working PaperSmall Business Pulse Survey Estimates by Owner Characteristics and Rural/Urban Designation
September 2021
Working Paper Number:
CES-21-24
In response to requests from policymakers for additional context for Small Business Pulse Survey (SBPS) measures of the impact of COVID-19 on small businesses, we researched developing estimates by owner characteristics and rural/urban locations. Leveraging geographic coding on the Business Register, we create estimates of the effect of the pandemic on small businesses by urban and rural designations. A more challenging exercise entails linking micro-level data from the SBPS with ownership data from the Annual Business Survey (ABS) to create estimates of the effect of the pandemic on small businesses by owner race, sex, ethnicity, and veteran status. Given important differences in survey design and concerns about nonresponse bias, we face significant challenges in producing estimates for owner demographics. We discuss our attempts to meet these challenges and provide discussion about caution that must be used in interpreting the results. The estimates produced for this paper are available for download. Reflecting the Census Bureau's commitment to scientific inquiry and transparency, the micro data from the SBPS will be available to qualified researchers on approved projects in the Federal Statistical Research Data Center network.View Full Paper PDF
-
Working PaperAutomating Response Evaluation For Franchising Questions On The 2017 Economic Census
July 2019
Working Paper Number:
CES-19-20
Between the 2007 and 2012 Economic Censuses (EC), the count of franchise-affiliated establishments declined by 9.8%. One reason for this decline was a reduction in resources that the Census Bureau was able to dedicate to the manual evaluation of survey responses in the franchise section of the EC. Extensive manual evaluation in 2007 resulted in many establishments, whose survey forms indicated they were not franchise-affiliated, being recoded as franchise-affiliated. No such evaluation could be undertaken in 2012. In this paper, we examine the potential of using external data harvested from the web in combination with machine learning methods to automate the process of evaluating responses to the franchise section of the 2017 EC. Our method allows us to quickly and accurately identify and recode establishments have been mistakenly classified as not being franchise-affiliated, increasing the unweighted number of franchise-affiliated establishments in the 2017 EC by 22%-42%.View Full Paper PDF
-
Working PaperDecennial Census Return Rates: The Role of Social Capital
January 2017
Working Paper Number:
CES-17-39
This paper explores how useful information about social and civic engagement (social capital) might be to the U.S. Census Bureau in their efforts to improve predictions of mail return rates for the Decennial Census (DC) at the census tract level. Through construction of Hard-to-count (HRC) scores and multivariate analysis, we find that if information about social capital were available, predictions of response rates would be marginally improved.View Full Paper PDF
-
Working PaperThe Impact of Household Surveys on 2020 Census Self-Response
July 2022
Working Paper Number:
CES-22-24
Households who were sampled in 2019 for the American Community Survey (ACS) had lower self-response rates to the 2020 Census. The magnitude varied from -1.5 percentage point for household sampled in January 2019 to -15.1 percent point for households sampled in December 2019. Similar effects are found for the Current Population Survey (CPS) as well.View Full Paper PDF
-
Working PaperWhen Race and Hispanic Origin Reporting are Discrepant Across Administrative Records and Third Party Sources: Exploring Methods to Assign Responses
December 2015
Working Paper Number:
carra-2015-08
The U.S. Census Bureau is researching uses of administrative records and third party data in survey and decennial census operations. One potential use of administrative records is to utilize these data when race and Hispanic origin responses are missing. When federal and third party administrative records are compiled, race and Hispanic origin responses are not always the same for an individual across sources. We explore different methods to assign one race and one Hispanic response when these responses are discrepant. We also describe the characteristics of individuals with matching, non-matching, and missing race and Hispanic origin data by demographic, household, and contextual variables. We find that minorities, especially Hispanics, are more likely to have non-matching Hispanic origin and race responses in administrative records and third party data compared to the 2010 Census. Minority groups and individuals ages 0-17 are more likely to have missing race or Hispanic origin data in administrative records and third party data. Larger households tend to have more missing race data in administrative records and third party data than smaller households.View Full Paper PDF