CREAT: Census Research Exploration and Analysis Tool

The Characteristics of Business Owners Database, 1992

May 1999

Written by: Brian Headd

Working Paper Number:

CES-99-08

Abstract

This report describes the Characteristics of Business Owners (CBO), 1992 microdata available to researchers at the Center for Economic Studies and the CBO survey. The Bureau of the Census has conducted the 1982, 1987, and 1992 CBOs for the U.S. Small Business Administration, the Minority Business Development Agency, and the general public. For the 1992 CBO, there were three surveys, a sole proprietor survey, an owner survey for each owner in partnerships and S corporations, and a firm survey for each partnership and S corporation. For database purposes, the owner questions on the sole proprietors survey and owner survey were merged, and the firm questions on the sole proprietors survey and firm survey were merged. The owner database has 116,589 records, and the firm survey has 78,147 records. The CBO reports on owners about their background such as owner type (race, and ethnicity), age, education, work experience, veteran status, etc. The CBO reports on firms (with and without employees) about their economic details such as industry, financing, home-based, exporting, franchising, profits, etc. In addition, the CBO was conducted in 1996 on firms in existence in 1992 allowing for some survivability analysis. The CBO over samples women and minority owners to allow researchers to more reliably study these owners. This survey is an extension of the Survey of Minority-Owned Business Enterprises (SMOBE) and Survey of Women-Owned Businesses (WOB) within the economic census. The CBO is available as a report, special tabulations, or microdata for approved researchers.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
company, sale, enterprise, survey, corporation, employed, employee, ownership, owner, owned businesses, business owners, shareholder, proprietor, establishment, business data, franchising, partnership, founder

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Small Business Administration, Bureau of Labor Statistics, Standard Industrial Classification, Internal Revenue Service, Characteristics of Business Owners, Social Security Administration, Center for Economic Studies, Current Population Survey, Chicago Census Research Data Center

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'The Characteristics of Business Owners Database, 1992' are listed below in order of similarity.