CREAT: Census Research Exploration and Analysis Tool

Social, Economic, Spatial, and Commuting Patterns of Self-Employed Jobholders

April 2007

Working Paper Number:

tp-2007-03

Abstract

A significant number of employees within the United States identify themselves as selfemployed, and they are distinct from the larger group identified as private jobholders. While socioeconomic and spatial information on these individuals is readily available in standard datasets, such as the 2000 Decennial Census Long Form, it is possible to gain further information on their wage earnings by using data from administrative wage records. This study takes advantage of firm-based data from Unemployment Insurance administrative wage records linked with the Census Bureau's household-based data in order to examine self-employed jobholders - both as a whole and as subgroups defined according to their earned wage status - by their demographic characteristics as well as their economic, commuting, and spatial location outcomes. Additionally, this report evaluates whether self-employed jobholders and the defined subgroups should be included explicitly in future labor-workforce analyses and transportation modeling. The analyses in this report use the sample of self-employed workers who lived in Los Angeles County, California.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
work census, payroll, employee, employed, proprietorship, incorporated, employment data, metropolitan, workforce, businesses census, clerical, residential, socioeconomic, citizen, unemployment rates, residence, commute

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Bureau of Labor Statistics, Service Annual Survey, Current Population Survey, Decennial Census, Census Industry Code, Unemployment Insurance, Consolidated Metropolitan Statistical Areas, American Community Survey, Longitudinal Employer Household Dynamics, LEHD Program, Census 2000, Survey of Business Owners, 2010 Census

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Social, Economic, Spatial, and Commuting Patterns of Self-Employed Jobholders' are listed below in order of similarity.