A significant number of employees within the United States identify themselves as selfemployed,
and they are distinct from the larger group identified as private jobholders. While
socioeconomic and spatial information on these individuals is readily available in standard
datasets, such as the 2000 Decennial Census Long Form, it is possible to gain further information
on their wage earnings by using data from administrative wage records. This study takes
advantage of firm-based data from Unemployment Insurance administrative wage records linked
with the Census Bureau's household-based data in order to examine self-employed jobholders -
both as a whole and as subgroups defined according to their earned wage status - by their
demographic characteristics as well as their economic, commuting, and spatial location
outcomes. Additionally, this report evaluates whether self-employed jobholders and the defined
subgroups should be included explicitly in future labor-workforce analyses and transportation
modeling. The analyses in this report use the sample of self-employed workers who lived in Los
Angeles County, California.
-
Social, Economic, Spatial, and Commuting Patterns of Informal Jobholders
April 2007
Working Paper Number:
tp-2007-02
A significant number of employees within the United States can be considered "informal" or
"off-the-books" workers. These workers, who by definition do not appear in administrative wage
records, are distinct from the larger group of private jobholders who do appear in administrative
records. However, while socioeconomic and spatial information on these individuals is readily
available in standard datasets, such as the 2000 Decennial Census Long Form, it is not possible
to identify the informal workers by only using such data because of the lack of accurate, formal
wage records. This study takes advantage of firm-based data that originates in Unemployment
Insurance administrative wage records linked with the Census Bureau's household-based data in
order to examine informal jobholders by their demographic characteristics as well as their
economic, commuting, and spatial location outcomes. In addition this report evaluates whether
informal jobholders should be included explicitly in future labor-workforce analyses and
transportation modeling. The analyses in this report use the sample of workers who lived in Los
Angeles County, California.
View Full
Paper PDF
-
Social, Economic, Spatial, and Commuting Patterns of Dual Jobholders
April 2007
Working Paper Number:
tp-2007-01
Individuals who hold multiple jobs have complex working lives and complex commuting
patterns. Economic and spatial information on these individuals is not readily available in
standard datasets, such as the 2000 Decennial Census Long Form, because the survey questions
were not designed to collect details on multiple jobs. This study takes advantage of firm-based
data from the Unemployment Insurance administrative wage records, linked with the Census
Bureau's household-based data, to examine multiple jobholders - and specifically a sentinel
group of dual jobholders. The study uses a sample from Los Angeles County, California and
examines the dual jobholders by their demographic characteristics as well as their economic,
commuting, and spatial location outcomes. In addition this report evaluates whether multiple
jobholders should be included explicitly in future labor-workforce analyses and transportation
modeling.
View Full
Paper PDF
-
Further Evidence from Census 2000 About Earnings by Detailed Occupation for Men and Women: The Role of Race and Hispanic Origin
November 2011
Working Paper Number:
CES-11-37
A 2004 report by the author reviewed data from Census 2000 and concluded "There is a substantial gap in median earnings between men and women that is unexplained, even after controlling for work experience (to the extent it can be represented by age and presence of children), education, and occupation." This paper extends the analysis and concludes that once those characteristics are controlled for, no further explanatory power is attributable to race or Hispanic origin.
View Full
Paper PDF
-
Exploring Differences in Employment between Household and Establishment Data
April 2009
Working Paper Number:
CES-09-09
Using a large data set that links individual Current Population Survey (CPS) records to employer-reported administrative data, we document substantial discrepancies in basic measures of employment status that persist even after controlling for known definitional differences between the two data sources. We hypothesize that reporting discrepancies should be most prevalent for marginal workers and marginal jobs, and find systematic associations between the incidence of reporting discrepancies and observable person and job characteristics that are consistent with this hypothesis. The paper discusses the implications of the reported findings for both micro and macro labor market analysis
View Full
Paper PDF
-
A COMPARISON OF PERSON-REPORTED INDUSTRY TO EMPLOYER-REPORTED INDUSTRY
IN SURVEY AND ADMINISTRATIVE DATA
September 2013
Working Paper Number:
CES-13-47
The Census Bureau collects industry information through surveys and administrative data and creates associated public-use statistics. In this paper, we compare person-reported industry in the American Community Survey (ACS) to employer-reported industry from the Quarterly Census of Employment and Wages (QCEW) that is part of the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) program. This research provides necessary information on the use of administrative data as a supplement to survey data industry information, and the findings will be useful for anyone using industry information from either source. Our project is part of a larger effort to compare information on jobs from household survey data to employer-reported information. This research is the first to compare ACS job data to firm-based administrative data. We find an overall industry sector match rate of 75 percent, and a 61 percent match rate at the 4-digit Census Industry Code (CIC) level. Industry match rates vary by sector and by whether industry sector is classified using ACS or LEHD industry information. The educational services and health care and social assistance sectors have among the highest match rates. The management of companies and enterprises sector has the lowest match rate, using either ACS-reported or LEHD-reported sector. For individuals with imputed industry data, the industry sector match rate is only 14 percent. Our findings suggest that the industry distribution and the sample in a particular industry sector will differ depending on whether ACS or LEHD data are used.
View Full
Paper PDF
-
Poverty Estimates for Places in the United States
September 2005
Working Paper Number:
CES-05-12
This paper first describes some historical poverty trends, overall and for demographic groups and broad locations within the U.S. from an ongoing household survey, and then presents some specific information on poverty for localities by size, from the most recent decennial census (2000). Rural poverty exceeded urban poverty in 1969 and 1979, but urban poverty in 1999 was higher than rural poverty. Non-metropolitan area poverty exceeded metropolitan area poverty in each of the four censuses, but within each of those areas, rural poverty is now less than urban poverty. Within metropolitan areas, poverty is highest for those in central cities. For urbanized areas (50,000 or more population), the poverty rate is lower as the area gets larger, with the exception of the very largest-sized areas. This higher poverty for the largest places is accounted for entirely by the higher poverty rate for the central city or cities in those urban agglomerations, as the poverty rates for the parts of the urbanized areas not in the central place continue to fall as the area itself gets larger. Some of the critical relationships affecting the poverty rate of places appear to be the location of certain types of people - female householders, non-citizens, people of color, and college graduates.
View Full
Paper PDF
-
Counting The Self-Employed From Two Perspectives: Household Vs. Business Sample Data
August 1995
Working Paper Number:
CES-95-11
This study compares the number and attributes of self-employed workers using the Characteristics of Business Owners and Current Population Survey data series. Both sources of data have been widely used in empirical studies of entrepreneurship/self-employment. Substantial and inexplicable differences were found in the two data series' estimates of the number of self-employed men and women for both reference years. In terms of individual attributes, the CBO and CPS appear to report reasonably similar profiles of self-employed individuals in terms of marital status and geographic location, and similar systematic gender differences in the industrial distributions of these individuals. However, in terms of other attributes captured by both data series, including age, the two series exhibit notable dissimilarities.
View Full
Paper PDF
-
Using Administrative Earnings Records to Assess Wage Data Quality in the March Current Population Survey and the Survey of Income and Program Participation
November 2002
Working Paper Number:
tp-2002-22
The March Current Population Survey (CPS) and the Survey of Income and Program
Participation (SIPP) produce different aggregates and distributions of annual wages. An excess of
high wages and shortage of low wages occurs in the March CPS. SIPP shows the opposite, an
excess of low wages and shortage of high wages. Exactly-matched Detailed Earnings Records
(DER) from the Social Security Administration allow comparing March CPS and SIPP people's
wages using data independent of the surveys. Findings include the following. March CPS and
SIPP people differ little in their true wage characteristics. March CPS and SIPP represent a
worker's percentile rank better than the dollar amount of wages. Workers with one job and low
work effort have underestimated March CPS wages. March CPS has a higher level of
"underground" wages than SIPP, and increasingly so in the 1990s. March CPS has a higher level
of self-employment income "misclassified" as wages than SIPP, and increasingly so in the 1990s.
These trends may explain one-third of March CPS's 6-percentage-point increase in aggregate
wages relative to independent estimates from 1993 to 1995. Finally, the paper delineates March
CPS occupations disproportionately likely to be absent from the administrative data entirely or to
"misclassify" self-employment income as wages.
View Full
Paper PDF
-
Design Comparison of LODES and ACS Commuting Data Products
October 2014
Working Paper Number:
CES-14-38
The Census Bureau produces two complementary data products, the American Community Survey (ACS) commuting and workplace data and the Longitudinal Employer-Household Dynamics (LEHD) Origin-Destination Employment Statistics (LODES), which can be used to answer questions about spatial, economic, and demographic questions relating to workplaces and home-to-work flows. The products are complementary in the sense that they measure similar activities but each has important unique characteristics that provide information that the other measure cannot. As a result of questions from data users, the Census Bureau has created this document to highlight the major design differences between these two data products. This report guides users on the relative advantages of each data product for various analyses and helps explain differences that may arise when using the products.2,3
As an overview, these two data products are sourced from different inputs, cover different populations and time periods, are subject to different sets of edits and imputations, are released under different confidentiality protection mechanisms, and are tabulated at different geographic and characteristic levels. As a general rule, the two data products should not be expected to match exactly for arbitrary queries and may differ substantially for some queries.
Within this document, we compare the two data products by the design elements that were deemed most likely to contribute to differences in tabulated data. These elements are: Collection, Coverage, Geographic and Longitudinal Scope, Job Definition and Reference Period, Job and Worker Characteristics, Location Definitions (Workplace and Residence), Completeness of Geographic Information and Edits/Imputations, Geographic Tabulation Levels, Control Totals, Confidentiality Protection and Suppression, and Related
Public-Use Data Products.
An in-depth data analysis'in aggregate or with the microdata'between the two data products will be the subject of a future technical report. The Census Bureau has begun a pilot project to integrate ACS microdata with LEHD administrative data to develop an enhanced frame of employment status, place of work, and commuting. The Census Bureau will publish quality metrics for person match rates, residence and workplace match rates, and commute distance comparisons.
View Full
Paper PDF
-
Migration and Dispersal of Hispanic and Asian Groups: An Analysis of the 2006-2008 Multiyear American Community Survey
October 2011
Working Paper Number:
CES-11-33
This report seeks to evaluate selective migration processes of Hispanic and Asian nationality groups in the US from established settlement areas, using recent migration data from the American Community Survey. The underlying goal is to detect migration tendencies leading toward an increased dispersion of these groups associated with their migration processes. Using descriptive statistics, maps, and migration models, we assess how migration processes in the 2006-8 period are leading to the dispersal of Hispanic and Asian race ethnic groups across metropolitan areas, with special attention to the roles of co-ethnic communities and spatial assimilation. These analyses employ migration data available from the 3-year 2006-8 American Community Survey using restricted data from the US Census Bureau's Research Data Centers. This use of the restricted ACS files permitted the first post 2000 analysis of inter-metropolitan migration for Hispanic groups (Mexicans, Puerto Ricans, Cubans, Salvadorans, Dominicans) and Asian groups (Chinese, Indians, Filipinos, Vietnamese, Koreans) using the detailed demographic and geographic attributes available with these files. The data and analysis presented here provide a benchmark for further research of this kind with the American Community Survey in light of the fact that migration data will no longer be available from the US decennial census. The study examines migration from these groups' major settlement areas to other metropolitan area destinations as they are affected by the attraction of co-ethnic communities and by a migrant selectivity pattern consistent with the perspective of spatial assimilation. The migration processes themselves were evaluated in terms of two components: the out--migration rates of residents, and the destination selection of movers. From the perspective of co-ethnic community attraction, it was hypothesized that the outmigration rates from high co-ethnic settlement areas would be lower than those from areas where the group had a smaller overall presence and that the destination selections of out-migrants would be positively affected by the presence of high co-ethnic population shares in destination areas. From the spatial assimilation perspective, it was hypothesized that out-migration from high coethnic areas would least likely occur for group members with lowest education, poor facility with English, and recently arrived in the US; whereas the selection of destinations with large coethnic population shares would be most likely to occur for these same population categories. The results strongly confirm that co-ethnic community attraction continues to reduce outmigration of groups from major settlement origins and positively influences their destination selections. A series of multivariate migrant destination selection models confirm a consistent draw of ethnically similar destinations across individual Hispanic and Asian groups when other economic, demographic and structural metropolitan attributes are taken into account. In contrast, results regarding spatial assimilation are typically mixed or nonexistent in characterizing both out-migration and mover destination selectivity patterns. In fact, we find contrary evidence for some Asian groups for whom it is the most educated, and native born migrants who show a penchant for selecting destinations with greater co-ethnic population shares. Among the greatest destinations for Indians, for example, are Philadelphia, Seattle, Dallas, Boston and Atlanta- areas with higher than average Indian population shares, and areas that also house knowledge-based industries. The selection of co-ethnic destinations among Hispanic group migrants appears somewhat impervious to education attainment and Hispanic and Mexican group movers, who are foreign born and who arrived since 2000, are least, rather than most, prone to select co-ethnic destinations. The mover destination models make plain that employment growth at destination provides a strong draw for all Hispanic groups. This suggests that recent growth in low skilled jobs in parts of the country with small Hispanic populations are nonetheless attracting newly arrived, and less skilled Mexicans and other Hispanics who might have previously been especially lured to destinations with large co-ethnic population shares.
View Full
Paper PDF