-
An Economic Analysis of Privacy Protection and Statistical Accuracy as Social Choices
August 2018
Working Paper Number:
CES-18-35
Statistical agencies face a dual mandate to publish accurate statistics while protecting respondent privacy. Increasing privacy protection requires decreased accuracy. Recognizing this as a resource allocation problem, we propose an economic solution: operate where the marginal cost of increasing privacy equals the marginal benefit. Our model of production, from computer science, assumes data are published using an efficient differentially private algorithm. Optimal choice weighs the demand for accurate statistics against the demand for privacy. Examples from U.S. statistical programs show how our framework can guide decision-making. Further progress requires a better understanding of willingness-to-pay for privacy and statistical accuracy.
View Full
Paper PDF
-
Does Federally-Funded Job Training Work? Nonexperimental Estimates of WIA Training Impacts Using Longitudinal Data on Workers and Firms
January 2018
Working Paper Number:
CES-18-02
We study the job training provided under the US Workforce Investment Act (WIA) to adults and dislocated workers in two states. Our substantive contributions center on impacts estimated non-experimentally using administrative data. These impacts compare WIA participants who do and do not receive training. In addition to the usual impacts on earnings and employment, we link our state data to the Longitudinal Employer-Household Dynamics (LEHD) data at the US Census Bureau, which allows us to estimate impacts on the characteristics of the firms at which participants find employment. We find moderate positive impacts on employment, earnings and desirable firm characteristics for adults, but not for dislocated workers. Our primary methodological contribution consists of assessing the value of the additional conditioning information provided by the LEHD relative to the data available in state Unemployment Insurance (UI) earnings records. We find that value to be zero.
View Full
Paper PDF
-
Two Perspectives on Commuting: A Comparison of Home to Work Flows Across Job-Linked Survey and Administrative Files
January 2017
Working Paper Number:
CES-17-34
Commuting flows and workplace employment data have a wide constituency of users including urban and regional planners, social science and transportation researchers, and businesses. The U.S. Census Bureau releases two, national data products that give the magnitude and characteristics of home to work flows. The American Community Survey (ACS) tabulates households' responses on employment, workplace, and commuting behavior. The Longitudinal Employer-Household Dynamics (LEHD) program tabulates administrative records on jobs in the LEHD Origin-Destination Employment Statistics (LODES). Design differences across the datasets lead to divergence in a comparable statistic: county-to-county aggregate commute flows. To understand differences in the public use data, this study compares ACS and LEHD source files, using identifying information and probabilistic matching to join person and job records. In our assessment, we compare commuting statistics for job frames linked on person, employment status, employer, and workplace and we identify person and job characteristics as well as design features of the data frames that explain aggregate differences. We find a lower rate of within-county commuting and farther commutes in LODES. We attribute these greater distances to differences in workplace reporting and to uncertainty of establishment assignments in LEHD for workers at multi-unit employers. Minor contributing factors include differences in residence location and ACS workplace edits. The results of this analysis and the data infrastructure developed will support further work to understand and enhance commuting statistics in both datasets.
View Full
Paper PDF
-
Racial Disparity in an Era of Increasing Income Inequality
January 2017
Working Paper Number:
carra-2017-01
Using unique linked data, we examine income inequality and mobility across racial and ethnic groups in the United States. Our data encompass the universe of tax filers in the U.S. for the period 2000 to 2014, matched with individual-level race and ethnicity information from multiple censuses and American Community Survey data. We document both income inequality and mobility trends over the period. We find significant stratification in terms of average incomes by race and ethnic group and distinct differences in within-group income inequality. The groups with the highest incomes - Whites and Asians - also have the highest levels of within-group inequality and the lowest levels of within-group mobility. The reverse is true for the lowest-income groups: Blacks, American Indians, and Hispanics have lower within-group inequality and immobility. On the other hand, our low-income groups are also highly immobile when looking at overall, rather than within-group, mobility. These same groups also have a higher probability of experiencing downward mobility compared with Whites and Asians. We also find that within-group income inequality increased for all groups between 2000 and 2014, and the increase was especially large for Whites. In regression analyses using individual-level panel data, we find persistent differences by race and ethnicity in incomes over time. We also examine young tax filers (ages 25-35) and investigate the long-term effects of local economic and racial residential segregation conditions at the start of their careers. We find persistent long-run effects of racial residential segregation at career entry on the incomes of certain groups. The picture that emerges from our analysis is of a rigid income structure, with mainly Whites and Asians confined to the top and Blacks, American Indians, and Hispanics confined to the bottom.
View Full
Paper PDF
-
Documenting the Business Register and Related Economic Business Data
March 2016
Working Paper Number:
CES-16-17
The Business Register (BR) is a comprehensive database of business establishments in the United States and provides resources for the U.S. Census Bureau's economic programs for sample selection, research, and survey operations. It is maintained using information from several federal agencies including the Census Bureau, Internal Revenue Service, Bureau of Labor Statistics, and the Social Security Administration. This paper provides a detailed description of the sources and functions of the BR. An overview of the BR as a linking tool and bridge to other Census Bureau data for additional business characteristics is also given.
View Full
Paper PDF
-
Urban-Suburban Migration in the United States, 1955-2000
February 2016
Working Paper Number:
CES-16-08
This study uses census microdata from 1960 to 2010 to look at the rates of suburbanization in the 100 largest metro areas. Looking at the racial and ethnic composition of the population, and then further breaking down these groups by income, it's clear that more affluent people were more likely to move to the suburbs. Also, the White non-Hispanic population has long been the most suburbanized group. A majority of the White population lived in suburbs by 1960 in the 100 largest metro areas, while most of the Black non-Hispanic population lived in urban core areas as late as 2000. The Hispanic and Asian populations went from majority urban to majority suburban during this period.
View Full
Paper PDF
-
Business Dynamics Statistics of High Tech Industries
January 2016
Working Paper Number:
CES-16-55
Modern market economies are characterized by the reallocation of resources from less productive, less valuable activities to more productive, more valuable ones. Businesses in the High Technology sector play a particularly important role in this reallocation by introducing new products and services that impact the entire economy. Tracking the performance of this sector is therefore of primary importance, especially in light of recent evidence that suggests a slowdown in business dynamism in High Tech industries. The Census Bureau produces the Business Dynamics Statistics (BDS), a suite of data products that track job creation, job destruction, startups, and exits by firm and establishment characteristics including sector, firm age, and firm size. In this paper we describe the methodologies used to produce a new extension to the BDS focused on businesses in High Technology industries.
View Full
Paper PDF
-
When Race and Hispanic Origin Reporting are Discrepant Across Administrative Records and Third Party Sources: Exploring Methods to Assign Responses
December 2015
Working Paper Number:
carra-2015-08
The U.S. Census Bureau is researching uses of administrative records and third party data in survey and decennial census operations. One potential use of administrative records is to utilize these data when race and Hispanic origin responses are missing. When federal and third party administrative records are compiled, race and Hispanic origin responses are not always the same for an individual across sources. We explore different methods to assign one race and one Hispanic response when these responses are discrepant. We also describe the characteristics of individuals with matching, non-matching, and missing race and Hispanic origin data by demographic, household, and contextual variables. We find that minorities, especially Hispanics, are more likely to have non-matching Hispanic origin and race responses in administrative records and third party data compared to the 2010 Census. Minority groups and individuals ages 0-17 are more likely to have missing race or Hispanic origin data in administrative records and third party data. Larger households tend to have more missing race data in administrative records and third party data than smaller households.
View Full
Paper PDF
-
An outside view: What do observers say about others' races and Hispanic origins?
August 2015
Working Paper Number:
carra-2015-05
Outsiders' views of a person's race or Hispanic origin can impact how she sees herself, how she reports her race and Hispanic origins, and her social and economic experiences. The way outsiders describe non-strangers in terms of their race and Hispanic origin may reveal popular assumptions about which race/Hispanic categories are salient for Americans, which kinds of people are seen as multiracial, and the types of cues people use when identifying another person's race. We study patterns of observer identification using a unique, large, linked data source with two measures of a person's race and Hispanic origin. One measure (from Census 2000 or the 2010 Census) was provided by a household respondent and the other (from the other census year) was provided by a census proxy reporter (e.g., a neighbor) who responded on behalf of a non-responsive household. We ask: Does an outsider's report of a person's race and Hispanic origin match a household report? We find that in about 90% of our 3.7 million (nonrepresentative) cases, proxy reports of a person's race and Hispanic origin match responses given by the household in a different census year. Match rates are high for the largest groups: non-Hispanic whites, blacks, and Asians and for Hispanics, though proxies are not very able to replicate the race responses of Hispanics. Matches are much less common for people in smaller groups (American Indian/Alaska Native, Pacific Islander, Some Other Race, and multiracial). We also ask: What predicts a matched response and what predicts a particular unmatched response? We find evidence of the persistence of hypodescent for blacks and hyperdescent for American Indians. Biracial Asian-whites and Pacific Islander-whites are more often seen by others as non-Hispanic white than as people of color. Proxy reporters tend to identify children as multiple race and elders as single race, whether they are or not. The race/Hispanic composition of the tract is more powerfully predictive of a particular unmatched response than are tract-level measures of socioeconomic status; unmatched responses are often consistent with the race/Hispanic characteristics of the neighborhood.
View Full
Paper PDF
-
Evaluating Race and Hispanic Origin Responses of Medicaid Participants Using Census Data
April 2015
Working Paper Number:
carra-2015-01
Health and health care disparities associated with race or Hispanic origin are complex and continue to challenge researchers and policy makers. With the intention of improving the measurement and monitoring of these disparities, provisions of the Patient Protection and Affordable Care Act (ACA) of 2010 require states to collect, report and analyze data on demographic characteristics of applicants and participants in Medicaid and other federally supported programs. By linking Medicaid records to 2010 Census, American Community Survey, and Census 2000, this new large-scale study examines and documents the extent to which pre-ACA Medicaid administrative records match self-reported race and Hispanic origin in Census data. Linked records allow comparisons between individuals with matching and non-matching race and Hispanic origin data across several demographic, socioeconomic and neighborhood characteristics, such as age, gender, language proficiency, education and Census tract variables. Identification of the groups most likely to have non-matching and missing race and Hispanic origin data in Medicaid relative to Census data can inform strategies to improve the quality of demographic data collected from Medicaid populations.
View Full
Paper PDF