-
An outside view: What do observers say about others' races and Hispanic origins?
August 2015
Working Paper Number:
carra-2015-05
Outsiders' views of a person's race or Hispanic origin can impact how she sees herself, how she reports her race and Hispanic origins, and her social and economic experiences. The way outsiders describe non-strangers in terms of their race and Hispanic origin may reveal popular assumptions about which race/Hispanic categories are salient for Americans, which kinds of people are seen as multiracial, and the types of cues people use when identifying another person's race. We study patterns of observer identification using a unique, large, linked data source with two measures of a person's race and Hispanic origin. One measure (from Census 2000 or the 2010 Census) was provided by a household respondent and the other (from the other census year) was provided by a census proxy reporter (e.g., a neighbor) who responded on behalf of a non-responsive household. We ask: Does an outsider's report of a person's race and Hispanic origin match a household report? We find that in about 90% of our 3.7 million (nonrepresentative) cases, proxy reports of a person's race and Hispanic origin match responses given by the household in a different census year. Match rates are high for the largest groups: non-Hispanic whites, blacks, and Asians and for Hispanics, though proxies are not very able to replicate the race responses of Hispanics. Matches are much less common for people in smaller groups (American Indian/Alaska Native, Pacific Islander, Some Other Race, and multiracial). We also ask: What predicts a matched response and what predicts a particular unmatched response? We find evidence of the persistence of hypodescent for blacks and hyperdescent for American Indians. Biracial Asian-whites and Pacific Islander-whites are more often seen by others as non-Hispanic white than as people of color. Proxy reporters tend to identify children as multiple race and elders as single race, whether they are or not. The race/Hispanic composition of the tract is more powerfully predictive of a particular unmatched response than are tract-level measures of socioeconomic status; unmatched responses are often consistent with the race/Hispanic characteristics of the neighborhood.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Assimilation and Coverage of the
Foreign-Born Population in Administrative Records
April 2015
Working Paper Number:
carra-2015-02
The U.S. Census Bureau is researching ways to incorporate administrative data in decennial census and survey operations. Critical to this work is an understanding of the coverage of the population by administrative records. Using federal and third party administrative data linked to the American Community Survey (ACS), we evaluate the extent to which administrative records provide data on foreign-born individuals in the ACS and employ multinomial logistic regression techniques to evaluate characteristics of those who are in administrative records relative to those who are not. We find that overall, administrative records provide high coverage of foreign-born individuals in our sample for whom a match can be determined. The odds of being in administrative records are found to be tied to the processes of immigrant assimilation - naturalization, higher English proficiency, educational attainment, and full-time employment are associated with greater odds of being in administrative records. These findings suggest that as immigrants adapt and integrate into U.S. society, they are more likely to be involved in government and commercial processes and programs for which we are including data. We further explore administrative records coverage for the two largest race/ethnic groups in our sample - Hispanic and non-Hispanic single-race Asian foreign born, finding again that characteristics related to assimilation are associated with administrative records coverage for both groups. However, we observe that neighborhood context impacts Hispanics and Asians differently.
View Full
Paper PDF
-
Evaluating Race and Hispanic Origin Responses of Medicaid Participants Using Census Data
April 2015
Working Paper Number:
carra-2015-01
Health and health care disparities associated with race or Hispanic origin are complex and continue to challenge researchers and policy makers. With the intention of improving the measurement and monitoring of these disparities, provisions of the Patient Protection and Affordable Care Act (ACA) of 2010 require states to collect, report and analyze data on demographic characteristics of applicants and participants in Medicaid and other federally supported programs. By linking Medicaid records to 2010 Census, American Community Survey, and Census 2000, this new large-scale study examines and documents the extent to which pre-ACA Medicaid administrative records match self-reported race and Hispanic origin in Census data. Linked records allow comparisons between individuals with matching and non-matching race and Hispanic origin data across several demographic, socioeconomic and neighborhood characteristics, such as age, gender, language proficiency, education and Census tract variables. Identification of the groups most likely to have non-matching and missing race and Hispanic origin data in Medicaid relative to Census data can inform strategies to improve the quality of demographic data collected from Medicaid populations.
View Full
Paper PDF
-
The EITC over the business cycle: Who benefits?
December 2014
Working Paper Number:
carra-2014-15
In this paper, I examine the impact of the Great Recession on Earned Income Tax Credit (EITC) eligibility. Because the EITC is structurally tied to earnings, the direction of this impact is not immediately obvious. Families who experience complete job loss for an entire tax year lose eligibility, while those experiencing underemployment (part-year employment, a reduction in hours, or spousal unemployment in married households) may become eligible. Determining the direction and magnitude of the impact is important for a number of reasons. The EITC has become the largest cash-transfer program in the U.S., and many low-earning families rely on it as a means of support in tough times. The program has largely been viewed as a replacement for welfare, enticing former welfare recipients into the labor force. However, the effectiveness of the EITC during a period of very high unemployment has not been assessed. To answer these questions, I first use the Current Population Survey (CPS) matched to Internal Revenue Service data from tax years 2005 to 2010 to assess patterns of employment and eligibility over the Great Recession for different labor-force groups. Results indicate that overall, EITC eligibility increased over the recession, but only among groups that were cushioned from total household earnings loss by marriage. I also use the 2006 CPS matched to tax data from 2005 through 2011 to examine changes in eligibility experienced by individuals over time. In assessing three competing causes of eligibility loss, I find that less-educated, unmarried women experienced a greater hazard of eligibility loss due a yearlong lack of earnings compared with other labor-market groups. I discuss the implications of these findings on the view of the EITC as a safety-net program.
View Full
Paper PDF
-
Coverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data
November 2014
Working Paper Number:
carra-2014-14
The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.
View Full
Paper PDF
-
Do Doubled-up Families Minimize Household-level Tax Burden?
September 2014
Working Paper Number:
carra-2014-13
This paper examines a method of tax avoidance not previously studied: the sorting of dependent children among related filers who have 'doubled up' in a household for economic reasons. Using the Current Population Survey Annual Social and Economic Supplement (CPS ASEC) linked with 1040 data from the Internal Revenue Service (IRS), we examine households with children and at least two adult tax filers to determine whether the household minimizes income tax burden, and thus maximizes refunds, by optimally claiming dependents. We examine specifically the relationship between the Earned Income Tax Credit (EITC) and the sorting of dependent children among filers in households. We find the following: The propensity to sort increases as the number of filers who are potentially eligible for the EITC increases; sorting probability increases as the optimal household EITC amount increases; and among households with at least one EITC-eligible filer, the propensity to sort increases as the difference between modeled household EITC amount and the optimal amount increases. We also exploit the 2009 change in EITC benefit for families with three or more children, finding that the propensity to sort to exactly three children increased among EITC-eligible filers after the rule change. The results of this analysis improve our understanding of filing behavior, particularly how households form filing units and pool resources, and have implications for poverty measurement in complex households This presentation was given at the CARRA Seminar, July 16, 2014
View Full
Paper PDF
-
Creating Linked Historical Data: An Assessment of the Census Bureau's Ability to Assign Protected Identification Keys to the 1960 Census
September 2014
Working Paper Number:
carra-2014-12
In order to study social phenomena over the course of the 20th century, the Census Bureau is investigating the feasibility of digitizing historical census records and linking them to contemporary data. However, historical censuses have limited personally identifiable information available to match on. In this paper, I discuss the problems associated with matching older censuses to contemporary data files, and I describe the matching process used to match a small sample of the 1960 census to the Social Security Administration Numeric Identification System.
View Full
Paper PDF
-
Person Matching in Historical Files using the Census Bureau's Person Validation System
September 2014
Working Paper Number:
carra-2014-11
The recent release of the 1940 Census manuscripts enables the creation of longitudinal data spanning the whole of the twentieth century. Linked historical and contemporary data would allow unprecedented analyses of the causes and consequences of health, demographic, and economic change. The Census Bureau is uniquely equipped to provide high quality linkages of person records across datasets. This paper summarizes the linkage techniques employed by the Census Bureau and discusses utilization of these techniques to append protected identification keys to the 1940 Census.
View Full
Paper PDF
-
America's Churning Races: Race and Ethnic Response Changes between Census 2000 and the 2010 Census
August 2014
Working Paper Number:
carra-2014-09
Race and ethnicity responses can change over time and across contexts - a component of population change not usually taken into account. To what extent do race and/or Hispanic origin responses change? Is change more common to/from some race/ethnic groups than others? Does the propensity to change responses vary by characteristics of the individual? To what extent do these changes affect researchers? We use internal Census Bureau data from the 2000 and 2010 censuses in which individuals' responses have been linked across years. Approximately 9.8 million people (about 6 percent) in our large, non-representative linked data have a different race and/or Hispanic origin response in 2010 than they did in 2000. Several groups experienced considerable fluidity in racial identification: American Indians and Alaska Natives, Native Hawaiians and Other Pacific Islanders, and multiple-race response groups, as well as Hispanics when reporting a race. In contrast, race and ethnic responses for single-race non-Hispanic whites, blacks, and Asians were relatively consistent over the decade, as were ethnicity responses by Hispanics. People who change their race and/or Hispanic origin response(s) are doing so in a wide variety of ways, as anticipated by previous research. For example, people's responses change from multiple races to a single race, from a single race to multiple races, from one single race to another, and some people add or drop a Hispanic response. The inflow of people to each race/Hispanic group is in many cases similar in size to the outflow from the same group, such that cross-sectional data would show a small net change. We find response changes across ages, sexes, regions, and response modes, with variation across groups. Researchers should consider the implications of changing race and Hispanic origin responses when conducting analyses and interpreting results.
View Full
Paper PDF