Scholars deploy census-based measures of neighborhood context throughout the social sciences and epidemiology. Decades of research confirm that variation in how individuals are aggregated into geographic units to create variables that control for social, economic or political contexts can dramatically alter analyses. While most researchers are aware of the problem, they have lacked the tools to determine its magnitude in the literature and in their own projects. By using confidential access to the complete 2010 U.S. Decennial Census, we are able to construct'for all persons in the US'individual-specific contexts, which we group according to the Census-assigned block, block group, and tract. We compare these individual-specific measures to the published statistics at each scale, and we then determine the magnitude of variation in context for an individual with respect to the published measures using a simple statistic, the standard deviation of individual context (SDIC). For three key measures (percent Black, percent Hispanic, and Entropy'a measure of ethno-racial diversity), we find that block-level Census statistics frequently do not capture the actual context of individuals within them. More problematic, we uncover systematic spatial patterns in the contextual variables at all three scales. Finally, we show that within-unit variation is greater in some parts of the country than in others. We publish county-level estimates of the SDIC statistics that enable scholars to assess whether mis-specification in context variables is likely to alter analytic findings when measured at any of the three common Census units.
-
Improving Estimates of Neighborhood Change with Constant Tract Boundaries
May 2022
Working Paper Number:
CES-22-16
Social scientists routinely rely on methods of interpolation to adjust available data to their research needs. This study calls attention to the potential for substantial error in efforts to harmonize data to constant boundaries using standard approaches to areal and population interpolation. We compare estimates from a standard source (the Longitudinal Tract Data Base) to true values calculated by re-aggregating original 2000 census microdata to 2010 tract areas. We then demonstrate an alternative approach that allows the re-aggregated values to be publicly disclosed, using 'differential privacy' (DP) methods to inject random noise to protect confidentiality of the raw data. The DP estimates are considerably more accurate than the interpolated estimates. We also examine conditions under which interpolation is more susceptible to error. This study reveals cause for greater caution in the use of interpolated estimates from any source. Until and unless DP estimates can be publicly disclosed for a wide range of variables and years, research on neighborhood change should routinely examine data for signs of estimation error that may be substantial in a large share of tracts that experienced complex boundary changes.
View Full
Paper PDF
-
Associations Between Public Housing and Individual Earnings in New Orleans
October 2015
Working Paper Number:
CES-15-32
This study uses a sample of the civilian labor force aged 16-64 constructed from the Decennial Census and American Community Survey, along with data from the HUD dataset Picture of Subsidized Households, to compare the likelihood for job earnings in relation to public housing developments in the New Orleans MSA before and after Hurricane Katrina. Results from a series of hierarchical linear models (HLM) indicate significant relationships are altered between time periods, including those from public and mixed-income developments, suggesting a fluid relationship between neighborhoods and economic outcomes during physical, demographic and economic restructuring.
View Full
Paper PDF
-
Metropolitan Segregation: No Breakthrough in Sight
May 2022
Working Paper Number:
CES-22-14
The 2020 Census offers new information on changes in residential segregation in metropolitan regions across the country as they continue to become more diverse. We take a long view, assessing trends since 1980 and extrapolating to the future. These new data mostly reinforce patterns that were observed a decade ago: high but slowly declining black-white segregation, and less intense but hardly changing segregation of Hispanics and Asians from whites. Enough time has passed since the civil rights era of the 1960s and 1970s to draw this conclusion: segregation will continue to divide Americans well into the 21st Century.
View Full
Paper PDF
-
WHITE-LATINO RESIDENTIAL ATTAINMENTS AND SEGREGATION
IN SIX CITIES: ASSESSING THE ROLE OF MICRO-LEVEL FACTORS
January 2016
Working Paper Number:
CES-16-51
This study examines the residential outcomes of Latinos in major metropolitan areas using new methods to connect micro-level analyses of residential attainments to overall patterns of segregation in the metropolitan area. Drawing on new formulations of standard measures of evenness, we conduct micro-level multivariate analyses using the restricted-use census microdata files to predict segregation-relevant neighborhood outcomes for individuals by race. We term the dependent variables segregation-relevant neighborhood outcomes because the differences in average outcomes for each group on these variables determine the values of the aggregate measures of evenness. This approach allows me to use standardization and components analysis to quantitatively assess the separate contributions that differences in social characteristics and differences in rates of return make towards determining the overall disparity in residential outcomes ' that is, the level of segregation ' between Whites and Latinos. Based on our micro-level residential attainment analyses we find that for Latinos, acculturation and gains in socioeconomic status are associated with greater residential contact with Whites, in agreement with spatial assimilation theory, which promotes lower segregation. However, our standardization and components analyses reveals that a substantial portion of White-Latino disparities in residential contact with Whites can be attributed to differences in rates of return; that is White-Latino differences in the ability to translate acculturation and gains in socioeconomic status into more residential contact with Whites. This is further elaborated upon by assessing the changes in contact with Whites for Whites and Latinos after manipulating single variables while holding all others constant. This can be interpreted as the role of discrimination which is emphasized by place stratification theory. Therefore we conclude that while members of minority groups make gains in residential outcomes that reduce segregation by attaining parity with Whites on social characteristics as spatial assimilation theory would predict, a substantial disparity will persist as Latinos cannot translate those gains into greater contact with Whites at the rate that Whites can. At the aggregate level of analysis, this means that White-Latino segregation remains substantial even when groups are equalized on social and economic characteristics.
View Full
Paper PDF
-
SYNTHETIC DATA FOR SMALL AREA ESTIMATION IN THE AMERICAN COMMUNITY SURVEY
April 2013
Working Paper Number:
CES-13-19
Small area estimates provide a critical source of information used to study local populations. Statistical agencies regularly collect data from small areas but are prevented from releasing detailed geographical identifiers in public-use data sets due to disclosure concerns. Alternative data dissemination methods used in practice include releasing summary/aggregate tables, suppressing detailed geographic information in public-use data sets, and accessing restricted data via Research Data Centers. This research examines an alternative method for disseminating microdata that contains more geographical details than are currently being released in public-use data files. Specifically, the method replaces the observed survey values with imputed, or synthetic, values simulated from a hierarchical Bayesian model. Confidentiality protection is enhanced because no actual values are released. The method is demonstrated using restricted data from the 2005-2009 American Community Survey. The analytic validity of the synthetic data is assessed by comparing small area estimates obtained from the synthetic data with those obtained from the observed data.
View Full
Paper PDF
-
Peer Income Exposure Across the Income Distribution
February 2025
Working Paper Number:
CES-25-16
Children from families across the income distribution attend public schools, making schools and classrooms potential sites for interaction between more- and less-affluent children. However, limited information exists regarding the extent of economic integration in these contexts. We merge educational administrative data from Oregon with measures of family income derived from IRS records to document student exposure to economically diverse school and classroom peers. Our findings indicate that affluent children in public schools are relatively isolated from their less affluent peers, while low- and middle-income students experience relatively even peer income distributions. Students from families in the top percentile of the income distribution attend schools where 20 percent of their peers, on average, come from the top five income percentiles. A large majority of the differences in peer exposure that we observe arise from the sorting of students across schools; sorting across classrooms within schools plays a substantially smaller role.
View Full
Paper PDF
-
Factors that Influence Change in Hispanic Identification: Evidence from Linked Decennial Census and American Community Survey Data
October 2018
Working Paper Number:
CES-18-45
This study explores patterns of ethnic boundary crossing as evidenced by changes in Hispanic origin responses across decennial census and survey data. We identify socioeconomic, cultural, and demographic factors associated with Hispanic response change. In addition, we assess whether changes in the Hispanic origin question between the 2000 and 2010 censuses influenced changes in Hispanic reporting. We use a unique large dataset that links a person's unedited responses to the Hispanic origin question across Census 2000, the 2010 Census and the 2006-2010 American Community Survey five-year file. We find that most of the individuals in the sample identified consistently as Hispanic regardless of changes in the wording of the Hispanic origin question. Individuals who changed in or out of a Hispanic identification, as well as those who consistently identified as non-Hispanic (of Hispanic ancestry), differed in socioeconomic and cultural characteristics from individuals who consistently reported as Hispanic. The likelihood of changing their Hispanic origin response is higher among U.S.-born individuals, those reporting mixed Hispanic and non-Hispanic ancestries, those who speak only English at home, and those who live in tracts that are predominantly non-Hispanic. Racial identification and detailed Hispanic background also influence changes in Hispanic origin responses. Finally, changes in mode and relationship to the reference person in the household are associated with changes in Hispanic origin responses, suggesting that data collection elements also can influence Hispanic origin response change.
View Full
Paper PDF
-
Structural versus Ethnic Dimensions of Housing Segregation
March 2016
Working Paper Number:
CES-16-22
Racial residential segregation is still very high in many American cities. Some portion of segregation is attributable to socioeconomic differences across racial lines; some portion is caused by purely racial factors, such as preferences about the racial composition of one's neighborhood or discrimination in the housing market. Social scientists have had great difficulty disaggregating segregation into a portion that can be explained by interracial differences in socioeconomic characteristics (what we call structural factors) versus a portion attributable to racial and ethnic factors. What would such a measure look like? In this paper, we draw on a new source of data to develop an innovative structural segregation measure that shows the amount of segregation that would remain if we could assign households to housing units based only on non-racial socioeconomic characteristics. This inquiry provides vital building blocks for the broader enterprise of understanding and remedying housing segregation.
View Full
Paper PDF
-
The Relationship of Personal and Neighborhood Characteristics to Immigrant Fertility
August 2002
Working Paper Number:
CES-02-20
We find that fertility varies by immigrant generation, with significant declines between the first and subsequent generations for groups with large immigrant population. However, we find that personal characteristics--such as educational attainment, marital status, and income levels--are much more important than immigrant generation in understanding fertility outcomes. In fact, generations are not independently important once these personal characteristics are controlled for. We maintain that declining fertility levels among the descendants of Mexican and Central American immigrants are primarily the result of higher educational attainment levels, lower rates of marriage, and lower poverty. For example, a four-year increase in educational attainment decreases children ever born (CEB) by half a child. We conclude that immigrant generation serves as a proxy for changes in other personal characteristics that decrease fertility. Neighborhood characteristics have some bearing on fertility, but the correlations are relatively weak. Among Mexican and Central American immigrants and their descendants, the most consistent predictor of children ever born (CEB) at the neighborhood level is the percentage of Hispanic adults. However, no neighborhood characteristics bear any statistical relationship to current fertility, the measure that emphasizes recent births. This pattern of evidence suggests that the observed relationships between neighborhood characteristics and fertility are based on selection into the neighborhood rather than on neighborhood influences as such.
View Full
Paper PDF
-
Changes in Neighborhood Inequality, 2000-2010
March 2016
Working Paper Number:
CES-16-18
Recent work has suggested that higher income inequality may be a desirable attribute of a neighborhood in that it represents diversity, even though high (and rising) inequality appears to be detrimental to the nation as a whole. The research reported here has determined the key characteristics of a census tract that are associated with the level of inequality in 2000 or 2010, and those associated with changes in income inequality between 2000 and 2010. For the change, the strongest influence is a negative effect for the level of income inequality in 2000; that is, higher income inequality in 2000 leads to a decline over the decade, ceteris paribus. Neighborhoods with higher proportions or levels of the following population and housing characteristics tend to have both higher income inequality and a larger increase in income inequality between 2000 and 2010: individuals in poverty, those with a bachelor's degree, older individuals, householders living alone, and median rent, and lower median housing value and household income. Among these, perhaps the most important determinant is the percent in poverty in 2000. Furthermore, as the baseline level of demographic and economic diversity increases, the better the baseline and change characteristics explain the change in the Gini index from 2000 to 2010.
View Full
Paper PDF