This paper describes a novel database and an associated suicide event prediction model that surmount longstanding barriers in suicide risk factor research. The database comingles person-level records from the National Violent Death Reporting System (NVDRS) and the American Community Survey (ACS) to establish a case-control study sample that includes all identified suicide cases, while faithfully reflecting general population sociodemographics, in sixteen USA states during the years 2005 2011. It supports a statistical model of individual suicide risk that accommodates person-level factors and the moderation of these factors by their community rates. Named the United States Multi-Level Suicide Data Set (US-MSDS), the database was developed outside the RDC laboratory using publicly available ACS microdata, and reconstructed inside the laboratory using restricted access ACS microdata. Analyses of the latter version yielded findings that largely amplified but also extended those obtained from analyses of the former. This experience shows that the analytic precision achievable using restricted access ACS data can play an important role in conducting social research, although it also indicates that publicly available ACS data have considerable value in conducting preliminary analyses and preparing to use an RDC laboratory. The database development strategy may interest scientists investigating sociodemographic risk factors for other types of low-frequency mortality.
-
Mortality in a Multi-State Cohort of Former State Prisoners, 2010-2015
February 2022
Working Paper Number:
CES-22-06
Previous studies report that individuals who have been imprisoned have higher mortality rates than their demographic counterparts in the general population, particularly non-Hispanic white former prisoners. Most of these studies have been based on a single state's prison system, and the extent to which their findings can be generalized has not been established. In this study we explore the role that race/Hispanic origin, other demographic characteristics, and custodial/ criminal history factors have on post-release mortality, including on the timing of deaths. We also assess whether conditional release to community supervision or reimprisonment may explain the higher post-release mortality found among non-Hispanic whites. In the second part of the analysis, we estimate standardized mortality ratios (SMRs) by sex, age group, and race/Hispanic origin using as reference the U.S. general population. The data come from state prison releases from the Bureau of Justice Statistics' (BJS) National Corrections Reporting Program (NCRP). The NCRP records were linked to the Census Numident to identify deaths occurring within five years from prison release. We also linked NCRP records to previous decennial censuses and survey responses to obtain self-reported race and Hispanic origin if available. We found that non-Hispanic white former prisoners were more likely to die within five years after prison release and more likely to die in the initial weeks after release compared to racial minorities and Hispanics. Reimprisonment, age at release, and a history of multiple prison terms had a similar influence on the odds of dying across all race/Hispanic origin groups. Other factors, such as the type of release and the duration of the last term in prison, were associated with higher risks of mortality for some groups but not for others.
View Full
Paper PDF
-
Leaving Home: Modeling the Effect of Civic and Economic Structure on Individual Migration Patterns
June 2002
Working Paper Number:
CES-02-16
This research analyzes the effect of community structure upon individuals' probabilities of moving between 1985 and 1990. Using the full Census sample long form microdata for 1990, we re-allocate adult persons in 1990 to their 1985 county of residence. Then, using origin county macro-structural variables (derived from the Economic Census microdata) and individual characteristics (from Decennial Census microdata), we develop a two level hierarchical linear model. In level 1, we construct a logistic equation modeling individual probabilities of moving. In level 2, we model the contextual effects of origin community structure on these models. These contextual effects fall into two categories: 1) economic conditions that comprise the usual aggregate 'push' factors and 2) civic community factors that act to retain people in their community. Results specify the relationship between community context and individual migration patterns, and demonstrate effects of local economic structure and local civic structure on these individual probabilities. Most notably, we find that civic attributes of communities are associated with a propensity to stay in place, net of community economic factors and individual characteristics.
View Full
Paper PDF
-
An In-Depth Examination of Requirements for Disclosure Risk Assessment
October 2023
Authors:
Ron Jarmin,
John M. Abowd,
Ian M. Schmutte,
Jerome P. Reiter,
Nathan Goldschlag,
Victoria A. Velkoff,
Michael B. Hawes,
Robert Ashmead,
Ryan Cumings-Menon,
Sallie Ann Keller,
Daniel Kifer,
Philip Leclerc,
Rolando A. RodrÃguez,
Pavel Zhuravlev
Working Paper Number:
CES-23-49
The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be based on pre-specified, objective criteria. Such criteria should be used to compare methodologies to identify those with the most desirable properties. We illustrate this approach, using simple desiderata, to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. Thus, more research is needed, but in the near-term, the counterfactual approach appears best-suited for privacy-utility analysis.
View Full
Paper PDF
-
SYNTHETIC DATA FOR SMALL AREA ESTIMATION IN THE AMERICAN COMMUNITY SURVEY
April 2013
Working Paper Number:
CES-13-19
Small area estimates provide a critical source of information used to study local populations. Statistical agencies regularly collect data from small areas but are prevented from releasing detailed geographical identifiers in public-use data sets due to disclosure concerns. Alternative data dissemination methods used in practice include releasing summary/aggregate tables, suppressing detailed geographic information in public-use data sets, and accessing restricted data via Research Data Centers. This research examines an alternative method for disseminating microdata that contains more geographical details than are currently being released in public-use data files. Specifically, the method replaces the observed survey values with imputed, or synthetic, values simulated from a hierarchical Bayesian model. Confidentiality protection is enhanced because no actual values are released. The method is demonstrated using restricted data from the 2005-2009 American Community Survey. The analytic validity of the synthetic data is assessed by comparing small area estimates obtained from the synthetic data with those obtained from the observed data.
View Full
Paper PDF
-
Evaluating Race and Hispanic Origin Responses of Medicaid Participants Using Census Data
April 2015
Working Paper Number:
carra-2015-01
Health and health care disparities associated with race or Hispanic origin are complex and continue to challenge researchers and policy makers. With the intention of improving the measurement and monitoring of these disparities, provisions of the Patient Protection and Affordable Care Act (ACA) of 2010 require states to collect, report and analyze data on demographic characteristics of applicants and participants in Medicaid and other federally supported programs. By linking Medicaid records to 2010 Census, American Community Survey, and Census 2000, this new large-scale study examines and documents the extent to which pre-ACA Medicaid administrative records match self-reported race and Hispanic origin in Census data. Linked records allow comparisons between individuals with matching and non-matching race and Hispanic origin data across several demographic, socioeconomic and neighborhood characteristics, such as age, gender, language proficiency, education and Census tract variables. Identification of the groups most likely to have non-matching and missing race and Hispanic origin data in Medicaid relative to Census data can inform strategies to improve the quality of demographic data collected from Medicaid populations.
View Full
Paper PDF
-
Geographic Disparities in Alzheimer's Disease and Related Dementia Mortality in the US: Comparing Impacts of Place of Birth and Place of Residence
January 2025
Working Paper Number:
CES-25-11
Objective: Building on the hypothesis that early-life exposures might influence the onset of Alzheimer's Disease and Related Dementia (ADRD), this study delves into geographic variations in ADRD mortality in the US. By considering both state of residence and state of birth, we aim to discern the comparative significance of these geospatial factors.
Methods: We conducted a secondary data analysis of the National Longitudinal Mortality Study (NLMS), that has 3.5 million records from 1973-2011 and over 0.5 million deaths. We focused on individuals born in or before 1930, tracked in NLMS cohorts from 1979-2000. Employing multi-level logistic regression, with individuals nested within states of residence and/or states of birth, we assessed the role of geographical factors in ADRD mortality variation.
Results: We found that both state of birth and state of residence account for a modest portion of ADRD mortality variation. Specifically, state of residence explains 1.19% of the total variation in ADRD mortality, whereas state of birth explains only 0.6%. When combined, both state of residence and state of birth account for only 1.05% of the variation, suggesting state of residence could matter more in ADRD mortality outcomes.
Conclusion: Findings of this study suggest that state of residence explains more variation in ADRD mortality than state of birth. These results indicate that factors in later life may present more impactful intervention points for curbing ADRD mortality. While early-life environmental exposures remain relevant, their role as primary determinants of ADRD in later life appears to be less pronounced in this study.
View Full
Paper PDF
-
Who are the people in my neighborhood? The 'contextual fallacy' of measuring individual context with census geographies
February 2018
Working Paper Number:
CES-18-11
Scholars deploy census-based measures of neighborhood context throughout the social sciences and epidemiology. Decades of research confirm that variation in how individuals are aggregated into geographic units to create variables that control for social, economic or political contexts can dramatically alter analyses. While most researchers are aware of the problem, they have lacked the tools to determine its magnitude in the literature and in their own projects. By using confidential access to the complete 2010 U.S. Decennial Census, we are able to construct'for all persons in the US'individual-specific contexts, which we group according to the Census-assigned block, block group, and tract. We compare these individual-specific measures to the published statistics at each scale, and we then determine the magnitude of variation in context for an individual with respect to the published measures using a simple statistic, the standard deviation of individual context (SDIC). For three key measures (percent Black, percent Hispanic, and Entropy'a measure of ethno-racial diversity), we find that block-level Census statistics frequently do not capture the actual context of individuals within them. More problematic, we uncover systematic spatial patterns in the contextual variables at all three scales. Finally, we show that within-unit variation is greater in some parts of the country than in others. We publish county-level estimates of the SDIC statistics that enable scholars to assess whether mis-specification in context variables is likely to alter analytic findings when measured at any of the three common Census units.
View Full
Paper PDF
-
The Case of the Missing Ethnicity: Indians without Tribes in the 21st Century
June 2011
Working Paper Number:
CES-11-17
Among American Indians and Alaska Natives, most aspects of ethnicity are tightly associated with the person's tribal origins. Language, history, foods, land, and traditions differ among the hundreds of tribes indigenous to the United States. Why did almost one million of them fail to respond to the tribal affiliation part of the Census 2000 race question? We investigate four hypotheses about why one-third of multiracial American Indians and one-sixth of single-race American Indians did not report a tribe: (1) survey item non-response which undermines all fillin- the-blank questions, (2) a non-salient tribal identity, (3) a genealogy-based affiliation, and (4) mestizo identity which does not require a tribe. We use multivariate logistic regression models and high-density restricted-use Census 2000 data. We find support for the first two hypotheses and note that the predictors and results differ substantially for single race versus multiple race American Indians.
View Full
Paper PDF
-
The Rural/Urban Volunteering Divide
June 2025
Working Paper Number:
CES-25-42
Are rural residents more likely to volunteer than those living in urban places? Although early sociological theory posited that rural residents were more likely to experience social bonds connecting them to their community, increasing their odds of volunteer engagement, empirical support is limited. Drawing upon the full population of rural and urban respondents to the United States Census Bureau's Current Population Survey (CPS) Volunteering Supplement (2002-2015), we found that rural respondents are more likely to report volunteering compared to urban respondents, although these differences are decreasing over time. Moreover, we found that propensities for rural and urban volunteerism vary based on differences in both individual and place-based characteristics; further, the size of these effects differ across rural and urban places. These findings have important implications for theory and empirical analysis.
View Full
Paper PDF
-
Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels
November 2022
Working Paper Number:
CES-22-48
In this study, we implement small-area estimation to assess the prevalence of child health outcomes at the county, state, and regional levels, using national survey data.
View Full
Paper PDF