CREAT - Census Bureau

Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels

November 2022

Written by: Rachel M. Hantman, Anja Zgodic, Jan M. Eberth, Alexander C. McLain

Working Paper Number:

CES-22-48

Abstract

In this study, we implement small-area estimation to assess the prevalence of child health outcomes at the county, state, and regional levels, using national survey data.

Document Tags and Keywords

Keywords:

report, census data, census research, survey, disclosure, confidentiality, information, rural, percentile, census bureau, coverage, health, parental, prevalence, public, publicly, child, rurality, census disclosure

Tags:

Service Annual Survey, Research Data Center, American Community Survey, Department of Health and Human Services, Special Sworn Status, National Institutes of Health, Census Bureau Disclosure Review Board, Disclosure Review Board, Centers for Disease Control and Prevention, Federal Statistical Research Data Center

Similar Working Papers

The 10 most similar working papers to the working paper 'Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels' are listed below in order of similarity.

Working Paper

SYNTHETIC DATA FOR SMALL AREA ESTIMATION IN THE AMERICAN COMMUNITY SURVEY

April 2013

Authors: Joseph W. Sakshaug, Trivellore Raghunathan

Working Paper Number:

CES-13-19

Small area estimates provide a critical source of information used to study local populations. Statistical agencies regularly collect data from small areas but are prevented from releasing detailed geographical identifiers in public-use data sets due to disclosure concerns. Alternative data dissemination methods used in practice include releasing summary/aggregate tables, suppressing detailed geographic information in public-use data sets, and accessing restricted data via Research Data Centers. This research examines an alternative method for disseminating microdata that contains more geographical details than are currently being released in public-use data files. Specifically, the method replaces the observed survey values with imputed, or synthetic, values simulated from a hierarchical Bayesian model. Confidentiality protection is enhanced because no actual values are released. The method is demonstrated using restricted data from the 2005-2009 American Community Survey. The analytic validity of the synthetic data is assessed by comparing small area estimates obtained from the synthetic data with those obtained from the observed data.
View Full Paper PDF
Working Paper

Connected and Uncooperative: The Effects of Homogenous and Exclusive Social Networks on Survey Response Rates and Nonresponse Bias

January 2024

Authors: Jonathan Eggleston, Chase Sawyer

Working Paper Number:

CES-24-01

Social capital, the strength of people's friendship networks and community ties, has been hypothesized as an important determinant of survey participation. Investigating this hypothesis has been difficult given data constraints. In this paper, we provide insights by investigating how response rates and nonresponse bias in the American Community Survey are correlated with county-level social network data from Facebook. We find that areas of the United States where people have more exclusive and homogenous social networks have higher nonresponse bias and lower response rates. These results provide further evidence that the effects of social capital may not be simply a matter of whether people are socially isolated or not, but also what types of social connections people have and the sociodemographic heterogeneity of their social networks.
View Full Paper PDF
Working Paper

Where Are Your Parents? Exploring Potential Bias in Administrative Records on Children

March 2024

Authors: Jennifer Bernard, Kelsey Drotning, Katie R. Genadek

Working Paper Number:

CES-24-18

This paper examines potential bias in the Census Household Composition Key's (CHCK) probabilistic parent-child linkages. By linking CHCK data to the American Community Survey (ACS), we reveal disparities in parent-child linkages among specific demographic groups and find that characteristics of children that can and cannot be linked to the CHCK vary considerably from the larger population. In particular, we find that children from low-income, less educated households and of Hispanic origin are less likely to be linked to a mother or a father in the CHCK. We also highlight some data considerations when using the CHCK.
View Full Paper PDF
Working Paper

Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets

June 2024

Authors: Narayan Sastry, Todd Gardner, Matthew Cefalu, John Sullivan, Elizabeth Fussell

Working Paper Number:

CES-24-27

This article introduces the twangRDC package, which contains functions to address non-linkage in US Census Bureau datasets. The Census Bureau's Person Identification Validation System facilitates data linkage by assigning unique person identifiers to federal, third party, decennial census, and survey data. Not all records in these datasets can be linked to the reference file and as such not all records will be assigned an identifier. This article is a tutorial for using the twangRDC to generate nonresponse weights to account for non-linkage of person records across US Census Bureau datasets.
View Full Paper PDF
Working Paper

Some Open Questions on Multiple-Source Extensions of Adaptive-Survey Design Concepts and Methods

February 2023

Authors: Stephanie Coffey, PhD., Jaya Damineni, John Eltinge, PhD., Anup Mathur, PhD., Kayla Varela, Allison Zotti

Working Paper Number:

CES-23-03

Adaptive survey design is a framework for making data-driven decisions about survey data collection operations. This paper discusses open questions related to the extension of adaptive principles and capabilities when capturing data from multiple data sources. Here, the concept of 'design' encompasses the focused allocation of resources required for the production of high-quality statistical information in a sustainable and cost-effective way. This conceptual framework leads to a discussion of six groups of issues including: (i) the goals for improvement through adaptation; (ii) the design features that are available for adaptation; (iii) the auxiliary data that may be available for informing adaptation; (iv) the decision rules that could guide adaptation; (v) the necessary systems to operationalize adaptation; and (vi) the quality, cost, and risk profiles of the proposed adaptations (and how to evaluate them). A multiple data source environment creates significant opportunities, but also introduces complexities that are a challenge in the production of high-quality statistical information.
View Full Paper PDF
Working Paper

Disconnected Geography: A Spatial Analysis of Disconnected Youth in the United States

January 2016

Authors: Jeremy W Bray, Brooks Depro, Dorren McMahon, Marion Siegle, Lee Mobley

Working Paper Number:

CES-16-37

Since the Great Recession, US policy and advocacy groups have sought to better understand its effect on a group of especially vulnerable young adults who are not enrolled in school or training programs and not participating in the labor market, so called 'disconnected youth.' This article distinguishes between disconnected youth and unemployed youth and examines the spatial clustering of these two groups across counties in the US. The focus is to ascertain whether there are differences in underlying contextual factors among groups of counties that are mutually exclusive and spatially disparate (non-adjacent), comprising two types of spatial clusters ' high rates of disconnected youth and high rates of unemployed youth. Using restricted, household-level census data inside the Census Research Data Center (RDC) under special permission by the US Census Bureau, we were able to define these two groups using detailed household questionnaires that are not available to researchers outside the RDC. The geospatial patterns in the two types of clusters suggest that places with high concentrations of disconnected youth are distinctly different in terms of underlying characteristics from places with high concentrations of unemployed youth. These differences include, among other things, arrests for synthetic drug production, enclaves of poor in rural areas, persistent poverty in areas, educational attainment in the populace, children in poverty, persons without health insurance, the social capital index, and elders who receive disability benefits. This article provides some preliminary evidence regarding the social forces underlying the two types of observed geospatial clusters and discusses how they differ.
View Full Paper PDF
Working Paper

Neighborhood Effects on High-School Drop-Out Rates and Teenage Childbearing: Tests for Non-Linearities, Race-Specific Effects, Interactions with Family Characteristics, and Endogenous Causation using Geocoded California Census Microdata

May 2008

Authors: Rhiannon Patterson

Working Paper Number:

CES-08-12

This paper examines the relationship between neighborhood characteristics and the likelihood that a youth will drop out of high school or have a child during the teenage years. Using a dataset that is uniquely wellsuited to the study of neighborhood effects, the impact of the neighborhood poverty rate and the percentage of professionals in the local labor force on youth outcomes in California is examined. The first section of the paper tests for non-linearities in the relationship between indicators of neighborhood distress and youth outcomes. Some evidence is found for a break-point at low levels of poverty. Suggestive but inconclusive evidence is also found for a second breakpoint, at very high levels of poverty, for African-American youth only. The second part of the paper examines interactions between family background characteristics and neighborhood effects, and finds that White youth are most sensitive to neighborhood effects, while the effect of parental education depends on the neighborhood measure in question. Among White youth, those from single-parent households are more vulnerable to neighborhood conditions. The third section of the paper finds that for White youth and Hispanic youth, the relevant neighborhood variables appear to be the own-race poverty rates and the percentage of professionals of youths' own race. The final section of the paper estimates a tract-fixed effects model, using the results from the third section to define multiple relevant poverty rates within each tract. The fixed-effects specification suggests that for White and Hispanic youth in California, neighborhood effects remain significant, even with the inclusion of controls for any unobserved family and neighborhood characteristics that are constant within tracts.
View Full Paper PDF
Working Paper

Examining Multi-Level Correlates of Suicide by Merging NVDRS and ACS Data

January 2017

Authors: David A Boulifard, Bernice A Pescosolido

Working Paper Number:

CES-17-25

This paper describes a novel database and an associated suicide event prediction model that surmount longstanding barriers in suicide risk factor research. The database comingles person-level records from the National Violent Death Reporting System (NVDRS) and the American Community Survey (ACS) to establish a case-control study sample that includes all identified suicide cases, while faithfully reflecting general population sociodemographics, in sixteen USA states during the years 2005 2011. It supports a statistical model of individual suicide risk that accommodates person-level factors and the moderation of these factors by their community rates. Named the United States Multi-Level Suicide Data Set (US-MSDS), the database was developed outside the RDC laboratory using publicly available ACS microdata, and reconstructed inside the laboratory using restricted access ACS microdata. Analyses of the latter version yielded findings that largely amplified but also extended those obtained from analyses of the former. This experience shows that the analytic precision achievable using restricted access ACS data can play an important role in conducting social research, although it also indicates that publicly available ACS data have considerable value in conducting preliminary analyses and preparing to use an RDC laboratory. The database development strategy may interest scientists investigating sociodemographic risk factors for other types of low-frequency mortality.
View Full Paper PDF
Working Paper

The Nature of the Bias When Studying Only Linkable Person Records: Evidence from the American Community Survey

April 2014

Authors: Adela Luque, J. David Brown, Brittany Bond, Amy B. O'Hara, Amy OHara

Working Paper Number:

carra-2014-08

Record linkage across survey and administrative records sources can greatly enrich data and improve their quality. The linkage can reduce respondent burden and nonresponse follow-up costs. This is particularly important in an era of declining survey response rates and tight budgets. Record linkage also creates statistical bias, however. The U.S. Census Bureau links person records through its Person Identification Validation System (PVS), assigning each record a Protected Identification Key (PIK). It is not possible to reliably assign a PIK to every record, either due to insufficient identifying information or because the information does not uniquely match any of the administrative records used in the person validation process. Non-random ability to assign a PIK can potentially inject bias into statistics using linked data. This paper studies the nature of this bias using the 2009 and 2010 American Community Survey (ACS). The ACS is well-suited for this analysis, as it contains a rich set of person characteristics that can describe the bias. We estimate probit models for whether a record is assigned a PIK. The results suggest that young children, minorities, residents of group quarters, immigrants, recent movers, low-income individuals, and non-employed individuals are less likely to receive a PIK using 2009 ACS. Changes to the PVS process in 2010 significantly addressed the young children deficit, attenuated the other biases, and increased the validated records share from 88.1 to 92.6 percent (person-weighted).
View Full Paper PDF
Working Paper

Estimating the U.S. Citizen Voting-Age Population (CVAP) Using Blended Survey Data, Administrative Record Data, and Modeling: Technical Report

April 2023

Authors: J. David Brown, Danielle H. Sandler, Lawrence Warren, Moises Yi, Misty L. Heggeness, Joseph L. Schafer, Matthew Spence, Marta Murray-Close, Carl Lieberman, Genevieve Denoeux, Lauren Medina

Working Paper Number:

CES-23-21

This report develops a method using administrative records (AR) to fill in responses for nonresponding American Community Survey (ACS) housing units rather than adjusting survey weights to account for selection of a subset of nonresponding housing units for follow-up interviews and for nonresponse bias. The method also inserts AR and modeling in place of edits and imputations for ACS survey citizenship item nonresponses. We produce Citizen Voting-Age Population (CVAP) tabulations using this enhanced CVAP method and compare them to published estimates. The enhanced CVAP method produces a 0.74 percentage point lower citizen share, and it is 3.05 percentage points lower for voting-age Hispanics. The latter result can be partly explained by omissions of voting-age Hispanic noncitizens with unknown legal status from ACS household responses. Weight adjustments may be less effective at addressing nonresponse bias under those conditions.
View Full Paper PDF

Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels

November 2022

Working Paper Number:

CES-22-48

Abstract

Document Tags and Keywords

The 10 most similar working papers to the working paper 'Using Small-Area Estimation (SAE) to Estimate Prevalence of Child Health Outcomes at the Census Regional-, State-, and County-Levels' are listed below in order of similarity.

April 2013

Working Paper Number:

CES-13-19

January 2024

Working Paper Number:

CES-24-01

March 2024

Working Paper Number:

CES-24-18

June 2024

Working Paper Number:

CES-24-27

February 2023

Working Paper Number:

CES-23-03

January 2016

Working Paper Number:

CES-16-37

May 2008

Working Paper Number:

CES-08-12

January 2017

Working Paper Number:

CES-17-25

April 2014

Working Paper Number:

carra-2014-08

April 2023

Working Paper Number:

CES-23-21