-
Tip of the Iceberg: Tip Reporting at U.S. Restaurants, 2005-2018
November 2024
Working Paper Number:
CES-24-68
Tipping is a significant form of compensation for many restaurant jobs, but it is poorly measured and therefore not well understood. We combine several large administrative and survey datasets and document patterns in tip reporting that are consistent with systematic under-reporting of tip income. Our analysis indicates that although the vast majority of tipped workers do report earning some tips, the dollar value of tips is under-reported and is sensitive to reporting incentives. In total, we estimate that about eight billion in tips paid at full-service, single-location, restaurants were not captured in tax data annually over the period 2005-2018. Due to changes in payment methods and reporting incentives, tip reporting has increased over time. Our findings have implications for downstream measures dependent on accurate measures of compensation including poverty measurement among tipped restaurant workers.
View Full
Paper PDF
-
The Census Historical Environmental Impacts Frame
October 2024
Working Paper Number:
CES-24-66
The Census Bureau's Environmental Impacts Frame (EIF) is a microdata infrastructure that combines individual-level information on residence, demographics, and economic characteristics with environmental amenities and hazards from 1999 through the present day. To better understand the long-run consequences and intergenerational effects of exposure to a changing environment, we expand the EIF by extending it backward to 1940. The Historical Environmental Impacts Frame (HEIF) combines the Census Bureau's historical administrative data, publicly available 1940 address information from the 1940 Decennial Census, and historical environmental data. This paper discusses the creation of the HEIF as well as the unique challenges that arise with using the Census Bureau's historical administrative data.
View Full
Paper PDF
-
Citizenship Question Effects on Household Survey Response
June 2024
Working Paper Number:
CES-24-31
Several small-sample studies have predicted that a citizenship question in the 2020 Census would cause a large drop in self-response rates. In contrast, minimal effects were found in Poehler et al.'s (2020) analysis of the 2019 Census Test randomized controlled trial (RCT). We reconcile these findings by analyzing associations between characteristics about the addresses in the 2019 Census Test and their response behavior by linking to independently constructed administrative data. We find significant heterogeneity in sensitivity to the citizenship question among households containing Hispanics, naturalized citizens, and noncitizens. Response drops the most for households containing noncitizens ineligible for a Social Security number (SSN). It falls more for households with Latin American-born immigrants than those with immigrants from other countries. Response drops less for households with U.S.-born Hispanics than households with noncitizens from Latin America. Reductions in responsiveness occur not only through lower unit self-response rates, but also by increased household roster omissions and internet break-offs. The inclusion of a citizenship question increases the undercount of households with noncitizens. Households with noncitizens also have much higher citizenship question item nonresponse rates than those only containing citizens. The use of tract-level characteristics and significant heterogeneity among Hispanics, the foreign-born, and noncitizens help explain why the effects found by Poehler et al. were so small. Linking administrative microdata with the RCT data expands what we can learn from the RCT.
View Full
Paper PDF
-
Where Are Your Parents? Exploring Potential Bias in Administrative Records on Children
March 2024
Working Paper Number:
CES-24-18
This paper examines potential bias in the Census Household Composition Key's (CHCK) probabilistic parent-child linkages. By linking CHCK data to the American Community Survey (ACS), we reveal disparities in parent-child linkages among specific demographic groups and find that characteristics of children that can and cannot be linked to the CHCK vary considerably from the larger population. In particular, we find that children from low-income, less educated households and of Hispanic origin are less likely to be linked to a mother or a father in the CHCK. We also highlight some data considerations when using the CHCK.
View Full
Paper PDF
-
Where to Build Affordable Housing?
Evaluating the Tradeoffs of Location
December 2023
Working Paper Number:
CES-23-62R
How does the location of affordable housing affect tenant welfare, the distribution of assistance, and broader societal objectives such as racial integration? Using administrative data on tenants of units funded by the Low-Income Housing Tax Credit (LIHTC), we first show that characteristics such as race and proxies for need vary widely across neighborhoods. Despite fixed eligibility requirements, LIHTC developments in more opportunity-rich neighborhoods house tenants who are higher income, more educated, and far less likely to be Black. To quantify the welfare implications, we build a residential choice model in which households choose from both market-rate and affordable housing options, where the latter must be rationed. While building affordable housing in higher-opportunity neighborhoods costs more, it also increases household welfare and reduces city-wide segregation. The gains in household welfare, however, accrue to more moderate-need, non-Black/Hispanic households at the expense of other households. This change in the distribution of assistance is primarily due to a 'crowding out' effect: households that only apply for assistance in higher-opportunity neighborhoods crowd out those willing to apply regardless of location. Finally, other policy levers'such as lowering the income limits used for means-testing'have only limited effects relative to the choice of location.
View Full
Paper PDF
-
Producing U.S. Population Statistics Using Multiple Administrative Sources
November 2023
Working Paper Number:
CES-23-58
We identify several challenges encountered when constructing U.S. administrative record-based (AR-based) population estimates for 2020. Though the AR estimates are higher than the 2020 Census at the national level, they are over 15 percent lower in 5 percent of counties, suggesting that locational accuracy can be improved. Other challenges include how to achieve comprehensive coverage, maintain consistent coverage across time, filter out nonresidents and people not alive on the reference date, uncover missing links across person and address records, and predict demographic characteristics when multiple ones are reported or when they are missing. We discuss several ways of addressing these issues, e.g., building in redundancy with more sources, linking children to their parents' addresses, and conducting additional record linkage for people without Social Security Numbers and for addresses not initially linked to the Census Bureau's Master Address File. We discuss modeling to predict lower levels of geography for people lacking those geocodes, the probability that a person is a U.S. resident on the reference date, the probability that an address is the person's residence on the reference date, and the probability a person is in each demographic characteristic category. Regression results illustrate how many of these challenges and solutions affect the AR county population estimates.
View Full
Paper PDF
-
Noncitizen Coverage and Its Effects on U.S. Population Statistics
August 2023
Working Paper Number:
CES-23-42
We produce population estimates with the same reference date, April 1, 2020, as the 2020 Census of Population and Housing by combining 31 types of administrative record (AR) and third-party sources, including several new to the Census Bureau with a focus on noncitizens. Our AR census national population estimate is higher than other Census Bureau official estimates: 1.8% greater than the 2020 Demographic Analysis high estimate, 3.0% more than the 2020 Census count, and 3.6% higher than the vintage-2020 Population Estimates Program estimate. Our analysis suggests that inclusion of more noncitizens, especially those with unknown legal status, explains the higher AR census estimate. About 19.8% of AR census noncitizens have addresses that cannot be linked to an address in the 2020 Census collection universe, compared to 5.7% of citizens, raising the possibility that the 2020 Census did not collect data for a significant fraction of noncitizens residing in the United States under the residency criteria used for the census. We show differences in estimates by age, sex, Hispanic origin, geography, and socioeconomic characteristics symptomatic of the differences in noncitizen coverage.
View Full
Paper PDF
-
Estimating the U.S. Citizen Voting-Age Population (CVAP) Using Blended Survey Data, Administrative Record Data, and Modeling: Technical Report
April 2023
Authors:
J. David Brown,
Danielle H. Sandler,
Lawrence Warren,
Moises Yi,
Misty L. Heggeness,
Joseph L. Schafer,
Matthew Spence,
Marta Murray-Close,
Carl Lieberman,
Genevieve Denoeux,
Lauren Medina
Working Paper Number:
CES-23-21
This report develops a method using administrative records (AR) to fill in responses for nonresponding American Community Survey (ACS) housing units rather than adjusting survey weights to account for selection of a subset of nonresponding housing units for follow-up interviews and for nonresponse bias. The method also inserts AR and modeling in place of edits and imputations for ACS survey citizenship item nonresponses. We produce Citizen Voting-Age Population (CVAP) tabulations using this enhanced CVAP method and compare them to published estimates. The enhanced CVAP method produces a 0.74 percentage point lower citizen share, and it is 3.05 percentage points lower for voting-age Hispanics. The latter result can be partly explained by omissions of voting-age Hispanic noncitizens with unknown legal status from ACS household responses. Weight adjustments may be less effective at addressing nonresponse bias under those conditions.
View Full
Paper PDF
-
National Experimental Wellbeing Statistics - Version 1
February 2023
Working Paper Number:
CES-23-04
This is the U.S. Census Bureau's first release of the National Experimental Wellbeing Statistics (NEWS) project. The NEWS project aims to produce the best possible estimates of income and poverty given all available survey and administrative data. We link survey, decennial census, administrative, and third-party data to address measurement error in income and poverty statistics. We estimate improved (pre-tax money) income and poverty statistics for 2018 by addressing several possible sources of bias documented in prior research. We address biases from 1) unit nonresponse through improved weights, 2) missing income information in both survey and administrative data through improved imputation, and 3) misreporting by combining or replacing survey responses with administrative information. Reducing survey error substantially affects key measures of well-being: We estimate median household income is 6.3 percent higher than in survey estimates, and poverty is 1.1 percentage points lower. These changes are driven by subpopulations for which survey error is particularly relevant. For house holders aged 65 and over, median household income is 27.3 percent higher and poverty is 3.3 percentage points lower than in survey estimates. We do not find a significant impact on median household income for householders under 65 or on child poverty. Finally, we discuss plans for future releases: addressing other potential sources of bias, releasing additional years of statistics, extending the income concepts measured, and including smaller geographies such as state and county.
View Full
Paper PDF
-
Maternal and Infant Health Inequality: New Evidence from Linked Administrative Data
November 2022
Working Paper Number:
CES-22-55
We use linked administrative data that combines the universe of California birth records, hospitalizations, and death records with parental income from Internal Revenue Service tax records and the Longitudinal Employer-Household Dynamics file to provide novel evidence on economic inequality in infant and maternal health. We find that birth outcomes vary nonmonotonically with parental income, and that children of parents in the top ventile of the income distribution have higher rates of low birth weight and preterm birth than those in the bottom ventile. However, unlike birth outcomes, infant mortality varies monotonically with income, and infants of parents in the top ventile of the income distribution---who have the worst birth outcomes---have a death rate that is half that of infants of parents in the bottom ventile. When studying maternal health, we find a similar pattern of non-monotonicity between income and severe maternal morbidity, and a monotonic and decreasing relationship between income and maternal mortality. At the same time, these disparities by parental income are small when compared to racial disparities, and we observe virtually no convergence in health outcomes across racial and ethnic groups as income rises. Indeed, infant and maternal health in Black families at the top of the income distribution is markedly worse than that of white families at the bottom of the income distribution. Lastly, we benchmark the health gradients in California to those in Sweden, finding that infant and maternal health is worse in California than in Sweden for most outcomes throughout the entire income distribution.
View Full
Paper PDF