CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'respondent'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Internal Revenue Service - 52

Current Population Survey - 52

American Community Survey - 48

Social Security Administration - 44

Survey of Income and Program Participation - 36

Protected Identification Key - 34

Census Bureau Disclosure Review Board - 33

Social Security - 32

Social Security Number - 31

Center for Economic Studies - 28

2020 Census - 26

National Science Foundation - 25

Bureau of Labor Statistics - 24

Decennial Census - 23

Person Validation System - 22

Master Address File - 21

Employer Identification Number - 19

Business Register - 19

Service Annual Survey - 19

Longitudinal Employer Household Dynamics - 19

Cornell University - 19

Personally Identifiable Information - 19

Research Data Center - 19

Disclosure Review Board - 18

Department of Housing and Urban Development - 16

Computer Assisted Personal Interview - 16

North American Industry Classification System - 16

Administrative Records - 15

Person Identification Validation System - 14

Federal Statistical Research Data Center - 14

Housing and Urban Development - 13

Census Bureau Business Register - 13

Office of Management and Budget - 13

Longitudinal Business Database - 12

Metropolitan Statistical Area - 11

Postal Service - 11

Standard Industrial Classification - 11

Ordinary Least Squares - 10

Some Other Race - 10

Agency for Healthcare Research and Quality - 10

Annual Survey of Manufactures - 10

Individual Taxpayer Identification Numbers - 9

Census Bureau Master Address File - 9

Temporary Assistance for Needy Families - 9

Supplemental Nutrition Assistance Program - 9

Standard Statistical Establishment List - 9

Quarterly Census of Employment and Wages - 9

Quarterly Workforce Indicators - 9

Economic Census - 9

W-2 - 8

American Housing Survey - 8

Detailed Earnings Records - 8

Medicaid Services - 8

National Opinion Research Center - 8

Cornell Institute for Social and Economic Research - 8

Indian Health Service - 8

Bureau of Economic Analysis - 8

Department of Health and Human Services - 8

Alfred P Sloan Foundation - 8

Unemployment Insurance - 8

1990 Census - 7

Master Beneficiary Record - 7

Census Household Composition Key - 7

SSA Numident - 7

MAFID - 7

Census Edited File - 7

Health and Retirement Study - 7

Computer Assisted Telephone Interviews and Computer Assisted Personal Interviews - 7

National Center for Health Statistics - 7

Longitudinal Research Database - 7

Center for Administrative Records Research and Applications - 7

Special Sworn Status - 7

Census Numident - 6

Disability Insurance - 6

Census Bureau Person Identification Validation System - 6

Social Science Research Institute - 6

ASEC - 6

Indian Housing Information Center - 6

Medical Expenditure Panel Survey - 6

National Health Interview Survey - 6

PIKed - 6

National Bureau of Economic Research - 6

Sloan Foundation - 6

Chicago Census Research Data Center - 6

Census of Manufactures - 6

Public Use Micro Sample - 6

Securities and Exchange Commission - 6

LEHD Program - 6

Adjusted Gross Income - 5

Centers for Medicare - 5

University of Chicago - 5

Statistics Canada - 5

National Academy of Sciences - 5

University of Michigan - 5

Social and Economic Supplement - 5

American Statistical Association - 5

Federal Reserve Bank - 5

National Longitudinal Survey of Youth - 5

Census 2000 - 5

Permanent Plant Number - 5

Data Management System - 4

MAF-ARF - 4

Master Earnings File - 4

NUMIDENT - 4

Department of Labor - 4

Department of Economics - 4

AHRQ Medical Expenditure Panel Survey Insurance Component - 4

National Institute on Aging - 4

Survey of Business Owners - 4

Kauffman Foundation - 4

Business Dynamics Statistics - 4

General Accounting Office - 4

Local Employment Dynamics - 4

Journal of Economic Literature - 4

Financial, Insurance and Real Estate Industries - 4

PSID - 4

Earned Income Tax Credit - 3

Current Population Survey Annual Social and Economic Supplement - 3

Core Based Statistical Area - 3

Customs and Border Protection - 3

Department of Justice - 3

Michigan Institute for Teaching and Research in Economics - 3

Office of Personnel Management - 3

University of Minnesota - 3

Federal Insurance Contributions Act - 3

Center for Administrative Records Research - 3

Urban Institute - 3

Geographic Information Systems - 3

National Institutes of Health - 3

Characteristics of Business Owners - 3

Small Business Administration - 3

Census Bureau Center for Economic Studies - 3

Probability Density Function - 3

Management and Organizational Practices Survey - 3

County Business Patterns - 3

American Economic Association - 3

Census Bureau Longitudinal Business Database - 3

Georgetown University - 3

Duke University - 3

Minnesota Population Center - 3

Total Factor Productivity - 3

Census of Manufacturing Firms - 3

Department of Education - 3

Organization for Economic Cooperation and Development - 3

Pollution Abatement Costs and Expenditures - 3

survey - 80

population - 44

data - 42

household - 42

census bureau - 35

statistical - 32

data census - 29

census data - 27

agency - 25

microdata - 22

imputation - 22

estimating - 22

report - 21

record - 21

datasets - 20

use census - 19

resident - 17

census survey - 17

coverage - 15

earnings - 15

hispanic - 15

statistician - 15

census research - 15

citizen - 14

survey income - 14

census response - 14

irs - 14

analysis - 14

ethnicity - 13

employed - 13

federal - 13

survey data - 13

sampling - 12

medicaid - 12

research census - 12

household survey - 11

assessed - 11

insurance - 11

1040 - 11

enrollment - 11

research - 11

study - 11

expenditure - 10

income data - 10

tax - 10

assessing - 10

census records - 10

confidentiality - 10

2010 census - 10

researcher - 10

income survey - 9

percentile - 9

disclosure - 9

privacy - 9

economic census - 9

sample - 9

estimation - 9

economist - 9

linked census - 8

minority - 8

poverty - 8

filing - 8

immigrant - 8

ssa - 8

census employment - 8

survey census - 8

surveys censuses - 8

earner - 8

information - 8

payroll - 8

employ - 8

workforce - 8

censuses surveys - 8

firms census - 8

residential - 7

residence - 7

taxpayer - 7

census linked - 7

salary - 7

immigration - 7

public - 7

ethnic - 7

race - 7

census use - 7

prevalence - 7

aggregate - 7

longitudinal - 7

socioeconomic - 6

revenue - 6

census household - 6

estimator - 6

unemployed - 6

records census - 6

work census - 6

quarterly - 6

employee - 6

econometric - 6

recession - 6

sector - 6

reporting - 6

healthcare - 6

race census - 6

database - 6

trend - 6

matching - 6

average - 6

yearly - 6

census years - 5

disparity - 5

bias - 5

citizenship - 5

state - 5

statistical disclosure - 5

racial - 5

disadvantaged - 5

housing - 5

discrepancy - 5

department - 5

employment data - 5

neighborhood - 5

labor - 5

ancestry - 5

medicare - 5

employee data - 5

enrollee - 5

statistical agencies - 5

earn - 5

environmental - 4

latino - 4

information census - 4

social - 4

migration - 4

census 2020 - 4

publicly - 4

family - 4

eligible - 4

welfare - 4

income households - 4

employment statistics - 4

linkage - 4

metropolitan - 4

worker - 4

establishment - 4

manufacturing - 4

business data - 4

native - 4

insurance coverage - 4

enrolled - 4

white - 4

disability - 4

health insurance - 4

aging - 4

gdp - 4

income individuals - 4

pandemic - 3

associate - 3

migrant - 3

unobserved - 3

eligibility - 3

empirical - 3

household income - 3

home - 3

housing survey - 3

census business - 3

imputation model - 3

policymakers - 3

estimates employment - 3

sale - 3

uninsured - 3

inference - 3

company - 3

analyst - 3

labor statistics - 3

insured - 3

matched - 3

census file - 3

workplace - 3

income year - 3

manufacturer - 3

Viewing papers 1 through 10 of 105


  • Working Paper

    The Census Historical Environmental Impacts Frame

    October 2024

    Working Paper Number:

    CES-24-66

    The Census Bureau's Environmental Impacts Frame (EIF) is a microdata infrastructure that combines individual-level information on residence, demographics, and economic characteristics with environmental amenities and hazards from 1999 through the present day. To better understand the long-run consequences and intergenerational effects of exposure to a changing environment, we expand the EIF by extending it backward to 1940. The Historical Environmental Impacts Frame (HEIF) combines the Census Bureau's historical administrative data, publicly available 1940 address information from the 1940 Decennial Census, and historical environmental data. This paper discusses the creation of the HEIF as well as the unique challenges that arise with using the Census Bureau's historical administrative data.
    View Full Paper PDF
  • Working Paper

    Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data

    October 2024

    Working Paper Number:

    CES-24-60

    The Household Pulse Survey (HPS) conducted by the U.S. Census Bureau is a unique survey that provided timely data on the effects of the COVID-19 Pandemic on American households and continues to provide data on other emergent social and economic issues. Because the survey has a response rate in the single digits and only has an online response mode, there are concerns about nonresponse and coverage bias. In this paper, we match administrative data from government agencies and third-party data to HPS respondents to examine how representative they are of the U.S. population. For comparison, we create a benchmark of American Community Survey (ACS) respondents and nonrespondents and include the ACS respondents as another point of reference. Overall, we find that the HPS is less representative of the U.S. population than the ACS. However, performance varies across administrative variables, and the existing weighting adjustments appear to greatly improve the representativeness of the HPS. Additionally, we look at household characteristics by their email domain to examine the effects on coverage from limiting email messages in 2023 to addresses from the contact frame with at least 90% deliverability rates, finding no clear change in the representativeness of the HPS afterwards.
    View Full Paper PDF
  • Working Paper

    Incorporating Administrative Data in Survey Weights for the 2018-2022 Survey of Income and Program Participation

    October 2024

    Working Paper Number:

    CES-24-58

    Response rates to the Survey of Income and Program Participation (SIPP) have declined over time, raising the potential for nonresponse bias in survey estimates. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we modify various parts of the SIPP weighting algorithm to incorporate such data. We create these new weights for the 2018 through 2022 SIPP panels and examine how the new weights affect survey estimates. Our results show that before weighting adjustments, SIPP respondents in these panels have higher socioeconomic status than the general population. Existing weighting procedures reduce many of these differences. Comparing SIPP estimates between the production weights and the administrative data-based weights yields changes that are not uniform across the joint income and program participation distribution. Unlike other Census Bureau household surveys, there is no large increase in nonresponse bias in SIPP due to the COVID-19 Pandemic. In summary, the magnitude and sign of nonresponse bias in SIPP is complicated, and the existing weighting procedures may change the sign of nonresponse bias for households with certain incomes and program benefit statuses.
    View Full Paper PDF
  • Working Paper

    Earnings Through the Stages: Using Tax Data to Test for Sources of Error in CPS ASEC Earnings and Inequality Measures

    September 2024

    Authors: Ethan Krohn

    Working Paper Number:

    CES-24-52

    In this paper, I explore the impact of generalized coverage error, item non-response bias, and measurement error on measures of earnings and earnings inequality in the CPS ASEC. I match addresses selected for the CPS ASEC to administrative data from 1040 tax returns. I then compare earnings statistics in the tax data for wage and salary earnings in samples corresponding to seven stages of the CPS ASEC survey production process. I also compare the statistics using the actual survey responses. The statistics I examine include mean earnings, the Gini coefficient, percentile earnings shares, and shares of the survey weight for a range of percentiles. I examine how the accuracy of the statistics calculated using the survey data is affected by including imputed responses for both those who did not respond to the full CPS ASEC and those who did not respond to the earnings question. I find that generalized coverage error and item nonresponse bias are dominated by measurement error, and that an important aspect of measurement error is households reporting no wage and salary earnings in the CPS ASEC when there are such earnings in the tax data. I find that the CPS ASEC sample misses earnings at the high end of the distribution from the initial selection stage and that the final survey weights exacerbate this.
    View Full Paper PDF
  • Working Paper

    Citizenship Question Effects on Household Survey Response

    June 2024

    Working Paper Number:

    CES-24-31

    Several small-sample studies have predicted that a citizenship question in the 2020 Census would cause a large drop in self-response rates. In contrast, minimal effects were found in Poehler et al.'s (2020) analysis of the 2019 Census Test randomized controlled trial (RCT). We reconcile these findings by analyzing associations between characteristics about the addresses in the 2019 Census Test and their response behavior by linking to independently constructed administrative data. We find significant heterogeneity in sensitivity to the citizenship question among households containing Hispanics, naturalized citizens, and noncitizens. Response drops the most for households containing noncitizens ineligible for a Social Security number (SSN). It falls more for households with Latin American-born immigrants than those with immigrants from other countries. Response drops less for households with U.S.-born Hispanics than households with noncitizens from Latin America. Reductions in responsiveness occur not only through lower unit self-response rates, but also by increased household roster omissions and internet break-offs. The inclusion of a citizenship question increases the undercount of households with noncitizens. Households with noncitizens also have much higher citizenship question item nonresponse rates than those only containing citizens. The use of tract-level characteristics and significant heterogeneity among Hispanics, the foreign-born, and noncitizens help explain why the effects found by Poehler et al. were so small. Linking administrative microdata with the RCT data expands what we can learn from the RCT.
    View Full Paper PDF
  • Working Paper

    The Icing on the Cake: The Effects of Monetary Incentives on Income Data Quality in the SIPP

    January 2024

    Working Paper Number:

    CES-24-03

    Accurate measurement of key income variables plays a crucial role in economic research and policy decision-making. However, the presence of item nonresponse and measurement error in survey data can cause biased estimates. These biases can subsequently lead to sub-optimal policy decisions and inefficient allocation of resources. While there have been various studies documenting item nonresponse and measurement error in economic data, there have not been many studies investigating interventions that could reduce item nonresponse and measurement error. In our research, we investigate the impact of monetary incentives on reducing item nonresponse and measurement error for labor and investment income in the Survey of Income and Program Participation (SIPP). Our study utilizes a randomized incentive experiment in Waves 1 and 2 of the 2014 SIPP, which allows us to assess the effectiveness of incentives in reducing item nonresponse and measurement error. We find that households receiving incentives had item nonresponse rates that are 1.3 percentage points lower for earnings and 1.5 percentage points lower for Social Security income. Measurement error was 6.31 percentage points lower at the intensive margin for interest income, and 16.48 percentage points lower for dividend income compared to non-incentive recipient households. These findings provide valuable insights for data producers and users and highlight the importance of implementing strategies to improve data quality in economic research.
    View Full Paper PDF
  • Working Paper

    Incorporating Administrative Data in Survey Weights for the Basic Monthly Current Population Survey

    January 2024

    Working Paper Number:

    CES-24-02

    Response rates to the Current Population Survey (CPS) have declined over time, raising the potential for nonresponse bias in key population statistics. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we take two approaches. First, we use administrative data to build a non-parametric nonresponse adjustment step while leaving the calibration to population estimates unchanged. Second, we use administratively linked data in the calibration process, matching income data from the Internal Return Service and state agencies, demographic data from the Social Security Administration and the decennial census, and industry data from the Census Bureau's Business Register to both responding and nonresponding households. We use the matched data in the household nonresponse adjustment of the CPS weighting algorithm, which changes the weights of respondents to account for differential nonresponse rates among subpopulations. After running the experimental weighting algorithm, we compare estimates of the unemployment rate and labor force participation rate between the experimental weights and the production weights. Before March 2020, estimates of the labor force participation rates using the experimental weights are 0.2 percentage points higher than the original estimates, with minimal effect on unemployment rate. After March 2020, the new labor force participation rates are similar, but the unemployment rate is about 0.2 percentage points higher in some months during the height of COVID-related interviewing restrictions. These results are suggestive that if there is any nonresponse bias present in the CPS, the magnitude is comparable to the typical margin of error of the unemployment rate estimate. Additionally, the results are overall similar across demographic groups and states, as well as using alternative weighting methodology. Finally, we discuss how our estimates compare to those from earlier papers that calculate estimates of bias in key CPS labor force statistics. This paper is for research purposes only. No changes to production are being implemented at this time.
    View Full Paper PDF
  • Working Paper

    Connected and Uncooperative: The Effects of Homogenous and Exclusive Social Networks on Survey Response Rates and Nonresponse Bias

    January 2024

    Working Paper Number:

    CES-24-01

    Social capital, the strength of people's friendship networks and community ties, has been hypothesized as an important determinant of survey participation. Investigating this hypothesis has been difficult given data constraints. In this paper, we provide insights by investigating how response rates and nonresponse bias in the American Community Survey are correlated with county-level social network data from Facebook. We find that areas of the United States where people have more exclusive and homogenous social networks have higher nonresponse bias and lower response rates. These results provide further evidence that the effects of social capital may not be simply a matter of whether people are socially isolated or not, but also what types of social connections people have and the sociodemographic heterogeneity of their social networks.
    View Full Paper PDF
  • Working Paper

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    December 2023

    Working Paper Number:

    CES-23-63

    Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. The tabular publications in Summary File 1 thus have prohibited disclosure risk similar to the unreleased confidential microdata. Reidentification studies confirm that an attacker can, within blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with nonmodal characteristics) with 95% accuracy, the same precision as the confidential data achieve and far greater than statistical baselines. The flaw in the 2010 Census framework was the assumption that aggregation prevented accurate microdata reconstruction, justifying weaker disclosure limitation methods than were applied to 2010 Census public microdata. The framework used for 2020 Census publications defends against attacks that are based on reconstruction, as we also demonstrate here. Finally, we show that alternatives to the 2020 Census Disclosure Avoidance System with similar accuracy (enhanced swapping) also fail to protect confidentiality, and those that partially defend against reconstruction attacks (incomplete suppression implementations) destroy the primary statutory use case: data for redistricting all legislatures in the country in compliance with the 1965 Voting Rights Act.
    View Full Paper PDF
  • Working Paper

    Producing U.S. Population Statistics Using Multiple Administrative Sources

    November 2023

    Working Paper Number:

    CES-23-58

    We identify several challenges encountered when constructing U.S. administrative record-based (AR-based) population estimates for 2020. Though the AR estimates are higher than the 2020 Census at the national level, they are over 15 percent lower in 5 percent of counties, suggesting that locational accuracy can be improved. Other challenges include how to achieve comprehensive coverage, maintain consistent coverage across time, filter out nonresidents and people not alive on the reference date, uncover missing links across person and address records, and predict demographic characteristics when multiple ones are reported or when they are missing. We discuss several ways of addressing these issues, e.g., building in redundancy with more sources, linking children to their parents' addresses, and conducting additional record linkage for people without Social Security Numbers and for addresses not initially linked to the Census Bureau's Master Address File. We discuss modeling to predict lower levels of geography for people lacking those geocodes, the probability that a person is a U.S. resident on the reference date, the probability that an address is the person's residence on the reference date, and the probability a person is in each demographic characteristic category. Regression results illustrate how many of these challenges and solutions affect the AR county population estimates.
    View Full Paper PDF