CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'SSA Numident'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Protected Identification Key - 20

Social Security Administration - 18

Social Security Number - 18

Internal Revenue Service - 15

Current Population Survey - 14

American Community Survey - 14

2010 Census - 11

Social Security - 10

Census Bureau Disclosure Review Board - 10

Person Validation System - 10

Decennial Census - 9

Person Identification Validation System - 8

Census Numident - 8

Personally Identifiable Information - 7

Center for Administrative Records Research and Applications - 7

Employer Identification Numbers - 6

Individual Taxpayer Identification Numbers - 6

Census Bureau Person Identification Validation System - 6

Disclosure Review Board - 6

North American Industry Classification System - 5

W-2 - 5

Administrative Records - 5

Business Register - 5

Master Address File - 5

Service Annual Survey - 5

National Opinion Research Center - 5

Center for Economic Studies - 4

Office of Management and Budget - 4

Survey of Income and Program Participation - 4

Longitudinal Employer Household Dynamics - 4

Department of Housing and Urban Development - 4

MAFID - 4

1940 Census - 4

Longitudinal Business Database - 3

Census Bureau Business Register - 3

Disability Insurance - 3

Accommodation and Food Services - 3

Housing and Urban Development - 3

Supplemental Nutrition Assistance Program - 3

Census Household Composition Key - 3

American Housing Survey - 3

Bureau of Labor Statistics - 3

National Science Foundation - 3

Ordinary Least Squares - 3

Department of Defense - 3

Research Data Center - 3

Minnesota Population Center - 3

Viewing papers 1 through 10 of 20


  • Working Paper

    Tip of the Iceberg: Tip Reporting at U.S. Restaurants, 2005-2018

    November 2024

    Working Paper Number:

    CES-24-68

    Tipping is a significant form of compensation for many restaurant jobs, but it is poorly measured and therefore not well understood. We combine several large administrative and survey datasets and document patterns in tip reporting that are consistent with systematic under-reporting of tip income. Our analysis indicates that although the vast majority of tipped workers do report earning some tips, the dollar value of tips is under-reported and is sensitive to reporting incentives. In total, we estimate that about eight billion in tips paid at full-service, single-location, restaurants were not captured in tax data annually over the period 2005-2018. Due to changes in payment methods and reporting incentives, tip reporting has increased over time. Our findings have implications for downstream measures dependent on accurate measures of compensation including poverty measurement among tipped restaurant workers.
    View Full Paper PDF
  • Working Paper

    Nonresponse and Coverage Bias in the Household Pulse Survey: Evidence from Administrative Data

    October 2024

    Working Paper Number:

    CES-24-60

    The Household Pulse Survey (HPS) conducted by the U.S. Census Bureau is a unique survey that provided timely data on the effects of the COVID-19 Pandemic on American households and continues to provide data on other emergent social and economic issues. Because the survey has a response rate in the single digits and only has an online response mode, there are concerns about nonresponse and coverage bias. In this paper, we match administrative data from government agencies and third-party data to HPS respondents to examine how representative they are of the U.S. population. For comparison, we create a benchmark of American Community Survey (ACS) respondents and nonrespondents and include the ACS respondents as another point of reference. Overall, we find that the HPS is less representative of the U.S. population than the ACS. However, performance varies across administrative variables, and the existing weighting adjustments appear to greatly improve the representativeness of the HPS. Additionally, we look at household characteristics by their email domain to examine the effects on coverage from limiting email messages in 2023 to addresses from the contact frame with at least 90% deliverability rates, finding no clear change in the representativeness of the HPS afterwards.
    View Full Paper PDF
  • Working Paper

    Incorporating Administrative Data in Survey Weights for the 2018-2022 Survey of Income and Program Participation

    October 2024

    Working Paper Number:

    CES-24-58

    Response rates to the Survey of Income and Program Participation (SIPP) have declined over time, raising the potential for nonresponse bias in survey estimates. A potential solution is to leverage administrative data from government agencies and third-party data providers when constructing survey weights. In this paper, we modify various parts of the SIPP weighting algorithm to incorporate such data. We create these new weights for the 2018 through 2022 SIPP panels and examine how the new weights affect survey estimates. Our results show that before weighting adjustments, SIPP respondents in these panels have higher socioeconomic status than the general population. Existing weighting procedures reduce many of these differences. Comparing SIPP estimates between the production weights and the administrative data-based weights yields changes that are not uniform across the joint income and program participation distribution. Unlike other Census Bureau household surveys, there is no large increase in nonresponse bias in SIPP due to the COVID-19 Pandemic. In summary, the magnitude and sign of nonresponse bias in SIPP is complicated, and the existing weighting procedures may change the sign of nonresponse bias for households with certain incomes and program benefit statuses.
    View Full Paper PDF
  • Working Paper

    Where Are Your Parents? Exploring Potential Bias in Administrative Records on Children

    March 2024

    Working Paper Number:

    CES-24-18

    This paper examines potential bias in the Census Household Composition Key's (CHCK) probabilistic parent-child linkages. By linking CHCK data to the American Community Survey (ACS), we reveal disparities in parent-child linkages among specific demographic groups and find that characteristics of children that can and cannot be linked to the CHCK vary considerably from the larger population. In particular, we find that children from low-income, less educated households and of Hispanic origin are less likely to be linked to a mother or a father in the CHCK. We also highlight some data considerations when using the CHCK.
    View Full Paper PDF
  • Working Paper

    Self-Employment Income Reporting on Surveys

    April 2023

    Working Paper Number:

    CES-23-19

    We examine the relation between administrative income data and survey reports for self-employed and wage-earning respondents from 2000 - 2015. The self-employed report 40 percent more wages and self-employment income in the survey than in tax administrative records; this estimate nets out differences between these two sources that are also shared by wage-earners. We provide evidence that differential reporting incentives are an important explanation of the larger self-employed gap by exploiting a well-known artifact ' self-employed respondents exhibit substantial bunching at the first EITC kink in their administrative records. We do not observe the same behavior in their survey responses even after accounting for survey measurement concerns.
    View Full Paper PDF
  • Working Paper

    The Long-run Effects of the 1930s Redlining Maps on Children

    December 2022

    Working Paper Number:

    CES-22-56

    We estimate the long-run effects of the 1930s Home Owners Loan Corporation (HOLC) redlining maps by linking children in the full count 1940 Census to 1) the universe of IRS tax data in 1974 and 1979 and 2) the long form 2000 Census. We use two identification strategies to estimate the potential long-run effects of differential access to credit along HOLC boundaries. The first strategy compares cross-boundary differences along HOLC boundaries to a comparison group of boundaries that had statistically similar pre-existing differences as the actual boundaries. A second approach only uses boundaries that were least likely to have been chosen by the HOLC based on our statistical model. We find that children living on the lower-graded side of HOLC boundaries had significantly lower levels of educational attainment, reduced income in adulthood, and lived in neighborhoods during adulthood characterized by lower educational attainment, higher poverty rates, and higher rates of single-headed households.
    View Full Paper PDF
  • Working Paper

    The Long Run Impacts of Court-Ordered Desegregation

    April 2022

    Working Paper Number:

    CES-22-11

    Court ordered desegregation plans were implemented in hundreds of US school districts nationwide from the 1960s through the 1980s, and were arguably the most substantive national attempt to improve educational access for African American children in modern American history. Using large Census samples that are linked to Social Security records containing county of birth, we implement event studies that estimate the long run effects of exposure to desegregation orders on human capital and labor market outcomes. We find that African Americans who were relatively young when a desegregation order was implemented in their county of birth, and therefore had more exposure to integrated schools, experienced large improvements in adult human capital and labor market outcomes relative to Blacks who were older when a court order was locally implemented. There are no comparable changes in outcomes among whites in counties undergoing an order, or among Blacks who were beyond school ages when a local order was implemented. These effects are strongly concentrated in the South, with largely null findings in other regions. Our data and methodology provide the most comprehensive national assessment to date on the impacts of court ordered desegregation, and strongly indicate that these policies were in fact highly effective at improving the long run socioeconomic outcomes of many Black students.
    View Full Paper PDF
  • Working Paper

    Nonemployer Statistics by Demographics (NES-D): Exploring Longitudinal Consistency and Sub-national Estimates

    December 2019

    Working Paper Number:

    CES-19-34

    Until recently, the quinquennial Survey of Business Owners (SBO) was the only source of information for U.S. employer and nonemployer businesses by owner demographic characteristics such as race, ethnicity, sex and veteran status. Now, however, the Nonemployer Statistics by Demographics series (NES-D) will replace the SBO's nonemployer component with reliable, and more frequent (annual) business demographic estimates with no additional respondent burden, and at lower imputation rates and costs. NES-D is not a survey; rather, it exploits existing administrative and census records to assign demographic characteristics to the universe of approximately 25 million (as of 2016) nonemployer businesses. Although only in the second year of its research phase, NES-D is rapidly moving towards production, with a planned prototype or experimental version release of 2017 nonemployer data in 2020, followed by annual releases of the series. After the first year of research, we released a working paper (Luque et al., 2019) that assessed the viability of estimating nonemployer demographics exclusively with administrative records (AR) and census data. That paper used one year of data (2015) to produce preliminary tabulations of business counts at the national level. This year we expand that research in multiple ways by: i) examining the longitudinal consistency of administrative and census records coverage, and of our AR-based demographics estimates, ii) evaluating further coverage from additional data sources, iii) exploring estimates at the sub-national level, iv) exploring estimates by industrial sector, v) examining demographics estimates of business receipts as well as of counts, and vi) implementing imputation of missing demographic values. Our current results are consistent with the main findings in Luque et al. (2019), and show that high coverage and demographic assignment rates are not the exception, but the norm. Specifically, we find that AR coverage rates are high and stable over time for each of the three years we examine, 2014-2016. We are able to identify owners for approximately 99 percent of nonemployer businesses (excluding C-corporations), 92 to 93 percent of identified nonemployer owners have no missing demographics, and only about 1 percent are missing three or more demographic characteristics in each of the three years. We also find that our demographics estimates are stable over time, with expected small annual changes that are consistent with underlying population trends in the U.S.. Due to data limitations, these results do not include C-corporations, which represent only 2 percent of nonemployer businesses and 4 percent of receipts. Without added respondent burden and at lower imputation rates and costs, NES-D will provide high-quality business demographics estimates at a higher frequency (annual vs. every 5 years) than the SBO.
    View Full Paper PDF
  • Working Paper

    Nonemployer Statistics by Demographics (NES-D): Using Administrative and Census Records Data in Business Statistics

    January 2019

    Working Paper Number:

    CES-19-01

    The quinquennial Survey of Business Owners or SBO provided the only comprehensive source of information in the United States on employer and nonemployer businesses by the sex, race, ethnicity and veteran status of the business owners. The annual Nonemployer Statistics series (NES) provides establishment counts and receipts for nonemployers but contains no demographic information on the business owners. With the transition of the employer component of the SBO to the Annual Business Survey, the Nonemployer Statistics by Demographics series or NES-D represents the continuation of demographics estimates for nonemployer businesses. NES-D will leverage existing administrative and census records to assign demographic characteristics to the universe of approximately 24 million nonemployer businesses (as of 2015). Demographic characteristics include key demographics measured by the SBO (sex, race, Hispanic origin and veteran status) as well as other demographics (age, place of birth and citizenship status) collected but not imputed by the SBO if missing. A spectrum of administrative and census data sources will provide the nonemployer universe and demographics information. Specifically, the nonemployer universe originates in the Business Register; the Census Numident will provide sex, age, place of birth and citizenship status; race and Hispanic origin information will be obtained from multiple years of the decennial census and the American Community Survey; and the Department of Veteran Affairs will provide administrative records data on veteran status. The use of blended data in this manner will make possible the production of NES-D, an annual series that will become the only source of detailed and comprehensive statistics on the scope, nature and activities of U.S. businesses with no paid employment by the demographic characteristics of the business owner. Using the 2015 vintage of nonemployers, initial results indicate that demographic information is available for the overwhelming majority of the universe of nonemployers. For instance, information on sex, age, place of birth and citizenship status is available for over 95 percent of the 24 million nonemployers while race and Hispanic origin are available for about 90 percent of them. These results exclude owners of C-corporations, which represent only 2 percent of nonemployer firms. Among other things, future work will entail imputation of missing demographics information (including that of C-corporations), testing the longitudinal consistency of the estimates, and expanding the set of characteristics beyond the demographics mentioned above. Without added respondent burden and at lower imputation rates and costs, NES-D will meet the needs of stakeholders as well as the economy as a whole by providing reliable estimates at a higher frequency (annual vs. every 5 years) and with a more timely dissemination schedule than the SBO.
    View Full Paper PDF
  • Working Paper

    LEHD Infrastructure S2014 files in the FSRDC

    September 2018

    Authors: Lars Vilhuber

    Working Paper Number:

    CES-18-27R

    The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
    View Full Paper PDF