CREAT: Census Research Exploration and Analysis Tool

Papers Containing Keywords(s): 'statistical'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Center for Economic Studies - 39

National Science Foundation - 39

Internal Revenue Service - 35

Bureau of Labor Statistics - 31

Cornell University - 31

American Community Survey - 27

Current Population Survey - 26

Census Bureau Disclosure Review Board - 25

North American Industry Classification System - 24

Social Security Administration - 23

Longitudinal Employer Household Dynamics - 21

Survey of Income and Program Participation - 20

Standard Industrial Classification - 18

Research Data Center - 18

Longitudinal Business Database - 17

Service Annual Survey - 17

Annual Survey of Manufactures - 15

Employer Identification Numbers - 15

Bureau of Economic Analysis - 15

Alfred P Sloan Foundation - 15

Longitudinal Research Database - 15

Federal Statistical Research Data Center - 14

Economic Census - 14

Quarterly Workforce Indicators - 13

Business Register - 13

Disclosure Review Board - 12

Census of Manufactures - 12

Total Factor Productivity - 12

Quarterly Census of Employment and Wages - 12

Social Security Number - 12

Ordinary Least Squares - 12

County Business Patterns - 12

Protected Identification Key - 11

Metropolitan Statistical Area - 11

2010 Census - 11

Special Sworn Status - 11

Decennial Census - 10

Social Security - 9

Office of Management and Budget - 9

Business Dynamics Statistics - 9

Unemployment Insurance - 9

National Longitudinal Survey of Youth - 8

Statistics Canada - 8

National Center for Health Statistics - 8

Cornell Institute for Social and Economic Research - 8

LEHD Program - 8

National Bureau of Economic Research - 7

Standard Statistical Establishment List - 7

Public Use Micro Sample - 7

Duke University - 7

American Statistical Association - 7

Chicago Census Research Data Center - 7

Census Bureau Longitudinal Business Database - 7

National Academy of Sciences - 6

Person Identification Validation System - 6

Census Bureau Business Register - 6

Department of Labor - 6

Detailed Earnings Records - 6

Person Validation System - 6

Master Address File - 6

Federal Reserve Bank - 6

Local Employment Dynamics - 6

PSID - 6

1940 Census - 5

Census Edited File - 5

Characteristics of Business Owners - 5

Housing and Urban Development - 5

W-2 - 5

Computer Assisted Personal Interview - 5

Personally Identifiable Information - 5

Census of Manufacturing Firms - 5

National Institutes of Health - 5

Department of Commerce - 5

Sloan Foundation - 5

Permanent Plant Number - 5

Department of Education - 4

Cobb-Douglas - 4

Department of Economics - 4

United States Census Bureau - 4

Some Other Race - 4

Social and Economic Supplement - 4

Financial, Insurance and Real Estate Industries - 4

Health and Retirement Study - 4

American Economic Association - 4

Small Business Administration - 4

Individual Characteristics File - 4

National Health Interview Survey - 4

Bureau of Labor - 4

National Institute on Aging - 4

Summary Earnings Records - 4

Company Organization Survey - 4

Journal of Economic Literature - 4

National Research Council - 3

Stanford University - 3

Annual Business Survey - 3

MAFID - 3

Federal Insurance Contribution Act - 3

Postal Service - 3

Department of Housing and Urban Development - 3

Supplemental Nutrition Assistance Program - 3

Agency for Healthcare Research and Quality - 3

Urban Institute - 3

American Housing Survey - 3

Temporary Assistance for Needy Families - 3

Department of Agriculture - 3

University of Michigan - 3

Employer Characteristics File - 3

Centers for Disease Control and Prevention - 3

North American Industry Classi - 3

Securities and Exchange Commission - 3

Review of Economics and Statistics - 3

Organization for Economic Cooperation and Development - 3

University of Maryland - 3

survey - 46

data - 43

respondent - 39

estimating - 33

population - 30

census bureau - 28

agency - 27

microdata - 27

report - 26

statistician - 23

datasets - 20

data census - 19

aggregate - 18

analysis - 17

percentile - 17

economist - 17

census data - 17

estimation - 17

earnings - 16

disclosure - 16

confidentiality - 15

use census - 14

database - 14

privacy - 14

researcher - 14

quarterly - 12

payroll - 12

employed - 12

workforce - 12

record - 12

public - 12

econometric - 12

statistical agencies - 12

longitudinal - 12

study - 11

employ - 11

estimator - 11

research - 11

imputation - 10

expenditure - 10

labor - 10

salary - 10

aggregation - 10

information - 10

socioeconomic - 9

manufacturing - 9

economic census - 9

statistical disclosure - 9

publicly - 9

recession - 9

employee - 9

census disclosure - 8

labor statistics - 8

sector - 8

federal - 8

market - 8

resident - 8

macroeconomic - 8

inference - 8

prevalence - 7

revenue - 7

census employment - 7

sample - 7

survey data - 7

sampling - 7

census survey - 7

research census - 7

enterprise - 7

sale - 7

censuses surveys - 7

census research - 7

employee data - 7

ethnicity - 6

poverty - 6

income data - 6

average - 6

assessed - 6

trend - 6

production - 6

company - 6

analyst - 6

empirical - 6

establishment - 6

industrial - 6

eligibility - 5

hispanic - 5

enrollment - 5

minority - 5

disadvantaged - 5

2010 census - 5

discrepancy - 5

gdp - 5

employment statistics - 5

census years - 5

social - 5

reporting - 5

model - 5

incorporated - 5

business data - 5

yearly - 5

aging - 5

growth - 5

state - 4

irs - 4

productivity growth - 4

productivity measures - 4

measures productivity - 4

imputation model - 4

census responses - 4

ssa - 4

survey income - 4

demand - 4

household surveys - 4

mobility - 4

income year - 4

citizen - 4

regression - 4

policymakers - 4

assessing - 4

employment data - 4

metropolitan - 4

tenure - 4

measure - 4

employer household - 4

corporation - 4

ethnic - 3

disparity - 3

eligible - 3

disability - 3

enrolled - 3

efficiency - 3

bias - 3

decade - 3

population survey - 3

matching - 3

racial - 3

intergenerational - 3

residence - 3

regressing - 3

information census - 3

corporate - 3

unobserved - 3

competitor - 3

surveys censuses - 3

economic statistics - 3

department - 3

linked census - 3

employment count - 3

work census - 3

regressors - 3

coverage - 3

produce - 3

family - 3

establishments data - 3

longitudinal employer - 3

workforce indicators - 3

poorer - 3

worker - 3

merger - 3

classified - 3

industrial classification - 3

classification - 3

classifying - 3

Viewing papers 1 through 10 of 96


  • Working Paper

    School-Based Disability Identification Varies by Student Family Income

    December 2025

    Working Paper Number:

    CES-25-74

    Currently, 18 percent of K-12 students in the United States receive additional supports through the identification of a disability. Socioeconomic status is viewed as central to understanding who gets identified as having a disability, yet limited large-scale evidence examines how disability identification varies for students from different income backgrounds. Using unique data linking information on Oregon students and their family income, we document pronounced income-based differences in how students are categorized for two school-based disability supports: special education services and Section 504 plans. We find that a quarter of students in the lowest income percentile receive supports through special education, compared with less than seven percent of students in the top income percentile. This pattern may partially reflect differences in underlying disability-related needs caused by poverty. However, we find the opposite pattern for 504 plans, where students in the top income percentiles are two times more likely to receive 504 plan supports. We further document substantial variation in these income-based differences by disability category, by race/ethnicity, and by grade level. Together, these patterns suggest that disability-related needs alone cannot account for the income-based differences that we observe and highlight the complex ways that income shapes the school and family processes that lead to variability in disability classification and services.
    View Full Paper PDF
  • Working Paper

    Gifted Identification Across the Distribution of Family Income

    December 2025

    Working Paper Number:

    CES-25-73

    Currently, 6.1 percent of K-12 students in the United States receive gifted education. Using education and IRS data that provide information on students and their family income, we show pronounced differences in who schools identify as gifted across the distribution of family income. Under 4 percent of students in the lowest income percentile are identified as gifted, compared with 20 percent of those in the top income percentile. Income-based differences persist after accounting for student test scores and exist across students of different sexes and racial/ethnic groups, underscoring the importance of family resources for gifted identification in schools.
    View Full Paper PDF
  • Working Paper

    Manufacturing Dispersion: How Data Cleaning Choices Affect Measured Misallocation and Productivity Growth in the Annual Survey of Manufactures

    September 2025

    Working Paper Number:

    CES-25-67

    Measurement of dispersion of productivity levels and productivity growth rates across businesses is a key input for answering a variety of important economic questions, such as understanding the allocation of economic inputs across businesses and over time. While item nonresponse is a readily quantifiable issue, we show there is also misreporting by respondents in the Annual Survey of Manufactures (ASM). Aware of these measurement issues, the Census Bureau edits and imputes survey responses before tabulation and dissemination. However, edit and imputation methods that are suitable for publishing aggregate totals may not be suitable for estimating other measures from the microdata. We show that the methods used dramatically affect estimates of productivity dispersion, allocative efficiency, and aggregate productivity growth. Using a Bayesian approach for editing and imputation, we model the joint distributions of all variables needed to estimate these measures, and we quantify the degree of uncertainty in the estimates due to imputations for faulty or missing data.
    View Full Paper PDF
  • Working Paper

    Job Tasks, Worker Skills, and Productivity

    September 2025

    Working Paper Number:

    CES-25-63

    We present new empirical evidence suggesting that we can better understand productivity dispersion across businesses by accounting for differences in how tasks, skills, and occupations are organized. This aligns with growing attention to the task content of production. We link establishment-level data from the Bureau of Labor Statistics Occupational Employment and Wage Statistics survey with productivity data from the Census Bureau's manufacturing surveys. Our analysis reveals strong relationships between establishment productivity and task, skill, and occupation inputs. These relationships are highly nonlinear and vary by industry. When we account for these patterns, we can explain a substantial share of productivity dispersion across establishments.
    View Full Paper PDF
  • Working Paper

    Revisiting the Unintended Consequences of Ban the Box

    August 2025

    Working Paper Number:

    CES-25-58

    Ban-the-Box (BTB) policies intend to help formerly incarcerated individuals find employment by delaying when employers can ask about criminal records. We revisit the finding in Doleac and Hansen (2020) that BTB causes statistical discrimination against minority men. We correct miscoded BTB laws and show that estimates from the Current Population Survey (CPS) remain quantitatively similar, while those from the American Community Survey (ACS) now fail to reject the null hypothesis of no effect of BTB on employment. In contrast to the published estimates, these ACS results are statistically significantly different from the CPS results, indicating a lack of robustness across datasets. We do not find evidence that these differences are due to sample composition or survey weights. There is limited evidence that these divergent results are explained by the different frequencies of these surveys. Differences in sample sizes may also lead to different estimates; the ACS has a much larger sample and more statistical power to detect effects near the corrected CPS estimates.
    View Full Paper PDF
  • Working Paper

    A Simulated Reconstruction and Reidentification Attack on the 2010 U.S. Census

    August 2025

    Working Paper Number:

    CES-25-57

    For the last half-century, it has been a common and accepted practice for statistical agencies, including the United States Census Bureau, to adopt different strategies to protect the confidentiality of aggregate tabular data products from those used to protect the individual records contained in publicly released microdata products. This strategy was premised on the assumption that the aggregation used to generate tabular data products made the resulting statistics inherently less disclosive than the microdata from which they were tabulated. Consistent with this common assumption, the 2010 Census of Population and Housing in the U.S. used different disclosure limitation rules for its tabular and microdata publications. This paper demonstrates that, in the context of disclosure limitation for the 2010 Census, the assumption that tabular data are inherently less disclosive than their underlying microdata is fundamentally flawed. The 2010 Census published more than 150 billion aggregate statistics in 180 table sets. Most of these tables were published at the most detailed geographic level'individual census blocks, which can have populations as small as one person. Using only 34 of the published table sets, we reconstructed microdata records including five variables (census block, sex, age, race, and ethnicity) from the confidential 2010 Census person records. Using only published data, an attacker using our methods can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. We further confirm, through reidentification studies, that an attacker can, within census blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with race and ethnicity different from the modal person on the census block) with 95% accuracy. Having shown the vulnerabilities inherent to the disclosure limitation methods used for the 2010 Census, we proceed to demonstrate that the more robust disclosure limitation framework used for the 2020 Census publications defends against attacks that are based on reconstruction. Finally, we show that available alternatives to the 2020 Census Disclosure Avoidance System would either fail to protect confidentiality, or would overly degrade the statistics' utility for the primary statutory use case: redrawing the boundaries of all of the nation's legislative and voting districts in compliance with the 1965 Voting Rights Act.
    View Full Paper PDF
  • Working Paper

    Earnings Measurement Error, Nonresponse and Administrative Mismatch in the CPS

    July 2025

    Working Paper Number:

    CES-25-48

    Using the Current Population Survey Annual Social and Economic Supplement matched to Social Security Administration Detailed Earnings Records, we link observations across consecutive years to investigate a relationship between item nonresponse and measurement error in the earnings questions. Linking individuals across consecutive years allows us to observe switching from response to nonresponse and vice versa. We estimate OLS, IV, and finite mixture models that allow for various assumptions separately for men and women. We find that those who respond in both years of the survey exhibit less measurement error than those who respond in one year. Our findings suggest a trade-off between survey response and data quality that should be considered by survey designers, data collectors, and data users.
    View Full Paper PDF
  • Working Paper

    Tapping Business and Household Surveys to Sharpen Our View of Work from Home

    June 2025

    Working Paper Number:

    CES-25-36

    Timely business-level measures of work from home (WFH) are scarce for the U.S. economy. We review prior survey-based efforts to quantify the incidence and character of WFH and describe new questions that we developed and fielded for the Business Trends and Outlook Survey (BTOS). Drawing on more than 150,000 firm-level responses to the BTOS, we obtain four main findings. First, nearly a third of businesses have employees who work from home, with tremendous variation across sectors. The share of businesses with WFH employees is nearly ten times larger in the Information sector than in Accommodation and Food Services. Second, employees work from home about 1 day per week, on average, and businesses expect similar WFH levels in five years. Third, feasibility aside, businesses' largest concern with WFH relates to productivity. Seven percent of businesses find that onsite work is more productive, while two percent find that WFH is more productive. Fourth, there is a low level of tracking and monitoring of WFH activities, with 70% of firms reporting they do not track employee days in the office and 75% reporting they do not monitor employees when they work from home. These lessons serve as a starting point for enhancing WFH-related content in the American Community Survey and other household surveys.
    View Full Paper PDF
  • Working Paper

    Revisiting Methods to Assign Responses when Race and Hispanic Origin Reporting are Discrepant Across Administrative Records and Third Party Sources

    May 2024

    Authors: James M. Noon

    Working Paper Number:

    CES-24-26

    The Best Race and Ethnicity Administrative Records Composite file ('Best Race file') is an composite file which combines Census, federal, and Third Party Data (TPD) sources and applies business rules to assign race and ethnicity values to person records. The first version of the Best Race administrative records composite was first constructed in 2015 and subsequently updated each year to include more recent vintages, when available, of the data sources originally included in the composite file. Where updates were available for data sources, the most recent information for persons was retained, and the business rules were reapplied to assign a single race and single Hispanic origin value to each person record. The majority of person records on the Best Race file have consistent race and ethnicity information across data sources. Where there are discrepancies in responses across data sources, we apply a series of business rules to assign a single race and ethnicity to each record. To improve the quality of the Best Race administrative records composite, we have begun revising the business rules which were developed several years ago. This paper discusses the original business rules as well as the implemented changes and their impact on the composite file.
    View Full Paper PDF
  • Working Paper

    Mobility, Opportunity, and Volatility Statistics (MOVS): Infrastructure Files and Public Use Data

    April 2024

    Working Paper Number:

    CES-24-23

    Federal statistical agencies and policymakers have identified a need for integrated systems of household and personal income statistics. This interest marks a recognition that aggregated measures of income, such as GDP or average income growth, tell an incomplete story that may conceal large gaps in well-being between different types of individuals and families. Until recently, longitudinal income data that are rich enough to calculate detailed income statistics and include demographic characteristics, such as race and ethnicity, have not been available. The Mobility, Opportunity, and Volatility Statistics project (MOVS) fills this gap in comprehensive income statistics. Using linked demographic and tax records on the population of U.S. working-age adults, the MOVS project defines households and calculates household income, applying an equivalence scale to create a personal income concept, and then traces the progress of individuals' incomes over time. We then output a set of intermediate statistics by race-ethnicity group, sex, year, base-year state of residence, and base-year income decile. We select the intermediate statistics most useful in developing more complex intragenerational income mobility measures, such as transition matrices, income growth curves, and variance-based volatility statistics. We provide these intermediate statistics as part of a publicly released data tool with downloadable flat files and accompanying documentation. This paper describes the data build process and the output files, including a brief analysis highlighting the structure and content of our main statistics.
    View Full Paper PDF