CREAT: Census Research Exploration and Analysis Tool

Papers Containing Tag(s): 'Service Annual Survey'

The following papers contain search terms that you selected. From the papers listed below, you can navigate to the PDF, the profile page for that working paper, or see all the working papers written by an author. You can also explore tags, keywords, and authors that occur frequently within these papers.
Click here to search again

Frequently Occurring Concepts within this Search

Center for Economic Studies - 37

Internal Revenue Service - 31

Longitudinal Business Database - 31

North American Industry Classification System - 30

Business Register - 30

Employer Identification Numbers - 27

Bureau of Labor Statistics - 27

American Community Survey - 25

Standard Industrial Classification - 24

National Science Foundation - 24

Social Security Administration - 23

Standard Statistical Establishment List - 23

Research Data Center - 23

Current Population Survey - 22

Longitudinal Employer Household Dynamics - 20

Economic Census - 20

Metropolitan Statistical Area - 19

Protected Identification Key - 18

Survey of Income and Program Participation - 17

Social Security Number - 16

Social Security - 15

Disclosure Review Board - 15

Chicago Census Research Data Center - 15

Cornell University - 15

Federal Statistical Research Data Center - 14

Annual Survey of Manufactures - 14

Census Bureau Disclosure Review Board - 13

County Business Patterns - 13

Longitudinal Research Database - 13

Decennial Census - 12

Master Address File - 12

Unemployment Insurance - 12

Bureau of Economic Analysis - 12

Census Bureau Business Register - 11

2010 Census - 11

Alfred P Sloan Foundation - 10

Quarterly Workforce Indicators - 10

Business Dynamics Statistics - 10

Small Business Administration - 10

Person Validation System - 9

University of Chicago - 9

Census of Manufacturing Firms - 9

Permanent Plant Number - 9

Quarterly Census of Employment and Wages - 8

Census Bureau Longitudinal Business Database - 8

DOB - 8

Special Sworn Status - 8

Federal Reserve Bank - 8

American Housing Survey - 8

Medical Expenditure Panel Survey - 8

National Bureau of Economic Research - 8

Company Organization Survey - 7

National Center for Health Statistics - 7

Employment History File - 7

Employer Characteristics File - 7

Census of Manufactures - 7

Center for Administrative Records Research and Applications - 7

Office of Management and Budget - 6

Core Based Statistical Area - 6

National Opinion Research Center - 6

Individual Characteristics File - 6

Successor Predecessor File - 6

Longitudinal Firm Trade Transactions Database - 6

Patent and Trademark Office - 6

Business Employment Dynamics - 6

Retail Trade - 6

Person Identification Validation System - 6

American Economic Association - 6

Business Master File - 6

LEHD Program - 6

Computer Assisted Personal Interview - 5

Centers for Disease Control and Prevention - 5

Composite Person Record - 5

Local Employment Dynamics - 5

Office of Personnel Management - 5

Census Numident - 5

Federal Tax Information - 5

Postal Service - 5

Department of Agriculture - 5

Public Administration - 5

North American Industry Classi - 5

Department of Commerce - 5

Cornell Institute for Social and Economic Research - 5

Review of Economics and Statistics - 5

American Economic Review - 5

Business Register Bridge - 5

SSA Numident - 5

Financial, Insurance and Real Estate Industries - 5

Health and Retirement Study - 4

Housing and Urban Development - 4

Department of Housing and Urban Development - 4

Census Bureau Person Identification Validation System - 4

National Institutes of Health - 4

University of Maryland - 4

Statistics Canada - 4

Federal Reserve System - 4

Integrated Longitudinal Business Database - 4

Arts, Entertainment - 4

1940 Census - 4

Department of Homeland Security - 4

MIT Press - 4

Geographic Information Systems - 4

University of Michigan - 4

Bureau of Labor - 4

National Longitudinal Survey of Youth - 4

Public Use Micro Sample - 4

Administrative Records - 4

Ordinary Least Squares - 4

CDF - 4

Cumulative Density Function - 4

Agency for Healthcare Research and Quality - 4

Consolidated Metropolitan Statistical Areas - 4

Detailed Earnings Records - 3

Federal Register - 3

W-2 - 3

National Institute on Aging - 3

National Center for Science and Engineering Statistics - 3

Professional Services - 3

IBM - 3

COVID-19 - 3

Survey of Business Owners - 3

Economic Research Service - 3

Customs and Border Protection - 3

Indian Health Service - 3

Individual Taxpayer Identification Numbers - 3

Personally Identifiable Information - 3

Code of Federal Regulations - 3

Department of Labor - 3

Wholesale Trade - 3

Educational Services - 3

Agriculture, Forestry - 3

Sloan Foundation - 3

American Statistical Association - 3

International Trade Research Report - 3

Business Research and Development and Innovation Survey - 3

University of California Los Angeles - 3

Energy Information Administration - 3

Environmental Protection Agency - 3

Minnesota Population Center - 3

Establishment Micro Properties - 3

Yale University - 3

PSID - 3

Survey of Consumer Finances - 3

Characteristics of Business Owners - 3

survey - 29

data - 29

respondent - 20

datasets - 19

enterprise - 18

census data - 18

record - 18

data census - 18

statistical - 17

payroll - 17

census bureau - 16

agency - 16

microdata - 16

employed - 16

employee - 15

database - 14

report - 13

analysis - 13

manufacturing - 13

workforce - 12

econometric - 12

company - 11

incorporated - 11

population - 11

sector - 11

establishment - 11

estimating - 11

sale - 11

matching - 10

industrial - 10

corporation - 9

identifier - 9

use census - 9

researcher - 9

quarterly - 9

expenditure - 9

census employment - 8

employee data - 8

inventory - 8

employment data - 8

economist - 8

statistician - 8

proprietorship - 8

research - 8

matched - 7

coverage - 7

employ - 7

census research - 7

innovation - 7

patent - 7

longitudinal - 7

gdp - 7

revenue - 7

production - 7

manufacturer - 7

residential - 7

department - 6

insurance - 6

information - 6

work census - 6

employment statistics - 6

employer household - 6

longitudinal employer - 6

organizational - 6

patenting - 6

business data - 6

censuses surveys - 6

census survey - 6

growth - 6

census file - 6

research census - 6

aggregate - 6

earnings - 6

labor - 6

information census - 5

disclosure - 5

ssa - 5

metropolitan - 5

geographic - 5

healthcare - 5

survey data - 5

acquisition - 5

economic census - 5

worker - 5

clerical - 5

study - 5

residence - 5

housing - 5

invention - 5

indian - 4

socioeconomic - 4

assessed - 4

rural - 4

market - 4

export - 4

corp - 4

associate - 4

firms patents - 4

recession - 4

warehousing - 4

businesses census - 4

census use - 4

technology - 4

technological - 4

venture - 4

entrepreneurship - 4

citizen - 4

records census - 4

linkage - 4

linked census - 4

irs - 4

job - 4

estimation - 4

estimates employment - 4

macroeconomic - 4

corporate - 3

subsidiary - 3

proprietor - 3

firm data - 3

ethnicity - 3

hispanic - 3

area - 3

region - 3

geography - 3

disparity - 3

sampling - 3

empirical - 3

environmental - 3

sample - 3

workforce indicators - 3

confidentiality - 3

prevalence - 3

investment - 3

import - 3

trademark - 3

patents firms - 3

reporting - 3

establishments data - 3

census years - 3

enrollment - 3

household surveys - 3

provided census - 3

competitiveness - 3

exporting - 3

exporter - 3

wholesale - 3

firms export - 3

importer - 3

tariff - 3

entrepreneurial - 3

hiring - 3

workplace - 3

employment dynamics - 3

privacy - 3

statistical disclosure - 3

housing survey - 3

aggregation - 3

commute - 3

imputation - 3

expense - 3

imputed - 3

firm patenting - 3

innovative - 3

ancestry - 3

census records - 3

labor statistics - 3

health insurance - 3

aging - 3

demand - 3

census business - 3

Viewing papers 21 through 30 of 81


  • Working Paper

    Reservation Nonemployer and Employer Establishments: Data from U.S. Census Longitudinal Business Databases

    December 2018

    Working Paper Number:

    CES-18-50

    The presence of businesses on American Indian reservations has been difficult to analyze due to limited data. Akee, Mykerezi, and Todd (AMT; 2017) geocoded confidential data from the U.S. Census Longitudinal Business Database to identify whether employer establishments were located on or off American Indian reservations and then compared federally recognized reservations and nearby county areas with respect to their per capita number of employers and jobs. We use their methods and the U.S. Census Integrated Longitudinal Business Database to develop parallel results for nonemployer establishments and for the combination of employer and nonemployer establishments. Similar to AMT's findings, we find that reservations and nearby county areas have a similar sectoral distribution of nonemployer and nonemployer-plus-employer establishments, but reservations have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000. By contrast to AMT, the average size of reservation nonemployer establishments, as measured by revenue (instead of the jobs measure AMT used for employers), is smaller than the size of nonemployers in nearby county areas, and this is true in most industries as well. The most significant exception is in the retail sector. Geographic and demographic factors, such as population density and per capita income, statistically account for only a small portion of these differences. However, when we assume that nonemployer establishments create the equivalent of one job and use combined employer-plus-nonemployer jobs to measure establishment size, the employer job numbers dominate and we parallel AMT's finding that, due to large job counts in the Arts/Entertainment/Recreation and Public Administration sectors, reservations on average have slightly more jobs per resident than nearby county areas.
    View Full Paper PDF
  • Working Paper

    Squeezing More Out of Your Data: Business Record Linkage with Python

    November 2018

    Working Paper Number:

    CES-18-46

    Integrating data from different sources has become a fundamental component of modern data analytics. Record linkage methods represent an important class of tools for accomplishing such integration. In the absence of common disambiguated identifiers, researchers often must resort to ''fuzzy" matching, which allows imprecision in the characteristics used to identify common entities across dfferent datasets. While the record linkage literature has identified numerous individually useful fuzzy matching techniques, there is little consensus on a way to integrate those techniques within a single framework. To this end, we introduce the Multiple Algorithm Matching for Better Analytics (MAMBA), an easy-to-use, flexible, scalable, and transparent software platform for business record linkage applications using Census microdata. MAMBA leverages multiple string comparators to assess the similarity of records using a machine learning algorithm to disambiguate matches. This software represents a transparent tool for researchers seeking to link external business data to the Census Business Register files.
    View Full Paper PDF
  • Working Paper

    LEHD Infrastructure S2014 files in the FSRDC

    September 2018

    Authors: Lars Vilhuber

    Working Paper Number:

    CES-18-27R

    The Longitudinal Employer-Household Dynamics (LEHD) Program at the U.S. Census Bureau, with the support of several national research agencies, maintains a set of infrastructure files using administrative data provided by state agencies, enhanced with information from other administrative data sources, demographic and economic (business) surveys and censuses. The LEHD Infrastructure Files provide a detailed and comprehensive picture of workers, employers, and their interaction in the U.S. economy. This document describes the structure and content of the 2014 Snapshot of the LEHD Infrastructure files as they are made available in the Census Bureau's secure and restricted-access Research Data Center network. The document attempts to provide a comprehensive description of all researcher-accessible files, of their creation, and of any modifications made to the files to facilitate researcher access.
    View Full Paper PDF
  • Working Paper

    Disclosure Limitation and Confidentiality Protection in Linked Data

    January 2018

    Working Paper Number:

    CES-18-07

    Confidentiality protection for linked administrative data is a combination of access modalities and statistical disclosure limitation. We review traditional statistical disclosure limitation methods and newer methods based on synthetic data, input noise infusion and formal privacy. We discuss how these methods are integrated with access modalities by providing three detailed examples. The first example is the linkages in the Health and Retirement Study to Social Security Administration data. The second example is the linkage of the Survey of Income and Program Participation to administrative data from the Internal Revenue Service and the Social Security Administration. The third example is the Longitudinal Employer-Household Dynamics data, which links state unemployment insurance records for workers and firms to a wide variety of censuses and surveys at the U.S. Census Bureau. For examples, we discuss access modalities, disclosure limitation methods, the effectiveness of those methods, and the resulting analytical validity. The final sections discuss recent advances in access modalities for linked administrative data.
    View Full Paper PDF
  • Working Paper

    The Need to Account for Complex Sampling Features when Analyzing Establishment Survey Data: An Illustration using the 2013 Business Research and Development and Innovation Survey (BRDIS)

    January 2017

    Working Paper Number:

    CES-17-62

    The importance of correctly accounting for complex sampling features when generating finite population inferences based on complex sample survey data sets has now been clearly established in a variety of fields, including those in both statistical and non statistical domains. Unfortunately, recent studies of analytic error have suggested that many secondary analysts of survey data do not ultimately account for these sampling features when analyzing their data, for a variety of possible reasons (e.g., poor documentation, or a data producer may not provide the information in a publicuse data set). The research in this area has focused exclusively on analyses of household survey data, and individual respondents. No research to date has considered how analysts are approaching the data collected in establishment surveys, and whether published articles advancing science based on analyses of establishment behaviors and outcomes are correctly accounting for complex sampling features. This article presents alternative analyses of real data from the 2013 Business Research and Development and Innovation Survey (BRDIS), and shows that a failure to account for the complex design features of the sample underlying these data can lead to substantial differences in inferences about the target population of establishments for the BRDIS.
    View Full Paper PDF
  • Working Paper

    Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?

    January 2017

    Working Paper Number:

    CES-17-59R

    The National Science Foundation-Census Bureau Research Network (NCRN) was established in 2011 to create interdisciplinary research nodes on methodological questions of interest and significance to the broader research community and to the Federal Statistical System (FSS), particularly the Census Bureau. The activities to date have covered both fundamental and applied statistical research and have focused at least in part on the training of current and future generations of researchers in skills of relevance to surveys and alternative measurement of economic units, households, and persons. This paper discusses some of the key research findings of the eight nodes, organized into six topics: (1) Improving census and survey data collection methods; (2) Using alternative sources of data; (3) Protecting privacy and confidentiality by improving disclosure avoidance; (4) Using spatial and spatio-temporal statistical modeling to improve estimates; (5) Assessing data cost and quality tradeoffs; and (6) Combining information from multiple sources. It also reports on collaborations across nodes and with federal agencies, new software developed, and educational activities and outcomes. The paper concludes with an evaluation of the ability of the FSS to apply the NCRN's research outcomes and suggests some next steps, as well as the implications of this research-network model for future federal government renewal initiatives.
    View Full Paper PDF
  • Working Paper

    Reservation Employer Establishments: Data from the U.S. Census Longitudinal Business Database

    January 2017

    Working Paper Number:

    CES-17-57

    The presence of employers and jobs on American Indian reservations has been difficult to analyze due to limited data. We are the first to geocode confidential data on employer establishments from the U.S. Census Longitudinal Business Database to identify location on or off American Indian reservations. We identify the per capita establishment count and jobs in reservation-based employer establishments for most federally recognized reservations. Comparisons to nearby non-reservation areas in the lower 48 states across 18 industries reveal that reservations have a similar sectoral distribution of employer establishments but have significantly fewer of them in nearly all sectors, especially when the area population is below 15,000 (as it is on the vast majority of reservations and for the majority of the reservation population). By contrast, the total number of jobs provided by reservation establishments is, on average, at par with or somewhat higher than in nearby county areas but is concentrated among casino-related and government employers. An implication is that average job numbers per establishment are higher in these sectors on reservations, including those with populations below 15,000, while the remaining industries are typically sparser within reservations (in firm count and jobs per capita). Geographic and demographic factors, such as population density and per capita income, statistically account for some but not all of these differences.
    View Full Paper PDF
  • Working Paper

    Recalculating... : How Uncertainty in Local Labor Market Definitions Affects Empirical Findings

    January 2017

    Working Paper Number:

    CES-17-49R

    This paper evaluates the use of commuting zones as a local labor market definition. We revisit Tolbert and Sizer (1996) and demonstrate the sensitivity of definitions to two features of the methodology: a cluster dissimilarity cutoff, or the count of clusters, and uncertainty in the input data. We show how these features impact empirical estimates using a standard application of commuting zones and an example from related literature. We conclude with advice to researchers on how to demonstrate the robustness of empirical findings to uncertainty in the definition of commuting zones
    View Full Paper PDF
  • Working Paper

    Examining Multi-Level Correlates of Suicide by Merging NVDRS and ACS Data

    January 2017

    Working Paper Number:

    CES-17-25

    This paper describes a novel database and an associated suicide event prediction model that surmount longstanding barriers in suicide risk factor research. The database comingles person-level records from the National Violent Death Reporting System (NVDRS) and the American Community Survey (ACS) to establish a case-control study sample that includes all identified suicide cases, while faithfully reflecting general population sociodemographics, in sixteen USA states during the years 2005 2011. It supports a statistical model of individual suicide risk that accommodates person-level factors and the moderation of these factors by their community rates. Named the United States Multi-Level Suicide Data Set (US-MSDS), the database was developed outside the RDC laboratory using publicly available ACS microdata, and reconstructed inside the laboratory using restricted access ACS microdata. Analyses of the latter version yielded findings that largely amplified but also extended those obtained from analyses of the former. This experience shows that the analytic precision achievable using restricted access ACS data can play an important role in conducting social research, although it also indicates that publicly available ACS data have considerable value in conducting preliminary analyses and preparing to use an RDC laboratory. The database development strategy may interest scientists investigating sociodemographic risk factors for other types of low-frequency mortality.
    View Full Paper PDF
  • Working Paper

    R&D, Attrition and Multiple Imputation in BRDIS

    January 2017

    Working Paper Number:

    CES-17-13

    Multiple imputation in business establishment surveys like BRDIS, an annual business survey in which some companies are sampled every year or multiple years, may enhance the estimates of total R&D in addition to helping researchers estimate models with subpopulations of small sample size. Considering a panel of BRDIS companies throughout the years 2008 to 2013 linked to LBD data, this paper uses the conclusions obtained with missing data visualization and other explorations to come up with a strategy to conduct multiple imputation appropriate to address the item nonresponse in R&D expenditures. Because survey design characteristics are behind much of the item and unit nonresponse, multiple imputation of missing data in BRDIS changes the estimates of total R&D significantly and alters the conclusions reached by models of the determinants of R&D investment obtained with complete case analysis.
    View Full Paper PDF