-
Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes
August 2016
Working Paper Number:
carra-2016-06
While commercial data sources offer promise to statistical agencies for use in production of official statistics, challenges can arise as the data are not collected for statistical purposes. This paper evaluates the use of 2008-2010 property tax data from CoreLogic, Inc. (CoreLogic), aggregated from county and township governments from around the country, to improve 2010 American Community Survey (ACS) estimates of property tax amounts for single-family homes. Particularly, the research evaluates the potential to use CoreLogic to reduce respondent burden, to study survey response error and to improve adjustments for survey nonresponse. The research found that the coverage of the CoreLogic data varies between counties as does the correspondence between ACS and CoreLogic property taxes. This geographic variation implies that different approaches toward using CoreLogic are needed in different areas of the country. Further, large differences between CoreLogic and ACS property taxes in certain counties seem to be due to conceptual differences between what is collected in the two data sources. The research examines three counties, Clark County, NV, Philadelphia County, PA and St. Louis County, MO, and compares how estimates would change with different approaches using the CoreLogic data. Mean county property tax estimates are highly sensitive to whether ACS or CoreLogic data are used to construct estimates. Using CoreLogic data in imputation modeling for nonresponse adjustment of ACS estimates modestly improves the predictive power of imputation models, although estimates of county property taxes and property taxes by mortgage status are not very sensitive to the imputation method.
View Full
Paper PDF
-
Using Partially Synthetic Microdata to Protect Sensitive Cells in Business Statistics
February 2016
Working Paper Number:
CES-16-10
We describe and analyze a method that blends records from both observed and synthetic microdata into public-use tabulations on establishment statistics. The resulting tables use synthetic data only in potentially sensitive cells. We describe different algorithms, and present preliminary results when applied to the Census Bureau's Business Dynamics Statistics and Synthetic Longitudinal Business Database, highlighting accuracy and protection afforded by the method when compared to existing public-use tabulations (with suppressions).
View Full
Paper PDF
-
Statistics on the International Trade Administration's Global Markets Program
September 2015
Working Paper Number:
CES-15-17
Recent mandates for evidence-based policy choices from both the Executive and Legislative branches of the federal government underscore the importance of understanding the relationship between program participation and business outcomes. In this paper, we examine the correlations between participation in an export-promotion program and business outcomes. We use this experience to provide more general lessons learned about combining program data on treatments with Census Bureau micro data that can be used as a control. Note this paper does not evaluate a program, but instead provides critical information about a program.
The mission of the Commercial Service/Global Markets program is to help companies either start or increase their exports of goods and services. It pursues this mission through advocacy, events, and counseling. This study looks at a very small part of the overall program. While we cannot rule-out several sources of bias in our results, we do observe several consistent patterns across our models. In particular, program participation is positively correlated with export growth and change and, for small businesses, also with positive employment growth. However, overall, and for large firms in particular, there is a negative correlation with employment growth and counseling. The paper concludes with a 'Lessons Learned' section that highlights areas where measurement can be improved.
View Full
Paper PDF
-
Business Dynamics of Innovating Firms: Linking U.S. Patents with Administrative Data on Workers and Firms
July 2015
Working Paper Number:
CES-15-19
This paper discusses the construction of a new longitudinal database tracking inventors and patent-owning firms over time. We match granted patents between 2000 and 2011 to administrative databases of firms and workers housed at the U.S. Census Bureau. We use inventor information in addition to the patent assignee firm name to and improve on previous efforts linking patents to firms. The triangulated database allows us to maximize match rates and provide validation for a large fraction of matches. In this paper, we describe the construction of the database and explore basic features of the data. We find patenting firms, particularly young patenting firms, disproportionally contribute jobs to the U.S. economy. We find patenting is a relatively rare event among small firms but that most patenting firms are nevertheless small, and that patenting is not as rare an event for the youngest firms compared to the oldest firms. While manufacturing firms are more likely to patent than firms in other sectors, we find most patenting firms are in the services and wholesale sectors. These new data are a product of collaboration within the U.S. Department of Commerce, between the U.S. Census Bureau and the U.S. Patent and Trademark Office.
View Full
Paper PDF
-
Matching Addresses between Household Surveys and Commercial Data
July 2015
Working Paper Number:
carra-2015-04
Matching third-party data sources to household surveys can benefit household surveys in a number of ways, but the utility of these new data sources depends critically on our ability to link units between data sets. To understand this better, this report discusses potential modifications to the existing match process that could potentially improve our matches. While many changes to the matching procedure produce marginal improvements in match rates, substantial increases in match rates can only be achieved by relaxing the definition of a successful match. In the end, the results show that the most important factor determining the success of matching procedures is the quality and composition of the data sets being matched.
View Full
Paper PDF
-
The Impact of Heterogeneous NOx Regulations on Distributed Electricity Generation in U.S. Manufacturing
April 2015
Working Paper Number:
CES-15-12
The US EPA's command-and-control NOx policies of the early 1990s are associated with a 3.1 percentage point reduction in the likelihood of manufacturing plants vertically integrating the electricity generation process. During the same period California adopted a cap-and-trade program for NOx emissions that resulted in no significant impact on distributed electricity generation in manufacturing. These results suggest that traditional command-and-control approaches to air pollution may exacerbate other market failures such as the energy efficiency gap, because distributed generation is generally recognized as a more energy efficient means of producing electricity
View Full
Paper PDF
-
Evaluating the Long-Term Effect of NIST MEP Services on Establishment Performance
March 2015
Working Paper Number:
CES-15-09
This work examines the effects of receipt of business assistance services from the Manufacturing Extension Partnership (MEP) on manufacturing establishment performance. Several measures of performance are considered: (1) change in value-added per employee (a measure of productivity); (2) change in sales per worker; (3) change in employment; and (4) establishment survival. To analyze these relationships, we merged program records from the MEP's client and project information files with administrative records from the Census of Manufacturers and other Census databases over the periods 1997'2002 and 2002'2007 to compare the outcomes and performance of 'served' and 'unserved' manufacturing establishments. The approach builds on, updates, and expands upon earlier studies comparing matched MEP client and non-client performance over time periods ending in 1992 and 2002. Our results generally indicate that MEP services had positive and significant impacts on establishment productivity and sales per worker for the 2002'2007 period with some exceptions based on employment size, industry, and type of service provided. MEP services also increased the probability of establishment survival for the 1997'2007 period. Regardless of econometric model specification, MEP clients with 1'19 employees have statistically significant and higher levels of labor productivity growth. We also observed significant productivity differences associated with MEP services by broad sector, with higher impacts over the 2002'2007 time period in the durable goods manufacturing sector. The study further finds that establishments receiving MEP assistance are more likely to survive than those that do not receive MEP assistance. Detailed findings of the study, as well as caveats and limitations, are discussed in the paper.
View Full
Paper PDF
-
INNOVATION OUTPUT CHOICES AND CHARACTERISTICS OF FIRMS IN THE U.S.
October 2014
Working Paper Number:
CES-14-42
This paper uses new business micro data from the Business Research and Development and Innovation Survey (BRDIS) for the years 2008-2011 to relate the discrete innovation choices made by U.S. companies to features of the company that have long been considered to be important correlates of innovation. We use multinomial logit to model those choices. Bloch and Lopez-Bassols (2009) used the Community Innovation Surveys (CIS) to classify companies according dual, technological or output-based innovation constructs. We found that for each of those constructs of innovation combinations considered, manufacturing and engaging in intellectual property transfer increase the odds of choosing innovation strategies that involve more than one type of categories (for example, both goods and services, or both tech and non-tech) and radical innovations, controlling form size, productivity, time and type of R&D. Company size and company productivity as well as time do not lean the choices in any particular direction. These associations are robust across the three multinomial choice models that we have considered. In contrast with other studies, we have been able to use companies that do and companies that do not innovate, and this has allowed to rule out to some extent selectivity bias.
View Full
Paper PDF
-
RECOVERING THE ITEM-LEVEL EDIT AND IMPUTATION FLAGS IN THE 1977-1997 CENSUSES OF MANUFACTURES
September 2014
Working Paper Number:
CES-14-37
As part of processing the Census of Manufactures, the Census Bureau edits some data items and imputes for missing data and some data that is deemed erroneous. Until recently it was difficult for researchers using the plant-level microdata to determine which data items were changed or imputed during the editing and imputation process, because the edit/imputation processing flags were not available to researchers. This paper describes the process of reconstructing the edit/imputation flags for variables in the 1977, 1982, 1987, 1992, and 1997 Censuses of Manufactures using recently recovered Census Bureau files. Thepaper also reports summary statistics for the percentage of cases that are imputed for key variables. Excluding plants with fewer than 5 employees, imputation rates for several key variables range from 8% to 54% for the manufacturing sector as a whole, and from 1% to 72% at the 2-digit SIC industry level.
View Full
Paper PDF
-
Creating Linked Historical Data: An Assessment of the Census Bureau's Ability to Assign Protected Identification Keys to the 1960 Census
September 2014
Working Paper Number:
carra-2014-12
In order to study social phenomena over the course of the 20th century, the Census Bureau is investigating the feasibility of digitizing historical census records and linking them to contemporary data. However, historical censuses have limited personally identifiable information available to match on. In this paper, I discuss the problems associated with matching older censuses to contemporary data files, and I describe the matching process used to match a small sample of the 1960 census to the Social Security Administration Numeric Identification System.
View Full
Paper PDF