This paper develops and applies a method for directly estimating a multivariate, autoregressive moving-average (ARMA) model with mixed-frequency, time-series data. Unlike standard, single-frequency methods, the method does not require the data to be transformed to a single frequency (by temporally aggregating higher-frequency data to lower frequencies for interpolating lower-frequency data to higher frequencies) or the model to be restricted by frequency. Subject to computational constraints, the method can handle any number of variable and frequencies. In addition, variable can be treated as temporally aggregated and observed with errors and delays. The key to the method is to view lower-frequency data as periodically missing and to use the missing-data variant of the Kalman filter.
In the application, a bivariate, ARMA model is estimated with monthly observations on total employment and quarterly observations on real GNP, in the U.S., for January 1958 to December 1978. The estimated model is, then, used to compute monthly forecasts of the variables for 1 to 12 months ahead, for January 1979 to December 1988. Compared with GNP forecasts, in particular, for similar periods produced by established econometric and time series models, present GNP forecasts are generally more accurate for 1 to 4 months ahead and about equally or slightly less accurate for 5 to 12 months ahead. The application, thus, shows that the present method is tractable and able to effectively exploit cross-frequency sample information, in ARMA estimate and forecasting, which standard methods cannot exploit at all.
-
Building the Census Bureau Index of Economic Activity (IDEA)
March 2023
Working Paper Number:
CES-23-15
The Census Bureau Index of Economic Activity (IDEA) is constructed from 15 of the Census Bureau's primary monthly economic time series. The index is intended to provide a single time series reflecting, to the extent possible, the variation over time in the whole set of component series. The component series provide monthly measures of activity in retail and wholesale trade, manufacturing, construction, international trade, and business formations. Most of the input series are Principal Federal Economic Indicators. The index is constructed by applying the method of principal components analysis (PCA) to the time series of monthly growth rates of the seasonally adjusted component series, after standardizing the growth rates to series with mean zero and variance 1. Similar PCA approaches have been used for the construction of other economic indices, including the Chicago Fed National Activity Index issued by the Federal Reserve Bank of Chicago, and the Weekly Economic Index issued by the Federal Reserve Bank of New York. While the IDEA is constructed from time series of monthly data, it is calculated and published every business day, and so is updated whenever a new monthly value is released for any of its component series. Since release dates of data values for a given month vary across the component series, with slight variations in the monthly release date for any one component series, updates to the index are frequent. It is unavoidably the case that, at almost all updates, some of the component series lack observations for the current (most recent) data month. To address this situation, component series that are one month behind are predicted (nowcast) for the current index month, using a multivariate autoregressive time series model. This report discusses the input series to the index, the construction of the index by PCA, and the nowcasting procedure used. The report then examines some properties of the index and its relation to quarterly U.S. Gross Domestic Product and to some monthly non-Census Bureau economic indicators.
View Full
Paper PDF
-
Dynamically Consistent Noise Infusion and Partially Synthetic Data as Confidentiality Protection Measures for Related Time Series
July 2012
Working Paper Number:
CES-12-13
The Census Bureau's Quarterly Workforce Indicators (QWI) provide detailed quarterly statistics on employment measures such as worker and job flows, tabulated by worker characteristics in various combinations. The data are released for several levels of NAICS industries and geography, the lowest aggregation of the latter being counties. Disclosure avoidance methods are required to protect the information about individuals and businesses that contribute to the underlying data. The QWI disclosure avoidance mechanism we describe here relies heavily on the use of noise infusion through a permanent multiplicative noise distortion factor, used for magnitudes, counts, differences and ratios. There is minimal suppression and no complementary suppressions. To our knowledge, the release in 2003 of the QWI was the first large-scale use of noise infusion in any official statistical product. We show that the released statistics are analytically valid along several critical dimensions { measures are unbiased and time series properties are preserved. We provide an analysis of the degree to which confidentiality is protected. Furthermore, we show how the judicious use of synthetic data, injected into the tabulation process, can completely eliminate suppressions, maintain analytical validity, and increase the protection of the underlying confidential data.
View Full
Paper PDF
-
Business Applications as a Leading Economic Indicator?
May 2021
Working Paper Number:
CES-21-09R
How are applications to start new businesses related to aggregate economic activity? This paper explores the properties of three monthly business application series from the U.S. Census Bureau's Business Formation Statistics as economic indicators: all business applications, business applications that are relatively likely to turn into new employer businesses ('likely employers'), and the residual series -- business applications that have a relatively low rate of becoming employers ('likely non-employers'). Growth in applications for likely employers significantly leads total nonfarm employment growth and has a strong positive correlation with it. Furthermore, growth in applications for likely employers leads growth in most of the monthly Principal Federal Economic Indicators (PFEIs). Motivated by our findings, we estimate a dynamic factor model (DFM) to forecast nonfarm employment growth over a 12-month period using the PFEIs and the likely employers series. The latter improves the model's forecast, especially in the years following the turning points of the Great Recession and the COVID-19 pandemic. Overall, applications for likely employers are a strong leading indicator of monthly PFEIs and aggregate economic activity, whereas applications for likely non-employers provide early information about changes in increasingly prevalent self-employment activity in the U.S. economy.
View Full
Paper PDF
-
JOB-TO-JOB (J2J) Flows: New Labor Market Statistics From Linked Employer-Employee Data
September 2014
Working Paper Number:
CES-14-34
Flows of workers across jobs are a principal mechanism by which labor markets allocate workers to optimize productivity. While these job flows are both large and economically important, they represent a significant gap in available economic statistics. A soon to be released data product from the U.S. Census Bureau will fill this gap. The Job-to-Job (J2J) flow statistics provide estimates of worker flows across jobs, across different geographic labor markets, by worker and firm characteristics, including direct job-to-job flows as well as job changes with intervening nonemployment. In this paper, we describe the creation of the public-use data product on job-to-job flows. The data underlying the statistics are the matched employer-employee data from the U.S. Census Bureau's Longitudinal Employer-Household Dynamics program. We describe definitional issues and the identification strategy for tracing worker movements between employers in administrative data. We then compare our data with related series and discuss similarities and differences. Lastly, we describe disclosure avoidance techniques for the public use file, and our methodology for estimating national statistics when there is partially missing geography.
View Full
Paper PDF
-
Labor Reallocation, Employment, and Earnings: Vector Autoregression Evidence
January 2017
Working Paper Number:
CES-17-11R
Analysis of the labor market has given increasing attention to the reallocation of jobs across employers and workers across jobs. However, whether and how job reallocation and labor market 'churn' affects the health of the labor market remains an open question. In this paper, we present time series evidence for the U.S. 1993-2013 and consider the relationship between labor reallocation, employment, and earnings using a vector autoregression (VAR) framework. We find that an increase in labor market churn by 1 percentage point predicts that, in the next quarter, employment will increase by 100 to 560 thousand jobs, lowering the unemployment rate by 0.05 to 0.25 percentage points. Job destruction does not predict future changes in employment but a 1 percentage point increase in job destruction leads to an increase in future unemployment 0.14 to 0.42 percentage points. We find mixed results on the relationship between labor reallocation rates and earnings: we nd that, especially for earnings derived from administrative records data, a 1 percentage point increase to either job destruction or churn leads to increased earnings of less than 2 percent. Results vary substantially depending on the earnings measure we use, and so the evidence inconsistent on whether productivity-enhancing aspects of churn and job destruction provide earnings gains for workers in aggregate. Our findings on churn leading to increased employment and a lower unemployment rate are consistent with models of replacement hiring and vacancy chains.
View Full
Paper PDF
-
Technology Locks, Creative Destruction And Non-Convergence In Productivity Levels
April 1995
Working Paper Number:
CES-95-06
This paper presents a simple solution to a new model that seeks to explain the distribution of plants across productivity levels within an industry, and empirically confirms some key predictions using the U.S. textile industry. In the model, plants are locked into a given productivity level, until they exit or retool. Convex costs of adjustment captures the fact that more productive plants expand faster. Provided there is technical change, productivity levels do not converge; the model achieves persistent dispersion in productivity levels within the context of a distortion free competitive equilibrium. The equilibrium, however, is rather turbulent; plants continually come on line with the cutting edge technology, gradually expand and finally exit or retool when they cease to recover their variable costs. The more productive plants create jobs, while the less productive destroy them. The model establishes a close link between productivity growth and dispersion in productivity levels; more rapid productivity growth leads to more widespread dispersion. This prediction is empirically confirmed. Additionally, the model provides an explanation for S-shaped diffusion.
View Full
Paper PDF
-
Estimation of Job-to-Job Flow Rates under Partially Missing Geography
September 2012
Working Paper Number:
CES-12-29
Integration of data from different regions presents challenges for the calculation of entitylevel longitudinal statistics with a strong geographic component: for example, movements between employers, migration, business dynamics, and health statistics. In this paper, we consider the estimation of worker-level employment statistics when the geographies (in our application, US states) over which such measures are defined are partially missing. We focus on the recent pilot set of job-to-job flow statistics produced by the US Census Bureau's Longitudinal Employer- Household Dynamics (LEHD) program, which measure the frequency of worker movements between jobs and into and out of nonemployment. LEHD's coverage of the labor force gradually increases during the 1990s and 2000s because some states have a longer time series than others, so employment transitions involving missing states are only partially or not at all observed. We propose and implement a method for estimating national-level job-to-job flow statistics that involves dropping observed states to recover the relationship between missing states and directly tabulated job-to-job flow rates. Using the estimated relationship between the observable characteristics of the missing states and changes in the employment measures, we provide estimates of the rates of job-to-job, and job-to-nonemployment, job-to-nonemploymentto- job flows were all states uniformly available.
View Full
Paper PDF
-
Expanding the Frontier of Economic Statistics Using Big Data: A Case Study of Regional Employment
July 2024
Working Paper Number:
CES-24-37
Big data offers potentially enormous benefits for improving economic measurement, but it also presents challenges (e.g., lack of representativeness and instability), implying that their value is not always clear. We propose a framework for quantifying the usefulness of these data sources for specific applications, relative to existing official sources. We specifically weigh the potential benefits of additional granularity and timeliness, while examining the accuracy associated with any new or improved estimates, relative to comparable accuracy produced in existing official statistics. We apply the methodology to employment estimates using data from a payroll processor, considering both the improvement of existing state-level estimates, but also the production of new, more timely, county-level estimates. We find that incorporating payroll data can improve existing state-level estimates by 11% based on out-of-sample mean absolute error, although the improvement is considerably higher for smaller state-industry cells. We also produce new county-level estimates that could provide more timely granular estimates than previously available. We develop a novel test to determine if these new county-level estimates have errors consistent with official series. Given the level of granularity, we cannot reject the hypothesis that the new county estimates have an accuracy in line with official measures, implying an expansion of the existing frontier. We demonstrate the practical importance of these experimental estimates by investigating a hypothetical application during the COVID-19 pandemic, a period in which more timely and granular information could have assisted in implementing effective policies. Relative to existing estimates, we find that the alternative payroll data series could help identify areas of the country where employment was lagging. Moreover, we also demonstrate the value of a more timely series.
View Full
Paper PDF
-
NOISE INFUSION AS A CONFIDENTIALITY PROTECTION MEASURE FOR GRAPH-BASED STATISTICS
September 2014
Working Paper Number:
CES-14-30
We use the bipartite graph representation of longitudinally linked em-ployer-employee data, and the associated projections onto the employer and em-ployee nodes, respectively, to characterize the set of potential statistical summar-ies that the trusted custodian might produce. We consider noise infusion as the primary confidentiality protection method. We show that a relatively straightfor-ward extension of the dynamic noise-infusion method used in the U.S. Census Bureau's Quarterly Workforce Indicators can be adapted to provide the same confidentiality guarantees for the graph-based statistics: all inputs have been modified by a minimum percentage deviation (i.e., no actual respondent data are used) and, as the number of entities contributing to a particular statistic increases, the accuracy of that statistic approaches the unprotected value. Our method also ensures that the protected statistics will be identical in all releases based on the same inputs.
View Full
Paper PDF
-
Agent Heterogeneity and Learning: An Application to Labor Markets
October 2002
Working Paper Number:
tp-2002-20
I develop a matching model with heterogeneous workers, rms, and worker-firm
matches, and apply it to longitudinal linked data on employers and employees. Workers
vary in their marginal product when employed and their value of leisure when unemployed.
Firms vary in their marginal product and cost of maintaining a vacancy. The
marginal product of a worker-firm match also depends on a match-specific interaction
between worker and rm that I call match quality. Agents have complete information
about worker and rm heterogeneity, and symmetric but incomplete information about
match quality. They learn its value slowly by observing production outcomes. There
are two key results. First, under a Nash bargain, the equilibrium wage is linear in a
person-specific component, a firm-specific component, and the posterior mean of beliefs
about match quality. Second, in each period the separation decision depends only on
the posterior mean of beliefs and person and rm characteristics. These results have
several implications for an empirical model of earnings with person and rm eects.
The rst implies that residuals within a worker-firm match are a martingale; the second
implies the distribution of earnings is truncated.
I test predictions from the matching model using data from the Longitudinal
Employer-Household Dynamics (LEHD) Program at the US Census Bureau. I present
both xed and mixed model specifications of the equilibrium wage function, taking
account of structural aspects implied by the learning process. In the most general
specification, earnings residuals have a completely unstructured covariance within a
worker-firm match. I estimate and test a variety of more parsimonious error structures,
including the martingale structure implied by the learning process. I nd considerable
support for the matching model in these data.
View Full
Paper PDF