What are the future works in "A panic attack on unit roots and cointegration" ?

The authors leave this for future research.

(Open Access) A PANIC Attack on Unit Roots and Cointegration (2004) | Jushan Bai

A PANIC ATTACK ON UNIT ROOTS AND COINTEGRATION

Jushan Bai

∗

Serena Ng

†

December 2001

Abstract

This paper develops a new methodology that makes use of the factor structure of large

dimensional panels to understand the nature of non-stationarity in the data. We refer to it

as PANIC– a ‘Panel Analysis of Non-stationarity in Idiosyncratic and Common components’.

PANIC consists of univariate and panel tests with a number of novel features. It can detect

whether the nonstationarity is pervasive, or variable-speciﬁc, or both. It tests the components

of the data instead of the observed series. Inference is therefore more accurate when the compo-

nents have diﬀerent orders of integration. PANIC also permits the construction of valid panel

tests even when cross-section correlation invalidates pooling of statistics constructed using the

observed data. The key to PANIC is consistent estimation of the components even when the

regressions are individually spurious. We provide a rigorous theory for estimation and inference.

In Monte Carlo simulations, the tests have very good size and power. PANIC is applied to a

panel of inﬂation series.

Keywords: Panel data, common factors, common trends, principal components

∗

Dept. of Economics, Boston College, Chestnut Hill, MA 02467 Email Jushan.Bai@bc.edu

†

Dept. of Economics, Johns Hopkins University, Baltimore, MD 21218 Email: Serena.Ng@jhu.edu

This paper was presented at the NSF 2001 Summer Symposium on Econometrics and Statistics in Berkeley, California,

the CEPR/Banca d’ltalia Conference in Rome, and at NYU. We thank the seminar participants and the discussants

(Andrew Harvey and Marco Lippi) for many helpful comments. We also thank To dd Clark for providing us with the

inﬂation data. The ﬁrst author acknowledges ﬁnancial support from the NSF (grant SBR 9709508).

1 Introduction

Knowledge of whether a series is stationary or non-stationary is important for a wide range of

economic analysis. As such, unit root testing is extensively conducted in empirical work. But

in spite of the development of many elegant theories, the power of univariate unit root tests is

severely constrained in practice by the short span of macroeconomic time series. Panel unit root

tests have since been developed with the goal of increasing power through pooling information

across units. But pooling is valid only if the units are independent, an assumption that is perhaps

unreasonable given that many economic models imply, and the data support, the comovement of

economic variables.

In this paper, we propose a new approach to understanding non-stationarity in the data, both

on a series by series basis, and from the viewpoint of a panel. Rather than treating the cross-

section correlation as a nuisance, we exploit these comovements to develop new univariate statistics

and valid pooled tests for the null hypothesis of non-stationarity. Our tests are applied to two

components of the data, one with the characteristic that it is strongly correlated with many series,

and one with the characteristic that it is largely unit speciﬁc. More precisely, we consider a factor

analytic model:

= D

+ λ

+ e

where D

is a polynomial trend function of order p, F

is a r × 1 vector of common factors, and

is a vector of factor loadings. The series X

is the sum of a deterministic component D

, a

common component λ

, and an error e

that is largely idiosyncratic. A factor model with N

variables will have N idiosyncratic components but a small number of common factors.

A series with a factor structure is non-stationary if one or more of the common factors are non-

stationary, or the idiosyncratic error is non-stationary, or both. Except by assumption, there is

nothing that restricts F

to be all I(1) or all I(0). There is also nothing that rules out the possibility

that F

and e

are integrated of diﬀerent orders. These are not merely cases of theoretical interest,

but also of empirical relevance. As an example, let X

be real output of country i. It may consist

of a global trend component F

, a global cyclical component F

, and an idiosyncratic component

) that may or may not be stationary. As another example, the inﬂation rate of durable goods

may consist of a component that is common to all prices, and a component that is speciﬁc to

durable goods.

It is well known that the sum of two time series can have dynamic properties very diﬀerent

from the individual series themselves. If one component is I(1) and one is I(0), it could be diﬃcult

This is a static factor model, and is to be distinguished from the dynamic factor model being analyzed in Forni,

Hallin, Lippi and Reichlin (2000).

to establish that a unit root exists from observations on X

alone, especially if the stationary

component is large. Unit root tests on X

can be expected to be oversized while stationarity

tests will have no power. The issue is documented in Schwert (1989), and formally analyzed in

Pantula (1991), Ng and Perron (2001), among others, in the context of a negative moving-average

component in the ﬁrst-diﬀerenced data.

Instead of testing for the presence of unit roots in X

, the approach proposed in this paper is

to test the common factors and the idiosyncratic components separately. We refer such a Panel

Analysis of Non-stationarity in the Idiosyncratic and Common components as PANIC. PANIC

allows us to determine if nonstationarity comes from a pervasive or an idiosyncratic source. To

our knowledge, there does not exist a test in the literature for this purpose. PANIC can also

potentially resolve three econometric problems. The ﬁrst is the size issue relating to summing

series with diﬀerent orders of integration just mentioned. The second is a consequence of the fact

that the idiosyncratic components in a factor model can only be weakly correlated across i by design.

In contrast, X

will be strongly correlated across units if the data obey a factor structure. Thus,

pooled tests based upon e

are more likely to satisfy the cross-section independence assumption

required for pooling. The third relates to power, and follows from the fact that pooled tests exploit

cross-section information and are more powerful than univariate unit root tests.

Since the factors and the idiosyncratic components are both unobserved, and our objective is

to test if they have unit roots, the key to our analysis is consistent estimation of these components

irrespective of their stationarity properties. To this end, we propose a robust common-idiosyncratic

(I-C) decomposition of the data using large dimensional panels. That is, datasets in which the

number of observations in the time (T ) and the cross-section (N ) dimensions are both large. Loosely

speaking, the large N permits consistent estimation of the common variation whether or not they

are stationary, while a large T enables application of the relevant central limit theorems so that

limiting distributions of the tests can be obtained. Robustness is achieved by a ‘diﬀerencing and

re-cummulating’ estimation procedure so that I(1) and I(0) errors can be accommodated. Our

results add to the growing literature on large dimensional factor analysis by showing how consistent

estimates of the factors can be obtained using the method of principal components even without

imposing stationarity on the errors.

Our framework diﬀers from conventional multivariate time series models in which N is small.

In small N analysis of cointegration, common trends and cycles, the estimation methodology being

employed typically depends on whether the variables considered are all I(1) or all I(0).

Pretesting

See, for example, King, Plosser, Stock and Watson (1991), Engle and Kozicki (1993), and Gonzalo and Granger

(1995).

for unit roots is thus necessary. Because N is small, what is extracted is the trend or the cycle

common to just a small number of variables. Not only is the information in many potentially

relevant series left unexploited, consistent estimation of common factors is in fact not possible

when the number of variables is small. In our analysis with N and T large, the common variation

can be extracted without appealing to stationarity assumptions and/or cointegration restrictions.

This makes it possible to decouple the extraction of common trends and cycles from the issue of

testing stationarity.

The rest of the paper is organized as follows. In Section 2, we describe the PANIC procedures

and present asymptotic results for the Dickey-Fuller t test of the unit root hypothesis. As an inter-

mediate result, we establish uniform consistency of the factor estimates even when the individual

regressions are spurious. As this result is important in its own right, we devote Section 3 to the

large sample properties of the factor estimates. Section 4 uses simulations to illustrate the prop-

erties of the factor estimates and the tests in ﬁnite samples. PANIC is then applied to a panel of

inﬂation data. Proofs are given in the Appendix.

2 PANIC

The data X

are assumed to be generated by

= c

+ β

t + λ

+ e

, t = 1, . . . T, (1)

= α

mt−1

+ u

m = 1, . . . r (2)

= ρ

it−1

+ ²

, i = 1, . . . N. (3)

Factor m is stationary if α

< 1. The idiosyncratic error e

is stationary if ρ

< 1. The objective

is to understand the stationarity property of F

and e

when these are all unobserved, and for

which we estimate by the method of principal components.

When e

is I(0), the principal components estimators for F

and λ

have been shown to be

consistent when all the factors are I(0) and when some or all of them are I(1). But consistent

estimation of the factors when e

is I(1) has not been considered in the literature. Indeed, when

has a unit root, a regression of X

on F

is spurious even if F

was observed, and the estimates

of λ

and thus of e

will not be consistent. The validity of PANIC thus hinges on the ability to

obtain estimates of F

and e

that preserve their orders of integration, both when e

is I(1) and

when it is I(0). We now outline a set of procedures that accomplish this goal. Essentially, the

trick is to apply the method of principal components to the ﬁrst diﬀerenced data. We show in this

section that inference about unit roots is not aﬀected by the fact that F

and e

are not observed.

We defer the discussion on the theoretical underpinnings of PANIC and the properties of factor

estimates to Section 3 so as to keep unit root testing the main focus of this section.

We consider two speciﬁcations of the deterministic trend function, leading to what will be

referred as the intercept only model and the linear trend model. We assume the number of common

factors (r) is known.

To simplify the proof, we let ²

and u

be serially uncorrelated. This allows

us to consider the t statistic on the ﬁrst order autoregressive parameter developed in Dickey and

Fuller (1979). More general errors can be permitted, provided they satisfy the assumptions stated

in Section 3. Remarks to this eﬀect will be made below.

2.1 The Intercept Only Case

The factor model in the intercept only case is

= c

+ λ

+ e

. (4)

We assume E(∆F

) = 0. This is without loss of generality because if F

= a + ξ

such that

E(∆ξ

) = 0, then X

= c

+ λ

a + λ

+ e

. The ﬁrst diﬀerenced model ∆X

= λ

∆ξ

+ ∆e

thus observationally equivalent to ∆X

= λ

∆F

+ ∆e

. Denote

= ∆X

, f

= ∆F

, and z

= ∆e

. (5)

Then the model in ﬁrst-diﬀerenced form is:

= λ

+ z

. (6)

The test statistics are constructed as follows:

1. Diﬀerence the data and estimate f

and λ

from (6) by the method of principal components. To

be precise, let x be the (T −1) ×N data matrix such that the i

column is (x

, x

, ..., x

)

i = 1, 2, ..., N. Let f = (f

, f

, ..., f

)

and Λ = (λ

, ..., λ

)

. The principal component

estimator of f , denoted

f, is

√

T − 1 times the r eigenvectors corresponding to the ﬁrst r

largest eigenvalues of the (T − 1) × (T − 1) matrix xx

. The estimated loading matrix is

Λ = x

f/(T − 1). Deﬁne bz

= x

−

2. Given

, deﬁne for each m = 1, . . . r,

s=2

Consistent estimation of r is possible using the method of Bai and Ng (2002) with data in diﬀerences. It can be

shown that this will not aﬀect the limiting distribution of the test statistics when the numb er of factors is estimated.

A PANIC Attack on Unit Roots and Cointegration

Figures

Citations

A Simple Panel Unit Root Test in the Presence of Cross Section Dependence

A simple panel unit root test in the presence of cross-section dependence

Testing for error correction in panel data

Testing for Granger Non-causality in Heterogeneous Panels

Introductory Econometrics for Finance

References

Distribution of the Estimators for Autoregressive Time Series with a Unit Root

A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix

Statistical analysis of cointegration vectors

Testing for unit roots in heterogeneous panels

Unit root tests in panel data: asymptotic and finite-sample properties

Related Papers (5)

A simple panel unit root test in the presence of cross-section dependence

Testing for unit roots in heterogeneous panels

Unit root tests in panel data: asymptotic and finite-sample properties

A Comparative Study of Unit Root Tests with Panel Data and a New Simple Test

Panel cointegration: asymptotic and finite sample properties of pooled time series tests with an application to the ppp hypothesis

Frequently Asked Questions (2)

Q1. What are the contributions mentioned in the paper "A panic attack on unit roots and cointegration" ?

Q2. What are the future works in "A panic attack on unit roots and cointegration" ?

Trending Questions (1)