scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Determining the Number of Factors in Approximate Factor Models

01 Jan 2002-Econometrica (John Wiley & Sons, Ltd)-Vol. 70, Iss: 1, pp 191-221
TL;DR: In this article, the convergence rate for the factor estimates that will allow for consistent estimation of the number of factors is established, and some panel criteria are proposed to obtain the convergence rates.
Abstract: In this paper we develop some econometric theory for factor models of large dimensions. The focus is the determination of the number of factors (r), which is an unresolved issue in the rapidly growing literature on multifactor models. We first establish the convergence rate for the factor estimates that will allow for consistent estimation of r. We then propose some panel criteria and show that the number of factors can be consistently estimated using the criteria. The theory is developed under the framework of large cross-sections (N) and large time dimensions (T). No restriction is imposed on the relation between N and T. Simulations show that the proposed criteria have good finite sample properties in many configurations of the panel data encountered in practice.

Summary (3 min read)

Introduction

  • The growing number of studies focused on child and family migration has brought the phenomena of living life across international borders increased attention in recent years (Parrenas 2005; Ní Laoire et al.
  • Free movement of EU migrant workers and their families has different characteristics than asylum seeker and refugee migrations to and within the EU, in which the impacts of family separation are well documented (see for example Spicer 2008, Szilassy and Arendas 2007).
  • There is a need for more in-depth study on the ways in which children may be involved in migration decisionmaking in their families, the processes of family separation they experience and the ways in which they manage their transnational lives.
  • Children in group two experienced separation from one or both parents and/or siblings until they migrated themselves.

Migrant Worker Families in Europe

  • In 2004, in an action that paralleled the actions of Sweden, the United Kingdom and Ireland did not place restrictions on migration from countries that acceded to the EU that year.
  • Children from Poland and other European countries made up a sizable proportion of migrants to Scotland and Ireland (see, for example, the data from the Scottish Government, Pupil Census – Supplementary Data 2012) and from the national census 2011 in Ireland (Central Statistics Office – CSO 2012).
  • 88) demonstrate, however, that migrant parents with children may measure ‘success’ in the new society in terms of how well the child is doing in school, learning the new language, and making new friends, also known as Adams and Shambleau (2006.
  • What the small body of research focused on children’s experiences of intra-EU migration as part of migrant worker families suggests is that there are multiple ways in which children manage and cope with processes of intra-EU family migration (see Darmody, Tyrrell and Song 2011; Devine 2011; Ní Laoire et al.

Research Design

  • The data that are analysed and discussed in this paper were gathered as part of two studies on intra-EU migration that were conducted independently.
  • The data from the Scotland study that are discussed in this article were collected between 2008 and 2010 during fieldwork with 65 members of Polish migrant families in Scotland (41 children and 24 adults).
  • Interviews and discussions between the researcher and children took place in English2 although children often communicated in Polish with each other in group activities or when carrying out individual tasks in group contexts.
  • A distinctive feature of both projects was to pay particular attention to the views of children (Christiansen and James 2000) and children’s competence as research participants was recognised (Morrow 2008).
  • The authors recognise that this is a limitation of the study.

Migration Decision-Making in EU Migrant Worker Families

  • 5 The methods employed depended on a number of factors: children’s choice, children’s age and understanding, the research location, group or individual activity.
  • Accession to the EU has brought many employment opportunities and free movement for the people of Poland and other new EU countries.
  • This reflects the broader positioning of children within much research on family migration until recently (White et al. 2011) and the lack of acknowledgement that children may participate in making migration decisions in some circumstances (Bushin 2009).
  • A negative aspect that sometimes follows children’s lack of involvement in decision- making is that children find hard to understand the choices of their parents, especially if they are very young (Bonizzoni 2009).
  • Philip (male, age 15, Ireland study) said that his parents, particularly his mother, were keen to remain living in Ireland but they had retained a property in their home city in Poland in case his father had to return to work there.

Processes of Step-Migration and Separation in Intra-EU Migrant Worker Families

  • Much of the research in Europe on transnational families has focused on those from outside the EU (Bryceson and Vuorela, 2002).
  • Many participating children have also experienced separation at the time of their own migration, leaving grandparents, uncles, cousins and other close relatives and friends behind.
  • Vicky (female, age 11, Scotland study) wanted to come to Scotland: ‘I missed my dad [when he lived in Scotland and she lived in Poland], he was at home maybe once a year.
  • In Poland I have friends but maybe I do not have them anymore.’.
  • Children also reflected on these friendships and compared them to friendships ‘back home’.

Living Transnational Family Lives

  • For both children and parents, relatives and/or friends in Poland frequently provided an important source of emotional support through telephone/internet chats and emails.
  • Robert is very attached to his grandparents in Poland as he lived three years with them after his mother migrated for work to Scotland.
  • Maintaining transnational connections in some migrant families allows them to provide not only emotional support from a distance but also practical support, including various forms of childcare.
  • After three years, my younger son came to join me.
  • I take care of the children and bring them to school and back, do homework with them.

Conclusion

  • Migratory processes and experiences of intra-EU migrant worker families who recently migrated from Poland to Scotland and Ireland.the authors.
  • By considering family migration to be a process or set of varying processes rather than a singular event (after Halfacree and Boyle 1993), the nuances of intraEU family migration decision-making, step-migration and separation can be revealed.
  • The studies show that many migrant children seemed to function well despite periods of separation from parents and/or other family members.
  • The authors studies confirm White’s (2011) argument that emotional reasons can prevail over apparent economic rationality.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

DETERMINING THE NUMBER OF FACTORS IN APPROXIMATE
FACTOR MODELS
Jushan Bai
Serena Ng
Department of Economics
Boston College
Chestnut Hill
MA 02467
December 2000
Abstract
In this paper we develop some econometric theory for factor models of large di-
mensions. The focus is the determination of the number of factors (r), which is an
unresolved issue in the rapidly growing literature on multifactor models. We first
establish the convergence rate for the factor estimates that will allow for consistent
estimation of r. We then propose some panel
C
p
criteria and show that the number of
factors can be consistently estimated using the criteria. The theory is developed under
the framework of large cross-sections (N
) and large time dimensions (
T ). No restric-
tion is imposed on the relation between N and
T . Simulations show that the proposed
criteria have good finite sample properties in many configurations of the panel data
encountered in practice.
JEL Classification: C13, C33, C43
Keywords: Factor analysis, asset pricing, principal components, model selection.
Email: Jushan.Bai@bc.edu Phone: 617-552-3689
Email: Serena.Ng@bc.edu Phone: 617-552-2182
We thank two referees for their very constructive comments, which led to a much improved presentation. The
first author acknowledges financial support from the National Science Foundation under grant SBR-9709508.
We would like to thank participants in the econometrics seminars at Harvard-MIT, Cornell University, the
University of Rochester, and the University of Pennsylvania for help suggestions and comments. Remaining
errors are our own.

1 Introduction
The idea that variations in a large number of economic variables can b e modeled by a
small number of reference variables is appealing and is used in many economic analysis.
For example, asset returns are often modeled as a function of a small number of factors.
Stock and Watson (1989) used one reference variable to model the comovements of four
main macroeconomic aggregates. Cross-country variations are also found to have common
components, see Gregory and Head (1999) and Forni, Hallin, Lippi and Reichlin (2000).
More recently, Sto ck and Watson (1999) showed that the forecast mean squared error of a
large number of macroeconomic variables can be reduced by including diffusion indexes, or
factors, in structural as well as non-structural forecasting models. In demand analysis, engel
curves can be expressed in terms of a finite numb er of factors. Lewbel (1991) showed that if
a demand system has one common factor, budget shares should be independent of the level
of income. In such a case, the number of factors is an object of economic interest since if
more than one factor is found, homothetic preferences can be rejected. Factor analysis also
provides a convenient way to study the aggregate implications of microeconomic behavior,
as shown in Forni and Lippi (1997).
Central to both the theoretical and the empirical validity of factor models is the correct
specification of the number of factors. To date, this crucial parameter is often assumed rather
than determined by the data.
1
This paper develops a formal statistical procedure that can
consistently estimate the number of factors from observed data. We demonstrate that the
penalty for overfitting must be a function of both N
and T
(the cross-section dimension and
the time dimension, respectively) in order to consistently estimate the number of factors.
Consequently the usual AIC and BIC which are functions of
N
or
T
alone do not work when
the both dimensions of the panel are large. Our theory is developed under the assumption
that both
N and
T
converge to infinity. This flexibility is of empirical relevance because the
time dimension of datasets relevant to factor analysis, although small relative to the cross
section dimension, is too large to justify the assumption of a fixed
T .
A small number of papers in the literature have also considered the problem of deter-
mining the number of factors, but the present analysis differs from these works in important
ways. Lewbel (1991) and Donald (1997) used the rank of a matrix to test for the num-
ber of factors, but these theories assume either N
or T
is fixed. Cragg and Donald (1997)
1
Lehmann and Modest (1988), for example, tested the APT for 5, 10 and 15 factors. Stock and Watson
(1989) assumed there is one factor underlying the coincident index. Ghysels and Ng (1998) tested the affine
term structure model assuming two factors.
1

considered the use of information criterion when the factors are functions of a set of observ-
able explanatory variables, but the data still have a fixed dimension. For large dimensional
panels, Connor and Korajczyk (1993) developed a test for the number of factors in asset
returns, but their test is derived under sequential limit asymptotics, i.e., N
converges to
infinity with a fixed
T
and then
T
converges to infinity. Furthermore, because their test is
based on a comparison of variances over different time periods, covariance stationarity and
homoskedasticity are not only technical assumptions, but are crucial for the validity of their
test. Under the assumption that
N
for fixed
T , Forni and Reichlin (1998) suggested a
graphical approach to identify the number of factors, but no theory is available. Assuming
N, T with
N/T
, Stock and Watson (1998) showed that a modification to
the BIC can be used to select the number of factors optimal for forecasting a single series.
Their criterion is restrictive not only because it requires N >> T , but also because there
can be factors that are pervasive for a set of data and yet have no predictive ability for an
individual data series. Thus, their rule may not be appropriate outside of the forecasting
framework. Forni, Hallin, Lippi and Reichlin (1999) suggested a multivariate variant of the
AIC but neither the theoretical nor the empirical properties of the criterion are known.
We set up the determination of factors as a model selection problem. In consequence, the
proposed criteria depend on the usual trade-off between good fit and parsimony. However,
the problem is non-standard not only because account needs to be taken of the sample size
in both the cross section and the time series dimensions, but also because the factors are
not observed. The theory we developed does not rely on sequential limit, nor does it impose
any restrictions between
N and T
. The results hold under heteroskedasticity in both the
time and the cross-section dimensions. The results also hold under weak serial dependence
and cross-section dependence. Simulations show that the criteria have good finite sample
properties.
The rest of the paper is organized as follows. Section 2 sets up the preliminaries and
introduces notation and assumptions. Estimation of the factors is considered in Section 3
and the estimation of the number of factors is studied in Section 4. Specific criteria are
considered in Section 5 and their finite sample prop erties are considered in Section 6, along
with an empirical application to asset returns. Concluding remarks are provided in Section
7. All the proofs are given in the Appendix.
2

2 Factor Models
Let X
it
be the observed data for the i
th
cross section unit at time
t
, for
i
= 1
, . . . N, and
t
= 1, . . . T . Consider the following model
X
it
= λ
0
i
F
t
+ e
it
, (1)
where F
t
is a vector of common factors, λ
i
is a vector of factor loadings associated with
F
t
, and
e
it
is the idiosyncratic component of X
it
. The product λ
0
i
F
t
is called the common
component of
X
it
. Equation (1) is then the factor representation of the data. Note that the
factors, their loadings, as well as the idiosyncratic errors are not observable.
Factor analysis allows for dimension reduction and is a useful statistical tool. Many
economic analyses fit naturally into the framework given by (1).
1.
Arbitrage pricing theory. In the finance literature, the arbitrage pricing theory (APT)
of Ross (1976) assumes that a small number of factors can be used to explain a large numb er
of asset returns. In this case, X
it
represents the return of asset i at time
t
, F
t
represents
the vector of factor returns and e
it
is the idiosyncratic component of returns. Although
analytical convenience makes it appealing to assume one factor, there is growing evidence
against the adequacy of a single factor in explaining asset returns.
2
The shifting interest
towards use of multifactor models inevitably calls for a formal procedure to determine the
number of factors. The analysis to follow allows the number of factors to be determined
even when
N and T are both large. This is especially suited for financial applications when
data are widely available for a large number of assets over an increasingly long span. Once
the number of factors is determined, the factor returns
F
t
can also be consistently estimated
(up to a invertible transformation).
2. The rank of a demand system
. Let p be a price vector for J
goods and services, e
h
be
total spending on the
J
goods by household h
. Consumer theory postulates that Marshallian
demand for good j by consumer
h
is
X
jh
= g
j
(p, e
h
). Let w
jh
= X
jh
/e
h
be the budget share
for household h on the
j
th
good. The rank of a demand system holding prices fixed is the
smallest integer r such that w
j
(e) = λ
j1
G
1
(e
) + . . . λ
jr
G
r
(
e
). Demand systems are of the
form (1) where the r factors, common across goods, are
F
h
= [G
1
(
e
h
) . . . G
r
(e
h
)]
0
. When
the number of households, H
, converges to infinity with a fixed J
,
G
1
(
e) . . . G
r
(
e) can be
estimated simultaneously, such as by non-parametric methods developed in Donald (1997).
2
Cochrane (1999) stressed that financial economists now recognize that there are multiple sources of risk,
or factors, that give rise to high returns. Backus, Forsei, Mozumdar and Wu (1997) made similar conclusions
in the context of the market for foreign assets.
3

Their approach will not work when the number of goods, J
, also converges to infinity. How-
ever, when
J is large, the theory developed in this paper still provides a consistent estimation
of the rank of the demand system and without the need for nonparametric estimation of the
G(
·) functions. This flexibility can be useful since some datasets have detailed information
on a large number of consumption goods. Once the rank of the demand system is deter-
mined, the nonparametric functions evaluated at
e
h
allows F
h
to be consistently estimable
(up to a transformation). Then functions G
1
(
e) . . . G
r
(e
) may then be recovered (also up to
a matrix transformation) from
b
F
h
(h = 1
, .., H
) via nonparametric estimation.
3. Forecasting with diffusion indices. Stock and Watson (1999) considered forecasting
inflation with diffusion indices (“factors”) constructed from a large number of macroeconomic
series. The underlying premise is that the movement of a large number of macroeconomic
series may be driven by a small number of unobservable factors. Consider the forecasting
equation for a scalar series
y
t+1
= α
0
F
t
+
β
0
W
t
+
t
.
The variables W
t
are observable. Although we do not observe F
t
, we observe X
it
,
i = 1
, . . . N
.
Suppose
X
it
bears relation with
F
t
as in (1). In the present context, we interpret (1) as
the reduced-form representation of
X
it
in terms of the unobservable factors. We can first
estimate F
t
from (1). Denote it by
b
F
t
. We can then regress y
t
on
b
F
t
1
and W
t
1
to obtain
the coefficients
b
α
and
b
β, from which a forecast
b
y
T +1|T
=
b
α
0
b
F
T
+
b
βW
T
can be formed. Stock and Watson (1998, 1999) showed that this approach of forecasting
outperforms many competing forecasting methods. But as p ointed out earlier, the dimension
of
F in Stock and Watson (1998) was determined using a criterion that minimizes the mean
squared forecast errors of
y. This may not be the same as the number of factors underlying
X
it
, which is the focus of this paper.
2.1 Notation and Preliminaries
Let F
0
t
,
λ
0
i
and r
denote the true common factors, the factor loadings, and the true number
of factors, respectively. Note that
F
0
t
is r dimensional. We assume that r does not depend
on
N. At a given
t
, we have
X
t
= Λ
0
F
0
t
+ e
t
.
(N × 1) (N ×
r) (
r ×1) (
N × 1)
(2)
4

Citations
More filters
Journal ArticleDOI
TL;DR: In this paper, a simple alternative test where the standard unit root regressions are augmented with the cross section averages of lagged levels and first-differences of the individual series is also considered.
Abstract: A number of panel unit root tests that allow for cross section dependence have been proposed in the literature, notably by Bai and Ng (2002), Moon and Perron (2003), and Phillips and Sul (2002) who use orthogonalization type procedures to asymptotically eliminate the cross dependence of the series before standard panel unit root tests are applied to the transformed series. In this paper we propose a simple alternative test where the standard DF (or ADF) regressions are augmented with the cross section averages of lagged levels and first-differences of the individual series. A truncated version of the CADF statistics is also considered. New asymptotic results are obtained both for the individual CADF statistics, and their simple averages. It is shown that the CADF_i statistics are asymptotically similar and do not depend on the factor loadings under joint asymptotics where N (cross section dimension) and T (time series dimension) tends to infinity, such that N/T tends to k, where k is a fixed finite non-zero constant. But they are asymptotically correlated due to their dependence on the common factor. Despite this it is shown that the limit distribution of the average CADF statistic exists and its critical values are tabulated. The small sample properties of the proposed tests are investigated by Monte Carlo experiments, for a variety of models. It is shown that the cross sectionally augmented panel unit root tests have satisfactory size and power even for relatively small values of N and T. This is particularly true of cross sectionally augmented and truncated versions of the simple average t-test of Im, Pesaran and Shin, and Choi's inverse normal combination test.

6,169 citations

Journal ArticleDOI
TL;DR: In this paper, a simple alternative where the standard ADF regressions are augmented with the cross section averages of lagged levels and first-differences of the individual series is proposed, and it is shown that the individual CADF statistics are asymptotically similar and do not depend on the factor loadings.
Abstract: A number of panel unit root tests that allow for cross section dependence have been proposed in the literature that use orthogonalization type procedures to asymptotically eliminate the cross dependence of the series before standard panel unit root tests are applied to the transformed series. In this paper we propose a simple alternative where the standard ADF regressions are augmented with the cross section averages of lagged levels and first-differences of the individual series. New asymptotic results are obtained both for the individual CADF statistics, and their simple averages. It is shown that the individual CADF statistics are asymptotically similar and do not depend on the factor loadings. The limit distribution of the average CADF statistic is shown to exist and its critical values are tabulated. Small sample properties of the proposed test are investigated by Monte Carlo experiments. The proposed test is applied to a panel of 17 OECD real exchange rate series as well as to log real earnings of households in the PSID data.

6,022 citations

Journal ArticleDOI
Peter Pedroni1
TL;DR: This paper examined properties of residual-based tests for the null of no cointegration for dynamic panels in which both the short-run dynamics and the long-run slope coefficients are permitted to be heterogeneous across individual members of the panel.
Abstract: We examine properties of residual-based tests for the null of no cointegration for dynamic panels in which both the short-run dynamics and the long-run slope coefficients are permitted to be heterogeneous across individual members of the panel. The tests also allow for individual heterogeneous fixed effects and trend terms, and we consider both pooled within dimension tests and group mean between dimension tests. We derive limiting distributions for these and show that they are normal and free of nuisance parameters. We also provide Monte Carlo evidence to demonstrate their small sample size and power performance, and we illustrate their use in testing purchasing power parity for the post–Bretton Woods period.I thank Rich Clarida, Bob Cumby, Mahmoud El-Gamal, Heejoon Kang, Chiwha Kao, Andy Levin, Klaus Neusser, Masao Ogaki, David Papell, Pierre Perron, Abdel Senhadji, Jean-Pierre Urbain, Alan Taylor, and three anonymous referees for helpful comments on various earlier versions of this paper. The paper has also benefited from presentations at the 1994 North American Econometric Society Summer Meetings in Quebec City, the 1994 European Econometric Society Summer Meetings in Maastricht, and workshop seminars at the Board of Governors of the Federal Reserve, INSEE-CREST Paris, IUPUI, Ohio State, Purdue, Queens University Belfast, Rice University–University of Houston, and Southern Methodist University. Finally, I thank the following students who provided assistance in the earlier stages of the project: Younghan Kim, Rasmus Ruffer, and Lining Wan.

4,189 citations

Posted Content
TL;DR: In this article, the authors proposed a new approach to estimation and inference in panel data models with a multifactor error structure where the unobserved common factors are correlated with exogenously given individual-specific regressors, and the factor loadings differ over the cross-section units.
Abstract: This paper presents a new approach to estimation and inference in panel data models with a multifactor error structure where the unobserved common factors are (possibly) correlated with exogenously given individual-specific regressors, and the factor loadings differ over the cross section units. The basic idea behind the proposed estimation procedure is to filter the individual-specific regressors by means of (weighted) cross-section aggregates such that asymptotically as the cross-section dimension (N) tends to infinity the differential effects of unobserved common factors are eliminated. The estimation procedure has the advantage that it can be computed by OLS applied to an auxiliary regression where the observed regressors are augmented by (weighted) cross sectional averages of the dependent variable and the individual specific regressors. Two different but related problems are addressed: one that concerns the coefficients of the individual-specific regressors, and the other that focusses on the mean of the individual coefficients assumed random. In both cases appropriate estimators, referred to as common correlated effects (CCE) estimators, are proposed and their asymptotic distribution as N with T (the time-series dimension) fixed or as N and T (jointly) are derived under different regularity conditions. One important feature of the proposed CCE mean group (CCEMG) estimator is its invariance to the (unknown but fixed) number of unobserved common factors as N and T (jointly). The small sample properties of the various pooled estimators are investigated by Monte Carlo experiments that confirm the theoretical derivations and show that the pooled estimators have generally satisfactory small sample properties even for relatively small values of N and T.

3,170 citations

Journal ArticleDOI
TL;DR: In this article, a new approach to estimation and inference in panel data models with a general multifactor error structure is presented, where the unobserved factors and the individual-specific errors are allowed to follow arbitrary stationary processes, and the number of unobserved factors need not be estimated.
Abstract: This paper presents a new approach to estimation and inference in panel data models with a general multifactor error structure. The unobserved factors and the individual-specific errors are allowed to follow arbitrary stationary processes, and the number of unobserved factors need not be estimated. The basic idea is to filter the individual-specific regressors by means of cross-section averages such that asymptotically as the cross-section dimension (N) tends to infinity, the differential effects of unobserved common factors are eliminated. The estimation procedure has the advantage that it can be computed by least squares applied to auxiliary regressions where the observed regressors are augmented with cross-sectional averages of the dependent variable and the individual-specific regressors. A number of estimators (referred to as common correlated effects (CCE) estimators) are proposed and their asymptotic distributions are derived. The small sample properties of mean group and pooled CCE estimators are investigated by Monte Carlo experiments, showing that the CCE estimators have satisfactory small sample properties even under a substantial degree of heterogeneity and dynamics, and for relatively small values of N and T.

2,906 citations

References
More filters
Book
14 Sep 1984
TL;DR: In this article, the distribution of the Mean Vector and the Covariance Matrix and the Generalized T2-Statistic is analyzed. But the distribution is not shown to be independent of sets of Variates.
Abstract: Preface to the Third Edition.Preface to the Second Edition.Preface to the First Edition.1. Introduction.2. The Multivariate Normal Distribution.3. Estimation of the Mean Vector and the Covariance Matrix.4. The Distributions and Uses of Sample Correlation Coefficients.5. The Generalized T2-Statistic.6. Classification of Observations.7. The Distribution of the Sample Covariance Matrix and the Sample Generalized Variance.8. Testing the General Linear Hypothesis: Multivariate Analysis of Variance9. Testing Independence of Sets of Variates.10. Testing Hypotheses of Equality of Covariance Matrices and Equality of Mean Vectors and Covariance Matrices.11. Principal Components.12. Cononical Correlations and Cononical Variables.13. The Distributions of Characteristic Roots and Vectors.14. Factor Analysis.15. Pattern of Dependence Graphical Models.Appendix A: Matrix Theory.Appendix B: Tables.References.Index.

9,693 citations

Book
01 Jan 1997
TL;DR: In this paper, Campbell, Lo, and MacKinlay present an attempt by three well-known and well-respected scholars to fill an acknowledged void in the empirical finance literature, a text covering the burgeoning field of empirical finance.
Abstract: This book is an ambitious effort by three well-known and well-respected scholars to fill an acknowledged void in the literature—a text covering the burgeoning field of empirical finance. As the authors note in the preface, there are several excellent books covering financial theory at a level suitable for a Ph.D. class or as a reference for academics and practitioners, but there is little or nothing similar that covers econometric methods and applications. Perhaps the closest existing text is the recent addition to the Wiley Series in Financial and Quantitative Analysis. written by Cuthbertson (1996). The major difference between the books is that Cuthbertson focuses exclusively on asset pricing in the stock, bond, and foreign exchange markets, whereas Campbell, Lo, and MacKinlay (henceforth CLM) consider empirical applications throughout the field of finance, including corporate finance, derivatives markets, and market microstructure. The level of anticipation preceding publication can be partly measured by the fact that at least three reviews (including this one) have appeared since the book arrived. Moreover, in their reviews, both Harvey (1998) and Tiso (1998) comment on the need for such a text, a sentiment that has been echoed by numerous finance academics.

7,169 citations

Journal ArticleDOI
TL;DR: Ebsco as mentioned in this paper examines the arbitrage model of capital asset pricing as an alternative to the mean variance pricing model introduced by Sharpe, Lintner and Treynor.

6,763 citations

Journal Article

1,913 citations

Journal ArticleDOI
TL;DR: In this article, a generalized dynamic factor model with infinite dynamics and nonorthogonal idiosyncratic components is proposed, which generalizes the static approximate factor model of Chamberlain and Rothschild (1983), as well as the exact factor model a la Sargent and Sims (1977).
Abstract: This paper proposes a factor model with infinite dynamics and nonorthogonal idiosyncratic components. The model, which we call the generalized dynamic-factor model, is novel to the literature and generalizes the static approximate factor model of Chamberlain and Rothschild (1983), as well as the exact factor model a la Sargent and Sims (1977). We provide identification conditions, propose an estimator of the common components, prove convergence as both time and cross-sectional size go to infinity at appropriate rates, and present simulation results. We use our model to construct a coincident index for the European Union. Such index is defined as the common component of real GDP within a model including several macroeconomic variables for each European country.

1,832 citations

Frequently Asked Questions (14)
Q1. What are the contributions mentioned in the paper "Determining the number of factors in approximate factor models" ?

In this paper the authors develop some econometric theory for factor models of large dimensions. The authors then propose some panel Cp criteria and show that the number of factors can be consistently estimated using the criteria. 

Many issues in factor analysis await further research. But using Theorem 1, it maybe possible to obtain these limiting distributions. It can be shown that ŷT+1|T is not only a consistent but a√ T consistent estimator of yT+1, conditional on the information up to time T ( provided that N is of no smaller order of magnitude than T ). Stock and Watson ( 1998 ) suggest how dynamics can be introduced into factor models when both N and T are large, although their empirical applications assume a static factor structure. 

The drawback of the approach is that, because the number of parameters increases with N ,3 computational difficulties make it necessary to abandoninformation on many series even though they are available. 

because their test is based on a comparison of variances over different time periods, covariance stationarity and homoskedasticity are not only technical assumptions, but are crucial for the validity of their test. 

The main advantage of these three panel information criteria (ICp) is that they do not depend on the choice of kmax through σ̂2, which could be desirable in practice. 

For large dimensional panels, Connor and Korajczyk (1993) developed a test for the number of factors in asset returns, but their test is derived under sequential limit asymptotics, i.e., N converges to infinity with a fixed T and then T converges to infinity. 

A likelihood ratio test can also, in theory, be used to select the number of factors if, in addition, normality of et is assumed. 

More recently, Stock and Watson (1999) showed that the forecast mean squared error of a large number of macroeconomic variables can be reduced by including diffusion indexes, or factors, in structural as well as non-structural forecasting models. 

Assuming N, T → ∞ with √ N/T → ∞, Stock and Watson (1998) showed that a modification tothe BIC can be used to select the number of factors optimal for forecasting a single series. 

In this case, Xit represents the return of asset i at time t, Ft represents the vector of factor returns and eit is the idiosyncratic component of returns. 

However the proof of Theorem 2 mainly uses the fact that ̂Ft satisfies Theorem 1, and does not rely on the principle components per se. 

Stock and Watson (1998) suggest how dynamics can be introduced into factor models when both N and T are large, although their empirical applications assume a static factor structure. 

when J is large, the theory developed in this paper still provides a consistent estimation of the rank of the demand system and without the need for nonparametric estimation of theG(·) functions. 

The shifting interesttowards use of multifactor models inevitably calls for a formal procedure to determine the number of factors.