scispace - formally typeset
Open AccessJournal ArticleDOI

A Comprehensive Look at the Empirical Performance of Equity Premium Prediction

Ivo Welch, +1 more
- 01 Jul 2008 - 
- Vol. 21, Iss: 4, pp 1455-1508
TLDR
The authors comprehensively reexamine the performance of variables that have been suggested by the academic literature to be good predictors of the equity premium and find that by and large, these models have predicted poorly both in-sample and out-of-sample (OOS) for 30 years now.
Abstract
Our article comprehensively reexamines the performance of variables that have been suggested by the academic literature to be good predictors of the equity premium. We find that by and large, these models have predicted poorly both in-sample (IS) and out-of-sample (OOS) for 30 years now; these models seem unstable, as diagnosed by their out-of-sample predictions and other statistics; and these models would not have helped an investor with access only to available information to profitably time the market.

read more

Content maybe subject to copyright    Report

NBER WORKING PAPER SERIES
A COMPREHENSIVE LOOK AT THE EMPIRICAL PERFORMANCE
OF EQUITY PREMIUM PREDICTION
Amit Goval
Ivo Welch
Working Paper 10483
http://www.nber.org/papers/w10483
NATIONAL BUREAU OF ECONOMIC RESEARCH
1050 Massachusetts Avenue
Cambridge, MA 02138
May 2004
Thanks to Malcolm Baker, Ray Ball, Francis Diebold, Owen Lamont, Sydney Ludvigson, Jeff Wurgler, and
Yihong Xia for comments; and Todd Clark for providing us with some critical McCraken values. The views
expressed herein are those of the author(s) and not necessarily those of the National Bureau of Economic
Research.
©2004 by Amit Goval and Ivo Welch. All rights reserved. Short sections of text, not to exceed two
paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given
to the source.

A Comprehensive Look at the Empirical Performance of Equity Premium Prediction
Amit Goval and Ivo Welch
NBER Working Paper No. 10483
May 2004
JEL No. G12, G14
ABSTRACT
Given the historically high equity premium, is it now a good time to invest in the stock market?
Economists have suggested a whole range of variables that investors could or should use to predict:
dividend price ratios, dividend yields, earnings-price ratios, dividend payout ratios, net issuing
ratios, book-market ratios, interest rates (in various guises), and consumption-based macroeconomic
ratios (cay). The typical paper reports that the variable predicted well in an *in-sample* regression,
implying forecasting ability.
Our paper explores the *out-of-sample* performance of these variables, and finds that not a single
one would have helped a real-world investor outpredicting the then-prevailing historical equity
premium mean. Most would have outright hurt. Therefore, we find that, for all practical purposes,
the equity premium has not been predictable, and any belief about whether the stock market is now
too high or too low has to be based on theoretical prior, not on the empirically variables we have
explored.
Amit Goval
Goizueta Business School
Emory University
Ivo Welch
Yale University
School of Management
46 Hillhouse Ave.
New Haven, CT 06520-8200
and NBER
ivo.welch@yale.edu

1 Introduction
Attempts to predict stock market returns or the equity premium have a long tradition
in finance. For example, as early as 1920, Dow (1920) explored the role of dividend
ratios. Nowadays, a typical specification regresses an independent lagged predictor on
the stock market rate of return or, as we shall do, on the equity premium,
Rm(t) Rf(t) = γ
0
+ γ
1
·
[
x(t 1)
]
+ (t) . (1)
γ
1
is interpreted as a measure of how significant x is in predicting the equity premium.
The most prominent x variables explored in the literature are
The dividend-price ratio and the dividend yield: Ball (1978), Rozeff (1984), Shiller (1984),
Campbell (1987), Campbell and Shiller (1988), Campbell and Shiller (1989), Fama
and French (1988a), Hodrick (1992), Campbell and Viceira (2002), Campbell and
Yogo (2003), Lewellen (2004), and Menzly, Santos, and Veronesi (2004). Cochrane
(1997) surveys the dividend ratio prediction literature.
The earnings price ratio and dividend-earnings (payout) ratio: Lamont (1998).
The interest and inflation rates: The short term interest rate: Campbell (1987) and Ho-
drick (1992). The term spread and the default spread: Avramov (2002), Campbell
(1987), Fama and French (1989), and Keim and Stambaugh (1986). The inflation
rate: Campbell and Vuolteenaho (2003), Fama (1981), Fama and Schwert (1977),
and Lintner (1975). Some papers explore multiple interest rate related variables,
as well as dividend related variables (e.g., Ang and Bekaert (2003)).
The book-to-market ratio: Kothari and Shanken (1997) and Pontiff and Schall (1998).
The consumption, wealth, and income ratio: Lettau and Ludvigson (2001).
The aggregate net issuing activity: Baker and Wurgler (2000).
In turn, a large theoretical and normative literature has developed that stipulates
how investors should allocate their wealth as a function of state variables—and promi-
nently the just-mentioned variables.
1

Our own paper is intentionally simple: as in Goyal and Welch (2003), we posit that
a real-world investor would not have had access to any ex-post information, either to
construct variables or to the entire-sample gamma regression coefficients. An investor
would have had to estimate the prediction equation only with data available strictly
before or at the prediction point, and then make an out-of-sample prediction. There-
fore, instead of running one single in-sample regression and comparing the fitted to the
actual value (or, equivalently, compute the R
2
or F-statistic), we must run rolling fore-
casting regressions and compare the performance of the regression predictions against
the equivalent predictions from simply projecting the then-prevailing historical equity
premium mean. Unlike Goyal and Welch (2003), our current paper expands the set of
variables and horizons to be comprehensive. We are interested in how well any of the
popular variables, which were proposed in existing literature as important in-sample
predictors of the equity premium, hold up out-of-sample.
1
Our paper not only tries out different time horizons and forecasting periods, but also
diagnoses when these variables were of help, using a graphical diagnostic first proposed
in Goyal and Welch (2003). We are also interested in the contradictory results in the
literature: different papers have identified different methods/variables to be important.
Our paper shows that many of the differences can be traced back to choices of sample
period and data frequency: these are not innocuous, but often the primary driver for
the significance of in-sample results.
Altogether, we find our evidence sobering: we could not identify a single variable
that would have been of solid and robust use to a real-world investor (who did not
have access to ex-post information). Our diagnostic shows that any presumed equity
premium forecasting ability was a mirage. Even before the often-considered anoma-
lous 1990s, many of these variables had little if any statistical forecasting power. It is
also usually not a matter of arguing over whether we computed correct statistical stan-
dard errors. Instead, most variables are just worse than the prevailing historical equity
premium average as a predictor, and some even economically significantly so.
1
Goyal and Welch (2003) was not the first paper to explore out-of-sample prediction. There are three
earlier/contemporaneous attempts we are aware of: First, Fama and French (1988a) interpreted out-
of-sample performance to be a success, primarily due to a fortunate sample period. Second, Pesaran
and Timmerman (1995) explore model selection in great detail, exploring dividend-yield, earnings-price
ratios, interest rates, and money in 2
9
= 512 model variations. Their data series is monthly, from
1954–1992. They conclude that investors could have succeeded, especially in the volatile periods of the
1970s. They do not entertain the historical equity premium mean as a null hypothesis. Third, like Goyal
and Welch (2003), Bossaerts and Hillion (1999) interpreted out-of-sample performance to be a failure.
However, Bossaerts and Hillion (1999) relied more on a large cross-section (14 countries) than on a long
out-of-sample time period (1990–1995).
Goyal and Welch (2003) was also not first to critique predictive regressions. In particular, the use
of dividend ratios has been critiqued in many other papers (see, e.g., Goetzmann and Jorion (1993)
and Ang and Bekaert (2003); apologies to everyone whose paper we omit to cite here—the literature is
voluminous).
2

Overall, the performance of these variables is worse than what we would have ex-
pected: given the data snooping of many researchers looking for variables that predict
stock prices, and given that our out-of-sample regressions often rely on the very same
data points that were used to establish the significance of the in-sample regression, so
we are not really conducting a true out-of-sample test—we would have expected at least
about equal performance. But instead, for example, of 51 predictive regressions on an-
nual frequencies, 46 (!) underperformed the prevailing mean on a the RMSE criterion.
As for the rare regression exceptions in which a variable outpredicts the mean, none are
robust across time-specifications and/or data periodicity, few reach statistical signifi-
cance, and none reaches good economic significance, i.e., surpassing even very modest
transaction costs. (The average annual outperformance is 12 basis points.)
In sum, despite good in-sample predictive ability for many of these variables, most
had consistently poor or zero out-of-sample forecasting ability. (They were essentially
noise.) Thus, our paper concludes that the evidence that the equity premium has ever
varied predictably with both prevailing variables and prevailing regression specifica-
tions has always been tenuous: a market-timing trader could not have taken advantage
of these variables to outperform the prevailing moving average—and could/should have
known this. By assuming that the equity premium was “like it always has been,” this
trader would have performed at least as well.
Before we proceed, we wish to point out what our paper does not do: it has nothing
to say about cross-sectional evidence, i.e., whether these variables can predict which
stocks do better than other stocks. It has little to say about models which assume
that agents know all parameters—if the relations are assumed to be known, then out-
of-sample estimates are not required. We are more interested in whether Amit Goyal
and Ivo Welch—agents without full model parameters—should rely on these variables
to time the market.
2 Data
In this section, we describe our data sources and data construction. First, the dependent
variable, the equity premium:
Stock Prices: S&P 500 index monthly prices from 1871 to 1926 are from Robert
Shiller’s website. These are monthly averages for the month. Prices from 1926
to 2003 are from CRSP’s month-end values. Stock Returns are the continuously
compunded returns on the S&P 500 index.
Risk-free Rate: The risk-free rate for the period 1920 to 2003 is the T-bill rate.
Because there was no risk-free short-term debt prior to the 1920’s, we had to
3

Figures
Citations
More filters
Journal ArticleDOI

The Cross-section of Expected Stock Returns

TL;DR: In this paper, the cross-sectional properties of return forecasts derived from Fama-MacBeth regressions were studied, and the authors found that the forecasts vary substantially across stocks and have strong predictive power for actual returns.
Journal ArticleDOI

Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average?

TL;DR: Goyal and Welch as mentioned in this paper showed that many predictive regressions beat the historical average return, once weak restrictions are imposed on the signs of coefficients and return forecasts, and that the implied predictability of returns is substantial at longer horizons.
Journal ArticleDOI

To Explain or to Predict

TL;DR: The distinction between explanatory and predictive models is discussed in this paper, and the practical implications of the distinction to each step in the model- ing process are discussed as well as a discussion of the differences that arise in the process of modeling for an explanatory ver- sus a predictive goal.
Journal ArticleDOI

Presidential Address: Discount Rates

TL;DR: Discount-rate variation is the central organizing question of current asset-pricing research as discussed by the authors, and a survey of discount-rate theories and applications can be found in the survey.
Journal ArticleDOI

Expected Stock Returns and Variance Risk Premia

TL;DR: This article found that the difference between implied and realized variances, or the variance risk premium, is able to explain more than fifteen percent of the ex-post time series variation in quarterly excess returns on the market portfolio over the 1990 to 2005 sample period, with high premia predicting high (low) future returns.
References
More filters
Posted Content

Comparing Predictive Accuracy

TL;DR: The authors describes the advantages of these studies and suggests how they can be improved and also provides aids in judging the validity of inferences they draw, such as multiple treatment and comparison groups and multiple pre- or post-intervention observations.
ReportDOI

Comparing Predictive Accuracy

TL;DR: In this article, explicit tests of the null hypothesis of no difference in the accuracy of two competing forecasts are proposed and evaluated, and asymptotic and exact finite-sample tests are proposed, evaluated and illustrated.
Journal ArticleDOI

Business conditions and expected returns on stocks and bonds

TL;DR: For example, this paper found that expected returns on common stocks and long-term bonds contain a term or maturity premium that has a clear business-cycle pattern (low near peaks, high near troughs).
Journal ArticleDOI

Dividend yields and expected stock returns

TL;DR: In this article, the power of dividend yields to forecast stock returns, measured by regression R2, increases with the return horizon, and the authors offer a two-part explanation: high autocorrelation causes the variance of expected returns to grow faster than the return-horizon.
Posted Content

The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors

TL;DR: In this article, a linearization of a rational expectations present value model for corporate stock prices produces a simple relation between the log dividend-price ratio and mathematical expectations of future log real dividend changes and future real discount rates.
Related Papers (5)
Frequently Asked Questions (1)
Q1. What are the contributions mentioned in the paper "Nber working paper series a comprehensive look at the empirical performance of equity premium prediction" ?

Goyal et al. this paper showed that a real-world investor would not have had access to any ex-post information, either to construct variables or to the entire-sample gamma regression coefficients.