(Open Access) A Comprehensive Look at the Empirical Performance of Equity Premium Prediction (2008) | Ivo Welch

Q: What are the contributions mentioned in the paper "Nber working paper series a comprehensive look at the empirical performance of equity premium prediction" ?

Goyal et al. this paper showed that a real-world investor would not have had access to any ex-post information, either to construct variables or to the entire-sample gamma regression coefficients.

NBER WORKING PAPER SERIES

A COMPREHENSIVE LOOK AT THE EMPIRICAL PERFORMANCE

OF EQUITY PREMIUM PREDICTION

Amit Goval

Ivo Welch

Working Paper 10483

http://www.nber.org/papers/w10483

NATIONAL BUREAU OF ECONOMIC RESEARCH

1050 Massachusetts Avenue

Cambridge, MA 02138

May 2004

Thanks to Malcolm Baker, Ray Ball, Francis Diebold, Owen Lamont, Sydney Ludvigson, Jeff Wurgler, and

Yihong Xia for comments; and Todd Clark for providing us with some critical McCraken values. The views

expressed herein are those of the author(s) and not necessarily those of the National Bureau of Economic

Research.

paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given

to the source.

A Comprehensive Look at the Empirical Performance of Equity Premium Prediction

Amit Goval and Ivo Welch

NBER Working Paper No. 10483

May 2004

JEL No. G12, G14

ABSTRACT

Given the historically high equity premium, is it now a good time to invest in the stock market?

Economists have suggested a whole range of variables that investors could or should use to predict:

dividend price ratios, dividend yields, earnings-price ratios, dividend payout ratios, net issuing

ratios, book-market ratios, interest rates (in various guises), and consumption-based macroeconomic

ratios (cay). The typical paper reports that the variable predicted well in an *in-sample* regression,

implying forecasting ability.

Our paper explores the *out-of-sample* performance of these variables, and finds that not a single

one would have helped a real-world investor outpredicting the then-prevailing historical equity

premium mean. Most would have outright hurt. Therefore, we find that, for all practical purposes,

the equity premium has not been predictable, and any belief about whether the stock market is now

too high or too low has to be based on theoretical prior, not on the empirically variables we have

explored.

Amit Goval

Goizueta Business School

Emory University

Ivo Welch

Yale University

School of Management

46 Hillhouse Ave.

New Haven, CT 06520-8200

and NBER

ivo.welch@yale.edu

1 Introduction

Attempts to predict stock market returns or the equity premium have a long tradition

in ﬁnance. For example, as early as 1920, Dow (1920) explored the role of dividend

ratios. Nowadays, a typical speciﬁcation regresses an independent lagged predictor on

the stock market rate of return or, as we shall do, on the equity premium,

Rm(t) − Rf(t) = γ

+ γ

[

x(t − 1)

]

+ (t) . (1)

is interpreted as a measure of how signiﬁcant x is in predicting the equity premium.

The most prominent x variables explored in the literature are

The dividend-price ratio and the dividend yield: Ball (1978), Rozeﬀ (1984), Shiller (1984),

Campbell (1987), Campbell and Shiller (1988), Campbell and Shiller (1989), Fama

and French (1988a), Hodrick (1992), Campbell and Viceira (2002), Campbell and

Yogo (2003), Lewellen (2004), and Menzly, Santos, and Veronesi (2004). Cochrane

(1997) surveys the dividend ratio prediction literature.

The earnings price ratio and dividend-earnings (payout) ratio: Lamont (1998).

The interest and inﬂation rates: The short term interest rate: Campbell (1987) and Ho-

drick (1992). The term spread and the default spread: Avramov (2002), Campbell

(1987), Fama and French (1989), and Keim and Stambaugh (1986). The inﬂation

rate: Campbell and Vuolteenaho (2003), Fama (1981), Fama and Schwert (1977),

and Lintner (1975). Some papers explore multiple interest rate related variables,

as well as dividend related variables (e.g., Ang and Bekaert (2003)).

The book-to-market ratio: Kothari and Shanken (1997) and Pontiﬀ and Schall (1998).

The consumption, wealth, and income ratio: Lettau and Ludvigson (2001).

The aggregate net issuing activity: Baker and Wurgler (2000).

In turn, a large theoretical and normative literature has developed that stipulates

how investors should allocate their wealth as a function of state variables—and promi-

nently the just-mentioned variables.

Our own paper is intentionally simple: as in Goyal and Welch (2003), we posit that

a real-world investor would not have had access to any ex-post information, either to

construct variables or to the entire-sample gamma regression coeﬃcients. An investor

would have had to estimate the prediction equation only with data available strictly

before or at the prediction point, and then make an out-of-sample prediction. There-

fore, instead of running one single in-sample regression and comparing the ﬁtted to the

actual value (or, equivalently, compute the R

or F-statistic), we must run rolling fore-

casting regressions and compare the performance of the regression predictions against

the equivalent predictions from simply projecting the then-prevailing historical equity

premium mean. Unlike Goyal and Welch (2003), our current paper expands the set of

variables and horizons to be comprehensive. We are interested in how well any of the

popular variables, which were proposed in existing literature as important in-sample

predictors of the equity premium, hold up out-of-sample.

Our paper not only tries out diﬀerent time horizons and forecasting periods, but also

diagnoses when these variables were of help, using a graphical diagnostic ﬁrst proposed

in Goyal and Welch (2003). We are also interested in the contradictory results in the

literature: diﬀerent papers have identiﬁed diﬀerent methods/variables to be important.

Our paper shows that many of the diﬀerences can be traced back to choices of sample

period and data frequency: these are not innocuous, but often the primary driver for

the signiﬁcance of in-sample results.

Altogether, we ﬁnd our evidence sobering: we could not identify a single variable

that would have been of solid and robust use to a real-world investor (who did not

have access to ex-post information). Our diagnostic shows that any presumed equity

premium forecasting ability was a mirage. Even before the often-considered anoma-

lous 1990s, many of these variables had little if any statistical forecasting power. It is

also usually not a matter of arguing over whether we computed correct statistical stan-

dard errors. Instead, most variables are just worse than the prevailing historical equity

premium average as a predictor, and some even economically signiﬁcantly so.

Goyal and Welch (2003) was not the ﬁrst paper to explore out-of-sample prediction. There are three

earlier/contemporaneous attempts we are aware of: First, Fama and French (1988a) interpreted out-

of-sample performance to be a success, primarily due to a fortunate sample period. Second, Pesaran

and Timmerman (1995) explore model selection in great detail, exploring dividend-yield, earnings-price

ratios, interest rates, and money in 2

= 512 model variations. Their data series is monthly, from

1954–1992. They conclude that investors could have succeeded, especially in the volatile periods of the

1970s. They do not entertain the historical equity premium mean as a null hypothesis. Third, like Goyal

and Welch (2003), Bossaerts and Hillion (1999) interpreted out-of-sample performance to be a failure.

However, Bossaerts and Hillion (1999) relied more on a large cross-section (14 countries) than on a long

out-of-sample time period (1990–1995).

Goyal and Welch (2003) was also not ﬁrst to critique predictive regressions. In particular, the use

of dividend ratios has been critiqued in many other papers (see, e.g., Goetzmann and Jorion (1993)

and Ang and Bekaert (2003); apologies to everyone whose paper we omit to cite here—the literature is

voluminous).

Overall, the performance of these variables is worse than what we would have ex-

pected: given the data snooping of many researchers looking for variables that predict

stock prices, and given that our out-of-sample regressions often rely on the very same

data points that were used to establish the signiﬁcance of the in-sample regression, so

we are not really conducting a true out-of-sample test—we would have expected at least

about equal performance. But instead, for example, of 51 predictive regressions on an-

nual frequencies, 46 (!) underperformed the prevailing mean on a the RMSE criterion.

As for the rare regression exceptions in which a variable outpredicts the mean, none are

robust across time-speciﬁcations and/or data periodicity, few reach statistical signiﬁ-

cance, and none reaches good economic signiﬁcance, i.e., surpassing even very modest

transaction costs. (The average annual outperformance is 12 basis points.)

In sum, despite good in-sample predictive ability for many of these variables, most

had consistently poor or zero out-of-sample forecasting ability. (They were essentially

noise.) Thus, our paper concludes that the evidence that the equity premium has ever

varied predictably with both prevailing variables and prevailing regression speciﬁca-

tions has always been tenuous: a market-timing trader could not have taken advantage

of these variables to outperform the prevailing moving average—and could/should have

known this. By assuming that the equity premium was “like it always has been,” this

trader would have performed at least as well.

Before we proceed, we wish to point out what our paper does not do: it has nothing

to say about cross-sectional evidence, i.e., whether these variables can predict which

stocks do better than other stocks. It has little to say about models which assume

that agents know all parameters—if the relations are assumed to be known, then out-

of-sample estimates are not required. We are more interested in whether Amit Goyal

and Ivo Welch—agents without full model parameters—should rely on these variables

to time the market.

2 Data

In this section, we describe our data sources and data construction. First, the dependent

variable, the equity premium:

• Stock Prices: S&P 500 index monthly prices from 1871 to 1926 are from Robert

Shiller’s website. These are monthly averages for the month. Prices from 1926

to 2003 are from CRSP’s month-end values. Stock Returns are the continuously

compunded returns on the S&P 500 index.

• Risk-free Rate: The risk-free rate for the period 1920 to 2003 is the T-bill rate.

Because there was no risk-free short-term debt prior to the 1920’s, we had to

A Comprehensive Look at the Empirical Performance of Equity Premium Prediction

Figures

Citations

The Cross-section of Expected Stock Returns

Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average?

To Explain or to Predict

Presidential Address: Discount Rates

Expected Stock Returns and Variance Risk Premia

References

Comparing Predictive Accuracy

Comparing Predictive Accuracy

Business conditions and expected returns on stocks and bonds

Dividend yields and expected stock returns

The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors

Related Papers (5)

Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average?

Dividend yields and expected stock returns

Business conditions and expected returns on stocks and bonds

The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors

A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix

Frequently Asked Questions (1)

Q1. What are the contributions mentioned in the paper "Nber working paper series a comprehensive look at the empirical performance of equity premium prediction" ?