scispace - formally typeset
Open AccessJournal ArticleDOI

Bayesian vector autoregressions with stochastic volatility

Harald Uhlig
- 01 Jan 1997 - 
- Vol. 65, Iss: 1, pp 59-73
TLDR
In this article, a Bayesian approach to vector autoregression with stochastic volatility is proposed, where the multiplicative evolution of the precision matrix is driven by a multivariate beta variate.
Abstract
This paper proposes a Bayesian approach to a vector autoregression with stochastic volatility, where the multiplicative evolution of the precision matrix is driven by a multivariate beta variate.Exact updating formulas are given to the nonlinear filtering of the precision matrix.Estimation of the autoregressive parameters requires numerical methods: an importance-sampling based approach is explained here.

read more

Content maybe subject to copyright    Report

Tilburg University
Bayesian Vector Autoregressions with Stochastic Volatility
Uhlig, H.F.H.V.S.
Publication date:
1996
Link to publication in Tilburg University Research Portal
Citation for published version (APA):
Uhlig, H. F. H. V. S. (1996).
Bayesian Vector Autoregressions with Stochastic Volatility
. (CentER Discussion
Paper; Vol. 1996-09). Macroeconomics.
General rights
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners
and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.
• Users may download and print one copy of any publication from the public portal for the purpose of private study or research.
• You may not further distribute the material or use it for any profit-making activity or commercial gain
• You may freely distribute the URL identifying the publication in the public portal
Take down policy
If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately
and investigate your claim.
Download date: 10. aug.. 2022

BAYESIAN VECTOR AUTOREGRESSIONS
WITH STOCHASTIC VOLATILITY
1
By Harald Uhlig
February 2, 1996
Abstract
This paper proposes a Bayesian approachtoavector autoregression with
stochastic volatility, where the multiplicativeevolution of the precision matrix
is driven byamultivariate b eta variate. Exact updating formulas are given to
the nonlinear ltering of the precision matrix. Estimation of the autoregressive
parameters requires numerical metho ds: an importance-sampling based approach
is explained here.
i

1
1 Intro duction
This paper introduces Bayesian vector autoregressions with sto chastic volatility. In con-
trast to multivariate autoregressive conditional heteroskedasticity (ARCH), the sto chas-
tic volatility setup here models the error precision matrix as an unobserved component
with sho cks drawn from a multivariate b eta distribution. This allows the interpretation
of a sudden large movement in the data as the result of a draw from a distribution
with a randomly increased but unobserved variance. Exploiting a conjugacy between
Wishart distributions and multivariate singular beta distributions, the integration over
the unobserved shock to the precision matrix can be p erformed in closed form, lead-
ing to a generalization of the standard Kalman-Filter formulas to the nonlinear ltering
problem at hand. Estimating the autoregressive parameters requires numerical methods,
however. The paper focusses on an importance-sampling based approach.
Bayesian vector autoregressions have been studied and popularized by e.g. Litter-
man (1979), Doan, Litterman and Sims (1984) and Doan's RATS Manual (1990). ARCH
models have been introduced by Engle (1982), see the review in Bollerslev, Chou and
Kroner (1992). Sto chastic volatility mo dels provide an alternative approach to model
time variation in the size of uctuations. The stochastic volatility mo del used here
is similar to Shephard (1994), whose model is a univariate, non-Bayesian and non-
autoregressive special case of the model proposed here. In contrast to other Bayesian
approaches to stochastic volatility, see Jacquier, Polson and Rossi (1994), the method
here results in exact up dating formulas for the posterior in the sense that the integra-
tion over the unobserved shocks to the precision matrices is done in closed form. The
conjugacy result needed for this step is established in Uhlig (1994b).
For simplicity, the main ideas are explained in section 2 for the univariate case with
the general case presented in section 3. Section 4 discusses how to analyze the posterior
numerically. Section 5 concludes. Appendix A lists some of the distributions used and
xes notation. Appendix B contains the proofs and one additional theorem. Appendix C
proposes a prior.
2 A Simple Case
Consider the following simple version of the mo del studied in this pap er:
y
t
=
y
t
1
+
h
1
=
2
t
t
;
with
t
N
(0
;
1)
;
(1)
h
t
+1
=
h
t
#
t
=;
with
#
t
B
1
((
+1)
=
2
;
1
=
2)
;
(2)
where all
#
t
's and
t
's drawn independently, where
t
=1
;:::;T
denotes time,
y
t
2
IR
;t
=
0
;:::;T
is data and observable,
>
0,
>
0 are parameters and
B
1
(
p; q
) denotes the
(one-dimensional) b eta-distribution on the interval [0,1].

2
Equation (2) sp ecies the unobserved precision
h
t
of the innovation
h
1
=
2
t
t
to b e
stochastic. The model thus belongs to the family of sto chastic volatility mo dels, see e.g.
Jacquier-Polson and Rossie (1994). The model captures auto correlated heteroskedastic-
ity, a feature often found especially in nancial data series. Another popular specica-
tion whichdoessoistheARCH-family of models. A GARCH(1,1) mo del, for example,
replaces
h
1
=
2
t
in (1) with
t
and replaces (2) with
2
t
+1
=
+

2
t
+

2
t
2
t
;
(3)
where
,
and
are parameters. It thus ties the innovation in the variance to the
size of the current innovation
t
t
. Given
t
1
and
h
t
1
or
t
1
,anunusually large
innovation in (2) can result from a randomly decreased
h
t
as well as a large
t
, whereas
the GARCH-model (3) only allows for an unusually large draw
t
.
To analyze the system (1) to (2) in a Bayesian fashion, one needs to choose a prior
density
0
(
; h
1
) for
and
h
1
, given
y
0
. The goal is to nd the p osterior density
T
(
; h
T
+1
) given data
y
0
;:::;y
T
.We restrict the choice of priors to b e of the following
form. Fix
>
0 and
>
0 (for a more general treatment, see section 3). Cho ose
b
0
2
IR
,
n
0
>
0,
s
2
0
>
0 and a function
g
0
(
)
0 to describ e a prior density prop ortional to
0
(
; h
1
)
/
g
0
(
)
f
NG
(
; h
1
j
b
0
;n
0
;s
0
;
)
;
where
f
NG
denotes the Normal-gamma density, see app endix A. The form of the prior
allows for a exible treatment of a root near or above unity via the function
g
0
(
), see
Uhlig (1994a).
Adapting the Bayesian updating formulas (12), (13), (14) and (15) derived b elowin
section 3 to the simple mo del ab ove results in
n
t
=
n
t
1
+
y
2
t
1
(4)
b
t
=
b
t
1
n
t
1
+
y
t
y
t
1
=n
t
(5)
s
t
=
s
t
1
+
e
2
t
1
y
2
t
1
=n
t
;
(6)
where
e
t
=
y
t
b
t
1
y
t
1
;
and
g
t
(
)=
g
t
1
(
)
(
b
t
)
2
n
t
+
s
t
1
=
2
(7)
for
t
=1
;:::;T
. These deliver the posterior density
T
(
; h
T
+1
)
/
g
T
(
)
f
NG
(
; h
T
+1
j
b
T
;n
T
;s
T
;
)
:

3
Equations (4) and (5) are the recursion formulas or Kalman Filter formulas for geomet-
rically weighted least squares. Dierent observations receive dierentweights according
to the size of
s
t
via equation (7). Equation (6) prescribes to nd the \estimate"
s
t
of
h
t
+1
essentially via a geometric lag on past squared residuals. Notice the formal similarityto
GARCH: ignoring the term (1
y
2
t
1
=n
t
), equation (6) resembles equation (3) rewritten
in terms of observables, using
= 0 and
=
=
.
The key for proving the validity of these updating formulas here or in the next section
is theorem 2 and its pro of (see appendix B): as the unobserved shock
#
t
occurs, one needs
to do a \change of variable" from
d dh
t
d#
t
to
d dh
t
+1
dz
t
for some suitably dened
z
t
. Thanks to the conjugacy b etween the b eta and the gamma distribution, integration
over
dz
t
can be p erformed in closed form, resulting in an integration constant depending
on
and the data. This constant is captured by the function
g
t
(
).
Shephard (1994) nds similar formulas with a classical interpretation for (1) to (2)
without the autoregressive term
y
t
1
.To include autoregressive terms, Shephard (1994)
suggests approximate ltering formulas. In contrast, the Bayesian formulas here are
exact. They do, however, require numerical techniques such as importance-sampling for
the estimation of
. There is no treatment of the multivariate case in Shephard (1994).
For
=
=
(
+1) wehave
=
=1
in equation (6). For
=(
+1)
=
(
+2)
the precision
h
t
is a martingale
E
[
h
t
+1
j
h
t
]=
h
t
on the positive part of the real axis.
Shephard (1994) suggests setting
=
e
r
, where
r
=
E
[log
#
t
]. This avoids the problem,
that otherwise
h
t
!1
a.s. or
h
t
!
0 a.s. (see Nelson (1990)) and makes log
h
t
a
random walk.
For
!1
one obtains a mo del where
h
1
is known a priori,
h
1
=
s
1
0
, and where
h
t
+1
=
h
t
(
+1)
=
(
(
+ 2)). In other words the mo del allows for the greater time
variation in the precision, the smaller the parameter
.
Figure 1 shows parts of the densities for
=#
t
, which are the multiplicative distur-
bances of the variance
2
t
h
1
t
. It shows that (2) typically leads to a slight decrease in
the innovation variance except for occasional and potentially large increases.
3 The General Model
Consider the VAR(k)-mo del with time-varying error precision matrices
Y
t
=
B
(0)
C
t
+
B
(1)
Y
t
1
+
B
(2)
Y
t
2
+
:::
+
B
(
k
)
Y
t
k
+
U
(
H
1
t
)
0
t
;
with
t
N
(0
;I
m
)
;
(8)
H
t
+1
=
U
(
H
t
)
0
t
U
(
H
t
)
=;
with
t
B
m
((
+
c
+
km
)
=
2
;
1
=
2)
;
(9)
where
t
=1
;:::;T
denotes time,
Y
t
;t
=1
k;:::;T
, size
m
1, is observable data,
and
C
t
, size
c
1, denotes deterministic regressors such as a constant and a time trend.
The co ecient matrix
B
(0)
is of size
m
c
, the co ecient matrices
B
(
i
)
;i
=1
;:::;k
are

Citations
More filters
Journal ArticleDOI

What are the effects of monetary policy on output? Results from an agnostic identification procedure

TL;DR: The authors proposed to estimate the effects of monetary policy shocks by a new agnostic method, imposing sign restrictions on the impulse responses of prices, nonborrowed reserves and the federal funds rate in response to a monetary policy shock.
Journal ArticleDOI

Time Varying Structural Vector Autoregressions and Monetary Policy

TL;DR: In this paper, monetary policy and the private sector behavior of the US economy are modeled as a time varying structural vector autoregression, where the sources of time variation are both the coefficients and the variance covariance matrix of the innovations.

Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks.

TL;DR: In this article, Rao-Blackwellised particle filters (RBPFs) were proposed to increase the efficiency of particle filtering, using a technique known as Rao-blackwellisation.
Patent

Adaptive pattern recognition based control system and method

TL;DR: An adaptive interface for a programmable system, for predicting a desired user function, based on user history, as well as machine internal status and context, is presented for confirmation by the user, and the predictive mechanism is updated based on this feedback as mentioned in this paper.
Book ChapterDOI

Rao-blackwellised particle filtering for dynamic Bayesian networks

TL;DR: In this paper, Rao-Blackwellised particle filters (RBPFs) were proposed to increase the efficiency of particle filtering, using a technique known as Rao-blackwellisation.
References
More filters
Journal ArticleDOI

Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation

Robert F. Engle
- 01 Jul 1982 - 
TL;DR: In this article, a new class of stochastic processes called autoregressive conditional heteroscedastic (ARCH) processes are introduced, which are mean zero, serially uncorrelated processes with nonconstant variances conditional on the past, but constant unconditional variances.
Book

Aspects of multivariate statistical theory

TL;DR: In this paper, the authors present a set of standard tests on Covariance Matrices and Mean Vectors, and test independence between k Sets of Variables and Canonical Correlation Analysis.
Journal ArticleDOI

ARCH modeling in finance: A review of the theory and empirical evidence

TL;DR: An overview of some of the developments in the formulation of ARCH models and a survey of the numerous empirical applications using financial data can be found in this paper, where several suggestions for future research, including the implementation and tests of competing asset pricing theories, market microstructure models, information transmission mechanisms, dynamic hedging strategies, and pricing of derivative assets, are also discussed.
Book

Bayesian inference in statistical analysis

TL;DR: In this article, the effect of non-normality on inference about a population mean with generalizations was investigated. But the authors focused on the effect on the mean with information from more than one source.
Related Papers (5)
Frequently Asked Questions (10)
Q1. What are the contributions in this paper?

This paper proposes a Bayesian approach to a vector autoregression with stochastic volatility, where the multiplicative evolution of the precision matrix is driven by a multivariate beta variate. 

Use as importance sampling density a tdistribution centered at B with Hessian J , whose degrees of freedom are chosen to ensure fatter tails than those of the marginal posterior T;marg(B): choose with 0 < < T + l + ml, preferably close to the upper bound. 

The stochastic volatility model used here is similar to Shephard (1994), whose model is a univariate, non-Bayesian and nonautoregressive special case of the model proposed here. 

This paper introduced Bayesian vector autoregressions with stochastic volatility, deriving in closed form the Bayesian posterior, when the error precision matrix is stochas-9 tically time-varying. 

The key for proving the validity of these updating formulas here or in the next section is theorem 2 and its proof (see appendix B): as the unobserved shock #t occurs, one needs to do a \\change of variable" from d dht d#t to d dht+1 dzt for some suitably de ned zt. 

Note that N 1 t can be computed numerically cheaply viaN 1t = N 1t 1 N 1 t 1XtX 0 tN 1 t 1=(X 0 tN 1 t 1Xt + ) = ;as can be veri ed directly or with rule (T8), p. 324 in Leamer (1978). 

Equation (9) is one of two rather natural generalizations of the multiplication of two real numbers in equation (2) in order to guarantee the symmetry of the resulting matrix Ht+1. 

Numerical methods are needed, since the posterior (11) is proportional to a Normal-Wishart distribution scaled with the function gT (B). 

The weights can di er by orders of magnitude: examining the raw numbers shows that the draw with the largest weight receives 5.3% of the sum of all weights, the 109 most heavily weighted draws constitute 50% of the mass and the 741 draws with the highest weights make up 90%. 

For each t = 1; : : : ; T calculate et = Yt BtXt andNt = Nt 1 +XtX 0 t(12)5 Bt = Bt 1Nt 1 + YtX 0t N 1t(13)St = St 1 + et 1 X 0tN 1 t Xt e0t(14)gt(B) gt 1(B) j (B Bt)Nt(B Bt) 0 + St j 1=2(15)t( ; ) m(( + l + 1)=2)m(( + l)=2) m(l+ )=2 t 1( ; )(16)In equation (16), m( ) is the multivariate gamma function, de ned in Muirhead (1982), De nition 2.1.10.