scispace - formally typeset
Search or ask a question
Journal ArticleDOI

On geometric ergodicity of CHARME models

01 May 2010-Journal of Time Series Analysis (Blackwell Publishing Ltd)-Vol. 31, Iss: 3, pp 141-152
TL;DR: In this paper, the authors consider a CHARME model, a class of generalized mixture of nonlinear nonparametric AR-ARCH time series, and apply the theory of Markov models to derive asymptotic stability of this model.
Abstract: In this paper we consider a CHARME Model, a class of generalized mixture of nonlinear nonparametric AR-ARCH time series. We apply the theory of Markov models to derive asymptotic stability of this model. Indeed, the goal is to provide some sets of conditions under which our model is geometric ergodic and therefore satisfies some mixing conditions. This result can be considered as the basis toward an asymptotic theory for our model.

Summary (1 min read)

1 Introduction

  • In particular, the important property of geometric ergodicity is obtained under some conditions.
  • In practice, it is often not realistic to assume that the observed process has the same trend function m and the same volatility function σ at each time instant.
  • In particular, the case p = 1 is interesting on its own.

2 First conditions for geometric ergodicity of CHARME

  • The authors focus on their CHARME model (1.2) and make the following assumptions A. 1.
  • The process {Qt} with values on {1, · · · ,K} is a first order strictly stationary Markov chain which is irreducible and aperiodic with probability distribution (π1, · · · , πK) and transition probability matrix A = (aij)1≤i,j≤K .

4 Concluding remarks

  • The authors have considered Conditional Heteroscedastic Autoregressive Mixture of Experts models, a form of hidden Markov model with nonlinear autoregressive-ARCH components, which can be useful in, e.g. financial econometrics.
  • Notice that their conditions for the mk and σk are easier to verify than some kind of sub linearity conditions or Lyapounov exponents conditions.
  • Subsequent work based on their results can lead to sufficient conditions for the existence of moments for Xt or for the existence of limit theorems for Xt.
  • This practical aspect is the subject of forthcoming publication.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

On Geometric Ergodicity of CHARME Models
J¨urgen Franke
Jean-Pierre Stockis
Joseph Tadjuidje Kamgaing
University of Kaiserslautern
January 10, 2007
Abstract
In this paper we consider a CHARME Model, a class of generalized mixture of
nonlinear nonparametric AR-ARCH time series. We apply the theory of Markov models
to derive as ymptotic stability of this model. Indeed, the goal is to provide some sets
of conditions under which our model is geometric ergodic and therefore satisfies some
mixing conditions. This result can be considered as the basis toward an asymptotic
theory for our model.
Keywords: Nonparametric AR-ARCH; Mixture Models; Markov chain; Geometric Ergodicity
University of Kaiserslautern, Department of Mathematics,Erwin-Schroedinger-Str., 67663 Kaisers-
lautern, Germany. E-mail address: franke@mathematik.uni-kl.de
The work was supported by the Deutsche Forschungsgemeinschaft (DFG) as part of the priority research
program 1114 Mathematical Methods of Time Series and Digital Image Analysis, the center of excellence
Dependable Adaptive Systems and Mathematical Modeling funded by the state of Rhineland-Palatinate as
well as the ”Graduiertenkolleg Mathematik und Praxis” and by the Fraunhofer ITWM.
1

1 Introduction
Nonparametric conditional heteroscedastic autoregressive (nonlinear CHARN) models of the form
X
t
= m(X
t1
, · · · , X
tp
) + σ(X
t1
, · · · , X
tp
)
t
, (1.1)
m and σ unknown functions,
t
independent identically distributed (i.i.d.) random variables with
mean 0, play an important role in many fields of application, for example in econometrics or
finance, see for example ardle and Tsybakov [2], Franke, Neumann and Stockis [6], Hafner
[15]. Theoretical results about stability properties of this processes are available. In particular, the
important property of geometric ergodicity is obtained under some conditions.
In practice, it is often not realistic to assume that the observed process has the same trend function
m and the same volatility function σ at each time instant. In this paper we are analyzing the
so-called Conditional Heteroscedastic Autoregressive Mixture of Experts, henceforth CHARME,
models. Here, a hidden Markov chain {Q
t
} with values in a finite set of states {1, 2, · · · , K} drives
the dynamics of {X
t
} and our model is defined as follows
X
t
=
K
X
k=1
S
tk
(m
k
(X
t1
, · · · , X
tp
) + σ
k
(X
t1
, · · · , X
tp
)
t
) (1.2)
with
S
tk
=
(
1 for Q
t
= k
0 otherwise
(1.3)
m
k
, σ
k
, k = 1, · · · , K unknown functions,
t
i.i.d. random variables with mean 0.
Notice that for sake of simplicity of notation, we take the same number of comp onents p in each
trend function m
k
and volatility function σ
k
. This is done without loss of generality if we take p
large enough.
We call this mo dels CHARME since many authors using a mixture of models, e.g. in engineering
are calling them mixture of experts as soon as nonparametric functions estimates, typically neural
networks, are considered, see, e.g. M¨uller et al. [8], Jacob et al. [12], or Jiang and Tanner [10].
CHARME is quite useful for modeling time series data which are piecewise stationary such that
their dynamics switch sometimes from one state to another. A typical example is given by stock
returns if the market changes from a quiesce nt to a volatile phase. Tadjuidje [16] gives some
applications of such models to financial data in the context of asset management and risk analysis
where the state functions m
k
, σ
k
, k = 1, · · · , K, are e stimate d by neural networks.
Independently of the type of estimates considered, a crucial c ondition for developing a theory for
estimation and testing in the setting of CHARME is the existence of a stochastic proces s satisfying
(1.2) which is geometric ergodic. In this paper we investigate separately the case p = 1 (s ec tion
1) and the case p 1 (section 2) since they differ somewhat with respect to the formulation and
proof. In particular, the case p = 1 is interesting on its own. We formulate for both cases two
different sets of conditions.
2 First conditions for geometric ergodicity of CHARME
proce sses
We focus on our CHARME model (1.2) and make the following assumptions
A. 1 The process {Q
t
} with values on {1, · · · , K} is a first order strictly stat ionary Markov chain
which is irreducible and aperiodic with probability distribution (π
1
, · · · , π
K
) and transition proba-
bility matrix A = (a
ij
)
1i,jK
.
2

Obviously, {S
t
= (S
t1
, · · · , S
tK
)
0
} inherits the properties of {Q
t
}.
A. 2 Let G
t1
= σ{X
r
, r t 1} be the σ-algebra generated by {X
r
, r t 1} and G
t1
any event
in G
t1
. Then
P (Q
t
= j | Q
t1
= i, G
t1
) = P (Q
t
= j | Q
t1
= i), i, j
This assumption means that the hidden process Q
t
is indep e ndent of the past observations given
its own past, i.e. Q
t1
.
A. 3 Given (Q
t1
, X
t1
, X
t2
, · · · ), Q
t
is uncorrelated with the innovation
t
.
A. 4
t
is independent of X
t1
, X
t2
, · · · .
A. 5 The functions m
k
and σ
k
are bounded on compact sets for all k, there exists a δ such that
σ
k
(u) δ > 0, for all k, u.
A. 6 The i.i.d. random variables
t
have a density f which is continuous and positive everywhere.
These assumptions are reasonable conditions for hidden Markov chain models, see e.g. Francq and
Roussignol [11] or Francq, Roussignol and Zakoian [9].
Now, we restrict ourselves for the rest of this section to the case p = 1, i.e. m
k
, σ
k
are functions on
the real line. We first assume
A. 7 The i.i.d. random variables
t
have mean 0 and variance σ
2
= 1
A. 8
max
l∈{1,··· ,K}
lim sup
|x|−→∞
P
k
a
lk
(m
2
k
(x) + σ
2
k
(x))
x
2
< 1
A.8 is the generalization of the well-known sufficient condition for geometric ergodicity in the case
of model (1.1). Now we need a Markov chain representing the transformed mixture proces s: under
assumptions A.1 to A.4 it is easily seen that if we define, as previously, S
t
= (S
t1
, · · · , S
tK
)
0
, then,
ζ
t
= (S
t
, X
t
)
0
is a Markov chain.
Theorem 1 Under A.1 to A.8, {ζ
t
} is geometrically ergodic.
Proof: We are going to prove that the conditions of Theorem 15.0.1, (iii) of Meyn and Tweedie
[7], pp 354 355, are satisfied.
{ζ
t
} is ϕ-irreducible if we take ϕ as the product of the stationary probability distribution
measure on {1, · · · , K} and the Lebesgue measure on R
This can be proven as follows:
Let A = A
1
× A
2
be such that ϕ(A) > 0. Then A
1
contains at least one integer between 1 and K
and it is enough to prove that there exists t such that
P

S
t+1
X
t+1
{e} × A
2
| S
1
= s
l
, X
1
= x
> 0
3

with e a unit vector with the the kth component equal 1 and s
l
a unit vector with the lth component
equal 1. By definition,
P

S
2
X
2
{e} × A
2
| S
1
= s
l
, X
1
= x
= P (Q
2
= k, X
2
A
2
| S
1
= s
l
, X
1
= x)
= P (X
2
A
2
| Q
2
= k, S
1
= s
l
, X
1
= x) P (Q
2
= k | Q
1
= l, X
1
= x)
= a
lk
P (m
k
(x) + σ
k
(x)
2
A
2
)
= a
lk
Z
A
2
1
σ
k
(x)
f
u m
k
(x)
σ
k
(x)
du
= a
lk
b
k
(x) with b
k
(x) > 0
Further,
P

S
3
X
3
{e} × A
2
| S
1
= s
l
, X
1
= x
=
K
X
j=1
a
lj
a
jk
Z
A
2
Z
R
1
σ
k
(y)
f
u m
k
(y)
σ
k
(y)
1
σ
j
(x)
f
y m
j
(x)
σ
j
(x)
dydu
=
K
X
j=1
a
lj
a
jk
b
jk
(x) with b
jk
(x) > 0
and doing so iteratively, we obtain
P

S
t+1
X
t+1
{e} × A
2
| S
1
= s
l
, X
1
= x
=
K
X
j,··· ,j
t1
a
lj
1
· · · a
j
t1
k
b
j,··· ,j
t1
(x)
which is strictly greater than 0 for some t because of the irreducibility of {Q
t
} and the fact that
b
j,··· ,j
t1
(x) > 0.
Analogously it c an easily be seen that {ζ
t
} is aperiodic.
In the drift criterion of Theorem 15.0.1, (iii) mentioned previously app ears the notion of a
petite set. In our case, it can be shown that each compact set is indeed a s mall set and thus
a petite set, see for example, Bhattacharya and Lee [3] and Lee and Shin [5].
So, to apply the drift criterion, we need to find a function g(ζ) > 1, β > 0 and M > 0 such
that
E
g(ζ
t
) | ζ
t1
=
s
l
x

g

s
l
x

g

s
l
x

β for kζ
t1
k > M
Let
g(ζ
t
) = 1 + X
2
t
.
Then,
E
g(ζ
t
) | ζ
t1
=
s
l
x

g

s
l
x

g

s
l
x

=
P
k
(m
2
k
(x) + σ
2
k
(x))E(S
tk
| S
t1
= s
l
) x
2
1 + x
2
P
k
(m
2
k
(x) + σ
2
k
(x))a
lk
x
2
1
4

and the conclusion is obtained by A.8.
Howeve r, in financial time series which are very often heavy-tailed, the existence of σ
2
= var(
t
) is
not necessarily guaranteed. Therefore, instead of A.7 and A.8 we assume
A. 9 The i.i.d. random variables
t
are such that E(|
t
|
α
) < for some 0 < α 1
A. 10
max
l∈{1,··· ,K}
lim sup
|x|−→∞
P
k
a
lk
(|m
k
(x)|
α
+ σ
α
k
(x)E|
t
|
α
)
|x|
α
< 1
with α as in A.9
Theorem 2 Under Assumptions A.1 to A.6, A.9 and A.10, {ζ
t
} is geometrically ergodic.
Proof: The only part of this proof which is not similar to the proof of Lemma 1 is the drift criterion.
Here we consider
g(ζ) = 1 + |X
t
|
α
.
Then,
E
g(ζ
t
) | ζ
t1
=
s
l
x

g

s
l
x

g

s
l
x

P
k
(|m
k
(x)|
α
+ σ
α
k
(x)E|
t
|
α
)E(|S
tk
|
α
| S
t1
= s
l
) |x|
α
1 + |x|
α
P
k
a
lk
(|m
k
(x)|
α
+ σ
α
k
(x)E|
t
|
α
)
|x|
α
1
and we conclude the proof by using A.10.
3 Geometric ergodicity for higher order CHARME pro-
cesses
We now follow a slightly different route to geometric ergodicity of CHARME processes. We first
state an auxiliary result that we are going to use for proving a condition for geometric ergodicity.
Lemma 1 Let φ, ψ be random variables with values in R
d
, C R
d
a measurable set, g : R
d
R
measurable and bounded on C satisfying g 1. If there exist constants 0 < r < 1, B > 0 such that
E(g(φ) | ψ = x) < rg(x), if x 6∈ C
E(g(φ) | ψ = x) < B, if x C
then, there exist β > 0, b < such that
E(g(φ) | ψ = x) g(x) < βg(x) + bI
C
(x).
5

Citations
More filters
Journal ArticleDOI
TL;DR: In this paper, it is shown that the test statistic under the null of stationarity asymptotically has a chi-squared distribution, whereas under the alternative of local stationarity, it has a noncentral chi-square distribution.

54 citations


Cites background or methods from "On geometric ergodicity of CHARME m..."

  • ...However, here we shall focus on time series whose correlation structure changes slowly over time (early work on time-varying time series include Priestley (1965), Subba Rao (1970) and Hallin (1984))....

    [...]

  • ...…including, under certain assumptions on the innovations, the vector AR models (see Pham and Tran (1985)) and other Markov models which are irreducible (cf. (Feigin & Tweedie, 1985), Mokkadem (1990), Meyn and Tweedie (1993), Bousamma (1998), Franke, Stockis, and Tadjuidje-Kamgaing (2010))....

    [...]

  • ...However, here we shall focus on time series whose correlation structure changes slowly over time (early work on time-varying time series include Priestley (1965), Subba Rao (1970) and Hallin (1984)). As in nonparametric regression and other work on nonparametric statistics we use the rescaling device to develop the asymptotic theory. The same rescaling device has been used for example in nonparametric time series by Robinson (1989) and by Dahlhaus (1997) in his definition of local stationarity....

    [...]

Journal ArticleDOI
TL;DR: In this article, the authors developed testing procedures for the detection of structural changes in nonlinear autoregressive processes, where the regression function is modeled by a single layer feed-forward neural network.
Abstract: In this paper we develop testing procedures for the detection of structural changes in nonlinear autoregressive processes. For the detection procedure we model the regression function by a single layer feedforward neural network. We show that CUSUM-type tests based on cumulative sums of estimated residuals, that have been intensively studied for linear regression, can be extended to this case. The limit distribution under the null hypothesis is obtained, which is needed to construct asymptotic tests. For a large class of alternatives it is shown that the tests have asymptotic power one. In this case, we obtain a consistent change-point estimator which is related to the test statistics. Power and size are further investigated in a small simulation study with a particular emphasis on situations where the model is misspecified, i.e. the data is not generated by a neural network but some other regression function. As illustration, an application on the Nile data set as well as S&P log-returns is given.

36 citations


Cites background or methods from "On geometric ergodicity of CHARME m..."

  • ...Stockis et al. (2010) use the time series in eqn (4) as building blocks in a regime-switching model, the so called CHARME-models, in the context of financial time series....

    [...]

  • ...The idea behind this is that the test would only be applied in situations were this is the case and the approximation by a neural network based process is reasonable....

    [...]

  • ...Since Model (4) is a special case with only one regime, the following result is a straightforward application of Theorem 4 of Stockis et al. (2010)....

    [...]

  • ...In view of possible financial applications, an indirect example of model (1) is given by the nonlinear ARCH processes, see, for example, Stockis et al. (2010), an extension of the b-ARCH model introduced by Guégan and Diebolt (1994) which include the celebrated ARCH model defined by Engle (1982)....

    [...]

  • ...AMS subject classification 2000: 62G10, 62M45, 62G08...

    [...]

Journal ArticleDOI
TL;DR: This work gives general regularity conditions under which the asymptotic null behavior of the corresponding tests in addition to their behavior under alternatives are derived, where conditions become particularly simple for sufficiently smooth estimating and monitoring functions.

35 citations

Posted Content
TL;DR: In this article, the authors examined the main properties of the Markov chain X t = T(X t-1 )+σ (X t -1 ) +σ(x t −1 ǫ t ) under general and tractable assumptions, and derived bounds for the tails of the stationary density of the process {X t } in terms of the common density.
Abstract: We examine the main properties of the Markov chain X t = T(X t-1 )+σ(X t-1 )ɛ t . Under general and tractable assumptions, we derive bounds for the tails of the stationary density of the process {X t } in terms of the common density of the ɛ t 's.

31 citations

References
More filters
Book
01 Jan 1993
TL;DR: This second edition reflects the same discipline and style that marked out the original and helped it to become a classic: proofs are rigorous and concise, the range of applications is broad and knowledgeable, and key ideas are accessible to practitioners with limited mathematical background.
Abstract: Meyn & Tweedie is back! The bible on Markov chains in general state spaces has been brought up to date to reflect developments in the field since 1996 - many of them sparked by publication of the first edition. The pursuit of more efficient simulation algorithms for complex Markovian models, or algorithms for computation of optimal policies for controlled Markov models, has opened new directions for research on Markov chains. As a result, new applications have emerged across a wide range of topics including optimisation, statistics, and economics. New commentary and an epilogue by Sean Meyn summarise recent developments and references have been fully updated. This second edition reflects the same discipline and style that marked out the original and helped it to become a classic: proofs are rigorous and concise, the range of applications is broad and knowledgeable, and key ideas are accessible to practitioners with limited mathematical background.

5,931 citations


"On geometric ergodicity of CHARME m..." refers background or methods in this paper

  • ...On proving Theorems 1 and 2, we have used the drift criterion of Meyn and Tweedie (1993)....

    [...]

  • ...Proof:As in the case p = 1 we are going to prove that the conditions of Theorem 15.0.1,(iii) of Meyn and Tweedie [ 7 ] pp 354 355 are satisfied....

    [...]

  • ...Proof: We are going to prove that the conditions of Theorem 15.0.1,(iii) of Meyn and Tweedie [ 7 ], pp 354 355, are satisfied....

    [...]

Journal ArticleDOI
TL;DR: A new supervised learning procedure for systems composed of many separate networks, each of which learns to handle a subset of the complete set of training cases, which is demonstrated to be able to be solved by a very simple expert network.
Abstract: We present a new supervised learning procedure for systems composed of many separate networks, each of which learns to handle a subset of the complete set of training cases. The new procedure can be viewed either as a modular version of a multilayer supervised network, or as an associative version of competitive learning. It therefore provides a new link between these two apparently different approaches. We demonstrate that the learning procedure divides up a vowel discrimination task into appropriate subtasks, each of which can be solved by a very simple expert network.

4,338 citations


Additional excerpts

  • ...[12], or Jiang and Tanner [10]....

    [...]

BookDOI
01 Jan 1997

3,636 citations


"On geometric ergodicity of CHARME m..." refers background in this paper

  • ...This is, e.g. indicated by estimates of the tail index of the marginal distribution of certain asset returns (compare, e.g. Embrechts et al., 1997)....

    [...]

  • ...Tong, 1983) or qualitative threshold ARCH models (Gouriéroux and Montfort, 1992). The latter are in particular of interest, as they also contain a switching mechanisms which, however, is solely driven by the current states of the time series under consideration. By introducing the hidden Markov chain Qt, we additionally allow for sudden changes of the state of the process driven by external forces. For many applications, Qt will change its value only rarely, i.e. the observed process follows the same regime for quite some time before a change occurs. An example would, e.g. be time series of asset returns which change their behaviour if the market moves from a phase with increasing to one with decreasing trend or from one with generally high to one with low volatility. Tadjuidje Kamgaing (2005) gives some illustrations of such models to financial data in the context of asset management and risk analysis where the state functions are estimated by neural networks. Another noneconomic example would be EEG time series from sleeping people who during one night’s sleep move through different stationary phases where, compared with the timescales, the changes are almost sudden and can be modelled as jumps (compare Liehr et al., 1999). We remark that some of the single phases of a CHARME model may also correspond to explosive regimes provided that this states are visited rarely enough. This feature is well known for several parametric mixture models, e.g. for the flexible coefficient GARCH model of Medeiros and Veiga (2008). Independent of the type of estimates considered, a crucial condition for developing a theory of estimation and testing in the setting of CHARME is the existence of a stochastic process satisfying eqn (2) which is geometrically ergodic....

    [...]

  • ...Tong, 1983) or qualitative threshold ARCH models (Gouriéroux and Montfort, 1992). The latter are in particular of interest, as they also contain a switching mechanisms which, however, is solely driven by the current states of the time series under consideration. By introducing the hidden Markov chain Qt, we additionally allow for sudden changes of the state of the process driven by external forces. For many applications, Qt will change its value only rarely, i.e. the observed process follows the same regime for quite some time before a change occurs. An example would, e.g. be time series of asset returns which change their behaviour if the market moves from a phase with increasing to one with decreasing trend or from one with generally high to one with low volatility. Tadjuidje Kamgaing (2005) gives some illustrations of such models to financial data in the context of asset management and risk analysis where the state functions are estimated by neural networks....

    [...]

Journal ArticleDOI
TL;DR: In this paper, the EM algorithm converges to a local maximum or a stationary value of the (incomplete-data) likelihood function under conditions that are applicable to many practical situations.
Abstract: Two convergence aspects of the EM algorithm are studied: (i) does the EM algorithm find a local maximum or a stationary value of the (incomplete-data) likelihood function? (ii) does the sequence of parameter estimates generated by EM converge? Several convergence results are obtained under conditions that are applicable to many practical situations Two useful special cases are: (a) if the unobserved complete-data specification can be described by a curved exponential family with compact parameter space, all the limit points of any EM sequence are stationary points of the likelihood function; (b) if the likelihood function is unimodal and a certain differentiability condition is satisfied, then any EM sequence converges to the unique maximum likelihood estimate A list of key properties of the algorithm is included

3,414 citations


"On geometric ergodicity of CHARME m..." refers background in this paper

  • ...So, to apply the drift criterion, we need to find a function g(f) > 1, b > 0 and M > 0 such that E gðftÞ ft 1 ¼ sl x g sl x g sl x b for kft 1k > M Let gðftÞ ¼ 1þ X2t : Then, E gðftÞ ft 1 ¼ sl x g sl x g sl x ¼ Pkðm2kðxÞ þ r2kðxÞÞEðStk j St 1 ¼ slÞ x2 1þ x2 P kðm2kðxÞ þ r2kðxÞÞalk x2 1 and the conclusion is obtained by Assumption 8. h Some financial time series are extremely heavy tailed, and even the adequacy of models assuming a finite variance is doubtful....

    [...]

  • ...In a similar manner, consistency of the parameter estimates in the pure nonparametric ARCH mixture model Xt ¼ XK k¼1 Stk rkðXt 1; . . . ; Xt p; hkÞ t can be shown, where t now are i.i.d. random variables with mean 0 and variance 1....

    [...]

  • ...2010, 31 141–152 1 4 4 E gðftÞ ft 1 ¼ sl x g sl x g sl x PkðjmkðxÞja þ rakðxÞEj tjaÞEðjStkja j St 1 ¼ slÞ jxja 1þ jxja P k alkðjmkðxÞj a þ rakðxÞEj tj aÞ jxja 1 and we conclude the proof by using Assumption 10. h REMARK 1....

    [...]

  • ...Stk is unknown; so, a direct way of getting the estimates is not possible....

    [...]

  • ...For illustration of how the previous results may be applied, we consider a special case of the model defined in equation (2), Xt ¼ XK k¼1 StkmkðXt 1; . . . ; Xt p; hkÞ þ r t ð5Þ where Stk is defined as previously, r>0 is fixed, and, for x 2 Rp mkðx; hkÞ ¼ m0k þ XH h¼1 mhkwð< ahk; x > þbhkÞ; k ¼ 1; . . . ; K ; ð6Þ is the output function of a one-layer feedforward neural network with Hk hidden neurons, <Æ> is the scalar product on R p and w the activation function, e.g. the logistic function....

    [...]

Frequently Asked Questions (11)
Q1. What contributions have the authors mentioned in the paper "On geometric ergodicity of charme models" ?

In this paper the authors consider a CHARME Model, a class of generalized mixture of nonlinear nonparametric AR-ARCH time series. Indeed, the goal is to provide some sets of conditions under which their model is geometric ergodic and therefore satisfies some mixing conditions. 

CHARME is quite useful for modeling time series data which are piecewise stationary such that their dynamics switch sometimes from one state to another. 

Then A1 contains at least one integer between 1 and K and it is enough to prove that there exists t such thatP (( St+1 Xt+1 ) ∈ {e} ×A2 |S1 = sl, X1 = x ) > 0with e a unit vector with the the kth component equal 1 and sl a unit vector with the lth component equal 1. 

Let x = (xt, · · · , xt−p+1)′ be a vector of real numbers and sl a K-dimensional unit vector with the l-th component equal 1 and considerE(g(ζt+1) | (Xt, · · · , Xt−p+1)′ = x, St = sl)= 1 + K∑k=1alk(m2k(x) + σ 2 k(x)) + bp−1x 2 t + · · ·+ b1x2t−p+2 (3.1)Now, let us focus onK∑ k=1 alk(m2k(x) + σ 2 k(x))= 

If the process {Xt} is also strictly stationary it is well known that this implies that {Xt} is absolutely regular with geometric decreasing rate, which gives a very useful condition for deriving limit theorems like the central limit theorem. 

0and doing so iteratively, the authors obtainP (( St+1 Xt+1 ) ∈ {e} ×A2 |S1 = sl, X1 = x ) =K∑ j,··· ,jt−1 alj1 · · · ajt−1kbj,··· ,jt−1(x)which is strictly greater than 0 for some t because of the irreducibility of {Qt} and the fact that bj,··· ,jt−1(x) > 0.• Analogously it can easily be seen that {ζt} is aperiodic.• 

In this paper the authors are analyzing the so-called Conditional Heteroscedastic Autoregressive Mixture of Experts, henceforth CHARME, models. 

{St = (St1, · · · , StK)′} inherits the properties of {Qt}.A. 2 Let Gt−1 = σ{Xr, r ≤ t−1} be the σ-algebra generated by {Xr, r ≤ t−1} and Gt−1 any event in Gt−1. 

Geometric ergodicity of ζt can clearly be obtained even if some of the underlying dynamics taken on their own are not geometric ergodic or even stationary, provided the probability to go from a stable dynamic to a non stable dynamic is low enough and the probability to move from a non stable dynamic to a stable dynamic is large enough. 

In particular in their case where {φt} is a Markov chain, it is enough to prove the existence of a petite set C, a function g ≥ 1 and constants 0 < r < 1 and B > 0 such thatE(g(φt) |φt−1 = x) < rg(x), x 6∈ C E(g(φt) |φt−1 = x) < B, x ∈ C. 

Independently of the type of estimates considered, a crucial condition for developing a theory for estimation and testing in the setting of CHARME is the existence of a stochastic process satisfying (1.2) which is geometric ergodic.