scispace - formally typeset
Open AccessJournal ArticleDOI

Flexible Class of Skew‐Symmetric Distributions

Yanyuan Ma, +1 more
- 01 Sep 2004 - 
- Vol. 31, Iss: 3, pp 459-468
Reads0
Chats0
TLDR
In this paper, a flexible class of skew-symmetric distributions for which the probab- ility density function has the form of a product of a symmetric density and a skewing function is proposed.
Abstract
We propose a flexible class of skew-symmetric distributions for which the probab- ility density function has the form of a product of a symmetric density and a skewing function. By constructing an enumerable dense subset of skewing functions on a compact set, we are able to consider a family of distributions, which can capture skewness, heavy tails and multimodality systematically. We present three illustrative examples for the fibreglass data, the simulated data from a mixture of two normal distributions and the Swiss bills dlata.

read more

Content maybe subject to copyright    Report

A Flexible Class of Skew-Symmetri Distributions
(running head: exible skew-symmetri distributions)
YANYUAN MA
North Carolina State University
MARC G. GENTON
North Carolina State University
ABSTRACT. We prop ose a exible lass of skew-symmetri distributions for whih the
probability density funtion has the form of a pro dut of a symmetri density and a skewing
funtion. By onstruting an enumerable dense subset of skewing funtions on a ompat
set, we are able to onsider a family of distributions whih an apture skewness, heavy
tails, and multimo dality systematially. We present three illustrative examples for the
b er-glass data, simulated data from a mixture of two normal distributions, and Swiss
bills data.
Key Words:
dense subset; generalized skew-elliptial; multimodality; skewness; skew-normal.
1 Intro dution
A popular approah to ahieve departures from normality onsists of modifying the probability density
funtion (p df ) of a random vetor in a multipliative fashion. Wang, Boyer, & Genton (2004) showed
that any
p
-dimensional multivariate pdf
g
(
x
) admits, for any xed loation parameter
2
R
p
, a unique
skew-symmetri (SS) representation:
g
(
x
) = 2
f
(
x
)
(
x
)
;
(1)
where
f
:
R
p
!
R
+
is a symmetri p df and
:
R
p
!
[0
;
1℄ is a skewing funtion satisfying
(
x
) =
1
(
x
). Vie-versa, any funtion
g
of the type dened by (1) is a valid pdf. By symmetri, we mean
f
(
x
) =
f
(
x
) and we will use \symmetri pdf " and the prop erty
f
(
x
) =
f
(
x
) interhangeably in
the sequel. Throughout this pap er, we restrit our interest on funtions
f
2
C
0
(
R
p
) and ontinuous
skewing funtions
(
x
), where
C
0
(
R
p
) denotes ontinuous funtions on
R
p
with the prop erty
f
(
x
)
!
0
when
k
x
k
2
! 1
, and
k k
2
denotes the
L
2
norm. Genton & Lop erdo (2002) onsidered the subfamily
of generalized skew-elliptial (GSE) distributions for whih the p df
f
in (1) is elliptially ontoured
rather than only symmetri. Many denitions of skewed distributions found in the literature an be
written in the form of a skew-symmetri distribution (1). For instane, Azzalini & Dalla Valle's (1996)
multivariate skew-normal distribution orresp onds to
f
(
x
) =
p
(
x
;
0
;
) and
(
x
) = (
T
x
), where
p
(
x
;
;
) is the
p
-dimensional multivariate normal pdf with mean vetor
and orrelation matrix ,
1

is the standard normal umulative distribution funtion (df ), and
is a shap e parameter ontrolling
skewness. Similarly, multivariate distributions suh as skew-
t
(Brano & Dey, 2001; Azzalini & Capitanio,
2003; Jones & Faddy, 2003; Sahu, Brano, & Dey, 2003), skew-Cauhy (Arnold & Beaver, 2000) and
other skew-elliptial ones (Azzalini & Capitanio, 1999; Brano & Dey, 2001; Sahu
et al.
, 2003) an be
represented by the skew-symmetri distribution (1) with appropriate hoies of
f
and
.
In this artile, we prop ose a exible lass of distributions (1) by onstruting an enumerable dense
subset of the skewing funtions
on a ompat set. The result is a family of distributions whih
an apture skewness, heavy tails, and multimodality systematially. The onstrution of the subset is
through p olynomials, whih has a similar avor as the seminonparametri (SNP) representation prop osed
by Gallant & Nyhka (1987). The latter is dened as the pro dut of the standard normal p df and the
square of a polynomial. The SNP distribution requires the oeÆients in the polynomial to b e onstrained
in order to yield a valid density. It also relies on rejetion sampling shemes to simulate random samples.
These diÆulties do not o ur with our onstrution.
The ontent of the pap er is organized as follows. In Setion 2, we desribe a subset of skewing
funtions based on o dd p olynomials and prove that it results in a dense subset of the skew-symmetri
distributions. In partiular, we dene exible skew-normal and skew-
t
distributions that an have more
than one mode. This is an essential property for some situations and provides an alternative to modeling
with mixtures of distributions. The exibility and p ossible multimodality of the new lass of distributions
is illustrated in Setion 3. We present three illustrative examples in Setion 4, and a disussion in Setion
5.
2 A dense subset of skew-symmetri distributions
In this setion, we onstrut a dense subset of skew-symmetri distributions through approximating the
skewing funtion
on a ompat set. Any ontinuous skewing funtion
an be written as:
(
x
) =
H
(
w
(
x
))
;
(2)
where
H
:
R
!
[0
;
1℄ is the df of a ontinuous random variable symmetri around 0, and
w
:
R
p
!
R
is an o dd ontinuous funtion, that is
w
(
x
) =
w
(
x
). In fat, for a hosen
H
suh that
H
1
exists,
w
(
x
) =
H
1
(
(
x
)) is a ontinuous odd funtion. This representation has been used by Azzalini &
Capitanio (2003) to dene ertain distributions by p erturbation of symmetry. Note however that the
representation (2) is not unique due to the many possible hoies of
H
.
Let
P
K
(
x
) b e an o dd p olynomial of order
K
. A p olynomial of order
K
in
R
p
is dened as a linear
ombination of terms of the form
Q
p
i
=1
x
r
i
i
, where
k
=
P
p
i
=1
r
i
K
. If eah term has an odd order (all
k
's are o dd), then the polynomial is alled an odd p olynomial, whereas if eah term has an even order
(all
k
's are even), it is alled an even polynomial. We dene exible skew-symmetri (FSS) distributions
2

by restriting (1) to:
2
f
(
x
)
K
(
x
)
;
(3)
where
K
(
x
) =
H
(
P
K
(
x
)) and
H
is any df of a ontinuous random variable symmetri around 0. Note
that there are no onstraints on the oeÆients of the p olynomial
P
K
in order to make (3) a valid
pdf. In partiular, (3) denes exible generalized skew-elliptial (FGSE) distributions when the pdf
f
is
elliptially ontoured. For instane, exible generalized skew-normal (FGSN) distributions are dened
by:
2
p
(
x
;
;
)(
P
K
(
A
(
x
)))
;
(4)
and exible generalized skew-
t
(FGST) distributions are dened by:
2
t
p
(
x
;
;
;
)
T
(
P
K
(
A
(
x
));
)
;
(5)
where we use the Choleski deomposition
1
=
A
T
A
,
t
p
denotes a
p
-dimensional multivariate
t
pdf,
and
T
denotes a univariate
t
df, both with degrees of freedom
. Note that we ould use , or any
other symmetri df, instead of
T
for the skewing funtion in (5). In pratie, a popular hoie for the
df
H
would b e or the univariate df orresponding to the symmetri p df
f
. Eetively, the following
proposition shows that FSS distributions an approximate skew-symmetri distributions arbitrarily well.
Prop osition 1
Let the lass of exible skew-symmetri (FSS) distributions onsist of distributions with
pdf given in (3) and the lass of skew-symmetri (SS) distributions of distributions with pdf given in (1),
where
f
2
C
0
(
R
p
)
in both lasses and
is ontinuous. Then the lass of FSS distributions is dense in
the lass of SS distributions under the
L
1
norm.
Pro of
: An arbitrary distribution in the SS lass an be written as 2
f
(
x
)
H
(
w
(
x
)), where
f
and
H
are ontinuous,
H
1
exists, and
w
is a ontinuous o dd funtion. Beause
f
2
C
0
(
R
p
), for any arbitrary
>
0, we an nd a ompat set
D
whih is symmetri around
(if
x
2
D
then
x
2
D
), suh that
for any
x
=
2
D
,
f
(
x
)
< =
4. Thus, for any
x
=
2
D
,
j
2
f
(
x
)
(
x
)
2
f
(
x
)
H
(
P
((
x
))
j
<
for any odd p olynomial
P
.
Sine
f
is ontinuous,
f
is bounded on
D
. We denote the bound by
C
, i.e.
f
(
x
)
C
for any
x
2
D
. We use
D
1
to denote the image spae of
w
, i.e.
D
1
=
f
w
(
x
)
j
x
2
D
g
. Beause of the
ontinuity of
w
, whih is a result of the ontinuity of b oth
H
and
,
D
1
is also ompat. The ontinuous
funtion
H
is uniformly ontinuous on the ompat set
D
1
. Hene there exists
>
0 suh that for
any
y
1
,
y
2
2
D
1
and
j
y
1
y
2
j
<
, we get
j
H
(
y
1
)
H
(
y
2
)
j
< =
(2
C
). From the Stone-Weierstrass
theorem (see e.g. Rudin, 1973, p. 115), there exists a polynomial
P
suh that
j
w
(
x
)
P
(
x
)
j
<
for any
x
2
D
. We deomp ose
P
into an even term
P
e
and an odd term
P
o
, i.e.
P
=
P
e
+
P
o
.
Then
j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
<
and
j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
<
. Beause
w
and
P
o
are odd, and
P
e
is even, we get
j
w
(
x
)
P
e
(
x
) +
P
o
(
x
)
j
<
. Notie that
2
j
w
(
x
)
P
o
(
x
)
j j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
+
j
w
(
x
)
P
e
(
x
) +
P
o
(
x
)
j
<
2
,
3

so
j
w
(
x
)
P
o
(
x
)
j
<
. Combining these results, we know that for an arbitrary member
2
f
(
x
)
H
(
w
(
x
)) in SS and an arbitrary
>
0, we an nd a member 2
f
(
x
)
H
(
P
o
(
x
)) in
FSS suh that
j
2
f
(
x
)
H
(
w
(
x
))
2
f
(
x
)
H
(
P
o
(
x
))
j
<
for any
x
2
D
.
Hene FSS is dense in SS with resp et to the
L
1
norm.
Remark 1
The requirement
f
2
C
0
(
R
p
)
in proposition 1 an be relaxed to al low that
f
has a nite
number,
m
say, of poles. In this ase, FSS is dense in SS with respet to almost uniform onvergene
(uniform in a set whose omplement is of measure arbitrarily smal l). Indeed, let
R
p
(
r
)
denote
R
p
minus
the union of
m
open bal ls of radius
r
entered at the
m
poles. Then FSS is dense in SS on
R
p
(
r
)
under
the
L
1
norm. Letting
r
!
0
, the result fol lows.
Proposition 1 shows in partiular that the lass of generalized skew-elliptial, skew
t
, and skew-
normal distributions an b e approximated arbitrarily well by their exible versions.
3 Flexibility and multimodality
In Figure 1, we illustrate the shap e exibility of the FGSN distribution in the univariate ase. Its pdf
for
K
= 3 is dened by:
2
1
(
x
;
;
2
)(
(
x
)
=
+
(
x
)
3
=
3
)
:
(6)
Figure 1 should b e here.
Figure 1(a) depits the p df of the FGSN model for
= 0,
2
= 1,
= 4, and
= 0, i.e. it redues
to Azzalini's (1985) univariate skew-normal distribution. However, when
6
= 0, the p df (6) an exhibit
bimodality as shown in Figure 1(b) with
= 1, and
=
1. In general, as the degree
K
of the o dd
polynomial in the skewing funtion beomes large, the number of mo des allowed in the p df inreases,
thus induing a greater exibility in the available shapes. Unfortunately, the number of modes depends
on the degree
K
of the o dd p olynomial, on the symmetri pdf
f
, and on the df
H
of the skewing
funtion
K
in a omplex fashion. Indeed, even for the univariate situation given by
p
= 1, the mo des
are determined by zeros of the rst derivative of the FSS distribution (3) given by:
2
f
0
(
x
)
H
(
P
K
(
x
)) + 2
f
(
x
)
H
0
(
P
K
(
x
))
P
0
K
(
x
)
;
(7)
for whih the number of zeros annot b e easily omputed. Even with restritions to some sp ei
f
and
H
funtions, a general statement on the relation between the number of mo des and the order of the
polynomial seems not available. However, in the univariate ase, if we onsider a normal pdf
f
=
1
and
a standard normal df
H
= with an o dd p olynomial of order
K
= 3, we have the following proposition.
Prop osition 2
The lass of exible generalized skew-normal (FGSN) distributions with pdf
2
1
(
x
;
;
2
)(
(
x
)
=
+
(
x
)
3
=
3
)
has at most 2 modes.
4

Pro of
: Without loss of generality, we an set
= 0,
= 1, assume
>
0, and only need to prove that
(
x
) = 2
(
x
)(
x
+
x
3
) has at most two modes. We prove this by ontradition. If
(
x
) has more
than two mo des, then
0
(
x
) has at least ve zeros. In the following pro of, we show that this annot b e
the ase. We have
0
(
x
) = 2
(
x
)((
+ 3
x
2
)
(
x
+
x
3
)
x
(
x
+
x
3
)) and need to onsider three
ases:
ase 1:
= 0
We write
0
(
x
) = 2
x
(
x
)
(
x
), where
(
x
) = 3
x
(
x
3
)
(
x
3
). We an verify that
0
(
x
) =
3
(
x
3
)
1
(
y
) where
y
=
x
2
and
1
(
y
) = 1
y
3
2
y
3
. Sine
1
(
y
) is a dereasing funtion on
y
0,
0
(
x
) has at most two zeros. Thus,
(
x
) has at most three zeros, hene
0
(
x
) has at most four
zeros.
ase 2:
>
0
Notie that
0
(
x
)
>
0 for
x
0. For
1
(
x
) =
0
(
x
)
=
(2
x
(
x
)) =
(
x
+
x
3
)(
+ 3
x
2
)
=x
(
x
+
x
3
),
we get
0
1
(
x
) =
(
x
+
x
3
)
=
(
9
x
2
)
2
(
y
), where
y
=
+ 3
x
2
>
0 and
2
(
y
) =
y
4
+
y
3
+ (3
2
2
)
y
2
(3
+ 9
)
y
+ 18

. Sine
00
2
(
y
) = 12
y
2
+ 6
y
+ (6
4
2
) has at most 1 positive zero, and
0
2
(
y
) = 4
y
3
+ 3
y
2
+ (6
4
2
)
y
(3
+ 9
)
<
0 at
y
= 0, we know that
0
2
(
y
) has at most one positive
zero. Thus
2
(
y
) has at most 2 positive zeros. This means
0
1
(
x
) has at most two p ositive zeros, so
0
(
x
)
has at most three (p ositive) zeros.
ase 3:
<
0
Notie that
0
(
x
)
<
0 for
x
2
[0
;
p
=
(3
) and
0
(
x
)
>
0 for
x
2
(
1
;
p
=
(3
) ℄. So we only
look for solutions
x
2
(
p
=
(3
)
;
1
) and
x
2
(
p
=
(3
)
;
0). Let
y
=
+ 3
x
2
, then there is a one
to one mapping b etween the
x
in the ab ove range and
y
2
(
;
1
). Let
1
(
x
) and
2
(
y
) have the same
expressions as in ase 2. We have that
2
(
y
) has at most four zeros sine it is a fourth order p olynomial.
Notie that
2
(
)
<
0
;
2
(
1
)
>
0, so
2
(
y
) has at most three zeros in (
;
1
). This means
0
1
(
x
) has
at most three zeros, hene
0
(
x
) has at most four zeros.
Figure 1 illustrates the result of prop osition 2 by depiting a unimo dal and a bimo dal pdf from the
univariate FGSN with
K
= 3. For
K
= 1, the p df is always unimodal as was already noted by Azzalini
(1985) for the univariate skew-normal distribution.
Next we investigate the exibility of the FGSN distribution in the bivariate ase. Its pdf for
K
= 3,
=
0
, and =
I
2
is given by:
2
2
(
x
1
; x
2
;
0
; I
2
)(
1
x
1
+
2
x
2
+
1
x
3
1
+
2
x
3
2
+
3
x
2
1
x
2
+
4
x
1
x
2
2
)
:
(8)
Figure 2 should b e here.
Figure 2 depits the ontours of four dierent pdfs (8) for various ombinations of values of the
skewness parameters
1
,
2
,
1
,
2
,
3
, and
4
. In partiular, for
1
=
2
=
3
=
4
= 0, the
pdf is exatly the bivariate skew-normal proposed by Azzalini & Dalla Valle (1996), and known to be
unimodal, see Figure 2(a). However, Figures 2(b)-(d) show that many dierent distributional shap es an
be obtained with the parameters
1
; : : : ;
4
, in partiular bimodal and trimo dal distributions. Additional
5

Citations
More filters
Journal ArticleDOI

The Skew-normal Distribution and Related Multivariate Families.

TL;DR: In this paper, the authors provide an introductory overview of a portion of distribution theory which is currently under intense development and illustrate connections with various areas of application, including selective sampling, models for compositional data, robust methods, some problems in econometrics, non-linear time series, especially in connection with financial data, and more.
Book

The Skew-Normal and Related Families

TL;DR: This comprehensive treatment, blending theory and practice, will be the standard resource for statisticians and applied researchers, and Assuming only basic knowledge of (non-measure-theoretic) probability and statistical inference, the book is accessible to the wide range of researchers who use statistical modelling techniques.
Journal ArticleDOI

Robust Likelihood Methods Based on the Skew-t and Related Distributions

TL;DR: In this paper, the robustness problem is tackled by adopting a parametric class of distributions flexible enough to match the behaviour of the observed data, and the skew-t distribution is explored in more detail and reasons to adopt this option as a sensible general-purpose compromise between robustness and simplicity, both of treatment and interpretation of the outcome.
Journal ArticleDOI

Sinh-arcsinh distributions

TL;DR: The sinh-arcsinh transformation as discussed by the authors was introduced to a generating distribution with no parameters other than location and scale, usually the normal, by applying it to a new family of sinh -normal distributions, which allows for tailweights that are both heavier and lighter than those of the generating distribution.
References
More filters
Journal ArticleDOI

Generalized skew-elliptical distributions and their quadratic forms

TL;DR: The authors introduced generalized skew-elliptical distributions (GSE), which include the multivariate skew-normal, skew-t and skew-Cauchy distributions as special cases, and showed that the distribution of any even function in GSE random vectors does not depend on the weight function.
Journal ArticleDOI

The skew-Cauchy distribution

TL;DR: In this article, the conditional distribution of X given Y > y 0, for some fixed value of y 0 ∈ R, is the multivariate skewed Cauchy, in which there is truncation with respect to Y : this is but one of a general class of skewed distributions for which the initial distribution is symmetric.

A skew-symmetric representation of multivariate distributions

TL;DR: In this article, a family of multivariate skew-symmetric distributions is studied and it is shown that any multivariate probability density function admits a skew- symmetric representation.
Posted Content

Survey of developments in the theory of continuous skewed distributions

TL;DR: In this article, the authors trace developments in the theory of skewed continuous distributions (univariate and multivariate) which commenced in the late 19 th century and after some dormant period during the most of the 20 th century, were invigorated in the middle 80's of 20th century and has become in the last 20 years an
Related Papers (5)