scispace - formally typeset
Open AccessJournal ArticleDOI

Flexible Class of Skew‐Symmetric Distributions

Yanyuan Ma, +1 more
- 01 Sep 2004 - 
- Vol. 31, Iss: 3, pp 459-468
Reads0
Chats0
TLDR
In this paper, a flexible class of skew-symmetric distributions for which the probab- ility density function has the form of a product of a symmetric density and a skewing function is proposed.
Abstract
We propose a flexible class of skew-symmetric distributions for which the probab- ility density function has the form of a product of a symmetric density and a skewing function. By constructing an enumerable dense subset of skewing functions on a compact set, we are able to consider a family of distributions, which can capture skewness, heavy tails and multimodality systematically. We present three illustrative examples for the fibreglass data, the simulated data from a mixture of two normal distributions and the Swiss bills dlata.

read more

Content maybe subject to copyright    Report

A Flexible Class of Skew-Symmetri Distributions
(running head: exible skew-symmetri distributions)
YANYUAN MA
North Carolina State University
MARC G. GENTON
North Carolina State University
ABSTRACT. We prop ose a exible lass of skew-symmetri distributions for whih the
probability density funtion has the form of a pro dut of a symmetri density and a skewing
funtion. By onstruting an enumerable dense subset of skewing funtions on a ompat
set, we are able to onsider a family of distributions whih an apture skewness, heavy
tails, and multimo dality systematially. We present three illustrative examples for the
b er-glass data, simulated data from a mixture of two normal distributions, and Swiss
bills data.
Key Words:
dense subset; generalized skew-elliptial; multimodality; skewness; skew-normal.
1 Intro dution
A popular approah to ahieve departures from normality onsists of modifying the probability density
funtion (p df ) of a random vetor in a multipliative fashion. Wang, Boyer, & Genton (2004) showed
that any
p
-dimensional multivariate pdf
g
(
x
) admits, for any xed loation parameter
2
R
p
, a unique
skew-symmetri (SS) representation:
g
(
x
) = 2
f
(
x
)
(
x
)
;
(1)
where
f
:
R
p
!
R
+
is a symmetri p df and
:
R
p
!
[0
;
1℄ is a skewing funtion satisfying
(
x
) =
1
(
x
). Vie-versa, any funtion
g
of the type dened by (1) is a valid pdf. By symmetri, we mean
f
(
x
) =
f
(
x
) and we will use \symmetri pdf " and the prop erty
f
(
x
) =
f
(
x
) interhangeably in
the sequel. Throughout this pap er, we restrit our interest on funtions
f
2
C
0
(
R
p
) and ontinuous
skewing funtions
(
x
), where
C
0
(
R
p
) denotes ontinuous funtions on
R
p
with the prop erty
f
(
x
)
!
0
when
k
x
k
2
! 1
, and
k k
2
denotes the
L
2
norm. Genton & Lop erdo (2002) onsidered the subfamily
of generalized skew-elliptial (GSE) distributions for whih the p df
f
in (1) is elliptially ontoured
rather than only symmetri. Many denitions of skewed distributions found in the literature an be
written in the form of a skew-symmetri distribution (1). For instane, Azzalini & Dalla Valle's (1996)
multivariate skew-normal distribution orresp onds to
f
(
x
) =
p
(
x
;
0
;
) and
(
x
) = (
T
x
), where
p
(
x
;
;
) is the
p
-dimensional multivariate normal pdf with mean vetor
and orrelation matrix ,
1

is the standard normal umulative distribution funtion (df ), and
is a shap e parameter ontrolling
skewness. Similarly, multivariate distributions suh as skew-
t
(Brano & Dey, 2001; Azzalini & Capitanio,
2003; Jones & Faddy, 2003; Sahu, Brano, & Dey, 2003), skew-Cauhy (Arnold & Beaver, 2000) and
other skew-elliptial ones (Azzalini & Capitanio, 1999; Brano & Dey, 2001; Sahu
et al.
, 2003) an be
represented by the skew-symmetri distribution (1) with appropriate hoies of
f
and
.
In this artile, we prop ose a exible lass of distributions (1) by onstruting an enumerable dense
subset of the skewing funtions
on a ompat set. The result is a family of distributions whih
an apture skewness, heavy tails, and multimodality systematially. The onstrution of the subset is
through p olynomials, whih has a similar avor as the seminonparametri (SNP) representation prop osed
by Gallant & Nyhka (1987). The latter is dened as the pro dut of the standard normal p df and the
square of a polynomial. The SNP distribution requires the oeÆients in the polynomial to b e onstrained
in order to yield a valid density. It also relies on rejetion sampling shemes to simulate random samples.
These diÆulties do not o ur with our onstrution.
The ontent of the pap er is organized as follows. In Setion 2, we desribe a subset of skewing
funtions based on o dd p olynomials and prove that it results in a dense subset of the skew-symmetri
distributions. In partiular, we dene exible skew-normal and skew-
t
distributions that an have more
than one mode. This is an essential property for some situations and provides an alternative to modeling
with mixtures of distributions. The exibility and p ossible multimodality of the new lass of distributions
is illustrated in Setion 3. We present three illustrative examples in Setion 4, and a disussion in Setion
5.
2 A dense subset of skew-symmetri distributions
In this setion, we onstrut a dense subset of skew-symmetri distributions through approximating the
skewing funtion
on a ompat set. Any ontinuous skewing funtion
an be written as:
(
x
) =
H
(
w
(
x
))
;
(2)
where
H
:
R
!
[0
;
1℄ is the df of a ontinuous random variable symmetri around 0, and
w
:
R
p
!
R
is an o dd ontinuous funtion, that is
w
(
x
) =
w
(
x
). In fat, for a hosen
H
suh that
H
1
exists,
w
(
x
) =
H
1
(
(
x
)) is a ontinuous odd funtion. This representation has been used by Azzalini &
Capitanio (2003) to dene ertain distributions by p erturbation of symmetry. Note however that the
representation (2) is not unique due to the many possible hoies of
H
.
Let
P
K
(
x
) b e an o dd p olynomial of order
K
. A p olynomial of order
K
in
R
p
is dened as a linear
ombination of terms of the form
Q
p
i
=1
x
r
i
i
, where
k
=
P
p
i
=1
r
i
K
. If eah term has an odd order (all
k
's are o dd), then the polynomial is alled an odd p olynomial, whereas if eah term has an even order
(all
k
's are even), it is alled an even polynomial. We dene exible skew-symmetri (FSS) distributions
2

by restriting (1) to:
2
f
(
x
)
K
(
x
)
;
(3)
where
K
(
x
) =
H
(
P
K
(
x
)) and
H
is any df of a ontinuous random variable symmetri around 0. Note
that there are no onstraints on the oeÆients of the p olynomial
P
K
in order to make (3) a valid
pdf. In partiular, (3) denes exible generalized skew-elliptial (FGSE) distributions when the pdf
f
is
elliptially ontoured. For instane, exible generalized skew-normal (FGSN) distributions are dened
by:
2
p
(
x
;
;
)(
P
K
(
A
(
x
)))
;
(4)
and exible generalized skew-
t
(FGST) distributions are dened by:
2
t
p
(
x
;
;
;
)
T
(
P
K
(
A
(
x
));
)
;
(5)
where we use the Choleski deomposition
1
=
A
T
A
,
t
p
denotes a
p
-dimensional multivariate
t
pdf,
and
T
denotes a univariate
t
df, both with degrees of freedom
. Note that we ould use , or any
other symmetri df, instead of
T
for the skewing funtion in (5). In pratie, a popular hoie for the
df
H
would b e or the univariate df orresponding to the symmetri p df
f
. Eetively, the following
proposition shows that FSS distributions an approximate skew-symmetri distributions arbitrarily well.
Prop osition 1
Let the lass of exible skew-symmetri (FSS) distributions onsist of distributions with
pdf given in (3) and the lass of skew-symmetri (SS) distributions of distributions with pdf given in (1),
where
f
2
C
0
(
R
p
)
in both lasses and
is ontinuous. Then the lass of FSS distributions is dense in
the lass of SS distributions under the
L
1
norm.
Pro of
: An arbitrary distribution in the SS lass an be written as 2
f
(
x
)
H
(
w
(
x
)), where
f
and
H
are ontinuous,
H
1
exists, and
w
is a ontinuous o dd funtion. Beause
f
2
C
0
(
R
p
), for any arbitrary
>
0, we an nd a ompat set
D
whih is symmetri around
(if
x
2
D
then
x
2
D
), suh that
for any
x
=
2
D
,
f
(
x
)
< =
4. Thus, for any
x
=
2
D
,
j
2
f
(
x
)
(
x
)
2
f
(
x
)
H
(
P
((
x
))
j
<
for any odd p olynomial
P
.
Sine
f
is ontinuous,
f
is bounded on
D
. We denote the bound by
C
, i.e.
f
(
x
)
C
for any
x
2
D
. We use
D
1
to denote the image spae of
w
, i.e.
D
1
=
f
w
(
x
)
j
x
2
D
g
. Beause of the
ontinuity of
w
, whih is a result of the ontinuity of b oth
H
and
,
D
1
is also ompat. The ontinuous
funtion
H
is uniformly ontinuous on the ompat set
D
1
. Hene there exists
>
0 suh that for
any
y
1
,
y
2
2
D
1
and
j
y
1
y
2
j
<
, we get
j
H
(
y
1
)
H
(
y
2
)
j
< =
(2
C
). From the Stone-Weierstrass
theorem (see e.g. Rudin, 1973, p. 115), there exists a polynomial
P
suh that
j
w
(
x
)
P
(
x
)
j
<
for any
x
2
D
. We deomp ose
P
into an even term
P
e
and an odd term
P
o
, i.e.
P
=
P
e
+
P
o
.
Then
j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
<
and
j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
<
. Beause
w
and
P
o
are odd, and
P
e
is even, we get
j
w
(
x
)
P
e
(
x
) +
P
o
(
x
)
j
<
. Notie that
2
j
w
(
x
)
P
o
(
x
)
j j
w
(
x
)
P
e
(
x
)
P
o
(
x
)
j
+
j
w
(
x
)
P
e
(
x
) +
P
o
(
x
)
j
<
2
,
3

so
j
w
(
x
)
P
o
(
x
)
j
<
. Combining these results, we know that for an arbitrary member
2
f
(
x
)
H
(
w
(
x
)) in SS and an arbitrary
>
0, we an nd a member 2
f
(
x
)
H
(
P
o
(
x
)) in
FSS suh that
j
2
f
(
x
)
H
(
w
(
x
))
2
f
(
x
)
H
(
P
o
(
x
))
j
<
for any
x
2
D
.
Hene FSS is dense in SS with resp et to the
L
1
norm.
Remark 1
The requirement
f
2
C
0
(
R
p
)
in proposition 1 an be relaxed to al low that
f
has a nite
number,
m
say, of poles. In this ase, FSS is dense in SS with respet to almost uniform onvergene
(uniform in a set whose omplement is of measure arbitrarily smal l). Indeed, let
R
p
(
r
)
denote
R
p
minus
the union of
m
open bal ls of radius
r
entered at the
m
poles. Then FSS is dense in SS on
R
p
(
r
)
under
the
L
1
norm. Letting
r
!
0
, the result fol lows.
Proposition 1 shows in partiular that the lass of generalized skew-elliptial, skew
t
, and skew-
normal distributions an b e approximated arbitrarily well by their exible versions.
3 Flexibility and multimodality
In Figure 1, we illustrate the shap e exibility of the FGSN distribution in the univariate ase. Its pdf
for
K
= 3 is dened by:
2
1
(
x
;
;
2
)(
(
x
)
=
+
(
x
)
3
=
3
)
:
(6)
Figure 1 should b e here.
Figure 1(a) depits the p df of the FGSN model for
= 0,
2
= 1,
= 4, and
= 0, i.e. it redues
to Azzalini's (1985) univariate skew-normal distribution. However, when
6
= 0, the p df (6) an exhibit
bimodality as shown in Figure 1(b) with
= 1, and
=
1. In general, as the degree
K
of the o dd
polynomial in the skewing funtion beomes large, the number of mo des allowed in the p df inreases,
thus induing a greater exibility in the available shapes. Unfortunately, the number of modes depends
on the degree
K
of the o dd p olynomial, on the symmetri pdf
f
, and on the df
H
of the skewing
funtion
K
in a omplex fashion. Indeed, even for the univariate situation given by
p
= 1, the mo des
are determined by zeros of the rst derivative of the FSS distribution (3) given by:
2
f
0
(
x
)
H
(
P
K
(
x
)) + 2
f
(
x
)
H
0
(
P
K
(
x
))
P
0
K
(
x
)
;
(7)
for whih the number of zeros annot b e easily omputed. Even with restritions to some sp ei
f
and
H
funtions, a general statement on the relation between the number of mo des and the order of the
polynomial seems not available. However, in the univariate ase, if we onsider a normal pdf
f
=
1
and
a standard normal df
H
= with an o dd p olynomial of order
K
= 3, we have the following proposition.
Prop osition 2
The lass of exible generalized skew-normal (FGSN) distributions with pdf
2
1
(
x
;
;
2
)(
(
x
)
=
+
(
x
)
3
=
3
)
has at most 2 modes.
4

Pro of
: Without loss of generality, we an set
= 0,
= 1, assume
>
0, and only need to prove that
(
x
) = 2
(
x
)(
x
+
x
3
) has at most two modes. We prove this by ontradition. If
(
x
) has more
than two mo des, then
0
(
x
) has at least ve zeros. In the following pro of, we show that this annot b e
the ase. We have
0
(
x
) = 2
(
x
)((
+ 3
x
2
)
(
x
+
x
3
)
x
(
x
+
x
3
)) and need to onsider three
ases:
ase 1:
= 0
We write
0
(
x
) = 2
x
(
x
)
(
x
), where
(
x
) = 3
x
(
x
3
)
(
x
3
). We an verify that
0
(
x
) =
3
(
x
3
)
1
(
y
) where
y
=
x
2
and
1
(
y
) = 1
y
3
2
y
3
. Sine
1
(
y
) is a dereasing funtion on
y
0,
0
(
x
) has at most two zeros. Thus,
(
x
) has at most three zeros, hene
0
(
x
) has at most four
zeros.
ase 2:
>
0
Notie that
0
(
x
)
>
0 for
x
0. For
1
(
x
) =
0
(
x
)
=
(2
x
(
x
)) =
(
x
+
x
3
)(
+ 3
x
2
)
=x
(
x
+
x
3
),
we get
0
1
(
x
) =
(
x
+
x
3
)
=
(
9
x
2
)
2
(
y
), where
y
=
+ 3
x
2
>
0 and
2
(
y
) =
y
4
+
y
3
+ (3
2
2
)
y
2
(3
+ 9
)
y
+ 18

. Sine
00
2
(
y
) = 12
y
2
+ 6
y
+ (6
4
2
) has at most 1 positive zero, and
0
2
(
y
) = 4
y
3
+ 3
y
2
+ (6
4
2
)
y
(3
+ 9
)
<
0 at
y
= 0, we know that
0
2
(
y
) has at most one positive
zero. Thus
2
(
y
) has at most 2 positive zeros. This means
0
1
(
x
) has at most two p ositive zeros, so
0
(
x
)
has at most three (p ositive) zeros.
ase 3:
<
0
Notie that
0
(
x
)
<
0 for
x
2
[0
;
p
=
(3
) and
0
(
x
)
>
0 for
x
2
(
1
;
p
=
(3
) ℄. So we only
look for solutions
x
2
(
p
=
(3
)
;
1
) and
x
2
(
p
=
(3
)
;
0). Let
y
=
+ 3
x
2
, then there is a one
to one mapping b etween the
x
in the ab ove range and
y
2
(
;
1
). Let
1
(
x
) and
2
(
y
) have the same
expressions as in ase 2. We have that
2
(
y
) has at most four zeros sine it is a fourth order p olynomial.
Notie that
2
(
)
<
0
;
2
(
1
)
>
0, so
2
(
y
) has at most three zeros in (
;
1
). This means
0
1
(
x
) has
at most three zeros, hene
0
(
x
) has at most four zeros.
Figure 1 illustrates the result of prop osition 2 by depiting a unimo dal and a bimo dal pdf from the
univariate FGSN with
K
= 3. For
K
= 1, the p df is always unimodal as was already noted by Azzalini
(1985) for the univariate skew-normal distribution.
Next we investigate the exibility of the FGSN distribution in the bivariate ase. Its pdf for
K
= 3,
=
0
, and =
I
2
is given by:
2
2
(
x
1
; x
2
;
0
; I
2
)(
1
x
1
+
2
x
2
+
1
x
3
1
+
2
x
3
2
+
3
x
2
1
x
2
+
4
x
1
x
2
2
)
:
(8)
Figure 2 should b e here.
Figure 2 depits the ontours of four dierent pdfs (8) for various ombinations of values of the
skewness parameters
1
,
2
,
1
,
2
,
3
, and
4
. In partiular, for
1
=
2
=
3
=
4
= 0, the
pdf is exatly the bivariate skew-normal proposed by Azzalini & Dalla Valle (1996), and known to be
unimodal, see Figure 2(a). However, Figures 2(b)-(d) show that many dierent distributional shap es an
be obtained with the parameters
1
; : : : ;
4
, in partiular bimodal and trimo dal distributions. Additional
5

Citations
More filters
Posted Content

Finite Mixtures of Multivariate Skew Laplace Distributions

TL;DR: In this paper, finite mixtures of multivariate skew Laplace distributions are proposed to model both skewness and heavy-tailedness in the heterogeneous data sets, and the maximum likelihood estimators for the parameters of interest are obtained by using the EM algorithm.
Proceedings Article

Nonlinear State Estimation Using Skew-Symmetric Representation of Distributions

TL;DR: The proposed skew-Gaussian filter is more general than traditional Gaussian filters and LMMSE-based nonlinear filters, which propagate only the first two moments, and can outperform conventional nonlinear filtering methods.
Journal ArticleDOI

Robust inference in linear mixed model with skew normal-symmetric error

TL;DR: In this paper, the robustness of linear mixed effects models with general skew normal-symmetric (SNS) error is considered and several properties of the SNS distributions are obtained.
Journal Article

Generalizations of the Skew t-Normal Distribution and their Properties

TL;DR: In this paper, several generalizations of the skew t-normal distribution, and some of their properties are considered, and several theorems for constructing each generalized skew tnormal distribution are presented.
Journal ArticleDOI

Modeling skew-symmetric distributions using B-spline and penalties

TL;DR: In this paper, a new flexible procedure for modeling skew-symmetric (SS) distributions via B-splines is proposed, which follows a penalized likelihood estimation method to avoid overfitting.
References
More filters
Book

Functional analysis

Walter Rudin
Journal Article

A class of distributions which includes the normal ones

TL;DR: In this paper, a nouvelle classe de fonctions de densite dependant du parametre de forme λ, telles que λ=0 corresponde a la densite normale standard.
Book

Smoothing Methods in Statistics

TL;DR: In this article, a nonparametric/parametric Compromise is used to improve the kernel density estimator, and the effect of simple Density Estimators is discussed.
Journal ArticleDOI

The multivariate skew-normal distribution

Adelchi Azzalini, +1 more
- 01 Dec 1996 - 
TL;DR: In this article, a multivariate parametric family such that the marginal densities are scalar skew-normal is introduced, and its properties are studied with special emphasis on the bivariate case.
Journal ArticleDOI

Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution

TL;DR: In this paper, a fairly general procedure is studied to perturb a multivariate density satisfying a weak form of multivariate symmetry, and to generate a whole set of non-symmetric densities.
Related Papers (5)