How can the HMT model be used to model the neighboring states?

In particular it has been shown that using only a mixture of two Gaussians, the HMT can already achieve satisfactory accuracy in wavelet coefficient modeling [7].

What is the reason for the horizontal distributions of the left sides of the plots?

If the authors assume that the horizontal distributions of the left sides of the plots are due to quantization errors and other sources of uncertainties dominating at small coefficient magnitudes, contourlet coefficients of natural images can be modeled according to some distributions with variances directly related to any linear combination of the magnitudes of their generalized neighborhoods.

What is the significance of joint statistics?

Joint statistics are particularly important because in the wavelet case, image processing algorithms exploiting joint statistics of coefficients [5]–[7],[11],[12] show significant improvements in performance over those that exploit marginal statistics alone [3],[8].

What is the significance of the mutual information estimation results for the three representative images?

Note that all images show significant mutual information across all of scale, space and directions, and reinforces their observation in Section III that coefficients are dependent on their generalized neighborhoods.

Why are the contourlet coefficients defined over different supports?

The reason is that the basis functions corresponding to the vertical and horizontal subbands are defined over different supports [19].

What is the widely used family of Gaussian mixture models?

The authors consider the hidden Markov model (HMM) family [7], which is one of the most well-known and widely used family of Gaussian mixture models.

What is the expected effect of contourlets on the directional filter?

Again note that as more effective directional filters for contourlets are developed in the future, it is expected that for contourlets, and should further decrease.

What is the definition of a directional filter bank?

The directional filter bank is a critically sampled filter bank that decomposes images into any power of two’s number of directions.

What is the value of contourlets in image processing?

Both results suggest that contourlets can capture directional information very well, which is a highly valuable property in image processing.

(Open Access) Directional multiscale modeling of images using the contourlet transform (2006) | D.D.-Y. Po

Q: What have the authors contributed in "Directional multiscale modeling of images using the contourlet transform" ?

The authors begin with a detailed study on the statistics of the contourlet coefficients of natural images, using histogram estimates of the marginal and joint distributions, and mutual information measurements to characterize the dependencies between coefficients. The study reveals the non-Gaussian marginal statistics and strong intra-subband, cross-scale, and cross-orientation dependencies of contourlet coefficients. Based on these statistics, the authors model contourlet coefficients using a hidden Markov tree ( HMT ) model that can capture all of their inter-scale, inter-orientation, and intra-subband dependencies. The authors experiment this model in the image denoising and texture retrieval applications where the results are very promising.

IEEE TRANSACTIONS ON IMAGE PROCESSING 1

Directional Multiscale Modeling of Images

using the Contourlet Transform

Duncan D.-Y. Po and Minh N. Do

Coordinated Science Lab and Beckman Institute

University of Illinois at Urbana-Champaign

Urbana IL 61801

Email: duncanpo@ifp.uiuc.edu, minhdo@uiuc.edu

Abstract

The contourlet transform is a new extension to the wavelet transform in two dimensions using

nonseparable and directional ﬁlter banks. The contourlet expansion is composed of basis images oriented

at varying directions in multiple scales, with ﬂexible aspect ratios. With this rich set of basis images, the

contourlet transform can effectively capture the smooth contours that are the dominant features in natural

images with only a small number of coefﬁcients. We begin with a detailed study on the statistics of the

contourlet coefﬁcients of natural images, using histogram estimates of the marginal and joint distributions,

and mutual information measurements to characterize the dependencies between coefﬁcients. The study

reveals the non-Gaussian marginal statistics and strong intra-subband, cross-scale, and cross-orientation

dependencies of contourlet coefﬁcients. It is also found that conditioned on the magnitudes of their

generalized neighborhood coefﬁcients, contourlet coefﬁcients can approximately be modeled as Gaussian

variables. Based on these statistics, we model contourlet coefﬁcients using a hidden Markov tree (HMT)

model that can capture all of their inter-scale, inter-orientation, and intra-subband dependencies. We

experiment this model in the image denoising and texture retrieval applications where the results are

very promising. In denoising, contourlet HMT outperforms wavelet HMT and other classical methods in

terms of visual quality. In particular, it preserves edges and oriented features better than other existing

methods. In texture retrieval, it shows improvements in performance over wavelet methods for various

oriented textures.

January 22, 2004 DRAFT

2 IEEE TRANSACTIONS ON IMAGE PROCESSING

I. INTRODUCTION

In image processing, it has been a common practice to use simple statistical models to describe images.

Natural images tend to have certain common characteristics that make them look “natural.” The aim of

statistical modeling is to capture these deﬁning characteristics in a small number of parameters so that

they can be used as prior information in image processing tasks such as compression and denoising. A

simple, accurate and tractable model is an essential element in any successful image processing algorithm.

Images can be better modeled in the wavelet transform [1],[2] domain, which shows multiresolution

and time-frequency localization properties, and in which energy density has a more local structure than

in the image spatial domain. Initially, wavelet transform was considered to be a good decorrelator for

images, and thus wavelet coefﬁcients were assumed to be independent and were simply modeled by

marginal statistics [3]. Later it was realized that wavelet coefﬁcients for natural images exhibit strong

dependencies both across scales and between neighbor coefﬁcients within a subband, especially around

image edges. This gave rise to several successful joint statistical models in the wavelet domain [4]–[10],

as well as improved image compression schemes [11]–[13].

The major drawback for wavelets in 2-D is their limited ability in capturing directional information. To

counter this deﬁciency, researchers have most recently shifted their attention to multiscale and directional

representations that can capture the intrinsic geometrical structures such as smooth directional contours

in natural images. Some examples include the steerable pyramid [14], brushlets [15], complex wavelets

[16], and the curvelet transform [17]. In particular, the curvelet transform, pioneered by Cand`es and

Donoho, was shown to be optimal in a certain sense for functions in the continuous domain with curved

singularities.

Inspired by curvelets, Do and Vetterli [18]–[20] developed the contourlet representation based on an

efﬁcient two-dimensional nonseparable ﬁlter bank that can deal effectively with images having smooth

contours. Contourlets not only possess the main features of wavelets (namely, multiresolution and time-

frequency localization), but also show a high degree of directionality and anisotropy. The main difference

between contourlets and other multiscale directional systems is that contourlets allow for a different and

ﬂexible number of directions at each scale, while achieving nearly critical sampling. In addition, contourlet

transform employs iterated ﬁlter banks, which makes it computationally efﬁcient.

In this work, we focus on image modeling in the contourlet domain. Our primary goal is to provide

an extensive study on the statistics of contourlet coefﬁcients in order to gain a thorough understanding of

their properties. Then we develop an appropriate model that can capture these properties, which can be

DRAFT January 22, 2004

PO AND DO: DIRECTIONAL MULTISCALE MODELING OF IMAGES USING THE CONTOURLET TRANSFORM 3

useful in future contourlet applications, including compression, denoising, and feature extraction. Similar

to wavelet-based models, contourlet-based models need to take into account the coefﬁcients’ dependencies

across scale and space. However, as a “true” two-dimensional representation, contourlets allow us to also

model the coefﬁcients’ dependencies across directions. In other words, contourlet modeling allows us to

jointly model all three fundamental parameters of visual information, namely: scale, space, and direction.

The rest of the paper is organized as follows. Section II introduces the basics of contourlets including

their transform algorithm, structure, properties, and coefﬁcient relationships. In Section III, we study

the marginal and joint statistics of contourlet coefﬁcients of natural images via histograms. Section IV

examines the dependencies between coefﬁcients using mutual information. Inspired by these results, we

develop a hidden Markov tree (HMT) model for the contourlet transform in Section V. In Section VI,

we apply the contourlet HMT model in denoising and texture retrieval. Finally, a conclusion is presented

in Section VII.

II. BACKGROUND

A. Contourlets

Do and Vetterli developed contourlets in [18]–[20]. Their primary aim was to construct a sparse efﬁcient

decomposition for two-dimensional signals that are piecewise smooth away from smooth contours. Such

signals resemble natural images of ordinary objects and scenes, with the discontinuities as boundaries of

objects. These discontinuities, referred to as edges, are gathered along one-dimensional smooth contours.

Two-dimensional wavelets, with basis functions shown in Figure 1(a), lack directionality and are only

good at catching zero-dimensional or point discontinuities, resulting in largely inefﬁcient decompositions.

For example, as shown in Figure 1(c), it would take many wavelet coefﬁcients to accurately represent

even one simple one-dimensional curve.

Contourlets were developed as an improvement over wavelets in terms of this inefﬁciency. The resulting

transform has the multiresolution and time-frequency localization properties of wavelets, but also shows a

very high degree of directionality and anisotropy. Precisely, contourlet transform involves basis functions

that are oriented at any power of two’s number of directions with ﬂexible aspect ratios, with some

examples shown in Figure 1(b). With such richness in the choice of basis functions, contourlets can

represent any one-dimensional smooth edges with close to optimal efﬁciency. For instance, Figure

1(d) shows that compared with wavelets, contourlets can represent a smooth contour with much fewer

coefﬁcients.

January 22, 2004 DRAFT

4 IEEE TRANSACTIONS ON IMAGE PROCESSING

50 100 150 200 250

100

150

200

250

50 100 150 200 250

100

150

200

250

ContourletWavelet

(a) (b) (c) (d)

Fig. 1. Contourlet and wavelet representation for images. (a) Basis functions of 2-D wavelets (b) Basis functions of contourlets

(c) Wavelets have square supports and can only capture points. (d) Contourlets have elongated supports and can capture line

segments. Contourlets thus can effectively represent a smooth contour with fewer coefﬁcients.

(2,2)

multiscale dec. directional dec.

(-pi,-pi)

(pi,pi)

(a) (b)

Fig. 2. (a) Pyramidal directional ﬁlter bank structure that implements the discrete contourlet transform. (b) A typical contourlet

frequency partition scheme.

Contourlets are implemented by the pyramidal directional ﬁlter bank (PDFB) which decomposes images

into directional subbands at multiple scales [18]–[20]. The PDFB is a cascade of a Laplacian pyramid

[21] and a directional ﬁlter bank [22] as shown in Figure 2(a). The directional ﬁlter bank is a critically

sampled ﬁlter bank that decomposes images into any power of two’s number of directions. Due to

the PDFB’s cascaded structure, the multiscale and directional decompositions are independent of each

other. One can decompose each scale into any arbitrary power of two’s number of orientations and

different scales can be divided into different numbers of orientations. This decomposition property makes

contourlets a unique transform that can achieve a high level of ﬂexibility in decomposition while being

DRAFT January 22, 2004

PO AND DO: DIRECTIONAL MULTISCALE MODELING OF IMAGES USING THE CONTOURLET TRANSFORM 5

(v)

(iv)

(iii)

(ii)

(i)

(a) (b)

(v)

(iv)

(iii)

(ii)

(i)

Fig. 3. (a) The “Peppers” image. (b) Contourlet representation of (a). (c) The “Goldhill” image. (d) Contourlet representation of

(c). (i)–(v) represents coarse to ﬁne scales respectively. Small coefﬁcients are colored black while large coefﬁcients are colored

white.

close to critically sampled (up to 33% overcomplete, which comes from the Laplacian pyramid)

. Other

multiscale directional transforms have either a ﬁxed number of directions, such as complex wavelets [16],

or are signiﬁcantly overcomplete (depending on the number of directions), such as the steerable pyramid

[14]. Figure 2(b) shows a typical frequency division of the contourlet transform where the four scales

are divided into four, four, eight and eight subbands from coarse to ﬁne scales respectively. The fact that

contourlets are close to critically sampled makes them especially promising in image compression.

Figure 3(a) shows the image “Peppers” and Figure 3(b) shows its contourlet representation. Similarly,

Figure 3(c) shows the image “Goldhill” and Figure 3(d) shows its contourlet representation. In this

particular decomposition, the image is divided into an approximation image (i) and four detail scales

(ii), (iii), (iv), (v) from coarse to ﬁne. Each detail scale is further partitioned into directional subbands

according to the scheme in Figure 2(b). The two coarser scales are partitioned into four directional

Recently, a modiﬁed version of the contourlet scheme that is critically sampled was developed [23].

January 22, 2004 DRAFT

Directional multiscale modeling of images using the contourlet transform

Figures

Citations

The contourlet transform: an efficient directional multiresolution image representation

The Nonsubsampled Contourlet Transform: Theory, Design, and Applications

Sparse directional image representations using the discrete shearlet transform

Optimally sparse multidimensional representation using shearlets

Sparse multidimensional representation using shearlets

References

Elements of information theory

A theory for multiresolution signal decomposition: the wavelet representation

A wavelet tour of signal processing

Density estimation for statistics and data analysis

The Laplacian Pyramid as a Compact Image Code

Related Papers (5)

The contourlet transform: an efficient directional multiresolution image representation

The Laplacian Pyramid as a Compact Image Code

The curvelet transform for image denoising

Image denoising using scale mixtures of Gaussians in the wavelet domain

A wavelet tour of signal processing

Frequently Asked Questions (10)

Q1. What have the authors contributed in "Directional multiscale modeling of images using the contourlet transform" ?

Q2. How can the HMT model be used to model the neighboring states?

Q3. What is the reason for the horizontal distributions of the left sides of the plots?

Q4. What is the significance of joint statistics?

Q5. What is the significance of the mutual information estimation results for the three representative images?

Q6. Why are the contourlet coefficients defined over different supports?

Q7. What is the widely used family of Gaussian mixture models?

Q8. What is the expected effect of contourlets on the directional filter?

Q9. What is the definition of a directional filter bank?

Q10. What is the value of contourlets in image processing?