The contourlet transform: an efficient directional multiresolution image representation
Summary (5 min read)
Introduction
- This work was supported in part by the US National Science Foundation under Grant CCR-0237633 and the Swiss National Science Foundation under Grant 20-63664.00.
- As a result of a separable extension from 1-D bases, wavelets in 2-D are good at isolating the discontinuities at edge points, but will not “see” the smoothness along the contours.
- The new style painter, on the other hand, exploits effectively the smoothness of the contour by making brush strokes with different elongated shapes and in a variety of directions following the contour.
- More importantly, this result suggests that for a computational image representation to be efficient, it should based on a local, directional, and multiresolution expansion.
A. Concept
- Comparing the wavelet scheme with the new scheme shown in Figure 1, the authors see that the improvement of the new scheme can be attributed to the grouping of nearby wavelet coefficients, since they are locally correlated due to the smoothness of the contours.
- In essence, the authors first use a wavelet-like transform for edge detection, and then a local directional transform for contour segment detection.
- The authors proposed a double filter bank structure (see Figure 7) [22] for obtaining sparse expansions for typical images having smooth contours.
- The overall result is an image expansion using basic elements like contour segments, and thus are named contourlets.
- In the frequency domain, the contourlet transform provides a multiscale and directional decomposition.
B. Pyramid frames
- One way to obtain a multiscale decomposition is to use the Laplacian pyramid (LP) introduced by Burt and Adelson [23].
- The LP decomposition at each level generates a downsampled lowpass version of the original and the difference between the original and the prediction, resulting in a bandpass image.
- Thus, the key in the DFB is to use an appropriate combination of shearing operators together with two-direction partition of quincunx filter banks at each node in a binary tree-structured filter bank, to obtain the desired 2-D spectrum division as shown in Figure 3(a).
- These basis functions have quasi-linear supports in space and span all directions.
- Furthermore, it can be shown [29] that if the building block filter bank in Figure 4 uses orthogonal filters, then the resulting DFB is orthogonal and (4) becomes an orthogonal basis.
D. Multiscale and directional decomposition: the discrete contourlet transform
- Combining the Laplacian pyramid and the directional filter bank, the authors are now ready to describe their combination into a double filter bank structure that was motivated in Section IIIA.
- That means, the j-th level of the LP decomposes the image aj−1[n] into a coarser image aj [n] and a detail image bj [n].
- The main properties of the discrete contourlet transform are stated in the following theorem.
- (5) For the DFB, its building block two-channel filter banks requires Ld operations per input sample.
- Since the multiscale and directional decomposition stages are decoupled in the discrete contourlet transform, the authors can have a different number of directions at different scales, thus offering a flexible multiscale and directional expansion.
IV. CONTOURLETS AND DIRECTIONAL MULTIRESOLUTION ANALYSIS
- As for the wavelet filter bank, the contourlet filter bank has an associated continuous-domain expansion in L2(R2) using the contourlet functions.
- The connection between the discrete contourlet transform and the continuousdomain contourlet expansion will be made precisely via a new multiresolution analysis framework that is similar to the link between wavelets and filter banks [2].
- The new elements in this framework are multidirection and its combination with multiscale.
- For simplicity, the authors will only consider the case with orthogonal filters, which leads to tight frames.
- The more general case with biorthogonal filters can be treated similarly.
C. Multiscale and multidirection: the contourlet expansion
- Applying the directional decomposition by the family (4) onto the detail subspace Proof:.
- Note that the number of DFB decomposition levels l can be different at different scales j, and in that case will be denoted by lj .
- As a result, the subspace W (l) j,k is defined on a rectangular grid with intervals 2j+l−2×2j or 2j×2j+l−2, depending on whether it is mostly horizontal or vertical (see Figure 9(b)).
- The reason that {λ(l)j,k,n}n∈Z2 is an overcomplete frame for W (l)j,k is because it uses the same sampling grid of the bigger subspace V (l)j−1,k . (22) The discrete filter w(l)k is roughly equal to the summation of convolutions between the directional filter d(l)k and bandpass filters fi’s, and thus it is a bandpass directional filter.
- 1) The contourlet expansions are defined on rectangular grids, and thus offer a seamless translation (as demonstrated in Theorem 3) to the discrete world, where image pixels are sampled on a rectangular grid.
V. CONTOURLET APPROXIMATION AND COMPRESSION
- The proposed contourlet filter bank and its associated continuous-domain frames in previous sections provide a framework for constructing general directional multiresolution image representations.
- Since their goal is to develop efficient or sparse expansions for images having smooth contours, the next important issues are: (1) what conditions should the authors impose on contourlets to obtain a sparse expansion for that class of images; and (2) how can they design filter banks that can lead to contourlet expansions satisfying those conditions.
- The authors consider the first issue in this paper; the second one is addressed in another paper [31].
A. Parabolic scaling
- In the curvelet construction, Candès and Donoho [4] point out that a key to achieving the correct nonlinear approximation behavior by curvelets is to select support sizes obeying the parabolic scaling relation for curves: width ∝ length2.
- The same scaling relation has been used in the study of Fourier integral operators and wave equations; for example, see [32].
- More precisely, with the local coordinate setup as in Figure 10(a), the authors can readily verify that the parametric representation of the discontinuity curve obeys u(v) ≈ κ 2 v2, when v ≈ 0, (27) where κ is the local curvature of the curve.
- As can be seen in the two pyramidal levels shown, as the support size of the basis element of the LP is reduced by four in each dimension, the number of directions of the DFB is doubled.
- Combining these two stages, the support sizes of the contourlet functions evolve in accordance to the desired parabolic scaling.
B. Directional vanishing moment
- For the wavelet case in 1-D, the wavelet approximation theory brought a novel condition into filter bank design, which earlier only focused on designing filters with good frequency selection properties.
- Intuitively, wavelets with vanishing moments are orthogonal to polynomial signals, and thus only a few wavelet basis functions around the discontinuities points would “feel” these discontinuities and lead to significant coefficients [33].
- The key feature of these images is that image edges are localized in both location and direction.
- Thus, it is desirable that only few contourlet functions whose supports intersect with a contour and align with the contour local direction would “feel” this discontinuity.
- The authors refer this requirement as the directional vanishing moment (DVM) condition.
C. Contourlet approximation
- In this subsection the authors will show that a contourlet expansion that satisfies the parabolic scaling and has sufficient DVMs (this will be defined precisely in Lemma 1) achieves the optimal nonlinear approximation rate for 2-D piecewise C2 smooth functions with discontinuities along C2 smooth curves.
- Therefore the authors need to bound the integral of 〈f, λj,k̃,n〉 outside region A to be the same order.
- In addition, since the discontinuity curve S has finite length, the number of type 1 coefficients with these indexes is mj,k̃ ∼ 1/dj,k̃,n ∼ 2−j/2k̃. (39) From (38), for a type 1 coefficient to have magnitude above a threshold ǫ, it is necessary that k̃ .
- Suppose that a compactly supported contourlet frame (24) satisfies the parabolic scaling condition (29), the contourlet functions λj,k satisfy the condition in Lemma 1, and the scaling function φ ∈ Cp has accuracy of order 2, also known as Theorem 4.
- Then for a function f that is C2 away from a C2 discontinuity curve, the M -term approximation by this frame achieves ‖f − f̂ M ‖22 ≤ C(logM)3M−2. (45) Remark 2: The approximation rate for s in (45) is the same as the approximation rate for curvelets, which was derived in [5] and [35].
D. Contourlet compression
- So far, the authors consider the approximation problem of contourlets by keeping the M largest coefficients.
- Specifically, from coarse to fine scales, significant contourlet coefficients are successively localized in both location (contourlets intersect with the discontinuity curve) and direction (intersected contourlets with direction close to the local direction of the discontinuity curve).
- Thus, using embedded tree structures for contourlet coefficients that are similar to the embedded zero-trees for wavelets [36], the authors can efficiently index the retained coefficients using 1 bit per coefficient.
- Instead of using fixed length coding for the quantized coefficients, a slight gain (in the log factor, but not the exponent of the rate-distortion function) can be obtained by variable length coding.
- In particular, the authors use the bit plane coding scheme [8] where coefficients with magnitude in the range (2l−1−L, 2l−L] are encoded with l bits.
VI. NUMERICAL EXPERIMENTS
- All experiments in this section use a wavelet transform with “9-7” biorthogonal filters [37], [38] and 6 decomposition levels.
- Apart from also being linear phase and nearly orthogonal, these fan filters are close to having the ideal frequency response and thus can approximate the directional vanishing moment condition.
- The number of DFB decomposition levels is doubled at every other finer scale and is equal to 5 at the finest scale.
- Note that in this case, both the wavelet and the contourlet transforms share the same detail subspaces.
- The difference is that each detail subspace.
B. Nonlinear approximation
- Next the authors compare the nonlinear approximation (NLA) performances of the wavelet and contourlet transforms.
- In these NLA experiments, for a given value M , the authors select the M - most significant coefficients in each transform domain, and then compare the reconstructed images from these sets of M coefficients.
- The authors expect that most of the refinement happens around the image edges.
- The wavelet scheme is seen to slowly capture contours by isolated “dots”.
- In addition, there is a significant gain of 1.46 dB in peak signalto-noise ratio (PSNR) for contourlets.
VII. CONCLUSION
- The authors constructed a discrete transform that provides a sparse expansion for typical images having smooth contours.
- Based on this observation, the authors developed a new filter bank structure, the contourlet filter bank, that can provide a flexible multiscale and directional decomposition for images.
- This connection is defined via a directional multiresolution analysis that provides successive refinements at both spatial and directional resolution.
- The authors make a change to a new coordinate (x, y) as shown in Figure 18, where λj,k̃ has vanishing moments along the x direction.
- Also, for the same order, the authors can parameterize the discontinuity line as y = αx.
Did you find this useful? Give us your feedback
Citations
2,603 citations
Cites background from "The contourlet transform: an effici..."
...A more recent, very interesting attempt at implementing low-redundancy curvelets was introduced by Do and Vetterli in [16]....
[...]
...Given the significance of such intermediate dimensional phenomena, there has been a vigorous research effort to provide better adapted alternatives by combining ideas from geometry with ideas from traditional multiscale analysis [17, 19, 4, 31, 14, 16]....
[...]
2,372 citations
Cites background from "The contourlet transform: an effici..."
...How can we wisely choose A to perform well on the signals we have in mind? One line of work considered choosing preconstructed dictionaries, such as undecimated wavelets [149], steerable wavelets [145, 37, 136], contourlets [38, 39, 40, 70, 71], curvelets [146, 12], and others [22, 123]....
[...]
1,900 citations
Cites background from "The contourlet transform: an effici..."
...These include adaptive schemes such as wedgelets [9], [10] and bandelets [11], and non-adaptive ones such as curvelets [12 ] and contourlets [13]....
[...]
...In particular, it can satisfy the a nisotropic scaling law — a key property in establishing the expansion nonlinear approxim ation behavior [12], [13]....
[...]
1,345 citations
Cites background from "The contourlet transform: an effici..."
...The dictionaries of this sort are characterized by an analytic formulation, and are usually supported by a set of optimality proofs and error rate bounds....
[...]
1,328 citations
References
17,693 citations
"The contourlet transform: an effici..." refers background or methods in this paper
...The more general case with biorthogonal filters can be treated similarly....
[...]
...Under certain regularity conditions, the lowpass synthesis filter in the iterated LP uniquely defines a unique scaling function that satisfies the following two-scale equation [2], [8] (7) Let (8) Then the family is an orthonormal basis for an approximation subspace at the scale ....
[...]
...In this double filter bank, the Laplacian pyramid (LP) [23] is first used to capture the point discontinuities, and then followed by a directional filter bank (DFB) [24] to link point discontinuities into linear structures....
[...]
12,923 citations
Additional excerpts
...Index Terms—Contourlets, contours, filter banks, geometric image processing, multidirection, multiresolution, sparse representation, wavelets....
[...]
8,153 citations
6,975 citations
"The contourlet transform: an effici..." refers methods in this paper
...One way to obtain a multiscale decomposition is to use the LP introduced by Burt and Adelson [23]....
[...]
...In this double filter bank, the Laplacian pyramid (LP) [23] is first used to capture the point discontinuities, and then followed by a directional filter bank (DFB) [24] to link point discontinuities into linear structures....
[...]
5,947 citations
"The contourlet transform: an effici..." refers background in this paper
...Furthermore, experiments in searching for the sparse components of natural images pro duced basis images that closely resemble the aforementione d characteristics of the visual cortex [7]....
[...]
Related Papers (5)
Frequently Asked Questions (10)
Q2. How can contourlet transforms be generalized to arbitrary tree structures?
the full binary tree decomposition of the DFB in the contourlet transform can be generalized to arbitrary tree structures, similar to the wavelet packets generalization of the wavelet transform [30].
Q3. What other well-known systems provide multiscale and directional image representations?
Several other well-known systems that provide multiscale and directional image representations include: 2-D Gabor wavelets [15], the cortex transform [16], the steerable pyramid [17], 2-D directional wavelets [18], brushlets [19], and complex wavelets [20].
Q4. What is the way to achieve the optimal approximation rate for contourlets?
With parabolic scaling and sufficient directional vanishing moments, the contourlet expansion is shown to achieve the optimal approximation rate for piecewise C2 smooth images with C2 smooth contours.
Q5. What is the M -term approximation error for wavelets?
For this class of functions, the best M -term approximation error (in L2-norm square) ‖f − f̂M‖22 using curvelets has a decay rate of O((logM)3M−2) [5], while for wavelets this rate is O(M−1) and for the Fourier basis it is O(M−1/2) [1], [2].
Q6. What is the way to improve a curvelet-like method?
for typical images with smooth contours, the authors expect a significant improvement of a curvelet-like method over wavelets, which is comparable to the improvement of wavelets over the Fourier basis for one-dimensional piecewise smooth signals.
Q7. what is the sum squared error due to discarding all except M largest type 1 coefficients?
Suppose that the scaling function φ has accuracy of order 2, which is equivalent to requiring the filter G(ejω1 , ejω2) in (7) to have a second-order zero at (π, π) [34], that is for all p1, p2 ∈ Z; 0 ≤ p1 + p2 < 2,∂p1ω1∂ p2 ω2G(e jω1 , ejω2) ∣ ∣ (π,π) = 0. (42)Then for f ∈ C2, the authors have‖f − PVjf‖2 ∼ (2j)2. (43)Thus, the sum squared error due to discarding all except M ∼ 2−2j type 2 contourlet coefficients down to scale 2j satisfiesE2(M) ∼ (2j)4 ∼M−2. (44)Combining (41) and (44), and using (32), the authors obtain the following result that characterizes the approximation power of contourlets.
Q8. What is the reason that (l)j,k is an overcomplete frame?
The reason that {λ(l)j,k,n}n∈Z2 is an overcomplete frame for W (l)j,k is because it uses the same sampling grid of the bigger subspace V (l)j−1,k .
Q9. What is the redundancy ratio of the discrete contourlet transform?
4) Suppose an lj-level DFB is applied at the pyramidal level j of the LP, then the basis images of the discrete contourlet transform (i.e. the equivalent filters of the contourlet filter bank) have an essential support size of width ≈ C2j and length ≈ C2j+lj−2.
Q10. What is the optimal nonlinear approximation rate for contourlet functions?
In this subsection the authors will show that a contourlet expansion that satisfies the parabolic scaling and has sufficient DVMs (this will be defined precisely in Lemma 1) achieves the optimal nonlinear approximation rate for 2-D piecewise C2 smooth functions with discontinuities along C2 smooth curves.