A comparison of three total variation based texture extraction models

doi:10.1016/J.JVCIR.2007.01.004

Journal Article•DOI•

A comparison of three total variation based texture extraction models

Wotao Yin¹, Donald Goldfarb², Stanley Osher³•Institutions (3)

Rice University¹, Columbia University², University of California, Los Angeles³

01 Jun 2007-Journal of Visual Communication and Image Representation (Academic Press, Inc.)-Vol. 18, Iss: 3, pp 240-252

TL;DR: This paper qualitatively compares three recently proposed models for signal/image texture extraction based on total variation minimization: the Meyer, Vese-Osher (VO), and TV-L^1[12,38,2-4,29-31] models.

read less

About: This article is published in Journal of Visual Communication and Image Representation.The article was published on 2007-06-01 and is currently open access. It has received 68 citations till now. The article focuses on the topics: Image texture.

...read moreread less

Summary (2 min read)

Jump to: [1 Introduction] – [1.1 The spaces BV and G] – [1.3 Second-order cone programming] – [2.2.3 The Vese-Osher (VO) model] – [3 Numerical results] – [Example 2:] and [4 Conclusion]

1 Introduction

Let f be an observed image that contains texture and/or noise.
Texture is characterized as repeated and meaningful structure of small patterns.
Noise is characterized as uncorrelated random patterns.
The rest of an image, which is called cartoon, contains object hues and sharp edges .

1.1 The spaces BV and G

In image processing, the space BV and the total variation semi-norm were first used by Rudin, Osher, and Fatemi [33] to remove noise from images.
The ROF model is the precursor to a large number of image processing models having a similar form.

1.3 Second-order cone programming

Since a one-dimensional second-order cone corresponds to a semi-infinite ray, SOCPs can accommodate nonnegative variables.
In fact if all cones are onedimensional, then the above SOCP is just a standard form linear program.
As is the case for linear programs, SOCPs can be solved in polynomial time by interior point methods.
This is the approach that the authors take to solve the TV-based cartoon-texture decomposition models in this paper.

2.2.3 The Vese-Osher (VO) model

This is equivalent to solving the residual-free version (45) below.
The authors chose to solve the latter in their numerical tests because using a large λ in (44) makes it difficult to numerically solve its SOCP accurately.

3 Numerical results

Similar artifacts can also be found in the results Figures 2 (h )-(j) of the VO model, but the differences are that the VO model generated u's that have a block-like structure and thus v's with more complicated patterns.
In Figure 2 (h), most of the signal in the second and third section was extracted from u, leaving very little signal near the boundary of these signal parts.
In short, the VO model performed like an approximation of Meyer's model but with certain features closer to those of the TV-L 1 model.

Example 2:

This fingerprint has slightly inhomogeneous brightness because the background near the center of the finger is whiter than the rest.
The authors believe that the inhomogeneity like this is not helpful to the recognition and comparison of fingerprints so should better be corrected.
The authors can observe in Figures 4 (a ) and (b) that their cartoon parts are close to each other, but slightly different from the cartoon in Figure 4 (c).
The VO and the TV-L 1 models gave us more satisfactory results than Meyer's model.
Compared to the parameters used in the three models for decomposing noiseless images in Example 3, the parameters used in the Meyer and VO models in this set of tests were changed due to the increase in the G-norm of the texture/noise part v that resulted from adding noise.

4 Conclusion

The authors have computationally studied three total variation based models with discrete inputs: the Meyer, VO, and TV-L 1 models.
The authors tested these models using a variety of 1D sig- nals and 2D images to reveal their differences in decomposing inputs into their cartoon and oscillating/small-scale/texture parts.
The Meyer model tends to capture the pattern of the oscillations in the input, which makes it well-suited to applications such as fingerprint image processing.
On the other hand, the TV-L 1 model decomposes the input into two parts according to the geometric scales of the components in the input, independent of the signal intensities, one part containing large-scale components and the other containing smallscale ones.
These results agree with those in [9] , which compares the ROF, Meyer, and TV-L 1 models.

Did you find this useful? Give us your feedback