scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Unsupervised texture segmentation using Gabor filters

01 Dec 1991-Pattern Recognition (Pergamon)-Vol. 24, Iss: 12, pp 1167-1186
TL;DR: A texture segmentation algorithm inspired by the multi-channel filtering theory for visual information processing in the early stages of human visual system is presented, which is based on reconstruction of the input image from the filtered images.
About: This article is published in Pattern Recognition.The article was published on 1991-12-01 and is currently open access. It has received 2351 citations till now. The article focuses on the topics: Image texture & Texture filtering.

Summary (2 min read)

Introduction

  • Image segmentation is a difficult yet very important task in many image analysis or computer vision applications.
  • A large number of techniques for analyzing image texture has been proposed in the past two decades [ll, 221.
  • This approach is inspired by a multi-channel filtering theory for processing visual information in the early stages of the human visual system.
  • The psychophysical experiments that suggested such a decomposition used various grating pattems as stimuli and were based on adaptation techniques [4] .
  • 'This work was supported in part by the National Science Foundation infrastructure grant CDA-8806599, and by a grant from E. I.

2 Channel Characterization

  • The use of Gabor filters in texture analysis is not new.
  • Tumer scrated their potential for texture discrimination.
  • Similarly, Perry & Lothe authors 1191 use a fixed set of Gabor filters in their texture segmentation algorithm.
  • Instead of using a fixed set of filters, Bovik er al. apply a simple peak finding algorithm to the power spectrum of the image in order to determine the radial frequencies of the appropriate Gabor filters.

2.1 Choice of Filter Parameters

  • Figure 4 shows examples of filtered images for an image containing 'straw matting' @55) and 'wood grain' (D68) textures from the photographic album of textures by Brodatz [2] .
  • The ability of the filters to exploit differences in spatial-frequency (size) and orientation in the two textures is evident in these images.
  • The differences in the strength of the responses in regions with different textures is the key to the multi-channel approach to texture analysis.
  • To maximize visibility, each filtered image has been scaled to full contrast.

Z,Y

  • Note that s(z,y) has a mean of zero, since the mean gray value of each filtered image is zero.
  • For computational efficiency, the authors determine the "best" subset 01 .he filtered images by the following suboptimal sequential forward selection procedure: Let r;(z,y) be the zth filtered image and R,(u,v) be its Discrete Fourier Transform.
  • The amount of overlap between the MTFs of the Gabor filters in their filter set is small.
  • These energies can be computed in the Fourier domain, hence avoiding unnecessary inverse Fourier transforms.

4 Integrating Feature Images

  • In texture segmentation, neighboring pixels are very likely to belong to the same texture category.
  • The authors propose a simple method that incorporates the spatial adjacency information directly in the clustering process.
  • This is achieved by including the spatial coordinates of the pixels as two additional features.

5 Experimental Results

  • The lack of appropriate quantitative measures of the goodness of a segmentation makes it very difficult to evaluate and compare texture segmentation algorithms.
  • One simple criterion that is often used is the percentage of misclassified pixels.
  • Table 1 gives the percentage of misclassified pixels for the segmentation experiments reported here.

6 Summary and Conclusions

  • In their texture segmentation algorithm, the authors assumed that the number of texture categories is given.
  • The authors believe that an integrated approach that uses both a region-based and an edge-based segmentation can be used to resolve the question of determining the number of texture categories.
  • Mal& 8~ Perona [16], for example, have developed a multi-channel filtering technique that produces edge-based segmentations.
  • The basic idea is to generate an oversegmented solu-19 tion using their region-based texture segmentanorl algorithm.
  • This integrated approach is currently being investigated.

Did you find this useful? Give us your feedback

Citations
More filters
Journal ArticleDOI
TL;DR: An overview of pattern clustering methods from a statistical pattern recognition perspective is presented, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.
Abstract: Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur. This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners. We present a taxonomy of clustering techniques, and identify cross-cutting themes and recent advances. We also describe some important applications of clustering algorithms such as image segmentation, object recognition, and information retrieval.

14,054 citations

Journal ArticleDOI
TL;DR: Almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation are surveyed, and the spawning of related subfields are discussed, to discuss the adaptation of existing image retrieval techniques to build systems that can be useful in the real world.
Abstract: We have witnessed great interest and a wealth of promise in content-based image retrieval as an emerging technology. While the last decade laid foundation to such promise, it also paved the way for a large number of new techniques and systems, got many new people involved, and triggered stronger association of weakly related fields. In this article, we survey almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation, and in the process discuss the spawning of related subfields. We also discuss significant challenges involved in the adaptation of existing image retrieval techniques to build systems that can be useful in the real world. In retrospect of what has been achieved so far, we also conjecture what the future may hold for image retrieval research.

3,433 citations


Cites methods from "Unsupervised texture segmentation u..."

  • ...The texture extraction part of this thesaurus building process involves the application of a bank of Gabor filters [Jain and Farrokhnia 1990] to the images, to encode statistics of the filtered outputs as texture features....

    [...]

  • ...The texture extraction part of this thesaurus building process involves the application of a bank of Gabor .lters [Jain and Farrokhnia 1990] to the images, to encode statistics of the .ltered outputs as texture features....

    [...]

Journal ArticleDOI
TL;DR: A method for rapid visual recognition of personal identity is described, based on the failure of a statistical test of independence, which implies a theoretical "cross-over" error rate of one in 131000 when a decision criterion is adopted that would equalize the false accept and false reject error rates.
Abstract: A method for rapid visual recognition of personal identity is described, based on the failure of a statistical test of independence. The most unique phenotypic feature visible in a person's face is the detailed texture of each eye's iris. The visible texture of a person's iris in a real-time video image is encoded into a compact sequence of multi-scale quadrature 2-D Gabor wavelet coefficients, whose most-significant bits comprise a 256-byte "iris code". Statistical decision theory generates identification decisions from Exclusive-OR comparisons of complete iris codes at the rate of 4000 per second, including calculation of decision confidence levels. The distributions observed empirically in such comparisons imply a theoretical "cross-over" error rate of one in 131000 when a decision criterion is adopted that would equalize the false accept and false reject error rates. In the typical recognition case, given the mean observed degree of iris code agreement, the decision confidence levels correspond formally to a conditional false accept probability of one in about 10/sup 31/. >

3,399 citations

Book
01 Dec 1993
TL;DR: The geometric, random field, fractal, and signal processing models of texture are presented and major classes of texture processing such as segmentation, classification, and shape from texture are discussed.
Abstract: This chapter reviews and discusses various aspects of texture analysis. The concentration is o the various methods of extracting textural features from images. The geometric, random field, fractal, and signal processing models of texture are presented. The major classes of texture processing pro lems such as segmentation, classification, and shape from texture are discussed. The possible applic tion areas of texture such as automated inspection, document processing, and remote sensing a summarized. A bibliography is provided at the end for further reading.

2,257 citations

Journal ArticleDOI
TL;DR: This work studies the problem of choosing an optimal feature set for land use classification based on SAR satellite images using four different texture models and shows that pooling features derived from different texture Models, followed by a feature selection results in a substantial improvement in the classification accuracy.
Abstract: A large number of algorithms have been proposed for feature subset selection. Our experimental results show that the sequential forward floating selection algorithm, proposed by Pudil et al. (1994), dominates the other algorithms tested. We study the problem of choosing an optimal feature set for land use classification based on SAR satellite images using four different texture models. Pooling features derived from different texture models, followed by a feature selection results in a substantial improvement in the classification accuracy. We also illustrate the dangers of using feature selection in small sample size situations.

2,238 citations

References
More filters
Journal ArticleDOI
TL;DR: In this paper, it is shown that the difference of information between the approximation of a signal at the resolutions 2/sup j+1/ and 2 /sup j/ (where j is an integer) can be extracted by decomposing this signal on a wavelet orthonormal basis of L/sup 2/(R/sup n/), the vector space of measurable, square-integrable n-dimensional functions.
Abstract: Multiresolution representations are effective for analyzing the information content of images. The properties of the operator which approximates a signal at a given resolution were studied. It is shown that the difference of information between the approximation of a signal at the resolutions 2/sup j+1/ and 2/sup j/ (where j is an integer) can be extracted by decomposing this signal on a wavelet orthonormal basis of L/sup 2/(R/sup n/), the vector space of measurable, square-integrable n-dimensional functions. In L/sup 2/(R), a wavelet orthonormal basis is a family of functions which is built by dilating and translating a unique function psi (x). This decomposition defines an orthogonal multiresolution representation called a wavelet representation. It is computed with a pyramidal algorithm based on convolutions with quadrature mirror filters. Wavelet representation lies between the spatial and Fourier domains. For images, the wavelet representation differentiates several spatial orientations. The application of this representation to data compression in image coding, texture discrimination and fractal analysis is discussed. >

20,028 citations

01 Jan 1988

9,439 citations

Book
01 Jan 1988

8,586 citations

Journal ArticleDOI
Robert M. Haralick1
01 Jan 1979
TL;DR: This survey reviews the image processing literature on the various approaches and models investigators have used for texture, including statistical approaches of autocorrelation function, optical transforms, digital transforms, textural edgeness, structural element, gray tone cooccurrence, run lengths, and autoregressive models.
Abstract: In this survey we review the image processing literature on the various approaches and models investigators have used for texture. These include statistical approaches of autocorrelation function, optical transforms, digital transforms, textural edgeness, structural element, gray tone cooccurrence, run lengths, and autoregressive models. We discuss and generalize some structural approaches to texture based on more complex primitives than gray tone. We conclude with some structural-statistical generalizations which apply the statistical techniques to the structural primitives.

5,112 citations

Journal ArticleDOI
John Daugman1
TL;DR: Evidence is presented that the 2D receptive-field profiles of simple cells in mammalian visual cortex are well described by members of this optimal 2D filter family, and thus such visual neurons could be said to optimize the general uncertainty relations for joint 2D-spatial-2D-spectral information resolution.
Abstract: Two-dimensional spatial linear filters are constrained by general uncertainty relations that limit their attainable information resolution for orientation, spatial frequency, and two-dimensional (2D) spatial position. The theoretical lower limit for the joint entropy, or uncertainty, of these variables is achieved by an optimal 2D filter family whose spatial weighting functions are generated by exponentiated bivariate second-order polynomials with complex coefficients, the elliptic generalization of the one-dimensional elementary functions proposed in Gabor’s famous theory of communication [ J. Inst. Electr. Eng.93, 429 ( 1946)]. The set includes filters with various orientation bandwidths, spatial-frequency bandwidths, and spatial dimensions, favoring the extraction of various kinds of information from an image. Each such filter occupies an irreducible quantal volume (corresponding to an independent datum) in a four-dimensional information hyperspace whose axes are interpretable as 2D visual space, orientation, and spatial frequency, and thus such a filter set could subserve an optimally efficient sampling of these variables. Evidence is presented that the 2D receptive-field profiles of simple cells in mammalian visual cortex are well described by members of this optimal 2D filter family, and thus such visual neurons could be said to optimize the general uncertainty relations for joint 2D-spatial–2D-spectral information resolution. The variety of their receptive-field dimensions and orientation and spatial-frequency bandwidths, and the correlations among these, reveal several underlying constraints, particularly in width/length aspect ratio and principal axis organization, suggesting a polar division of labor in occupying the quantal volumes of information hyperspace. Such an ensemble of 2D neural receptive fields in visual cortex could locally embed coarse polar mappings of the orientation–frequency plane piecewise within the global retinotopic mapping of visual space, thus efficiently representing 2D spatial visual information by localized 2D spectral signatures.

3,392 citations