scispace - formally typeset
Search or ask a question

Showing papers on "Content-based image retrieval published in 2002"


Journal ArticleDOI
TL;DR: A statistical view of the texture retrieval problem is presented by combining the two related tasks, namely feature extraction (FE) and similarity measurement (SM), into a joint modeling and classification scheme that leads to a new wavelet-based texture retrieval method that is based on the accurate modeling of the marginal distribution of wavelet coefficients using generalized Gaussian density (GGD).
Abstract: We present a statistical view of the texture retrieval problem by combining the two related tasks, namely feature extraction (FE) and similarity measurement (SM), into a joint modeling and classification scheme. We show that using a consistent estimator of texture model parameters for the FE step followed by computing the Kullback-Leibler distance (KLD) between estimated models for the SM step is asymptotically optimal in term of retrieval error probability. The statistical scheme leads to a new wavelet-based texture retrieval method that is based on the accurate modeling of the marginal distribution of wavelet coefficients using generalized Gaussian density (GGD) and on the existence a closed form for the KLD between GGDs. The proposed method provides greater accuracy and flexibility in capturing texture information, while its simplified form has a close resemblance with the existing methods which uses energy distribution in the frequency domain to identify textures. Experimental results on a database of 640 texture images indicate that the new method significantly improves retrieval rates, e.g., from 65% to 77%, compared with traditional approaches, while it retains comparable levels of computational complexity.

1,228 citations


Proceedings ArticleDOI
10 Dec 2002
TL;DR: The feature extraction method has been applied for both image segmentation as well as histogram generation applications - two distinct approaches to content based image retrieval (CBIR), showing better identification of objects in an image.
Abstract: We have analyzed the properties of the HSV (hue, saturation and value) color space with emphasis on the visual perception of the variation in hue, saturation and intensity values of an image pixel. We extract pixel features by either choosing the hue or the intensity as the dominant property based on the saturation value of a pixel. The feature extraction method has been applied for both image segmentation as well as histogram generation applications - two distinct approaches to content based image retrieval (CBIR). Segmentation using this method shows better identification of objects in an image. The histogram retains a uniform color transition that enables us to do a window-based smoothing during retrieval. The results have been compared with those generated using the RGB color space.

555 citations


Journal ArticleDOI
TL;DR: A fuzzy logic approach, UFM (unified feature matching), for region-based image retrieval, which greatly reduces the influence of inaccurate segmentation and provides a very intuitive quantification.
Abstract: This paper proposes a fuzzy logic approach, UFM (unified feature matching), for region-based image retrieval. In our retrieval system, an image is represented by a set of segmented regions, each of which is characterized by a fuzzy feature (fuzzy set) reflecting color, texture, and shape properties. As a result, an image is associated with a family of fuzzy features corresponding to regions. Fuzzy features naturally characterize the gradual transition between regions (blurry boundaries) within an image and incorporate the segmentation-related uncertainties into the retrieval algorithm. The resemblance of two images is then defined as the overall similarity between two families of fuzzy features and quantified by a similarity measure, UFM measure, which integrates properties of all the regions in the images. Compared with similarity measures based on individual regions and on all regions with crisp-valued feature representations, the UFM measure greatly reduces the influence of inaccurate segmentation and provides a very intuitive quantification. The UFM has been implemented as a part of our experimental SIMPLIcity image retrieval system. The performance of the system is illustrated using examples from an image database of about 60,000 general-purpose images.

441 citations


Proceedings ArticleDOI
04 Nov 2002
TL;DR: Experimental results show that the BIC approach is consistently more compact, more efficient and more effective than state-of-the-art CBIR approaches based on sophisticated image analysis algorithms and complex distance functions.
Abstract: This paper presents \bic (Border/Interior pixel Classification), a compact and efficient CBIR approach suitable for broad image domains It has three main components: (1) a simple and powerful image analysis algorithm that classifies image pixels as either border or interior, (2) a new logarithmic distance (dLog) for comparing histograms, and (3) a compact representation for the visual features extracted from images Experimental results show that the BIC approach is consistently more compact, more efficient and more effective than state-of-the-art CBIR approaches based on sophisticated image analysis algorithms and complex distance functions It was also observed that the dLog distance function has two main advantages over vectorial distances (eg, L1): (1) it is able to increase substantially the effectiveness of (several) histogram-based CBIR approaches and, at the same time, (2) it reduces by 50% the space requirement to represent a histogram

322 citations


Proceedings Article
08 Jul 2002
TL;DR: The multiple-instance (MI) learning model is applied to use a small number of training images to learn what images from the database are of interest to the user.
Abstract: We explore the application of machine learning techniques to the problem of content-based image retrieval (CBIR). Unlike most existing CBIR systems in which only global information is used or in which a user must explicitly indicate what part of the image is of interest, we apply the multiple-instance (MI) learning model to use a small number of training images to learn what images from the database are of interest to the user.

274 citations


Proceedings ArticleDOI
24 Jun 2002
TL;DR: A local Fourier transform is adopted as a texture representation scheme and eight characteristic maps for describing different aspects of cooccurrence relations of image pixels in each channel of the (SVcosH, SVsinH, V) color space are derived, resulting in a 48-dimensional feature vector.
Abstract: We adopt a local Fourier transform as a texture representation scheme and derive eight characteristic maps for describing different aspects of cooccurrence relations of image pixels in each channel of the (SVcosH, SVsinH, V) color space. Then we calculate the first and second moments of these maps as a representation of the natural color image pixel distribution, resulting in a 48-dimensional feature vector. The novel low-level feature is named color texture moments (CTM), which can also be regarded as a certain extension to color moments in eight aspects through eight orthogonal templates. Experiments show that this new feature can achieve good retrieval performance for CBIR.

259 citations


Proceedings Article
01 Jan 2002
TL;DR: Different FDs are studied and a Java retrieval framework is built to compare shape retrieval performance using different FDs in terms of computation complexity, robustness, convergence speed and retrieval performance.
Abstract: Shape is one of the primary low level image features in Content Based Image Retrieval (CBIR). Many shape representations and retrieval methods exist. However, most of those methods either do not well capture shape features or are difficult to do normalization (making matching difficult). Among them, methods based Fourier descriptors (FDs) achieve both good representation (perceptually meaningful) and easy normalization. Besides, FDs are easy to derive and compact in terms of representation. Design of FDs focuses on how to derive Fourier invariants from Fourier coefficients and how to obtain Fourier coefficients from shape signatures. Different Fourier invariants and shape signatures have been exploited to derive FDs. In this paper, we study different FDs and build a Java retrieval framework to compare shape retrieval performance using different FDs in terms of computation complexity, robustness, convergence speed and retrieval performance. The retrieval performance of the different FDs is compared using a standard shape database.

241 citations


Journal ArticleDOI
TL;DR: Experimental results demonstrate the usefulness and effectiveness of the proposed similarity measure for image retrieval, which not only takes into consideration the perceptual similarity between images, but also significantly improves the retrieval performance of the Euclidean distance measure.
Abstract: A new scheme of learning similarity measure is proposed for content-based image retrieval (CBIR). It learns a boundary that separates the images in the database into two clusters. Images inside the boundary are ranked by their Euclidean distances to the query. The scheme is called constrained similarity measure (CSM), which not only takes into consideration the perceptual similarity between images, but also significantly improves the retrieval performance of the Euclidean distance measure. Two techniques, support vector machine (SVM) and AdaBoost from machine learning, are utilized to learn the boundary. They are compared to see their differences in boundary learning. The positive and negative examples used to learn the boundary are provided by the user with relevance feedback. The CSM metric is evaluated in a large database of 10009 natural images with an accurate ground truth. Experimental results demonstrate the usefulness and effectiveness of the proposed similarity measure for image retrieval.

227 citations


Journal ArticleDOI
TL;DR: The results of the experiments show that the MPEG-7-defined content descriptors can be used as such in thePicSOM system even though Euclidean distance calculation, inherently used in the PicSom system, is not optimal for all of them.
Abstract: Development of content-based image retrieval (CBIR) techniques has suffered from the lack of standardized ways for describing visual image content. Luckily, the MPEG-7 international standard is now emerging as both a general framework for content description and a collection of specific agreed-upon content descriptors. We have developed a neural, self-organizing technique for CBIR. Our system is named PicSOM and it is based on pictorial examples and relevance feedback (RF). The name stems from "picture" and the self-organizing map (SOM). The PicSOM system is implemented by using tree structured SOMs. In this paper, we apply the visual content descriptors provided by MPEG-7 in the PicSOM system and compare our own image indexing technique with a reference system based on vector quantization (VQ). The results of our experiments show that the MPEG-7-defined content descriptors can be used as such in the PicSOM system even though Euclidean distance calculation, inherently used in the PicSOM system, is not optimal for all of them. Also, the results indicate that the PicSOM technique is a bit slower than the reference system in starting to find relevant images. However, when the strong RF mechanism of PicSOM begins to function, its retrieval precision exceeds that of the reference system.

222 citations


Journal ArticleDOI
TL;DR: The survey includes a large number of papers covering the research aspects of system design and applications of CBIR, image feature representation and extraction, Multidimensional indexing, and future research directions are suggested.
Abstract: Retrieving information from the Web is becoming a common practice for internet users. However, the size and heterogeneity of the Web challenge the effectiveness of classical information retrieval techniques. Content-based retrieval of images and video has become a hot research area. The reason for this is the fact that we need effective and efficient techniques that meet user requirements, to access large volumes of digital images and video data. This paper gives a brief survey of current CBIR (Content Based Image Retrieval) methods and technical achievement in this area. The survey includes a large number of papers covering the research aspects of system design and applications of CBIR, image feature representation and extraction, Multidimensional indexing. Furthermore future research directions are suggested.

151 citations


Book ChapterDOI
01 Jan 2002
TL;DR: In this chapter, some technical aspects of current content-based image retrieval systems are surveyed.
Abstract: In this chapter we survey some technical aspects of current content-based image retrieval systems.

Proceedings ArticleDOI
11 Aug 2002
TL;DR: A system that enables the pictorial specification of queries in an image database is described that makes use of an abstraction of the contour of the shape which is invariant against translation, scale, rotation, and starting point that is based on the use of Fourier descriptors.
Abstract: A system that enables the pictorial specification of queries in an image database is described. The queries are comprised of rectangle, polygon, ellipse, and B-spline shapes. The queries specify which shapes should appear in the target image as well as spatial constraints on the distance between them and their relative position. The retrieval process makes use of an abstraction of the contour of the shape which is invariant against translation, scale, rotation, and starting point, that is based on the use of Fourier descriptors. These abstractions are used in a system to locate logos in an image database. The utility of this approach is illustrated using some sample queries.

Book ChapterDOI
01 Jan 2002
TL;DR: There is an urgent need of finding the latent correlation between low-level features and high-level concepts and merging them from a different perspective to retrieve or manage visual data in an effective or efficient way.
Abstract: The emergence of multimedia technology and the rapidly expanding image and video collections on the Internet have attracted significant research efforts in providing tools for effective retrieval and management of visual data. Image retrieval is based on the availability of a representation scheme of image content. Image content descriptors may be visual features such as color, texture, shape, and spatial relationships, or semantic primitives. Conventional information retrieval was based solely on text, and those approaches to textual information retrieval have been transplanted into image retrieval in a variety of ways. However, "a picture is worth a thousand words." Image contents are much more versatile compared with text, and the amount of visual data is already enormous and still expanding very rapidly. Hoping to cope with these special characteristics of visual data, content-based image retrieval methods have been introduced. It has been widely recognized that the family of image retrieval techniques should become an integration of both low-level visual features addressing the more detailed perceptual aspects and high-level semantic features underlying the more general conceptual aspects of visual data. Neither of these two types of features is sufficient to retrieve or manage visual data in an effective or efficient way. Although efforts have been devoted to combining these two aspects of visual data, the gap between them is still a huge barrier in front of researchers. Intuitive and heuristic approaches do not provide us with satisfactory performance. Therefore, there is an urgent need of finding the latent correlation between low-level features and high-level concepts and merging them from a different perspective. How to find this new perspective and bridge the gap between visual features and semantic features has been a major challenge in this research field. This chapter addresses these issues.

Journal ArticleDOI
TL;DR: This paper applies perceptual grouping rules to the retrieval of images containing large manmade objects such as buildings, towers, bridges, and other architectural objects by employing a K -nearest neighbor framework.

Journal ArticleDOI
TL;DR: A keyblock-based approach to content-based image retrieval where each image is encoded as a set of one-dimensional index codes linked to the keyblocks in the codebook, analogous to considering a text document as a linear list of keywords.
Abstract: The success of text-based retrieval motivates us to investigate analogous techniques which can support the querying and browsing of image data. However, images differ significantly from text both syntactically and semantically in their mode of representing and expressing information. Thus, the generalization of information retrieval from the text domain to the image domain is non-trivial. This paper presents a framework for information retrieval in the image domain which supports content-based querying and browsing of images. A critical first step to establishing such a framework is to construct a codebook of "keywords" for images which is analogous to the dictionary for text documents. We refer to such "keywords" in the image domain as "keyblocks." In this paper, we first present various approaches to generating a codebook containing keyblocks at different resolutions. Then we present a keyblock-based approach to content-based image retrieval. In this approach, each image is encoded as a set of one-dimensional index codes linked to the keyblocks in the codebook, analogous to considering a text document as a linear list of keywords. Generalizing upon text-based information retrieval methods, we then offer various techniques for image-based information retrieval. By comparing the performance of this approach with conventional techniques using color and texture features, we demonstrate the effectiveness of the keyblock-based approach to content-based image retrieval.

Book ChapterDOI
TL;DR: In this paper, a color image retrieval method based on the primitives of color moments will be proposed and the experimental results show that the proposed method is better than others.
Abstract: In this paper, a color image retrieval method based on the primitives of color moments will be proposed. First, an image is divided into several blocks. Then, the color moments of all blocks are extracted and clustered into several classes. The mean moments of each class are considered as a primitive of the image. All primitives are used as features. Since two different images may have different numbers of features, a new similarity measure is then proposed. To demonstrate the effectiveness of the proposed method, a test database from Corel is used to compare the performances of the proposed method with other existing ones. The experimental results show that the proposed method is better than others.

Journal ArticleDOI
TL;DR: This paper presents a relevance feedback technique that uses decision trees to learn a common thread among instances marked relevant in a preexisting content-based image retrieval (CBIR) system that is used to access high resolution computed tomographic images of the human lung.

Journal ArticleDOI
TL;DR: Experimental results show robust and accurate performance by the proposed method, as compared with conventional noninteractive content-based image retrieval systems and user controlled interactive systems, when applied to image retrieval in compressed and uncompressed image databases.
Abstract: In this paper, an unsupervised learning network is explored to incorporate a self-learning capability into image retrieval systems. Our proposal is a new attempt to automate recursive content-based image retrieval. The adoption of a self-organizing tree map (SOTM) is introduced, to minimize the user participation in an effort to automate interactive retrieval. The automatic learning mode has been applied to optimize the relevance feedback (RF) method and the single radial basis function-based RF method. In addition, a semiautomatic version is proposed to support retrieval with different user subjectivities. Image similarity is evaluated by a nonlinear model, which performs discrimination based on local analysis. Experimental results show robust and accurate performance by the proposed method, as compared with conventional noninteractive content-based image retrieval (CBIR) systems and user controlled interactive systems, when applied to image retrieval in compressed and uncompressed image databases.

Proceedings ArticleDOI
01 Dec 2002
TL;DR: A statistical modeling approach to automatic linguistic indexing of pictures by focusing on a particular group of stochastic processes for describing images, that is, the two-dimensional multiresolution hidden Markov models (2-D MHMMs).
Abstract: Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of concepts automatically based on statistical modeling. Images of any given concept category are regarded as instances of a stochastic process that characterizes the category. To measure the extent of association between an image and the textual description of a category of images, the likelihood of the occurrence of the image based on the stochastic process derived from the category is computed. A high likelihood indicates a strong association. In our experimental implementation, the ALIP (Automatic Linguistic Indexing of Pictures) system, we focus on a particular group of stochastic processes for describing images, that is, the two-dimensional multiresolution hidden Markov models (2-D MHMMs). We implemented and tested the system on a photographic image database of 600 different semantic cat- egories, each with about 40 training images. Tested using 3,000 images outside the training database, the system has demonstrated good accuracy and high potential in linguistic indexing of these test images.

Proceedings Article
01 Jan 2002
TL;DR: A integrated shape descriptor, which combines a contour shape descriptor and the region shape descriptors, is presented, which can provide interactivity between users and the retrieval system and can improve retrieval performance.
Abstract: Shape is one of the primary low level image features in the newly emerged Content Based Image Retrieval (CBIR). Many shape representations have been proposed, and they are generally classified into contour-based methods and region-based methods. Contour-based methods capture shape boundary features while ignore shape inner content. Region-based methods capture shape inner features while do not emphasize on boundary features. It is known neither methods can produce ideal retrieval results in all situations. The shortcoming of both types of methods can be overcome by combine the strength of two methods. In this paper, a contour shape descriptor and a region shape descriptor are studied. Then a integrated shape descriptor, which combines a contour shape descriptors and the region shape descriptors, is presented. The integrated shape descriptors provide interactivity between users and the retrieval system and can improve retrieval performance. Retrieval results are given to show the comparison and the improvement.

Journal ArticleDOI
TL;DR: The results show that both local and global shape features are important clues of shapes in an image.
Abstract: In this article the use of statistical, low-level shape features in content-based image retrieval is studied. The emphasis is on such techniques which do not demand object segmentation. PicSOM, the image retrieval system used in the experiments, requires that features are represented by constant-sized feature vectors for which the Euclidean distance can be used as a similarity measure. The shape features suggested here are edge histograms and Fourier-transform-based features computed from the image after edge detection in Cartesian or polar coordinate planes. The results show that both local and global shape features are important clues of shapes in an image.

Proceedings ArticleDOI
10 Dec 2002
TL;DR: This work reports the discovery of a perceptual distance function through mining a large set of visual data and calls the discovered function dynamic the partial distance function (DPF), which performs significantly better in finding similar images.
Abstract: For almost a decade, content-based image retrieval has been an active research area, yet one fundamental problem remains largely unsolved: how to measure perceptual similarity. To measure perceptual similarity, most researchers employ the Minkowski-type metric. Our extensive data-mining experiments on visual data show that, unfortunately, the Minkowski metric is not very effective in modeling perceptual similarity. Our experiments also show that the traditional "static" feature weighting approaches are not sufficient for retrieving various similar images. We report our discovery of a perceptual distance function through mining a large set of visual data. We call the discovered function dynamic the partial distance function (DPF). When we empirically compare the DPF to Minkowski-type distance functions, the DPF performs significantly better in finding similar images. The effectiveness of the DPF can be well explained by similarity theories in cognitive psychology.

Proceedings ArticleDOI
04 Nov 2002
TL;DR: This paper addresses the issue of effective and efficient content based image retrieval by presenting a novel indexing and retrieval methodology that integrates color, texture, and shape information, and applies these features in regions obtained through unsupervised segmentation, as opposed to applying them to the whole image domain.
Abstract: This paper addresses the issue of effective and efficient content based image retrieval by presenting a novel indexing and retrieval methodology that integrates color, texture, and shape information for the indexing and retrieval, and applies these features in regions obtained through unsupervised segmentation, as opposed to applying them to the whole image domain. In order to address the typical color feature "inaccuracy" problem in the literature, fuzzy logic is applied to the traditional color histogram to solve for the problem to a certain degree. The similarity is defined through a balanced combination between global and regional similarity measures incorporating all the features. In order to further improve the retrieval efficiency, a secondary clustering technique is developed and employed to significantly save query processing time without compromising the retrieval precision. An implemented prototype system has demonstrated a promising retrieval performance for a test database containing 2000 general-purpose color images, as compared with its peer systems in the literature.

Journal ArticleDOI
TL;DR: This paper describes an indexing system for use in Content Based Image Retrieval that uses additional features in an attempt to incorporate shape and textual information to the index key.

Proceedings ArticleDOI
07 Apr 2002
TL;DR: This paper presents a new strategy to extract an image feature with high retrieval accuracy and proposes how to reduce the image feature dimension using the reward-punishment algorithm, so any robust indexing methods can be used.
Abstract: This paper introduces a new approach to content based image retrieval by texture. There are three problems to solve: high computational time, handling high dimension data, and comparing images consistent with human perception. To decrease the computational, time, we present a new strategy to extract an image feature with high retrieval accuracy. We also propose how to reduce the image feature dimension using the reward-punishment algorithm, so any robust indexing methods can be used. By weighting the extracted image features, a system may perceive the image consistently with human perception.

Journal ArticleDOI
TL;DR: In this paper, principal component analysis was used to represent and retrieve images on the basis of content, which reduces the dimensionality of the search to a basis set of prototype images that best describes the images.
Abstract: Most picture archiving and communication systems provide image search capabilities that support queries based on patient demographics and study descriptions. In a preliminary study, principal component analysis was used to represent and retrieve images on the basis of content. Principal component analysis reduces the dimensionality of the search to a basis set of prototype images that best describes the images. Each image is described by its projection on the basis set; a match to a query image is determined by comparing its projection vector on the basis set with that of the images in the database. The training image database consisted of 100 axial brain images from a three-dimensional T1-weighted magnetic resonance imaging study. The algorithm was evaluated by using 96 axial images from eight patients. Image retrieval was considered accurate if the automated algorithm returned the match section to within 3 mm of an expert-selected section; the retrieval accuracy was 83% when the images were preprocessed for uniformity in intensity and geometry. Principal component analysis can be applied to content-based retrieval of medical images. The algorithm is designed to be part of an automated image selection module that filters relevant images from an imaging study.

Proceedings ArticleDOI
10 Dec 2002
TL;DR: This paper proposes a whole retrieval strategy based on a new relevance feedback scheme and on a long-term similarity learning algorithm which uses feedback information of previous sessions, without additional need of user interaction.
Abstract: This paper presents a new learning technique for the similarity model refinement in CBIR systems. We propose a whole retrieval strategy based on a new relevance feedback scheme and on a long-term similarity learning algorithm which uses feedback information of previous sessions. We introduce this technique as the simple evolution of the short-term relevance feedback approach into a long-term similarity learning, without additional need of user interaction. Our algorithm is validated via a quality assessment realized on a heterogeneous database of 1,200 color images.

Journal ArticleDOI
TL;DR: This method offers substantial efficiency as images are processed in compressed format, information that was derived during the original compression of the images is reused, and extensive early pruning is possible.

Proceedings ArticleDOI
07 Aug 2002
TL;DR: The linear and kernel-based biased discriminant analysis, or BiasMap, is introduced to fit the unique nature of relevance feedback as a small sample biased classification problem and a WARF (word association via relevance feedback) formula is presented.
Abstract: In this paper we address several aspects of the learning problem in content-based image retrieval (CBIR). First, we introduce the linear and kernel-based biased discriminant analysis, or BiasMap, to fit the unique nature of relevance feedback as a small sample biased classification problem. Secondly, a WARF (word association via relevance feedback) formula is presented for learning keyword relations during the process of relevance feedback. We also introduce our new user interface for CBIR, ImageGrouper, which is designed to support more sophisticated user feedbacks and annotations. Finally, we use the D-EM (Discriminant-EM) algorithm as a way of exploiting unlabeled data in CBIR and offer some insights as to when unlabeled data will help.

Proceedings ArticleDOI
26 Aug 2002
TL;DR: An effective content-based visual image retrieval system that uses a color label histogram with only thirteen bins to extract the color information from an image in the image database and generates the spatial feature of an image automatically.
Abstract: An effective content-based visual image retrieval system is presented. This system consists of two main components: visual content extraction and indexing, and query engine. Each image in the image database is represented by its visual features: color and spatial information. The system uses a color label histogram with only thirteen bins to extract the color information from an image in the image database. A unique unsupervised segmentation algorithm combined with the wavelet technique generates the spatial feature of an image automatically. The resulting feature vectors are relatively low in dimensions compared to those in other systems. The query engine employs a color filter and a spatial filter to dramatically reduce the search range. As a result, queue processing is speeded up. The experimental results demonstrate that our system is capable of retrieving images that belong to the same category.