scispace - formally typeset
Open Access

Efficient image retrieval by exploiting vertical fragmentation

Reads0
Chats0
TLDR
An efficient similarity search method that is robust to dimensionality and has optimal space complexity is proposed, and the implementation of the algorithm in Monet illustrates that core database technology supports image retrieval well, without special extensions.
Abstract
In content-based retrieval systems, the goal of similarity search is to identify the k most similar images to a given example. Images are represented and queried by high-dimensional feature vectors encoding dominant characteristics like color and texture. We propose an efficient similarity search method that is robust to dimensionality and has optimal space complexity. Our approach fragments the feature vectors vertically, and computes the similarity of all images dimension by dimension. The innovation lies in gradually removing images that cannot participate in the response set. We show how to apply this scheme for two common similarity metrics, namely histogram intersection and euclidean distance. The implementation of our algorithm in Monet illustrates that core database technology supports image retrieval well, without special extensions. Finally, we report the effectiveness of our approach on real and synthetic data sets, and show significant improvements in response time yielded.

read more

Citations
More filters

Querying Sparse Matrices for Information Retrieval

TL;DR: This thesis proposes an innovation in the search system engineering process, by introducing a layered approach typical of database systems, which enables more flexibility in the IR system's architecture and is evaluated in terms of flexibility and run-time efficiency.
Journal Article

Cellular DBMS: An Attempt Towards Biologically-Inspired Data Management

TL;DR: An aspect-oriented programming based model to equip Cellular DBMS architecture with autonomy is explained and an extension to decomposed storage model (DSM) for use in cellular DBMS is presented.

Cellular DBMS: Customizable and autonomous data management using a RISC-style architecture

TL;DR: The Cellular DBMS architecture is presented, which is designed according to the RISC-style self-tuning database architecture proposed by Chaudhuri and Weikum in their VLDB 2000 paper and implemented as Evolutionary Column-oriented Storage (ECOS), which supports the storage model customization at table-level using different variations of the decomposed storage model.
Book ChapterDOI

ECOS: evolutionary column-oriented storage

TL;DR: The capability of self-tuning data management with minimal human intervention, which is the main design goal for ECOS, is achieved by dynamically adjusting the storage structures of a column-oriented storage manager according to data size and access characteristics.
References
More filters
Proceedings ArticleDOI

R-trees: a dynamic index structure for spatial searching

TL;DR: A dynamic index structure called an R-tree is described which meets this need, and algorithms for searching and updating it are given and it is concluded that it is useful for current database systems in spatial applications.
Journal ArticleDOI

Color indexing

TL;DR: In this paper, color histograms of multicolored objects provide a robust, efficient cue for indexing into a large database of models, and they can differentiate among a large number of objects.
Proceedings Article

A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces

TL;DR: It is shown formally that partitioning and clustering techniques for similarity search in HDVSs exhibit linear complexity at high dimensionality, and that existing methods are outperformed on average by a simple sequential scan if the number of dimensions exceeds around 10.
Proceedings ArticleDOI

Nearest neighbor queries

TL;DR: This paper presents an efficient branch-and-bound R-tree traversal algorithm to find the nearest neighbor object to a point, and then generalizes it to finding the k nearest neighbors.
Book ChapterDOI

The X-tree: an index structure for high-dimensional data

TL;DR: A new organization of the directory is introduced which uses a split algorithm minimizing overlap and additionally utilizes the concept of supernodes to keep the directory as hierarchical as possible, and at the same time to avoid splits in the directory that would result in high overlap.
Related Papers (5)