Efficient image retrieval by exploiting vertical fragmentation

Open Access

Efficient image retrieval by exploiting vertical fragmentation

Arjen P. de Vries, +3 more

- Iss: 9, pp 1-18

Chats0

TLDR

An efficient similarity search method that is robust to dimensionality and has optimal space complexity is proposed, and the implementation of the algorithm in Monet illustrates that core database technology supports image retrieval well, without special extensions.

Abstract:

In content-based retrieval systems, the goal of similarity search is to identify the k most similar images to a given example. Images are represented and queried by high-dimensional feature vectors encoding dominant characteristics like color and texture. We propose an efficient similarity search method that is robust to dimensionality and has optimal space complexity. Our approach fragments the feature vectors vertically, and computes the similarity of all images dimension by dimension. The innovation lies in gradually removing images that cannot participate in the response set. We show how to apply this scheme for two common similarity metrics, namely histogram intersection and euclidean distance. The implementation of our algorithm in Monet illustrates that core database technology supports image retrieval well, without special extensions. Finally, we report the effectiveness of our approach on real and synthetic data sets, and show significant improvements in response time yielded.

Citations

PDF

Open Access

More filters

Querying Sparse Matrices for Information Retrieval

R. Cornacchia

TL;DR: This thesis proposes an innovation in the search system engineering process, by introducing a layered approach typical of database systems, which enables more flexibility in the IR system's architecture and is evaluated in terms of flexibility and run-time efficiency.

...read moreread less

Journal Article

Cellular DBMS: An Attempt Towards Biologically-Inspired Data Management

Syed Saif ur Rahman, +2 more

- 01 Apr 2010 -

Journal of Digital Information Managemen...

TL;DR: An aspect-oriented programming based model to equip Cellular DBMS architecture with autonomy is explained and an extension to decomposed storage model (DSM) for use in cellular DBMS is presented.

...read moreread less

Cellular DBMS: Customizable and autonomous data management using a RISC-style architecture

Gunter Saake, +2 more

TL;DR: The Cellular DBMS architecture is presented, which is designed according to the RISC-style self-tuning database architecture proposed by Chaudhuri and Weikum in their VLDB 2000 paper and implemented as Evolutionary Column-oriented Storage (ECOS), which supports the storage model customization at table-level using different variations of the decomposed storage model.

...read moreread less

Book ChapterDOI

ECOS: evolutionary column-oriented storage

Syed Saif ur Rahman, +2 more

TL;DR: The capability of self-tuning data management with minimal human intervention, which is the main design goal for ECOS, is achieved by dynamically adjusting the storage structures of a column-oriented storage manager according to data size and access characteristics.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

R-trees: a dynamic index structure for spatial searching

Antonin Guttman

TL;DR: A dynamic index structure called an R-tree is described which meets this need, and algorithms for searching and updating it are given and it is concluded that it is useful for current database systems in spatial applications.

...read moreread less

Journal ArticleDOI

Color indexing

Michael J. Swain, +1 more

- 01 Nov 1991 -

International Journal of Computer Vision

TL;DR: In this paper, color histograms of multicolored objects provide a robust, efficient cue for indexing into a large database of models, and they can differentiate among a large number of objects.

...read moreread less

Proceedings Article

A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces

Roger Weber, +2 more

TL;DR: It is shown formally that partitioning and clustering techniques for similarity search in HDVSs exhibit linear complexity at high dimensionality, and that existing methods are outperformed on average by a simple sequential scan if the number of dimensions exceeds around 10.

...read moreread less

Proceedings ArticleDOI

Nearest neighbor queries

Nick Roussopoulos, +2 more

TL;DR: This paper presents an efficient branch-and-bound R-tree traversal algorithm to find the nearest neighbor object to a point, and then generalizes it to finding the k nearest neighbors.

...read moreread less

Book ChapterDOI

The X-tree: an index structure for high-dimensional data

Stefan Berchtold, +2 more

TL;DR: A new organization of the directory is introduced which uses a split algorithm minimizing overlap and additionally utilizes the concept of supernodes to keep the directory as hierarchical as possible, and at the same time to avoid splits in the directory that would result in high overlap.

...read moreread less