scispace - formally typeset
Book ChapterDOI

HPAT indexing for fast object/scene recognition based on local appearance

Reads0
Chats0
TLDR
The paper describes a fast system for appearance based image recognition that uses local invariant descriptors and efficient nearest neighbor search to overcomes the drawbacks of most binary tree-like indexing techniques, namely the high complexity in high dimensional data sets and the boundary problem.
Abstract
The paper describes a fast system for appearance based image recognition. It uses local invariant descriptors and efficient nearest neighbor search. First, local affine invariant regions are found nested at multiscale intensity extremas. These regions are characterized by nine generalized color moment invariants. An efficient novel method called HPAT (hyper-polyhedron with adaptive threshold) is introduced for efficient localization of the nearest neighbor in feature space. The invariants make the method robust against changing illumination and viewpoint. The locality helps to resolve occlusions. The proposed indexing method overcomes the drawbacks of most binary tree-like indexing techniques, namely the high complexity in high dimensional data sets and the boundary problem. The database representation is very compact and the retrieval close to realtime on a standard PC. The performance of the proposed method is demonstrated on a public database containing 1005 images of urban scenes. Experiments with an image database containing objects are also presented.

read more

Citations
More filters
Journal ArticleDOI

A survey of content-based image retrieval with high-level semantics

TL;DR: This paper attempts to provide a comprehensive survey of the recent technical achievements in high-level semantic-based image retrieval, identifying five major categories of the state-of-the-art techniques in narrowing down the 'semantic gap'.
Journal ArticleDOI

Content-based multimedia information retrieval: State of the art and challenges

TL;DR: This survey reviews 100+ recent articles on content-based multimedia information retrieval and discusses their role in current research directions which include browsing and search paradigms, user studies, affective computing, learning, semantic queries, new features and media types, high performance indexing, and evaluation techniques.
Proceedings ArticleDOI

Matching with PROSAC - progressive sample consensus

Ondrej Chum, +1 more
TL;DR: A new robust matching method, PROSAC, which exploits the linear ordering defined on the set of correspondences by a similarity function used in establishing tentative correspondences and achieves large computational savings.
Proceedings ArticleDOI

City-Scale Location Recognition

TL;DR: It is shown that by carefully selecting the vocabulary using the most informative features, retrieval performance is significantly improved, allowing us to increase the number of database images by a factor of 10.
Journal ArticleDOI

Features for image retrieval: an experimental comparison

TL;DR: An experimental comparison of a large number of different image descriptors for content-based image retrieval is presented and the often used, but very simple, color histogram performs well in the comparison and thus can be recommended as a simple baseline for many applications.
References
More filters
Journal ArticleDOI

Robust wide-baseline stereo from maximally stable extremal regions

TL;DR: The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.
Proceedings ArticleDOI

Robust wide baseline stereo from maximally stable extremal regions

TL;DR: The wide-baseline stereo problem, i.e. the problem of establishing correspondences between a pair of images taken from different viewpoints, is studied and an efficient and practically fast detection algorithm is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal region (MSER).
Book ChapterDOI

An Affine Invariant Interest Point Detector

TL;DR: A novel approach for detecting affine invariant interest points that can deal with significant affine transformations including large scale changes and shows an excellent performance in the presence of large perspective transformations including significant scale changes.
Proceedings ArticleDOI

Reliable feature matching across widely separated views

A. Baumberg
TL;DR: A robust method for automatically matching features in images corresponding to the same physical point on an object seen from two arbitrary viewpoints that is optimised for a structure-from-motion application where it wishes to ignore unreliable matches at the expense of reducing the number of feature matches.
Proceedings ArticleDOI

Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions

TL;DR: This work presents an alternative method for extracting invariant regions that does not depend on the presence of edges or corners in the image but is purely intensity-based, and demonstrates the use of such regions for another application, which is wide baseline stereo matching.