scispace - formally typeset
Open AccessJournal ArticleDOI

Robust wide-baseline stereo from maximally stable extremal regions

TLDR
The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.
About
This article is published in Image and Vision Computing.The article was published on 2004-09-01 and is currently open access. It has received 3422 citations till now. The article focuses on the topics: Maximally stable extremal regions & Epipolar geometry.

read more

Citations
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Journal ArticleDOI

Fiji: an open-source platform for biological-image analysis

TL;DR: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis that facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system.

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.
Book ChapterDOI

SURF: speeded up robust features

TL;DR: A novel scale- and rotation-invariant interest point detector and descriptor, coined SURF (Speeded Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.
Journal ArticleDOI

Speeded-Up Robust Features (SURF)

TL;DR: A novel scale- and rotation-invariant detector and descriptor, coined SURF (Speeded-Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.
References
More filters
Proceedings ArticleDOI

Object recognition from local scale-invariant features

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.
Book

Multiple view geometry in computer vision

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.
Journal ArticleDOI

Watersheds in digital spaces: an efficient algorithm based on immersion simulations

TL;DR: A fast and flexible algorithm for computing watersheds in digital gray-scale images is introduced, based on an immersion process analogy, which is reported to be faster than any other watershed algorithm.
Journal ArticleDOI

Local grayvalue invariants for image retrieval

TL;DR: This paper addresses the problem of retrieving images from large image databases with a method based on local grayvalue invariants which are computed at automatically detected interest points and allows for efficient retrieval from a database of more than 1,000 images.
Book ChapterDOI

An Affine Invariant Interest Point Detector

TL;DR: A novel approach for detecting affine invariant interest points that can deal with significant affine transformations including large scale changes and shows an excellent performance in the presence of large perspective transformations including significant scale changes.
Frequently Asked Questions (18)
Q1. What is the definition of a MSER?

The MSERs are sets of image elements, closed under the affine transformation of image coordinates and invariant to affine transformation of intensity. 

The wide-baseline stereo problem, i. e. the problem of establishing correspondences between a pair of images taken from different viewpoints is studied. A new set of image elements that are put into correspondence, the so called extremal regions, is introduced. Extremal regions possess highly desirable properties: the set is closed under 1. continuous ( and thus projective ) transformation of image coordinates and 2. monotonic transformation of image intensities. An efficient ( near linear complexity ) and practically fast detection algorithm ( near frame rate ) is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal regions ( MSER ). 

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching. 

Finding epipolar geometry consistent with the largest number of tentative (local) correspondences is the final step of all wide-baseline algorithms. 

The three main novelties are: the introduction of MSERs, robust matching of local features and the use of multiple scaled measurement regions. 

distinguished regions or their scaled version serve as measurement regions and tentative correspondences are established by comparing invariants using Mahalanobis distance [10, 16, 11]. 

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching. 

Important design decisions at this stage include: 1. the choice of measurement regions, i.e. the parts of the image on which invariants are computed, 2. the method of selecting tentative correspondences given the invariant description and 3. 

A merge of two components is viewed as termination of existence of the smaller component and an insertion of all pixels of the smaller component into the larger one. 

an affine transformation between pairs of potentially corresponding DRs, i.e. the DRs consistent with the rough EG, is computed. 

Since matching is accomplished in a robust manner, the authors benefit from the increase of distinctiveness of large regions without being severely affected by clutter or non-planarity of the DR’s pre-image. 

Since the influential paper by Schmid and Mohr [11] many image matching and wide-baseline stereo algorithms have been proposed, most commonly usingHarris interest points as distinguished regions. 

A measurement taken from an almost planar patch of the scene with stable invariant description will be referred to as a ’good measurement’. 

The robustness of the proposed similarity measure allows us to use invariants from a collection of measurement regions, even some that are much larger than the associated distinguished region. 

Due to the robustness, the authors were able to consider invariants from multiple measurement regions, even some that were significantly larger (and hence probably discriminative) than the associated MSER. 

Probabilistic analysis of the likelihood of the success of the procedure is not simple, since the distribution of invariants and their noise is image-dependent. 

Finally the authors remark that MSERs can be defined on any image (even high-dimensional) whose pixel values are from a totally ordered set. 

In the wide-baseline set-up, local image deformations cannot be realistically approximated by translation or translation with rotation and a full affine model is required.