Proceedings Article•DOI•

Robust wide baseline stereo from maximally stable extremal regions

Q: What is the definition of a MSER?

The MSERs are sets of image elements, closed under the affine transformation of image coordinates and invariant to affine transformation of intensity.

Q: What have the authors contributed in "Robust wide baseline stereo from maximally stable extremal regions" ?

The wide-baseline stereo problem, i. e. the problem of establishing correspondences between a pair of images taken from different viewpoints is studied. A new set of image elements that are put into correspondence, the so called extremal regions, is introduced. Extremal regions possess highly desirable properties: the set is closed under 1. continuous ( and thus projective ) transformation of image coordinates and 2. monotonic transformation of image intensities. An efficient ( near linear complexity ) and practically fast detection algorithm ( near frame rate ) is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal regions ( MSER ).

Q: What are the future works mentioned in the paper "Robust wide baseline stereo from maximally stable extremal regions" ?

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching.

Q: How did the MSER detector perform on the epipolar scene?

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching.

Q: What is the definition of a merge of two components?

A merge of two components is viewed as termination of existence of the smaller component and an insertion of all pixels of the smaller component into the larger one.

Jiri Matas¹, Ondrej Chum, Martin Urban, Tomas Pajdla•Institutions (1)

University of Surrey¹

01 Jan 2002-pp 1-10

TL;DR: The wide-baseline stereo problem, i.e. the problem of establishing correspondences between a pair of images taken from different viewpoints, is studied and an efficient and practically fast detection algorithm is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal region (MSER).

read less

Abstract: The wide-baseline stereo problem, i.e. the problem of establishing correspondences between a pair of images taken from different viewpoints is studied. A new set of image elements that are put into correspondence, the so called extremal regions , is introduced. Extremal regions possess highly desirable properties: the set is closed under (1) continuous (and thus projective) transformation of image coordinates and (2) monotonic transformation of image intensities. An efficient (near linear complexity) and practically fast detection algorithm (near frame rate) is presented for an affinely invariant stable subset of extremal regions, the maximally stable extremal regions (MSER). A new robust similarity measure for establishing tentative correspondences is proposed. The robustness ensures that invariants from multiple measurement regions (regions obtained by invariant constructions from extremal regions), some that are significantly larger (and hence discriminative) than the MSERs, may be used to establish tentative correspondences. The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes. Significant change of scale (3.5×), illumination conditions, out-of-plane rotation, occlusion, locally anisotropic scale change and 3D translation of the viewpoint are all present in the test problems. Good estimates of epipolar geometry (average distance from corresponding points to the epipolar line below 0.09 of the inter-pixel distance) are obtained.

...read moreread less

Summary (2 min read)

Jump to: [1 Introduction] – [2 Maximally Stable Extremal Regions] – [3 The proposed robust wide-baseline algorithm] – [4 Experiments] and [5 Conclusions]

1 Introduction

Finding reliable correspondences in two images of a scene taken from arbitrary viewpoints viewed with possibly different cameras and in different illumination conditions is a difficult and critical step towards fully automatic reconstruction of 3D scenes [5].
Successful wide-baseline experiments on indoor and outdoor datasets presented in Section 4 demonstrate the potential of MSERs.
Finding epipolar geometry consistent with the largest number of tentative correspondences is the final step of all wide-baseline algorithms.
Baumberg [1] applied an iterative scheme originally proposed by Lindeberg and Garding to associate affine-invariant measurement regions with Harris interest points.
Maximally Stable Extremal Regions are defined and their detection algorithm is described in Section 2.

2 Maximally Stable Extremal Regions

The authors introduce a new type of image elements useful in wide-baseline matching — the Maximally Stable Extremal Regions.
The concept can be explained informally as follows.
Finally, intensity levels that are local minima of the rate of change of the area function are selected as thresholds producing maximally stable extremal regions.
Every extremal region is a connected component of a 1even faster (but more complex) connected component algorithms exist with O(nα(n)) complexity, where α is the inverse Ackerman function; α(n) ≤ 4 for all practical n. thresholded image.
The output of the MSER detector is not a binarized image.

3 The proposed robust wide-baseline algorithm

As a first step, the DRs are detected - the MSERs computed on the intensity image (MSER+) and on the inverted image (MSER-).
Smaller measurement regions are both more likely to satisfy the planarity condition and not to cross a discontinuity in depth or orientation.
In all experiments, rotational invariants (based on complex moments) were used after applying a transformation that diagonalises the regions covariance matrix of the DR.
First, an affine transformation between pairs of potentially corresponding DRs, i.e. the DRs consistent with the rough EG, is computed.
Next, DR correspondences are pruned and only those with correlation of their transformed images above a threshold are selected.

4 Experiments

The following experiments were conducted: Bookshelf, (Fig. 1).
The part of the scene visible in both views covers a small fraction of the image.
The regions matched on the box demonstrate performance on a non-planar surface.
The final number of correspondences is given in the penultimate column ’fine EG’.
The authors can see, that the precision of the estimated epipolar geometry is very high, much higher than the precision of the rough EG.

5 Conclusions

In the paper, a new method for wide-baseline matching was proposed.
The three main novelties are: the introduction of MSERs, robust matching of local features and the use of multiple scaled measurement regions.
Another novelty of the approach is the use of a robust similarity measure for establishing tentative correspondences.
The average distance from corresponding points to the epipolar line was below 0.09 of the inter-pixel distance.
Test images included both outdoor and indoor scenes, some already used in published work.

Did you find this useful? Give us your feedback

Figures (8)

Figure 1: BOOKSHELF: Estimated epipolar geometry on indoor scene with significant scale change. In the cutouts the change in the resolution of detected DRs is clearly visible.

Table 2: Number of DRs detected in images. The number of tentative correspondences is given in the TC column.

Figure 2: VALBONNE: Estimated epipolar geometry and points associated to the matched regions are shown in the first row. Cutouts in the second row show matched bricks.

Figure 5: CYLINDRICAL BOX: Epipolar geometry (top) and matched regions (bottom left). Fully affine distortion, a non-planar object, textured surface and a strong specular reflection are present in the scene. SHOUT (bottom right), a scene with a change of illumination spectral power distribution.

Figure 3: WASH: Epipolar geometry and dense matched regions with fully affine distortion.

Table 3: Experimental results. For details see the text, at the beginning of Section 4.

Figure 4: Estimated EG on an outdoor scene.

Content maybe subject to copyright Report

Robust Wide Baseline Stereo from

Maximally Stable Extremal Regions

J. Matas

1,2

, O. Chum

,M.Urban

,T.Pajdla

Center for Machine Perception, Dept. of Cybernetics, CTU Prague, Karlovo n´am 13, CZ 121 35

CVSSP, University of Surrey, Guildford GU2 7XH, UK

[matas, chum]@cmp.felk.cvut.cz

Abstract

The wide-baseline stereo problem, i.e. the problem of establishing correspon-

dences between a pair of images taken from different viewpoints is studied.

A new set of image elements that are put into correspondence, the so

called extremal regions, is introduced. Extremal regions possess highly de-

sirable properties: the set is closed under 1. continuous (and thus projective)

transformation of image coordinates and 2. monotonic transformation of im-

age intensities. An efﬁcient (near linear complexity) and practically fast de-

tection algorithm (near frame rate) is presented for an afﬁnely-invariant stable

subset of extremal regions, the maximally stable extremal regions (MSER).

A new robust similarity measure for establishing tentative correspon-

dences is proposed. The robustness ensures that invariants from multiple

measurement regions (regions obtained by invariant constructions from ex-

tremal regions), some that are signiﬁcantly larger (and hence discriminative)

than the MSERs, may be used to establish tentative correspondences.

The high utility of MSERs, multiple measurement regions and the robust

metric is demonstrated in wide-baseline experiments on image pairs from

both indoor and outdoor scenes. Signiﬁcant change of scale (3.5×), illumi-

nation conditions, out-of-plane rotation, occlusion , locally anisotropic scale

change and 3D translation of the viewpoint are all present in the test prob-

lems. Good estimates of epipolar geometry (average distance from corre-

sponding points to the epipolar line below 0.09 of the inter-pixel distance)

are obtained.

1 Introduction

Finding reliable correspondences in two images of a scene taken from arbitrary view-

points viewed with possibly different cameras and in different illumination conditions is a

difﬁcult and critical step towards fully automatic reconstruction of 3D scenes [5]. A cru-

cial issue is the choice of elements whose correspondence is sought. In the wide-baseline

set-up, local image deformations cannot be realistically approximated by translation or

translation with rotation and a full afﬁne model is required. Correspondence cannot be

therefore established by comparing regions of a ﬁxed (Euclidean) shape like rectangles or

circles since their shape is not preserved under afﬁne transformation.

In most images there are regions that can be detected with high repeatability since they

posses some distinguishing, invariant and stable property. We argue that such regions of,

384

BMVC 2002 doi:10.5244/C.16.36

in general, data-dependent shape, called distinguished regions (DRs) in the paper, may

serve as the elements to be put into correspondence either in stereo matching or object

recognition.

The ﬁrst contribution of the paper is the introduction of a new set of distinguished

regions, the so called extremal regions. Extremal regions have two desirable properties.

The set is closed under continuous (and thus perspective) transformation of image coor-

dinates and, secondly, it is closed under monotonic transformation of image intensities.

An efﬁcient (near linear complexity) and practically fast detection algorithm is presented

for an afﬁnely-invariant stable subset of extremal regions, the maximally stable extremal

regions (MSER). Robustness of a particular type of DR depends on the image data and

must be tested experimentally. Successful wide-baseline experiments on indoor and out-

door datasets presented in Section 4 demonstrate the potential of MSERs.

Reliable extraction of a manageable number of potentially corresponding image ele-

ments is a necessary but certainly not a sufﬁcient prerequisite for successful wide-baseline

matching. With two sets of distinguished regions, the matching problem can be posed as

a search in the correspondence space [3]. Forming a complete bipartite graph on the two

sets of DRs and searching for a globally consistent subset of correspondences is clearly

out of question for computational reasons. Recently, a whole class of stereo matching

and object recognition algorithms with common structure has emerged [9, 15, 1, 16, 2,

13, 7, 6]. These methods exploit local invariant descriptors to limit the number of tenta-

tive correspondences. Important design decisions at this stage include: 1. the choice of

measurement regions, i.e. the parts of the image on which invariants are computed, 2. the

method of selecting tentative correspondences given the invariant description and 3. the

choice of invariants.

Typically, distinguished regions or their scaled version serve as measurement regions

and tentative correspondences are established by comparing invariants using Mahalanobis

distance [10, 16, 11]. As a second novelty of the presented approach, a robust similar-

ity measure for establishing tentative correspondences is proposed to replace the Maha-

lanobis distance. The robustness of the proposed similarity measure allows us to use

invariants from a collection of measurement regions, even some that are much larger than

the associated distinguished region. Measurements from large regions are either very

discriminative (it is very unlikely that two large parts of the image are identical) or com-

pletely wrong (e.g. if orientation or depth discontinuity becomes part of the region). The

former helps establishing reliable tentative (local) correspondences, the inﬂuence of the

latter is limited due to the robustness of the approach.

Finding epipolar geometry consistent with the largest number of tentative (local) cor-

respondences is the ﬁnal step of all wide-baseline algorithms.

RANSAC has been by far

the most widely adopted method since [14]. The presented algorithm takes novel steps

to increase the number of matched regions and the precision of the epipolar geometry.

The rough epipolar geometry estimated from tentative correspondences is used to guide

the search for further region matches. It restricts location to epipolar lines and provides

an estimate of afﬁne mapping between corresponding regions. This mapping allows the

use of correlation to ﬁlter out mismatches. The process signiﬁcantly increases precision

of the EG estimate; the ﬁnal average inlier distance-from-epipolar-line is below 0.1 pixel.

For details see Section 3.

Related work. Since the inﬂuential paper by Schmid and Mohr [11] many image

matching and wide-baseline stereo algorithms have been proposed, most commonly using

385

Image I is a mapping I : D⊂Z

→S. Extremal regions are well deﬁned on images if:

1. S is totally ordered, i.e. reﬂexive, antisymmetric and transitive binary relation ≤

exists. In this paper only S = {0, 1,...,255} is considered, but extremal regions

can be deﬁned on e.g. real-valued images (S = R).

2. An adjacency (neighbourhood) relation A ⊂D×Dis deﬁned. In this paper

4-neighbourhoods are used, i.e. p, q ∈Dare adjacent (pAq)iff



i=1

−q

|≤1.

Region Q is a contiguous subset of D, i.e. for each p, q ∈Qthere is a sequence

p, a

,...,a

,qand pAa

i+1

Aq.

(Outer) Region Boundary ∂Q = {q ∈D\Q: ∃p ∈Q: qAp}, i.e. the boundary ∂Q of

Q is the set of pixels being adjacent to at least one pixel of Q but not belonging to Q.

Extremal Region Q⊂D is a region such that for all p ∈Q,q ∈ ∂Q : I(p) >I(q)

(maximum intensity region) or I(p) <I(q) (minimum intensity region).

Maximally Stable Extremal Region (MSER). Let Q

,...,Q

i−1

, Q

,...be a sequence

of nested extremal regions, i.e. Q

⊂Q

i+1

.ExtremalregionQ

∗

is maximally stable iff

q(i)=|Q

i+∆

i−∆

|/|Q

| has a local minimum at i

∗

(|.| denotes cardinality). ∆ ∈S

is a parameter of the method.

Table 1: Deﬁnitions used in Section 2

Harris interest points as distinguished regions. Tell and Carlsson [13] proposed a method

where line segments connecting Harris interest points form measurement regions. The

measurements are characterised by scale invariant Fourier coefﬁcients. The Harris interest

detector is stable over a range of scales, but deﬁnes no scale or afﬁne invariant measure-

ment region. Baumberg [1] applied an iterative scheme originally proposed by Lindeberg

and Garding to associate afﬁne-invariant measurement regions with Harris interest points.

In [7], Mikolajczyk and Schmid show that a scale-invariant MR can be found around

Harris interest points. In [9], Pritchett and Zisserman form groups of line segments and

estimate local homographies using parallelograms as measurement regions. Tuytelaars

and Van Gool introduced two new classes of afﬁne-invariant distinguished regions, one

based on local intensity extrema [16] the other using point and curve features [15]. In

the latter approach, DRs are characterised by measurements from inside an ellipse, con-

structed in an afﬁne invariant manner. Lowe [6] describes the ’Scale Invariant Feature

Transform’ approach which produces a scale and orientation-invariant characterisation of

interest points.

The rest of the paper is structured as follows. Maximally Stable Extremal Regions

are deﬁned and their detection algorithm is described in Section 2. In Section 3, details

of a novel robust matching algorithm are given. Experimental results on outdoor and

indoor images taken with an uncalibrated camera are presented in Section 4. Presented

experiments are summarized and the contributions of the paper are reviewed in Section 5.

2 Maximally Stable Extremal Regions

In this section, we introduce a new type of image elements useful in wide-baseline match-

ing — the Maximally Stable Extremal Regions. The regions are deﬁned solely by an

extremal property of the intensity function in the region and on its outer boundary.

The concept can be explained informally as follows. Imagine all possible threshold-

ings of a gray-level image I. We will refer to the pixels below a threshold as ’black’ and

386

to those above or equal as ’white’. If we were shown a movie of thresholded images I

with frame t corresponding to threshold t, we would see ﬁrst a white image. Subsequently

black spots corresponding to local intensity minima will appear and grow. At some point

regions corresponding to two local minima will merge. Finally, the last image will be

black. The set of all connected components of all frames of the movie is the set of all

maximal regions; minimal regions could be obtained by inverting the intensity of I and

running the same process. The formal deﬁnition of the MSER concept and the necessary

auxiliary deﬁnitions are given in Table 1.

In many images, local binarization is stable over a large range of thresholds in certain

regions. Such regions are of interest since they posses the following properties:

• Invariance to afﬁne transformation of image intensities.

• Covariance to adjacency preserving (continuous) transformation T : D→Don

the image domain.

• Stability, since only extremal regions whose support is virtually unchanged over a

range of thresholds is selected.

• Multi-scale detection. Since no smoothing is involved, both very ﬁne and very

large structure is detected.

• The set of all extremal regions can be enumerated in O(n log log n),wheren is

the number of pixels in the image.

Enumeration of extremal regions proceeds as follows. First, pixels are sorted by inten-

sity. The computational complexity of this step is O(n) if the range of image values S is

small, e.g. the typical {0,...,255}, since the sort can be implemented as

BINSORT [12].

After sorting, pixels are placed in the image (either in decreasing or increasing order) and

the list of connected components and their areas is maintained using the efﬁcient union-

ﬁnd algorithm [12]. The complexity of our union-ﬁnd implementation is O(n log log n),

i.e. almost linear

. Importantly, the algorithm is very fast in practice. The MSER detec-

tion takes only 0.14 seconds on a Linux PC with the Athlon XP 1600+ processor for an

530x350 image (n = 185500).

The process produces a data structure storing the area of each connected component as

a function of intensity. A merge of two components is viewed as termination of existence

of the smaller component and an insertion of all pixels of the smaller component into the

larger one. Finally, intensity levels that are local minima of the rate of change of the area

function are selected as thresholds producing maximally stable extremal regions. In the

output, each MSER is represented by position of a local intensity minimum (or maximum)

and a threshold.

Notes. The structure of the above algorithm and of an efﬁcient watershed algorithm

[17] is essentially identical. However, the structure of the output of the two algorithms

is different. The watershed is a partitioning of D, i.e. a set of regions R



D, R

∩R

= ∅. In watershed computation, focus is on the thresholds where regions

merge (and two watersheds touch). Such threshold are of little interest here, since they are

highly unstable – after merge, the region area jumps. In MSER detection, we seek a range

of thresholds that leaves the watershed basin effectively unchanged. Detection of MSER

is also related to thresholding. Every extremal region is a connected component of a

even faster (but more complex) connected component algorithms exist with O(nα(n)) complexity, where

α is the inverse Ackerman function; α(n) ≤ 4 for all practical n.

387

thresholded image. However, no global or ’optimal’ threshold is sought, all thresholds

are tested and the stability of the connected components evaluated. The output of the

MSER detector is not a binarized image. For some parts of the image, multiple stable

thresholds exist and a system of nested subsets is output in this case. Finally we remark

that MSERs can be deﬁned on any image (even high-dimensional) whose pixel values are

from a totally ordered set.

3 The proposed robust wide-baseline algorithm

Distinguished region detection. As a ﬁrst step, the DRs are detected - the MSERs com-

puted on the intensity image (MSER+) and on the inverted image (MSER-).

Measurement regions. A measurement region of arbitrary size may be associated with

each DR, if the construction is afﬁne-covariant. Smaller measurement regions are both

more likely to satisfy the planarity condition and not to cross a discontinuity in depth or

orientation. On the other hand, small regions are less discriminative, i. e. they are much

less likely to be unique. Increasing the size of a measurement region carries the risk of

including parts of background that are completely different in the two images considered.

Clearly, the optimal size of a MR depends on the scene content and it is different for each

DR. In [16], Tuytelaars at al. double the elliptical DR to increase discriminability, while

keeping the probability of crossing object boundaries at an acceptable level.

In the proposed algorithm, measurement regions are selected at multiple scales: the

DR itself, 1.5, 2 and 3 times scaled convex hull of the DR. Since matching is accomplished

in a robust manner, we beneﬁt from the increase of distinctiveness of large regions with-

out being severely affected by clutter or non-planarity of the DR’s pre-image. This is a

novelty of our approach. Commonly, Mahalanobis distance has been used in MR match-

ing. However, the non-robustness of this metric means that matching may fail because of

a single corrupted measurement (this happened in the experiments reported below).

Invariant description. In all experiments, rotational invariants (based on complex

moments) were used after applying a transformation that diagonalises the regions covari-

ance matrix of the DR. In combination, this is an afﬁnely-invariant procedure. Combi-

nation of rotational and afﬁnely invariant generalised colour moments [8] gave a similar

result. On their own, these afﬁne invariants failed on problems with a large scale change.

Robust matching. A measurement taken from an almost planar patch of the scene

with stable invariant description will be referred to as a ’good measurement’. Unstable

measurements or those computed on non-planar surfaces or at discontinuities in depth or

orientation will be referred to as ’corrupted measurements’.

The robust similarity is computed as follows. For each measurement M

on region

A, k regions B

,...,B

from the other image with the corresponding i-th measurement

,...,M

nearest to M

are found and a vote is cast suggesting correspondence of

A and each of B

,...,B

Votes are summed over all measurements. In the current implementation 216 invari-

ants at each scale, i.e. a total of 864 measurements are used (i ∈ [1, 864]). The DRs with

the largest number of votes are the candidates for tentative correspondences. Experimen-

tally, we found that k set to 1% of the number of regions gives good results.

Probabilistic analysis of the likelihood of the success of the procedure is not simple,

since the distribution of invariants and their noise is image-dependent. We therefore only

suppose that corrupted measurements spread their votes randomly, not conspiring to cre-

ate a high score and that good measurements are more likely to vote for correct matches.

388

HTML Viewer

Frequently Asked Questions (18)

Q1. What is the definition of a MSER?

The MSERs are sets of image elements, closed under the affine transformation of image coordinates and invariant to affine transformation of intensity.

Q2. What have the authors contributed in "Robust wide baseline stereo from maximally stable extremal regions" ?

The wide-baseline stereo problem, i. e. the problem of establishing correspondences between a pair of images taken from different viewpoints is studied. A new set of image elements that are put into correspondence, the so called extremal regions, is introduced. Extremal regions possess highly desirable properties: the set is closed under 1. continuous ( and thus projective ) transformation of image coordinates and 2. monotonic transformation of image intensities. An efficient ( near linear complexity ) and practically fast detection algorithm ( near frame rate ) is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal regions ( MSER ).

Q3. What are the future works mentioned in the paper "Robust wide baseline stereo from maximally stable extremal regions" ?

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching.

Q4. What is the final step of all wide-baseline algorithms?

Finding epipolar geometry consistent with the largest number of tentative (local) correspondences is the final step of all wide-baseline algorithms.

Q5. What are the main novelties of the paper?

The three main novelties are: the introduction of MSERs, robust matching of local features and the use of multiple scaled measurement regions.

Q6. What is the common method of establishing tentative correspondences?

distinguished regions or their scaled version serve as measurement regions and tentative correspondences are established by comparing invariants using Mahalanobis distance [10, 16, 11].

Q7. How did the MSER detector perform on the epipolar scene?

In future work, the authors intend to proceed towards fully automatic projective reconstruction of the 3D scene, which requires computing projective reconstruction and dense matching.

Q8. What are the main design decisions at this stage?

Important design decisions at this stage include: 1. the choice of measurement regions, i.e. the parts of the image on which invariants are computed, 2. the method of selecting tentative correspondences given the invariant description and 3.

Q9. What is the definition of a merge of two components?

A merge of two components is viewed as termination of existence of the smaller component and an insertion of all pixels of the smaller component into the larger one.

Q10. What is the procedure for determining the EG?

an affine transformation between pairs of potentially corresponding DRs, i.e. the DRs consistent with the rough EG, is computed.

Q11. What is the advantage of a robust MR matching algorithm?

Since matching is accomplished in a robust manner, the authors benefit from the increase of distinctiveness of large regions without being severely affected by clutter or non-planarity of the DR’s pre-image.

Q12. What is the important paper by Schmid and Mohr?

Since the influential paper by Schmid and Mohr [11] many image matching and wide-baseline stereo algorithms have been proposed, most commonly usingHarris interest points as distinguished regions.

Q13. What is the definition of a good measurement?

A measurement taken from an almost planar patch of the scene with stable invariant description will be referred to as a ’good measurement’.

Q14. What is the description of the proposed similarity measure?

The robustness of the proposed similarity measure allows us to use invariants from a collection of measurement regions, even some that are much larger than the associated distinguished region.

Q15. Why did the authors have to consider invariants from multiple measurement regions?

Due to the robustness, the authors were able to consider invariants from multiple measurement regions, even some that were significantly larger (and hence probably discriminative) than the associated MSER.

Q16. What is the probability of the success of the procedure?

Probabilistic analysis of the likelihood of the success of the procedure is not simple, since the distribution of invariants and their noise is image-dependent.

Q17. What is the way to define a MSER?

Finally the authors remark that MSERs can be defined on any image (even high-dimensional) whose pixel values are from a totally ordered set.

Q18. What is the way to find a reliable correspondence between two images?

In the wide-baseline set-up, local image deformations cannot be realistically approximated by translation or translation with rotation and a full affine model is required.

Robust wide baseline stereo from maximally stable extremal regions

Summary (2 min read)

1 Introduction

2 Maximally Stable Extremal Regions

3 The proposed robust wide-baseline algorithm

4 Experiments

5 Conclusions

Figures (8)

Citations

Cites background or methods from "Robust wide baseline stereo from ma..."

Cites background from "Robust wide baseline stereo from ma..."

Additional excerpts

References

"Robust wide baseline stereo from ma..." refers background in this paper

"Robust wide baseline stereo from ma..." refers background or methods in this paper

"Robust wide baseline stereo from ma..." refers background in this paper

"Robust wide baseline stereo from ma..." refers methods in this paper

Related Papers (5)

Frequently Asked Questions (18)

Q1. What is the definition of a MSER?

Q2. What have the authors contributed in "Robust wide baseline stereo from maximally stable extremal regions" ?

Q3. What are the future works mentioned in the paper "Robust wide baseline stereo from maximally stable extremal regions" ?

Q4. What is the final step of all wide-baseline algorithms?

Q5. What are the main novelties of the paper?

Q6. What is the common method of establishing tentative correspondences?

Q7. How did the MSER detector perform on the epipolar scene?

Q8. What are the main design decisions at this stage?

Q9. What is the definition of a merge of two components?

Q10. What is the procedure for determining the EG?

Q11. What is the advantage of a robust MR matching algorithm?

Q12. What is the important paper by Schmid and Mohr?

Q13. What is the definition of a good measurement?

Q14. What is the description of the proposed similarity measure?

Q15. Why did the authors have to consider invariants from multiple measurement regions?

Q16. What is the probability of the success of the procedure?

Q17. What is the way to define a MSER?

Q18. What is the way to find a reliable correspondence between two images?