What was used to compute the homographies?

SURF keypoints and descriptors were used to find correspondences between the image pairs, and RANSAC was employed to compute the homographies.

What are the contributions mentioned in the paper "Using sparse elimination for solving minimal problems in computer vision" ?

In this paper, the authors propose a new algorithm for selecting the basis that is in general more compact than the basis obtained with a state-of-the-art algorithm making PEP a more viable option for solving polynomial equations. Another contribution is that the authors present two minimal problems for camera self-calibration based on homography, and demonstrate experimentally using synthetic and real data that their algorithm can provide a numerically stable solution to the camera focal length from two homographies of unknown planar scene.

How many monomials are produced in the first calibration problem?

1. Since d = 8 and n = 3 in the first calibration problem (equal focal length) the authors get 45 basis monomial which is more than twice the number of the monomials produced by their method.

What is the definition of a polynomial eigenvalue problem?

Polynomial eigenvalue problem (PEP) is an extension of the standard eigenvalue problem (C−λI)v = 0 to a system of polynomials represented with the matrix equation:(C0 +C1λ+C2λ 2 + · · ·+Clλ l)v = 0, (1)where l is the highest degree of the polynomials in the variable λ that the authors want to solve, v is a vector of monomials in other variables than λ, and C0, . . .

How many unknowns are needed to solve the camera pose?

There are now 3 + k unknowns λ0, . . . , λk, nx and ny , and two equations which means that the authors cannot solve the problem from a single homography, but the authors need at least two homographies (i = 1, 2) that lead to four equations with four unknowns.

How many zero monomials did the authors remove from the equations?

Instead the authors selected the equations randomly and used their SVD based elimination scheme to remove 11 zero monomials that helped to improve the stability of the method.

What is the common approach for solving polynomial equations?

An alternative approach that is also commonly used for solving polynomial equations is multipolynomial resultant, which provides an efficient tool for eliminating variables from multiple equations, and solving the remaining variable as a root of a univariate polynomial [4].

Why is the accuracy of the solver higher than in the first problem?

One possible reason for this is that in the second problem the solver uses 4 constraints in contrast to the 3 constraints of the first problem.

Why did the authors not implement the modified resultant-based method?

The authors did not implement the modified resultant-based method, because in this case d = 17 and n = 4 that would give us 1140 monomials which is almost 6 times more than with their method, and it is clear that the results would be inferior.

(Open Access) Using Sparse Elimination for Solving Minimal Problems in Computer Vision (2017) | Janne Heikkilä

Q: How did Herrera and colleagues initialize the solver?

The solver was initialized by assuming that the reference view is fronto-parallel when it becomes easy to compute an initial value for the focal length.

Q: What is the main disadvantage of the resultant-based approach for solving the polynomial?

The main disadvantage of the resultant-based approach for solving the polynomial equations is that it requires computing the determinant of a matrix which often has high dimensions.

Q: Why is the determinant of an N N matrix infeasible?

Because the determinant of an N ×N matrix has N ! terms, solving the unknowns from the resultant often becomes computationally infeasible even for relatively small problems.

Q: What are the main techniques used to solve systems of polynomials?

In addition to the Gröbner basis techniques and resultants, systems of polynomial equations can be often solved using eigenvalues and eigenvectors.

Using Sparse Elimination for Solving Minimal Problems in Computer Vision

Janne Heikkilä

Center for Machine Vision and Signal Analysis

University of Oulu, Finland

janne.heikkila@oulu.fi

Abstract

Finding a closed form solution to a system of polynomial

equations is a common problem in computer vision as well

as in many other areas of engineering and science. Gröb-

ner basis techniques are often employed to provide the so-

lution, but implementing an efﬁcient Gröbner basis solver

to a given problem requires strong expertise in algebraic

geometry. One can also convert the equations to a poly-

nomial eigenvalue problem (PEP) and solve it using linear

algebra, which is a more accessible approach for those who

are not so familiar with algebraic geometry. In previous

works PEP has been successfully applied for solving some

relative pose problems in computer vision, but its wider ex-

ploitation is limited by the problem of ﬁnding a compact

monomial basis. In this paper, we propose a new algorithm

for selecting the basis that is in general more compact than

the basis obtained with a state-of-the-art algorithm making

PEP a more viable option for solving polynomial equations.

Another contribution is that we present two minimal prob-

lems for camera self-calibration based on homography, and

demonstrate experimentally using synthetic and real data

that our algorithm can provide a numerically stable solu-

tion to the camera focal length from two homographies of

unknown planar scene.

1. Introduction

Many camera pose estimation and calibration problems boil

down to solving a system of polynomial equations. These

are often so-called minimal problems, where the camera

parameters are computed from a minimal number of con-

straints so that there are essentially as many unknowns as

equations, but the relationship between the unknown vari-

ables and the measurements follows a polynomial model

that makes the dependence nonlinear and difﬁcult to solve

by means of linear algebra. Such minimal problems in-

clude for example, the classical P3P (Perspective-Three-

Point) problem for a calibrated camera where an image of

three points with known distances is sufﬁcient to compute

the camera pose, but it requires solving a system of three

quadratic equations in three variables [

9]. Another clas-

sical example is the ﬁve-point problem that allows ﬁnd-

ing the relative pose between two views from an unknown

scene using ﬁve point correspondences. Nistér [

17] con-

verted the resulting system of polynomial equations to a

tenth degree univariate polynomial that can be efﬁciently

solved using standard numerical techniques. The relative

pose problem has been modiﬁed in various studies to in-

corporate also unknown camera parameters that enable the

use of uncalibrated cameras and makes it a self-calibration

problem. To mention few of them, Stewenius et al. [

20]

used six point correspondences to solve the relative pose

together with the focal length. Fitzgibbon [

8] augmented

the fundamental matrix estimation to include one term of

radial lens distortion, and solved them from 9 point corre-

spondences. Kukelova and Pajdla [

15] used an additional

constraint to solve the same problem from 8 point corre-

spondences, and Jiang et al. [

11] added still one constraint

and they were able to solve the problem from 7 point corre-

spondences. A comprehensive list of minimal problems in

computer vision and related papers can be found in [

18].

Planar objects are commonly used for estimating the

camera pose and intrinsic parameters. Well-known Zhang’s

calibration method [23] provides a closed form solution to

the calibration problem from images of a known planar tar-

get. Also, OpenCV and Matlab include tools to perform cal-

ibration with a similar setup. Despite of the extensive num-

ber of minimal problems introduced in recent years, it is

surprising that homography has not been much considered

in this context, and there are only few related works. Min-

imal solutions to panorama stitching in [

1] and [2] assume

that the camera centers coincide which reduces the motion

to pure rotation. Methods for decomposing a homography

into rotation, translation, and surface normal parameters

have been proposed e.g. in [

7] and [24]. Saurer et al. [19]

consider a minimal solution to a 3-point plus a common di-

rection relative pose problem using homography. Recently,

Kukelova et al. [

14] have presented two algorithms for esti-

mating the homography between two cameras with different

radial distortions. However, none of these works address the

problem of solving the camera focal length from images of

an unknown planar target, which is the homography-based

minimal problem presented in this paper.

The most common approach for solving minimal prob-

lems in computer vision and corresponding systems of poly-

nomial equations is to use Gröbner basis techniques. One

drawback of this approach is that when the polynomial de-

grees are high, it often suffers from numerical inaccuracies.

To address this problem, for example, Byröd et al. [

3] have

proposed a generalization of the Gröbner basis method for

improving the numerical stability. Another limitation of this

approach is that implementing a Gröbner basis solver for a

given problem requires expertise in algebraic geometry be-

cause the solver needs to be handcrafted in practice to make

it efﬁcient. Because of the complicated theory this approach

is often beyond the reach of non-experts. An alternative ap-

proach that is also commonly used for solving polynomial

equations is multipolynomial resultant, which provides an

efﬁcient tool for eliminating variables from multiple equa-

tions, and solving the remaining variable as a root of a uni-

variate polynomial [

4]. However, there are some limita-

tions that make resultants less useful for engineering ap-

plications. For classical multipolynomial resultants such as

the Macaulay resultant, most of the polynomial coefﬁcients

need to be non-zero, the roots distinct and there should be

no solutions at inﬁnity. This problem can often be avoided

by using a sparse resultant [

6] that works also for polynomi-

als with several zero coefﬁcients. Another disadvantage is

that after elimination the remaining univariate polynomial

is a determinant of a matrix that often has high dimensions.

Because a determinant of an N × N matrix has N! terms,

ﬁnding the roots of the remaining polynomial can easily be-

come computationally infeasible or unstable.

In addition to the Gröbner basis techniques and resul-

tants, systems of polynomial equations can be often solved

using eigenvalues and eigenvectors. One approach is to

convert the classical multipolynomial resultant to a stan-

dard eigenvalue problem [

5],[4] which however works only

with dense polynomials. In [

8] the minimal problem of

computing the radial distortion coefﬁcient was expressed

as a quadratic polynomial eigenvalue problem and later it

was extended in [

16] to include an additional constraint.

A resultant-based algorithm for transforming a system of

polynomial equations to a polynomial eigenvalue problem

(PEP) was proposed in [

13] that enabled solving several

minimal relative pose problems using linear algebra. How-

ever, this algorithm has an inherent property of leading to

unnecessarily high dimensional vector spaces and spurious

roots that make the algorithm numerically unstable when

solving sparse systems of polynomials with high degrees.

To overcome the problems related to the algorithm pre-

sented in [13], we propose a new algorithm based on sparse

elimination theory that provides more stable solutions to

sparse systems that are the most typical cases in practical

applications. In addition, we demonstrate the applicablility

of our algorithm in two new minimal problems that have

higher number of solutions than typical relative pose prob-

lems previously presented in the literature.

2. Polynomial eigenvalue problems

Polynomial eigenvalue problem (PEP) is an extension of the

standard eigenvalue problem (C − λI)v = 0 to a system of

polynomials represented with the matrix equation:

+ C

λ + C

+ · · · + C

)v = 0, (1)

where l is the highest degree of the polynomials in the vari-

able λ that we want to solve, v is a vector of monomials in

other variables than λ, and C

, . . . , C

are m × m square

matrices that contain the coefﬁcients of the polynomials.

This equation can be converted to a generalized eigenvalue

problem

Au = λBu, (2)

where

A =







0 I 0 · · · 0

0 0 I · · · 0

−C

· · · −C

l−1







B =







I 0 0 · · · 0

0 I 0 · · · 0

0 0 0 · · · C







, u =







λv

l−1







Since most of the mathematical software libraries and pack-

ages can solve this problem it becomes easy to ﬁnd all the

roots for λ. The eigenvector u contains the solutions of the

monomials that exist in the polynomials, and therefore, one

can extract the roots of the remaining variables by comput-

ing suitable ratios between individual elements of u.

The most difﬁcult part of converting the system of poly-

nomials to PEP is to determine the monomials in v and

consequently the matrices C

, . . . , C

. Given n polynomi-

als one can easily construct n × m matrices and the cor-

responding v satisfying (

1), but usually n < m which re-

sults in an underdetermined system of linear equations that

cannot be solved. The trick emplyed in [

13] is to gener-

ate new equations by multiplying the initial equations with

monomials produced by their algorithm. Some of these new

equations may be linearly independent from the initial equa-

tions, which then enables constructing a fully determined

system. Notice that this procedure also increases the num-

ber of monomials in v and hence the dimensions of the co-

efﬁcient matrices. In [

13] they used the classical Macaulay

resultant formulation for creating the set of basis monomials

in v. Because the Macaulay resultant is designed for dense

homogeneous polynomials, it is not guaranteed to produce

a basis that is linearly independent. Therefore, they pro-

posed a small modiﬁcation to the resultant-based approach

that gives a higher number of polynomial equations that in-

creases the chances to get a linearly independent set of ba-

sis monomials. However, for larger systems of polynomials

the basis generated by this method becomes huge, because

the Macaulay resultant is based on the assumption that the

number of the solutions obtained is maximal for the given

degrees of the polynomials. According to Bézout’s theo-

rem [

4] the maximum number of solutions is d

· d

· · · d

where d

is the degree of the polynomial f

. The set of ba-

sis monomials contain all the monomials with total degree

of d =

− 1) + 1. One can easily see that the num-

ber of monomials increases exponentially. Therefore, this

approach is feasible only for small systems of equations

and low polynomial degrees. Next, we present a method

based on sparse elimination that exploits the sparsity of the

general polynomials, and produces smaller monomial bases

and coefﬁcient matrices enabling solutions to problems that

have been previously intractable.

3. Determining basis monomials

Most of the polynomial equations encountered in computer

vision are sparse, and therefore classical multivariate resul-

tants are not well-suited for generating the basis monomi-

als. Sparse elimination theory [

21],[5] has been developed

to deal with general polynomials that have many zero co-

efﬁcients. The beneﬁt of the sparsity is that the resultants

obtained have much smaller dimensions than the classical

resultants. Therefore, instead of selecting all the monomials

of a certain total degree we can get a signiﬁcantly smaller

set of monomials by using the tools provided by the sparse

elimination theory.

Let x = {x

, x

, . . . , x

} be a set of unknown variables

that we want to solve from n multivariate polynomials

(x) = f

(x) = · · · = f

(x) = 0 (3)

deﬁned by

(x) =

j=1

, (4)

where x

= x

ij1

ij2

· · · x

ijn

are the monomi-

als corresponding to the non-zero coefﬁcients c

. Let

= {a

, . . . , a

} ⊂ Z

denote the exponent vectors of

all the monomials in f

that is also called the support of f

Next we introduce few concepts from algebraic geometry

[

4] that are needed to formulate our method.

Deﬁnition 1: The Newton polytope of f

is the convex hull

of the support A

denoted by P

= Conv(A

) ⊂ R

. The

volume of P

is denoted by Vol

Notice that in the low dimensional cases when

n = 1, 2 or 3, the Newton polytope represents a line,

polygon or polyhedron, respectively. Clearly, the way

how the volume Vol

) is computed depends on n. For

example, Vol

) is the length of the line, and Vol

) is

the area of the polygon.

Deﬁnition 2: The Minkowski sum of two convex polytopes

and P

is the convex polytope

= P

+ P

= {p

+ p

∈ P

, p

∈ P

} ⊂ R

Using the Minkowski sum (also known as dilation) we can

aggregate the Newton polytopes of individual polynomials

to form combined supports. It is also needed for deﬁning

the mixed volume.

Deﬁnition 3: Given convex polytopes P

, . . . , P

⊂ R

there is a real-valued function called mixed volume that can

be computed as

, . . . , P

) =

k=1

(−1)

n−k

I⊂{1,...,n}

|I|=k

Vol

i∈I

(5)

In high-dimensional cases computing the mixed volume us-

ing (

5) can be time consuming. There are faster algorithms

that use a so called mixed subdivision of the Minkowski

sum, and one can also ﬁnd their software implementations

from the Internet, but in the cases discussed in this paper

n ≤ 4 and using (

5) is still tractable. The following theorem

is the reason why we introduced the mixed volume.

Theorem 1 (Bernstein’s Theorem): Given the polynomi-

als f

, . . . f

over C with ﬁnitely many common zeroes in

∗

)

, where C

∗

= C \ {0}, let P

be the Newton polytope

of f

in R

. Then the number of solutions of the f

in (C

∗

)

is bounded above by the mixed volume MV

, . . . , P

For generic choices of the coefﬁcients c

the number of

common solutions is exactly MV

, . . . , P

The proof of the theorem can be found from [

4]. Bern-

stein’s theorem is an important result of the sparse elimina-

tion theory that gives us a tool for calculating the maximum

number of the roots in advance without knowing the numer-

ical values of the coefﬁcients. What we need is only the ex-

ponent vectors a

of the monomials, i.e., supports A

. This

also determines the minimum size of the monomial basis as

we will see later.

Next we discuss about ﬁnding the basis monomials for

the polynomial eigenvalue problem, i.e., the elements of

v in (

1). Sparse elimination provides the tools for con-

structing sparse resultants that generalize the classical mul-

tivariate resultant. While the degree of the classical mul-

tivariate resultant is determined by Bézout’s theorem (i.e.

·d

· · · d

), the degree of the sparse resultant comes from

Bernstein’s theorem which is the mixed volume. These two

types of resultants coincide only when all Newton polytopes

are n-simplices scaled by the total degree of the respec-

tive polynomials [

4]. Otherwise the degree of the sparse

resultant is smaller, which also means that the matrix con-

structed from the coefﬁcients c

has smaller dimensions.

Furthermore, it is necessary that the matrix has full rank

and its determinant vanishes only when the equations have

a common solution. It often happens that the multivariate

resultant is rank-deﬁcient if the polynomials have zero co-

efﬁcients, and thus it fails to provide a solution. In order

to have a full rank, it becomes necessary to select the basis

monomials of the sparse resultant carefully using, e.g., the

Lift-Prune algorithm proposed by Emiris & Canny [

6]. The

main disadvantage of the resultant-based approach for solv-

ing the polynomial equations is that it requires computing

the determinant of a matrix which often has high dimen-

sions. Because the determinant of an N × N matrix has N !

terms, solving the unknowns from the resultant often be-

comes computationally infeasible even for relatively small

problems. For example, if the dimension of the coefﬁcient

matrix for the sparse resultant is 10 × 10, the resultant is a

factor of an expression that has more than 3.6 million terms.

In such cases ﬁnding the solution via PEP is much more ef-

ﬁcient.

A sparse resultant to a system of n equations is com-

puted in n − 1 variables, which means that one of the vari-

ables of our original problem (

3) needs to be hidden to the

coefﬁcient ﬁeld. The resultant obtained is then a univariate

polynomial of the hidden variable which can be solved by

ﬁnding the roots of this polynomial. Without loss of gener-

ality we can decide to solve the ﬁrst variable x

that is then

treated as a coefﬁcient in the polynomials

′

(

x) =

′

j=1

′

, (6)

where c

′

ij1

x = {x

, . . . , x

} and

′

= (α

ij2

, . . . , α

ijn

) ∈ A

′

are n − 1 dimensional sup-

port vectors with |A

′

| = s

′

. The ﬁrst step is to create the

Newton polytopes P

′

, . . . , P

′

⊂ R

n−1

corresponding to

the modiﬁed system (

6), and compute the Minkowski sum

P = P

′

+ · · · + P

′

. The set of basis monomials S for the

sparse resultant is then obtained from

S = Z

n−1

∩ (P + d), (7)

where Z

n−1

deﬁnes a square lattice with integer points, and

d ∈ R

n−1

is a small translation vector that displaces P

slightly so that the lattice points lie in the interiors of the

convex polytope [

6, 4]. In practice, the elements of d can be

randomly selected from {−ǫ, 0, ǫ} where ǫ ∈ Q is a small

rational number.

Sparse resultants need to be of full rank in order to have

a non-zero determinant. In our case, the only strict require-

ment is that C

, . . . , C

in (1) must be square matrices. No-

tice that this is a looser condition than in [

13] where they

assumed that either C

or C

must be of full rank and in-

vertible. Here this not necessary, but in some cases rank-

deﬁciency may lead to numerical instability with the eigen-

value solver. Because the PEP in (

1) is deﬁned for one un-

known variable λ which is then computed as an eigenvalue

of (

2) we need to choose this variable from x

, . . . , x

. This

is exactly the same situation as with the sparse resultant, and

therefore, we decide again without loss of generality that

λ ≡ x

, and we hide x

to the coefﬁcient ﬁeld which then

results in the modiﬁed system (

6).

Due to the relaxed requirements, we can try to ﬁnd

a smaller set of basis monomials than (

7) deﬁned for

the sparse resultant. The lower bound is determined by

Bernstein’s theorem which gives the maximum number of

the common roots for the polynomials denoted by r ≡

, . . . , P

). It should be noticed that the mixed vol-

ume is computed for the original system (

3). The eigen-

vector u in (

2) has the same dimension as the maximum

number of unique eigenvalues i.e. possible roots of the sys-

tem. The length of u is clearly lm which gives us the bound

m ≥

. (8)

Hence, it is sufﬁcient to ﬁnd a set of support vectors B for

the basis monomials where |B| ≥ r/l.

Algorithm

1 summarizes the procedure for constructing

B based on the previous discussion. It generates several

putative sets of support vectors for the basis and selects

the smallest set among these candidates. It also returns a

set T = {T

, . . . , T

} where T

6= ∅ are subsets of vec-

tors that can be used to construct the coefﬁcient matrices

, . . . , C

. These vectors are ﬁrst converted to n sets of

monomials M

= {

| ∀ t ∈ T

}, and the monomials are

multiplied with the original equations f

which then results

in n sets of new equations E

= {

(x)| ∀

∈ M

These equations are converted to a matrix form (

1), which

then directly gives us the coefﬁcient matrices C

, . . . , C

The total number of new equations

| is greater or

equal to the number of the basis monomials, which means

that the coefﬁcient matrices have at least as many rows as

columns. If there are more rows than columns, one can

choose m rows that minimize the condition number, and

discard the remaining rows. It may also happen that the

most compact basis does not work, and in that case one

could try the next candidate produced by the algorithm.

There are often monomials (or vectors) in B that do not

contribute to the equations. Such monomials may cause in-

stability to the eigenvalue solver, and they need to be re-

moved. In [

13] they call them “parasitic” zero eigenval-

ues, and they propose to convert the generalized eigenvalue

problem to a standard eigenvalue problem so that one can

easily identify and remove these monomials as they corre-

spond to zero columns of the matrix to be decomposed. The

drawback is that either C

or C

need to be of full rank,

which then causes extra constraints to selection of the basis

monomials. Hence, we propose here a simple strategy for

ﬁnding these zero monomials. First, we need to specialize

the coefﬁcient matrices with some random numerical val-

Algorithm 1 Generate basis monomials

Input: A

′

, . . . , A

′

, r, l

Output: B, T

1: Create Newton polytopes P

′

← Conv(A

′

) ⊂ R

n−1

for

i = 1, . . . , n, and a unit (n − 1)-simplex P

′

⊂ R

n−1

2: Create a list of index sets:

K ← [{k

, . . . , k

} | ∀ i = 0, . . . , n ; k

, . . . , k

∈

{0, . . . , n} ; k

j+1

> k

3: Create a list of displacement vectors:

∆ ← [(δ

, . . . , δ

n−1

) | ∀ δ

, . . . , δ

n−1

∈ {−ǫ, 0, ǫ}].

4: Initialize B ← ∅, T ← ∅ and N ← ∞.

5: for I in K do

6: Compute Minkowski sum Q ←

k∈I

′

7: for d in ∆ do

8: Create a putative basis B ← Z

n−1

∩ (Q + d).

9: if |B| ≥

AND |B| < N then

10: Find the sets of vectors:

← {t|t ∈ Z

n−1

, A

′

+ t ⊂ B}

for i = 1, . . . , n.

11: if

| ≥ |B| AND min(|T

|) > 0 then

12: B ← B, T ← {T

}

i=1,...,n

, and

N ← |B|.

13: end if

14: end if

15: end for

16: end for

ues as c

. Using these values we construct the matrices A

and B in (

2), and compute the singular value decomposition

B = USV

⊤

. Next we perform a unitary transformation

′

= U

⊤

AV, (9)

and ﬁnd the zero columns of A

′

. These columns correspond

to the zero monomials and they can be removed from A and

B. The rows with the same indices are also removed so that

the matrices will remain square. This procedure might need

to be repeated few times to ﬁnd all zero monomials. How-

ever, one should notice that the elimination is performed

ofﬂine when designing the solver, and there is no need to do

it runtime once the zero polynomials have been identiﬁed.

4. Planar self-calibration

A standard approach for geometric camera calibration is to

use a known checker board pattern printed on a planar sur-

face. To demonstrate the applicability of our algorithm, we

present two minimal problems for solving the camera focal

length from two homographies corresponding to three im-

ages where the patterns are unknown, which makes this a

self-calibration problem. We consider the following cases:

1) a constant focal length and 2) two different focal lengths.

The resulting polynomials can be converted to PEPs using

Algorithm

1 and solved efﬁciently, for example, with Mat-

lab or some other software package or library capable of

computing generalized eigenvalues.

Let two 3D vectors a and b span a plane so that they are

both orthogonal and of equal length fulﬁlling the constraints

⊤

b = 0 and a

⊤

a − b

⊤

b = 0. (10)

If |a| = |b| = 1 the normal vector of the plane is deﬁned

by n = a × b. In order to express the vectors a and b in

terms of the normal vector n we can choose

a = n × e and b = n × a, (11)

where e is a unit vector not parallel to n. For simplic-

ity, we select e = [1, 0, 0]

⊤

. We use parametrization

n = [n

, n

, 1]

⊤

+ n

+ 1, and we can now express

a and b in variables n

and n

. Further assuming that a and

b are represented in the camera coordinate frame of the ref-

erence view we can convert them to the image coordinates

using

= K

a and

= K

b, (12)

where K

is the intrinsic camera matrix for the reference

camera. Because one can often assume with reasonable ac-

curacy that the principal point of the camera is in the center

of the image, the pixel aspect ratio is 1, and lens distortion

is negligible, we limit ourselves to the case where we have

only one intrinsic parameter, the focal length λ

, that leads

to the camera matrix K

= diag(λ

, λ

, 1).

The mapping from the reference image to the i

image is

described by the homography H

. After back-projecting to

the 3D space the corresponding vectors are obtained from

= K

−1

a and b

= K

−1

b, (13)

where K

= d i ag(λ

, λ

, 1) and λ

is the focal length of

the i

camera. Because orthogonality and equality in the

length should hold in each frame we have the following self-

calibration constraints expressed by two polynomials

1,i

(λ

, λ

, n

) = a

⊤

= 0 (14)

2,i

(λ

, λ

, n

) = a

⊤

− b

⊤

= 0. (15)

Herrera et al. [

10] used similar constraints for planar

self-calibration but they solved the camera parameters with

non-linear minimization. The solver was initialized by as-

suming that the reference view is fronto-parallel when it be-

comes easy to compute an initial value for the focal length.

In this paper, we do not make such assumptions, and no

initialization is needed. There are now 3 + k unknowns

, . . . , λ

, n

and n

, and two equations which means

that we cannot solve the problem from a single homogra-

phy, but we need at least two homographies (i = 1, 2) that

lead to four equations with four unknowns. This is true also

in general, because one homography provides only 8 con-

straints to calibration. Five constraints are needed for the

camera pose (3 for rotation and 2 for translation up to scale),

the normal of the plane n requires two constraints and one

Using Sparse Elimination for Solving Minimal Problems in Computer Vision

Figures

Citations

Minimal Case Relative Pose Computation Using Ray-Point-Ray Features

Minimal Solutions to Relative Pose Estimation From Two Views Sharing a Common Direction With Unknown Focal Length

A Sparse Resultant Based Method for Efficient Minimal Solvers

Computing stable resultant-based minimal solvers by hiding a variable

Computational Methods for Computer Vision: Minimal Solvers and Convex Relaxations

References

A flexible new technique for camera calibration

An efficient solution to the five-point relative pose problem

Simultaneous linear estimation of multiple view geometry and lens distortion

Motion and structure from motion in a piecewise planar environment

Analysis and solutions of the three point perspective pose estimation problem

Related Papers (5)

Automatic Generator of Minimal Problem Solvers

A sparse resultant based method for efficient minimal solvers

A Clever Elimination Strategy for Efficient Minimal Solvers

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Polynomial Eigenvalue Solutions to Minimal Problems in Computer Vision

Frequently Asked Questions (18)

Q1. What was used to compute the homographies?

Q2. What are the contributions mentioned in the paper "Using sparse elimination for solving minimal problems in computer vision" ?

Q3. How did Herrera and colleagues initialize the solver?

Q4. How many constraints are needed for the camera pose?

Q5. What is the common problem that is solved using linear algebra?

Q6. What is the main disadvantage of the resultant-based approach for solving the polynomial?

Q7. Why is the determinant of an N N matrix infeasible?

Q8. How many monomials are produced in the first calibration problem?

Q9. What are the main techniques used to solve systems of polynomials?

Q10. What is the definition of a polynomial eigenvalue problem?

Q11. What is the smallest set of support vectors for the basis monomials?

Q12. What is the approach to solving a polynomial eigenvalue problem?

Q13. How many unknowns are needed to solve the camera pose?

Q14. How many zero monomials did the authors remove from the equations?

Q15. What is the common approach for solving polynomial equations?

Q16. What is the disadvantage of using a Gröbner basis solver?

Q17. Why is the accuracy of the solver higher than in the first problem?

Q18. Why did the authors not implement the modified resultant-based method?