Journal Article•DOI•

On Symmetry and Multiple-View Geometry: Structure, Pose, and Calibration from a Single Image

Wei Hong¹, Allen Y. Yang¹, Kun Huang¹, Yi Ma¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Dec 2004-International Journal of Computer Vision (Kluwer Academic Publishers)-Vol. 60, Iss: 3, pp 241-265

TL;DR: Since every symmetric structure admits a “canonical” coordinate frame with respect to which the group action can be naturally represented, the canonical pose between the viewer and this canonical frame can be recovered too, which explains why symmetric objects provide us overwhelming clues to their orientation and position.

read less

Abstract: In this paper, we provide a principled explanation of how knowledge in global 3-D structural invariants, typically captured by a group action on a symmetric structure, can dramatically facilitate the task of reconstructing a 3-D scene from one or more images. More importantly, since every symmetric structure admits a “canonical” coordinate frame with respect to which the group action can be naturally represented, the canonical pose between the viewer and this canonical frame can be recovered too, which explains why symmetric objects (e.g., buildings) provide us overwhelming clues to their orientation and position. We give the necessary and sufficient conditions in terms of the symmetry (group) admitted by a structure under which this pose can be uniquely determined. We also characterize, when such conditions are not satisfied, to what extent this pose can be recovered. We show how algorithms from conventional multiple-view geometry, after properly modified and extended, can be directly applied to perform such recovery, from all “hidden images” of one image of the symmetric structure. We also apply our results to a wide range of applications in computer vision and image processing such as camera self-calibration, image segmentation and global orientation, large baseline feature matching, image rendering and photo editing, as well as visual illusions (caused by symmetry if incorrectly assumed).

...read moreread less

Summary (1 min read)

Jump to: and [Summary]

Summary

A retrospective study was performed in selected states of the Sudan that include Gezira state, White Nile, Blue Nile, Khartoum, River Nile and Sennar states in order to investigate the seroprevalence of Rift Valley Fever (RVF) from 2007 to 2016.
The risk factors that identi ed for RVF were locality, species, and animal population.
It affects livestock like sheep, goat, cattle and camel .it usually occurs following heavy rainfall and cause storm of abortion in pregnant animals.
Locality and species were signi cantly associated with seroprevalence of RVF (P-value = 049), (P-value = 0.000) respectively, While animal population was not associated in Gezira state (P-value = .415).
Study design Retrospective study design was carried out to investigate, sero-prevalence, associated risk factors, spatial distribution from 2007 to 2016.
Rift Valley fever was con ned to Africa and Madagascar and in year 2000 has occurred in Saudi Arabia and Yemen by ( 3) and (7).
The current study has concluded that RVF is endemic in some areas of sudan; and further surveillances is needed to throughly understand the dynamic and epidemiology of the disease.

Did you find this useful? Give us your feedback

Figures (20)

Figure 1. Symmetry is in: architecture, machines, textures, crystals, molecules, ornaments, and nature, etc.

Figure 17. The three basic types of isometric transformations: rotation, translation and reflection.

Figure 18. A band ornament pattern (frieze pattern). It is an example of 1-D symmetric patterns.

Figure 6. The reconstruction result from the reflective symmetry. The recovered structure is represented in the canonical world coordinate frame. From our discussion above, the origin o of the world coordinate frame may translate freely along the y-axis. The smaller coordinate frame is the camera coordinate frame. The longest axis is the z-axis of the camera frame which represents the optical axis of the camera.

Table 1. Ambiguity in determining the canonical pose from three types of symmetry. (∗: “(a + b)parameter” means there are an a-parameter family of ambiguity in R0 of g0 and b-parameter family of ambiguity in T0 of g0.)

Figure 12. Top: Two symmetry objects matched in three images. From the raw images, symmetry objects segmentation and matching do not need any manual intervention. Bottom: Camera poses and structures of symmetric objects are recovered. From left to right: top, side, and frontal views of the matched and reconstructed objects and the camera poses.

Figure 13. An example of photo editing. Left: The original picture with some symmetric regions registered. Right: The shadows of the roof on the frontal wall and the occlusions by the lamp poles are removed using symmetry-based “copy-and-paste.” Some paintings are pasted on the side walls according to the correct perspectiveness. Additional windows are added on the side walls.

Figure 19. Five Platonic solids: cube, tetrahedron, octahedron, icosahedron and dodecahedron.

Figure 10. Reconstruction result for the translational symmetry. The structure is represented in the canonical world coordinate frame. From our discussion above, the origin o of the world coordinate may translate freely within the xy-plane. The smaller coordinate frame is the camera coordinate frame. The longest axis is the zaxis of the camera frame which represents the optical axis of the camera.

Figure 9. Top: An image of a mosaic floor which admits translational symmetry. Bottom: The symmetry is represented by some corresponding points. We draw two identical images here to represent the correspondence more clearly: Points shown in the left images correspond to points shown in the right image by a translational symmetry.

Figure 3. Ix : Corner correspondence between the original image of the board and an “image” with the board reflected in the x-axis by 180◦; Iy : Corner correspondence between the original image of the board and an “image” with the board reflected in the y-axis by 180◦.

Figure 2. Left: a checker board whose symmetry includes reflection along the x and y axes and rotation about o by 90◦. Right: an image taken at location o1. Notice that the image would appear to be exactly the same if it were taken at o2 instead. g0 is the relative pose of the board we perceive from the image on the right.

Figure 14. Mosaic of the two images on the top using corresponding symmetry objects in the scene (in this case windows on the front side of the building). The middle picture is a bird view of the recovered 3-D shape of the two sides of the building and the camera poses (the two coordinate frames). Notice that perspectiveness is precisely preserved for the entire building in the final result (bottom image).

Figure 8. Reconstruction result from the rotational symmetry. The recovered structure is represented in the canonical world coordinate frame. From our discussion above, the origin o of the world coordinate may translate freely along the z-axis, and the x, y-axis can be rotated within the xy plane freely. The smaller coordinate frame is the camera coordinate frame. The longest axis is the z-axis of the camera frame which represents the optical axis of the camera.

Figure 7. Top: An image of a cube which is rotationally symmetric about its longest diagonal axis. Bottom: The symmetry is represented by some corresponding points. Points in the left images correspond to points in the right image by a rotational symmetry.

Figure 5. Top: An image of a reflectively symmetric checker board. Bottom: The symmetry is represented by some corresponding points. We draw two identical images here to illustrate the correspondence more clearly: Points in the left image are corresponding to points in the right image by a reflective symmetry.

Figure 16. Necker’s Cube. The illusion is caused by orthographic projection. The cube can be interpreted either concave or convex.

Figure 15. Top: Ames room. Ames room is located in San Francisco Exploratorium (www.exploratorium.edu). This room actually has no square corners, but carefully designed so as to exploit the nature of human vision, and viewer will get a wrong 3-D perception by looking into the room from this vantage point (only). Bottom: Escher’s waterfall. It is a 2-D drawing of a bizzare waterfall by Maurits C. Escher. When we focus on the tunnel, it guides the water from low close to high far from the waterfall, but if we focus on the tower, it actually raises the tunnel directly from low to high along the waterfall.

Figure 4. The checker pattern is repeated indefinitely along the x-axis. Images taken at o1, o2, and o3 will be the same.

Figure 11. Examples of symmetry-based segmentation of rectangles. Rectangular regions are segmented and attached with local coordinate frames which represent their orientations. The segmentation does not need any human intervention.

Content maybe subject to copyright Report

International Journal of Computer Vision 60(3), 241–265, 2004

 2004 Kluwer Academic Publishers. Manufactured in The Netherlands.

On Symmetry and Multiple-View Geometry:

Structure, Pose, and Calibration from a Single Image

∗

WEI HONG, ALLEN YANG YANG, KUN HUANG AND YI MA

Department of Electrical & Computer Engineering, University of Illinois at Urbana-Champaign,

1308 West Main St., Urbana, IL 61801, USA

weihong@uiuc.edu

yangyang@uiuc.edu

kunhuang@uiuc.edu

yima@uiuc.edu

Received October 16, 2002; Revised March 17, 2004; Accepted March 17, 2004

Abstract. In this paper, we provide a principled explanation of how knowledge in global 3-D structural invariants,

typically captured by a group action on a symmetric structure, can dramatically facilitate the task of reconstructing

a 3-D scene from one or more images. More importantly, since every symmetric structure admits a “canonical”

coordinate frame with respect to which the group action can be naturally represented, the canonical pose between

the viewer and this canonical frame can be recovered too, which explains why symmetric objects (e.g., buildings)

provide us overwhelming clues to their orientation and position. We give the necessary and sufﬁcient conditions

in terms of the symmetry (group) admitted by a structure under which this pose can be uniquely determined. We

also characterize, when such conditions are not satisﬁed, to what extent this pose can be recovered. We show how

algorithms from conventional multiple-view geometry, after properly modiﬁed and extended, can be directly applied

to perform such recovery, from all “hidden images” of one image of the symmetric structure. We also apply our

results to a wide range of applications in computer vision and image processing such as camera self-calibration,

image segmentation and global orientation, large baseline feature matching, image rendering and photo editing, as

well as visual illusions (caused by symmetry if incorrectly assumed).

Keywords: structure from symmetry, multiple-view geometry, symmetry group, reﬂective symmetry, rotational

symmetry, and translational symmetry

1. Introduction

One of the main goals of computer vision is the study

of how to infer three-dimensional (3-D) information

(e.g., shape, layout and motion) of a scene from its

two-dimensional (2-D) image(s). A particular thrust of

effort is to extract 3-D geometric information from 2-

D images by exploiting geometric relationships among

multiple images of the same set of features on a 3-D

∗

This work is supported by UIUC ECE/CSL startup fund and NSF

Career Award IIS-0347456.

object. This gives rise to the subject of multiple-view

geometry,aprimary focus of study in the computer

vision community for the past two decades or so. Un-

fortunately, certain relationships among features them-

selves have been, to a large extent, ignored or at least

under-studied. Some of those relationships, as we will

see from this paper, have signiﬁcant impact on the way

that 3-D information can be (and should be) inferred

from images.

Before we proceed further, let us pause and exam

the images given in Fig. 1 below. What do they have

in common? Notice that these images are just a few

242 Hong et al.

Figure 1. Symmetry is in: architecture, machines, textures, crystals, molecules, ornaments, and nature, etc.

representatives of a common phenomenon exhibited in

nature or man-made environment: symmetry.Itisnot

so hard to convince ourselves that even from only a

single image, we are able to perceive clearly the 3-D

structure and relative pose (orientation and location)

of the object being seen, even though in the image the

shape of the object is distorted by the perspective pro-

jection. The reason is, simply put, there is symmetry at

play.

The goals of this paper are to provide a principled

explanation why symmetry could encode 3-D informa-

tion within a single perspective image and to develop

algorithms based on multiple-view geometry that ef-

ﬁciently extract the 3-D information from single im-

ages. There are two things which we want to point out

already:

1. Symmetry is not the only cue which encodes 3-D in-

formation through relationships among a set of fea-

tures (in one image or more images). For instance,

incidence relations among points, lines, and planes

may as well provide 3-D information to the viewer;

2. The concept of symmetry that we consider here is

not just the (bilateral) reﬂective symmetry, or the

(statistical) isotopic symmetry which has been stud-

ied in a certain extent in the computer vision litera-

ture. Instead it is a more general notion describing

global structural invariants of an object under the

action of any group of transformations. To clarify

this notion is one of the goals of this paper.

Symmetry, as a useful geometric cue to 3-D informa-

tion, has been extensively discussed in psychological

vision literature (Marr, 1982; Plamer, 1999). Neverthe-

less, its contribution to computational vision so far has

been explored often through statistical methods, such as

the study of isotropic textures (e.g., for the 4th image of

Fig. 1) (Gibson, 1950; Witkin, 1988; Zabrodsky et al.,

1995; Mukherjee et al., 1995; Malik and Rosenholtz,

1997; Rosenholtz and Malik, 1997; Leung and Malik,

1997). It is the works of Garding (1992, 1993) and

Malik and Rosenholtz (1997) that have provided peo-

ple a wide range of efﬁcient algorithms for recov-

ering the shape (i.e. the slant and tilt) of a textured

plane based on the assumption of isotropy (or weak

isotropy). These methods are mainly based on collect-

ing statistical characteristics (e.g., the distribution of

edge directions) from sample patches of the texture

and comparing them with those of adjacent patches

against the isotropic hypothesis. Information about the

On Symmetry and Multiple-View Geometry 243

surface shape is then often conveniently encoded in the

discrepancy or variation of these characteristics.

But symmetry is by nature a geometric property!

Although in many cases the result of symmetry indeed

causes certain statistical homogeneity (like the 4th im-

age of Fig. 1), there are reasons to believe that more

accurate and reliable 3-D geometric information can

be retrieved if we can directly exploit this property

through geometric means. For example, for the texture

shown in the 4th image of Fig. 1, shouldn’t we directly

exploit the fact that the tiling is invariant under certain

proper translations parallel to the plane? To a large ex-

tent, such a geometric approach is complementary to

extant statistical approaches: if statistical homogeneity

can be exploited for shape recovery, so can geometric

homogeneity, especially in cases where symmetry is the

underlying cause for such homogeneity. Of course, for

cases where statistical methods no longer apply (e.g.,

the 5th image of Fig. 1), geometric methods remain as

the only option. One may call this approach as structure

from symmetry.

We are by no means the ﬁrst to notice that symme-

try, especially reﬂective symmetry, can be exploited

by geometric means for retrieving 3-D geometric in-

formation. Mitsumoto et al. (1992) studied how to re-

construct a 3-D object using the mirror image based

on planar symmetry, Vetter and Poggio (1994) proved

that for any reﬂective symmetric 3-D object one non-

accidental 2-D model view is sufﬁcient for recognition,

Zabrodsky and Weinshall (1997) used bilateral symme-

try assumption to improve 3-D reconstruction from im-

age sequences, and Zabrodsky et al. (1995) provided

a good survey on studies of reﬂective symmetry and

rotational symmetry in computer vision at the time.

In 3-D object and pose recognition, Rothwell et al.

(1993) pointed out that the assumption of reﬂective

symmetry can be used in the construction of projective

invariants and is able to eliminate certain restrictions

on the corresponding points. Cham and Cipolla (1996)

built the correspondences of contours from reﬂective

symmetry. For translational symmetry, Schaffalitzky

and Zisserman (2000) used it to detect the vanish-

ing lines and points. Liu et al. (1995) analyzed the

error of obtaining 3-D invariants derived from trans-

lational symmetry. In addition to isometric symme-

try, Liebowitz and Zisserman (1998), Criminisi and

Zisserman (1999) and Criminisi and Zisserman (2000)

showed that other knowledge (e.g., length ratio, vanish-

ing line, etc.) in 3-D also allows accurate reconstruction

of structural metric and camera pose.

For the detection of symmetry from images, Marola

(1989), Kiryati and Gofman (1998) and Mukherjee

et al. (1995) presented efﬁcient algorithms to ﬁnd axes

of reﬂective symmetry in 2-D images, Sun and Sherrah

(1997) discussed reﬂective symmetry detection in 3-D

space, and Zabrodsky et al. (1995) introduced a symme-

try distance to classify reﬂective and rotational symme-

try in 2-D and 3-D spaces (with some related comments

given in Kanatani (1997)). Carlsson (1998) and Gool

et al. (1996) derived methods to ﬁnd 3-D symmetry

from invariants in the 2-D projection. Liu and Colline

(1998) proposed a method to classify any images with

translational symmetry into the 7 Frieze groups and 17

wallpaper groups.

However, there is still a lack of formal and uniﬁed

analysis as well as efﬁcient algorithms which would

allow people to easily make use of numerous and dif-

ferent types of symmetry that nature offers. Is there a

uniﬁed approach to study 3-D information encoded in a

2-D perspective image of an object that exhibits certain

symmetry? This paper will try to provide a deﬁnite an-

swer to this question. Our work differs from previous

results in at least the following three aspects:

1. We study symmetry under perspective projection

based on existing theory of multiple-view geom-

etry.

We claim that in order to fully understand

such 3-D information encoded in a single image,

one must understand geometry among multiple im-

ages.

2. In addition to recover 3-D structure of a symmetric

object from its image, we show that any type of

symmetry is naturally equipped with a canonical

(world) coordinate frame, from which the viewer’s

relative pose to the object can be recovered.

3. We give the necessary and sufﬁcient conditions in

terms of the symmetry group of the object under

which the canonical pose can be uniquely recov-

ered, and we characterize the inherent ambiguity for

each fundamental type of symmetry. Thus, for the

ﬁrst time, geometric group theory and (perspective)

multiple-view geometry are elegantly and tightly

integrated.

During the development, an important principle asso-

ciated with images of symmetric objects will be ex-

amined with care: One image of a symmetric object is

equivalent to multiple images. This principle is how-

ever not entirely correct since, as we will see, often

relationships among such “images” will not be the

244 Hong et al.

same as those among conventional images. It in fact

requires careful modiﬁcations to existing theories and

algorithms in multiple-view geometry if they are to be

correctly applied to images of symmetric objects.

2. Problem Formulation

Before we formulate the problem in a more abstract

form, let us take a look at a simple example: a planar

board with a symmetric pattern as shown in Fig. 2. It is

easy to see that, from any generic viewpoint, there are

at least four equivalent vantage points (with only the

rotational symmetry considered, for now) which give

rise to an identical image. The only question is which

corners in the image correspond to the ones on the

board. In this sense, these images are in fact different

from the original one. We may call these images as

“hidden.”

For instance, in Fig. 2, we labeled in bracket

corresponding corner numbers for such a hidden image.

Figure 2. Left: a checker board whose symmetry includes reﬂection along the x and y axes and rotation about o by 90

◦

. Right: an image taken

at location o

. Notice that the image would appear to be exactly the same if it were taken at o

instead. g

is the relative pose of the board we

perceive from the image on the right.

Figure 3. I

: Corner correspondence between the original image of the board and an “image” with the board reﬂected in the x-axis by 180

◦

;

: Corner correspondence between the original image of the board and an “image” with the board reﬂected in the y-axis by 180

◦

In addition to the rotational symmetry, another kind

of symmetry, the reﬂective symmetry, can give rise to

a not so conventional type of hidden images, as shown

in Fig. 3. Notice that, in the ﬁgure, the two “hidden

images” with the four corners labeled by numbers in

bracket cannot be an image of the same board from any

(physically viable) vantage point!

Nevertheless, as we

will see below, just like the rotational symmetry, this

type of hidden images also encodes rich 3-D geometric

information about the object.

There is yet another type of symmetry “hidden” in

a pattern like a checker board. As shown in Fig. 4 be-

low, for a pattern that repeats a fundamental region

indeﬁnitely along one or more directions, the so-called

“inﬁnite rapport,” one would obtain exactly “the same”

image had the images been taken at vantage points that

differ from each other by multiples nT of one basic

translation T . Although all images would appear to be

the same, features (e.g., points, lines) in these images

correspond to different physical features in the world.

On Symmetry and Multiple-View Geometry 245

Figure 4. The checker pattern is repeated indeﬁnitely along the

x-axis. Images taken at o

, o

, and o

will be the same.

Therefore, for an image like the 4th one in Fig. 1, it in

fact may give rise to many (in theory, possibly inﬁnitely

many) “hidden images.” There is clearly a reason to

believe that it is these (many) hidden images that give

away the geometry of the plane (e.g., tilt, slant) to the

viewer’s eyes.

It is then not hard to imagine that the combination of

the rotational, reﬂective and translational symmetries

will give rise to all sorts of symmetric objects in 2-D

or 3-D space, many of which could be rather compli-

cated. In our man-made world, symmetric objects are

ubiquitous, under the names of “ornament,” “mosaic,”

“pattern,” or “tiling,” etc. Fascination about symmetric

objects can be traced back to ancient Egyptians and

Greeks.

Nevertheless, a formal mathematical inquiry

to symmetry is known as Hilbert’s 18th problem, and

a complete answer to it was not found till 1910 by

Bieberbach (1910). While in the appendix we brieﬂy

review results of a complete list for 2-D and 3-D sym-

metric structures and groups, this paper will focus on

how to combine this knowledge about symmetry with

multiple-view geometry so as to infer 3-D information

of a symmetric object from its image(s).

In order to explain why symmetry gives away accu-

rate information about structure and location of a sym-

metric 3-D object from a single 2-D perspective image,

we will need a mathematical framework within which

all types of symmetries (that we have mentioned or not

mentioned in the above examples) can be uniformly

taken into account. Only if we can do that, will the in-

troduction of symmetry into multiple-view geometry

become natural and convenient.

Deﬁnition 1 (Symmetric structure and its group action).

A set of points S ⊂ R

is called a symmetric structure if

there exists a non-trivial subgroup G of the Euclidean

group E(3) that acts on it. That is, for any element

g ∈ G,itdeﬁnes a bijection (i.e. a one-to-one, onto)

map from S to itself:

g ∈ G : S → S.

Sometimes we say that S has a symmetry group G.Or

G is a group of symmetries of S.

In particular, we have g(S) = g

−1

(S) = S for any

g ∈ G. Mathematically, symmetric structures and

groups are equivalent ways to capture symmetry: any

symmetric structure is invariant under the action of its

symmetry group; and any group (here as a subgroup

of E(3)) deﬁnes a class of (3-D) structures that are in-

variant under this group action (see Appendix A). Here

we emphasize that G is in general a subgroup of the

Euclidean group E(3) but not the special one SE(3).

This is because many symmetric structures that we are

going to consider are invariant under reﬂection which

is an element in O(3) but not SO(3).

For simplicity,

in this paper we consider G to be a discontinuous (or

discrete) group.

Using the homogeneous representation of E(3), any

element g = (R, T )inG can be represented as a 4 × 4

matrix of the form

g =





∈ R

4×4

, (1)

where R ∈ R

3×3

is an orthogonal matrix (“R” for both

rotation and reﬂection) and T ∈ R

is a vector (“T ” for

translation). Note that in order to represent G in this

way, aworld coordinate frame must have been chosen.

It is conventional to choose the origin of the world

coordinate frame to be the center of rotation and its

axes to line up with the axes of rotation and reﬂection

or direction of translation. Often the canonical world

coordinate frame results in the simplest representation

of the symmetry (Ma et al., 2003).

Now suppose that an image of a symmetric structure

S is taken at a vantage point g

= (R

, T

) ∈ SE(3)—

denoting the pose of the structure relative to the viewer

or the camera. Here g

is assumed to be represented

with respect to the canonical world coordinate frame

for the symmetry. If so, we call g

the canonical pose.

As we will see shortly, the canonical pose g

from the

viewer to the object can be uniquely determined from

a single image as long as symmetry admitted by the

object (or the scene) is “rich” enough.

HTML Viewer

Frequently Asked Questions (14)

Q1. What have the authors contributed in "On symmetry and multiple-view geometry: structure, pose, and calibration from a single image∗" ?

In this paper, the authors provide a principled explanation of how knowledge in global 3-D structural invariants, typically captured by a group action on a symmetric structure, can dramatically facilitate the task of reconstructing a 3-D scene from one or more images. More importantly, since every symmetric structure admits a “ canonical ” coordinate frame with respect to which the group action can be naturally represented, the canonical pose between the viewer and this canonical frame can be recovered too, which explains why symmetric objects ( e. g., buildings ) provide us overwhelming clues to their orientation and position. The authors show how algorithms from conventional multiple-view geometry, after properly modified and extended, can be directly applied to perform such recovery, from all “ hidden images ” of one image of the symmetric structure. The authors give the necessary and sufficient conditions in terms of the symmetry ( group ) admitted by a structure under which this pose can be uniquely determined. The authors also characterize, when such conditions are not satisfied, to what extent this pose can be recovered.

Q2. What are the future works mentioned in the paper "On symmetry and multiple-view geometry: structure, pose, and calibration from a single image∗" ?

Obviously this is a more principled way to study assumptions about 3-D structure that people have exploited before in multiple-view geometry, such as orthogonality and parallelism ( hence vanishing points ) etc. Furthermore, such information can be readily utilized to establish correspondence across images taken with a large baseline or change of view angle: As long as one common ( local ) symmetry can be recognized and aligned properly, the rest of the structures in the scene can then be correctly registered and reconstructed. The authors believe that, together with conventional geometric constraints among multiple images, symmetry is indeed an important cue which eventually makes 3-D reconstruction a more well-conditioned problem.

Q3. What are the examples of symmetry in the 3D scene?

lines, and planes are special symmetric objects which have been extensively studied as primitive geometric features for reconstructing a 3-D scene from 2-D images.

Q4. What is the important observation from this paper?

Probably the most important observation from this paper is that, in addition to the 3-D structure, the “canonical” pose between the canonical world coordinate frame of a symmetric object and the camera can also be recovered.

Q5. What is the theory of multiple-view geometry?

As the authors have suggested before, although symmetry is a phenomenon associated with a single image, a full understanding of its effect on 3-D reconstruction depends on the theory of multiple-view geometry.

Q6. What is the ambiguity in the plane P?

Given an image of a structure S with a reflective symmetry with respect to a plane in 3-D, the canonical pose g0 can be determined up to an arbitrary choice of an orthonormal frame in this plane, which is a 3- parameter family of ambiguity (i.e. SE(2)).

Q7. What is the basis of the real kernel of L?

Then the real kernel of L is a 3-dimensional space which has the basis{[0, 0, v1], [Im(v2), Re(v2), 0], [−Re(v2), Im(v2), 0]} ∈ R3×3.

Q8. What is the ground truth for the length ratios of the white board and table?

The ground truth for the length ratios of the white board and table are 1.51 and 1.00, and the recovered length ratio are 1.506 and 1.003, respectively.

Q9. What can be done to test whether certain image segments can be the perspective projection of symmetric objects?

Using the techniques introduced earlier in this paper, the authors can test whether certain image segments, obtained by other low-level segmentation algorithms such as mean shift (Comanicu and Meer, 2002), can be the perspective projection of symmetric objects in 3-D.

Q10. What is the way to obtain the camera poses?

Using their methods, the camera poses can be easily obtained as a “by-product” when the authors align the symmetric objects in different images.

Q11. How can the authors determine the canonical pose of a symmetric structure?

As the authors will see shortly, the canonical pose g0 from the viewer to the object can be uniquely determined from a single image as long as symmetry admitted by the object (or the scene) is “rich” enough.

Q12. What is the ambiguity in determining the relative pose?

As the authors have seen from above sections, there is always some ambiguity in determining the relative pose (g0) from the vantage point to the canonical world coordinate frame (where the symmetry group G was represented in the first place) if only one type of symmetry is considered.

Q13. What is the example of symmetry-based photo editing?

Figure 13 shows an comprehensive example of symmetry-based photo editing, which includes removing occlusion, copying and replacing objects in the scene, and adding new objects.

Q14. What is the ambiguity in the symmetry of the checker board?

The checker board is a planar structure which is symmetric with respect to the central line of itself (in fact there are many more local reflective symmetry on parts of the board).