scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Complete visual metrology using relative affine structure

TL;DR: A framework for retrieving metric information for repeated objects from single perspective image based on relative affine structure along X, Y and Z axes and the possible extension of this framework for motion analysis - structure from motion and motion segmentation is proposed.
Abstract: We propose a framework for retrieving metric information for repeated objects from single perspective image. Relative affine structure, which is an invariant, is directly proportional to the Euclidean distance of a three dimensional point from a reference plane. The proposed method is based on this fundamental concept. The first object undergoes 4 × 4 transformation and results in a repeated object. We represent this transformation in terms of three relative affine structures along X, Y and Z axes. Additionally, we propose the possible extension of this framework for motion analysis - structure from motion and motion segmentation.

Summary (2 min read)

Introduction

  • In computer vision, invariants are widely used for recognition and classification of objects and three dimensional reconstruction of a scene from one or more uncalibrated images [1][2][3][4].
  • Broadly interpreted, all these vision tasks use invariants for retrieving geometric properties of objects from images.
  • The camera model used in this work is the central projection.
  • The relative affine structure is one of the widely used tools in the context of repeated objects [4].
  • The method to compute measurements of repeated objects, individually, without using relative affine structures for corresponding points is developed in section III.

B. Apparatus for Proposed Framework

  • The chosen world coordinate system for repeated objects is shown in figure (2).
  • The three orthogonal planes πY Z , πZX and πXY are reference planes.
  • Second constant (say µx0) is fixed for every point with respect to the reference plane (say πY Z).
  • The proposed framework use this fundamental concept behind relative affine structure to determine 3D measurements of translaionally and affinely repeated objects.
  • Any one cuboid is considered as principal object and rest as auxiliary objects.

III. MEASUREMENTS OF INDIVIDUAL OBJECT

  • As described in section II-B, for a point Mi, a relative affine structure kx has a fixed constant µx0 and a variable constant 1 λi .
  • The fixed constant will be eliminated by taking ratio of two relative affine structures for two different points with respect to same reference plane, πY Z .
  • Thus, for any arbitrary point’s X coordinate can be computed using relative affine structure and projective depth.
  • Similarly, the authors can compute the Y and Z coordinates of every point, given reference metric measurements Yref and Zref along Y and Z direction, respectively.

IV. PRINCIPAL OBJECT AS REFERENCE

  • Alternately, the authors can arbitrarily choose a pair of points on two repeated objects.
  • The respective relative affine structures kxi and k′xi for mi and m ′ i can be computed by Eq. (5) and Eq. (7).
  • The ratio of kxi and k′xi can be expressed as follows kxi k′xi =.

V. TRANSFORMATION OF REPEATED OBJECT

  • Under specific configurations, relative affine structure, which is projective structure, turns into affine structure.
  • If the reference plane is at infinity or in case of parallel projection, relative affine structure approaches to affine structure [2].
  • Thus, that ratio is a projective structure.
  • Suppose S and S′ are repeated objects and are related by S′ = T (S), where T is a 4×4 general transformation matrix.

C. Affine Repetition

  • The most general case is affine repetition that encapsulates rotation, translation, scaling and shearing [10].
  • Once constants ψx0, ψy0 and ψz0 along X , Y and Z directions are computed from image (coordinates of M0), affine repetition can be computed uniquely.

VI. RESULTS

  • In their experiments, the authors consider a real image, as shown in figure (3).
  • Table I and II display the measurements of objects computed using methods discussed in sections III and IV, respectively.
  • Based on the precision required for an application, the errors can be further reduced by employing efficient techniques for computing point correspondences and vanishing points from image.
  • Additionally, proper uncertainly analysis will also improve the results [6].

VII. CONCLUSION

  • The authors extended prior work on relative affine structure for computing three dimensional measurement from a single perspective image of repeated objects.
  • The transformation between repeated objects can be represented in terms of relative affine structures along three orthogonal directions.
  • Therefore, one invariant is used to analyze projective, affine and Euclidean space for vision tasks.
  • Furthermore, three dimensional motion of an object or a camera can be parameterized in terms of relative affine structure.
  • So, motion analysis related tasks such as motion segmentation and tracking can use relative affine structure, an invariant.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

Complete Visual Metrology using Relative Affine
Structure
Adersh Miglani
Adersh.Miglani@gmail.com
Sumantra Dutta Roy
sumantra@ee.iitd.ac.in
Santanu Chaudhury
santanuc@ee.iitd.ac.in
J B Srivastava
jbsrivas@gmail.com
Abstract—We propose a framework for retrieving metric
information for repeated objects from single perspective image.
Relative affine structure, which is an invariant, is directly
proportional to the Euclidean distance of a three dimensional
point from a reference plane. The proposed method is based
on this fundamental concept. The first object undergoes 4 × 4
transformation and results in a repeated object. We represent
this transformation in terms of three relative affine structures
along X , Y and Z axes. Additionally, we propose the possible
extension of this framework for motion analysis - structure from
motion and motion segmentation.
I. INTRODUCTION
In computer vision, invariants are widely used for recog-
nition and classification of objects and three dimensional
reconstruction of a scene from one or more uncalibrated
images [1][2][3][4]. Broadly interpreted, all these vision tasks
use invariants for retrieving geometric properties of objects
from images. Here, we use a view-point invariant for retrieving
metric measurements of multiple objects with translational
and affine repetition from single perspective image. No prior
knowledge of camera’s internal and external parameters is
required in this setup. The camera model used in this work is
the central projection.
Repetition of two and three dimensional objects are fre-
quently used in multiple vision tasks. The relative affine struc-
ture is one of the widely used tools in the context of repeated
objects [4]. This is a projective invariant for repeated objects
and turns into an affine invariant under special cases such as
parallel projection. Prior work used relative affine structure for
reconstruction and recognition of three dimensional objects,
retrieving structure from motion and synthesizing new views
from multiple prespective images [2][3][5]. Here, we explored
its fundamental property that it is directly proportional to the
Euclidean distance of a point from the reference plane. This
led us to create another visual metrology framework beyond
traditional usages of relative affine structure.
In this paper, we blend and develop previous results on rela-
tive affine structure and single view metrology [4][6][7][8][9].
Earlier, relative affine structure was used for retrieving three
dimensional projective structure of one object from multiple
images taken from different view points [4][2]. Later, it
was used for projective reconstruction of repeated objects
with translational and affine repetition from single perspective
image [3]. And, homology, a plane projective transforma-
tion, was used for vanishing points based visual metrology
techniques and camera calibration [6][7][10][11][12]. Here,
we use relative affine structure and homology, together in
a different manner, for computing metric measurements of
repeated objects with minimal scene information and reference
metric measurements.
In single view metrology, we can compute the distance
between two parallel planes when the corresponding points
on the planes are along the direction normal to the planes
[7]. This is a requirement to establish homology between two
parallel planes [10]. But, our framework considers the general
configuration of corresponding points on the repeated objects
and is not restricted to such point correspondences.
Given reference measurements of first object, measurements
of repeated objects, irrespective of translation and affine
transformation, are computed. In case of affine repetition,
computed measurements are upto a uniform scale along the
reference direction that is normal to reference plane. Further-
more, proposed framework can be extended for vision tasks
involving multiple views such as motion analysis - structure
from motion, motion segmentation and tracking.
Section II describes relative affine structure and its proper-
ties suited for the proposed framework. The method to com-
pute measurements of repeated objects, individually, without
using relative affine structures for corresponding points is
developed in section III. Section IV has details on retrieving
metric information for affinely repeated objects up to uniform
scale along X, Y and Z directions, respectively. Section V
describes relationships between different object transformation
matrices and relative affine structures. The results and conclu-
sion with future work are discussed in sections VI and VII,
respectively.
II. OVERVIEW
A. Relative Affine Strcture
In three dimensional space, a plane π and a point M
1
/ π
are chosen. Given two views ψ and ψ
0
with projection centers
O and O
0
, H
π
is a homography that transfers image point m
to m
0
, where m and m
0
are image of point M π [10]. Image
points of M
1
in two views ψ and ψ
0
are m
1
and m
0
1
. These
are related by the following relationship which is derived in
[2]:
m
0
1
=
H
π
m
1
+ ke
0
(1)
where, e
0
is epipole and k is relative affine structure. Above
relation has a scale factor which can be resolved by appropri-

ately scaling H
π
or e
0
such that [2],
m
0
0
=
H
π
m
0
+ e
0
(2)
where m
0
0
and m
0
are images of a fixed point M
0
/ π. This
configuration is shown in figure (1). Geometrically, the relative
affine structure is defined as [2],
k =
X
1
λ
1
λ
0
X
0
(3)
where λ
1
and λ
0
are depth of point M
1
and M
0
and X
1
and
X
0
are perpendicular distance of these points from reference
plane π which is Y Z plane in this case. Thus, it is concluded
that:
The relative affine structure is proportional to the perpen-
dicular distance from a reference plane.
The scale factor for Eq. (1) is computed in [3]. It is ratio
of depths of M
1
with respect to projection centers O and O
0
.
λ
0
m
0
1
= λ(H
π
m
1
+ ke
0
N
) (4)
where e
0
N
is normalized epipole. After simplifying it further,
the expression for k is written as [3],
k =
(m
0
1
× e
0
N
)
T
((H
π
m
1
) × m
0
1
)
||(m
0
1
× e
0
N
)||
2
(5)
Let us consider the dual configuration of what is shown in
figure (1). An object S undergoes a transformation T , affine
or translation, and results in object S
0
. In three dimensional
space, a point M S is related by its corresponding point
M
0
S
0
such as M
0
= T M. The image of M and M
0
in view
ψ with projection center O are m and m
0
. This configuration
can be considered as single image of two repeated objects
with transformation T or two different images of single object
when two cameras undergo same transformation T . This is
called isometry property. The relative affine structures for
corresponding points m and m
0
are k and k
0
. The expression
for k is given by Eq. (5). The expression for k
0
under
translational and affine repetition is given by Eq. (6) and (7),
respectively.
k
0
=
(m × e
0
N
)
T
((H
π
m
0
2m
0
) × m)
||(m × e
0
N
)||
2
(6)
k
0
=
(H
m × e
0
N
)
T
((e
0
N
ν
T
πN
m
0
m
0
) × m)
||(H
m × e
0
N
)||
2
(7)
where ν
T
πN
= e
0
T
N
(H
π
H
) and H
is infinite homography
between two views [3].
B. Apparatus for Proposed Framework
The chosen world coordinate system for repeated objects is
shown in figure (2). The three orthogonal planes π
Y Z
, π
ZX
and π
XY
are reference planes. For every point M
1
/
{π
Y Z
, π
ZX
, π
XY
} will have three relative affine structures,
k
x
, k
y
and k
z
, respectively.
k
x
= X
1
1
λ
λ
0
X
0
, k
y
= Y
1
1
λ
λ
0
X
0
, and k
z
= Z
1
1
λ
λ
0
X
0
(8)
Fig. 1. Geometry: Relative Affine Structure
Fig. 2. Geometry: Three Orthogonal Relative Affine Structures
where, X
0
and λ
0
are X coordinate and depth of a fixed
point M
0
/ {π
XY
, π
Y Z
, π
ZX
}. The ratio
λ
0
X
0
is a constant
and denoted by µ
x0
. Similarly, we can write,
k
x
= X
1
1
λ
µ
x0
, k
y
= Y
1
1
λ
µ
y0
, and k
z
= Z
1
1
λ
µ
z0
(9)
Therefore, each expression for k is proportional to the per-
pendicular distance from the chosen reference plane, e.g.
k
x
X
1
, k
y
Y
1
and k
z
Z
1
. There are two constants of
proportionality. First constant is inverse of depth of the point
1
λ
which will vary for every point. Second constant (say µ
x0
)
is fixed for every point with respect to the reference plane (say
π
Y Z
).
The proposed framework use this fundamental concept
behind relative affine structure to determine 3D measurements
of translaionally and affinely repeated objects. We experiment
our framework on a perspective image of repeated cuboids, as
shown in figure (3). Any one cuboid is considered as principal
object and rest as auxiliary objects.
III. MEASUREMENTS OF INDIVIDUAL OBJECT
As described in section II-B, for a point M
i
, a relative affine
structure k
x
has a fixed constant µ
x0
and a variable constant
1
λ
i
. The fixed constant will be eliminated by taking ratio of two
relative affine structures for two different points with respect
to same reference plane, π
Y Z
.
Considering points M
2
= (X
2
0 0) and M
5
= (X
5
0 Z
5
)
as shown in figure (2), ratio of their relative affine structures
is reduced to the following expression,
k
x2
k
x5
=
X
2
X
5
λ
5
λ
2
λ
0
X
0
X
0
λ
0
=
X
2
X
5
λ
5
λ
2
(10)

The values of k
x2
and k
x5
are computed by Eq. (5). The
expression for X
5
can be written as
X
5
= X
2
k
x5
k
x2
λ
5
λ
2
(11)
The depth of points M
2
and M
5
, λ
2
and λ
5
, are computed
using vanishing points based method given in [6]. Given metric
value of X
2
(X
ref
) and metric value of X
5
is computed.
Thus, for any arbitrary point’s X coordinate can be computed
using relative affine structure and projective depth. Similarly,
we can compute the Y and Z coordinates of every point, given
reference metric measurements Y
ref
and Z
ref
along Y and Z
direction, respectively.
Y = Y
ref
k
y
k
yref
λ
λ
ref
and Z = Z
ref
k
z
k
zref
λ
λ
ref
(12)
IV. PRINCIPAL OBJECT AS REFERENCE
Consider a pair of corresponding points M
i
and M
0
i
on
affinely repeated objects S and S
0
. Alternately, we can arbi-
trarily choose a pair of points on two repeated objects. The
respective relative affine structures k
xi
and k
0
xi
for m
i
and m
0
i
can be computed by Eq. (5) and Eq. (7). The ratio of k
xi
and
k
0
xi
can be expressed as follows
k
xi
k
0
xi
=
X
i
X
0
i
λ
0
i
λ
i
λ
i0
λ
0
i0
X
0
i0
X
i0
(13)
k
xi
k
0
xi
=
X
i
X
0
i
λ
0
i
λ
i
ψ
x0
=
X
i
X
0
i
λ
0
i
λ
i
(14)
Since ψ
x0
=
λ
i0
λ
0
i0
X
0
i0
X
i0
is fixed for all points, the ratio
k
xi
k
0
xi
can
be computed up to uniform scale along X-axis. The ratio
λ
0
i
λ
i
can be computed by solving equation (4).
λ
0
i
λ
i
=
||((H
π
m
i
) × e
0
N
)||
||m
0
i
× m
i
||
(15)
Equation (14) can be written as
X
0
i
=
X
i
k
0
xi
k
xi
λ
0
i
λ
i
= X
i
α
xi
(16)
Similarly, we can write expressions for Y and Z directions as
below,
Y
0
i
=
Y
i
k
0
yi
k
yi
λ
0
i
λ
i
= Y
i
α
yi
, Z
0
i
=
Z
i
k
0
zi
k
zi
λ
0
i
λ
i
= Z
i
α
zi
(17)
Given reference measurements on the principal object along
X, Y and Z axes, measurements of affinely repeated object
can be computed up to a respective scale by Eq. (16) and (17).
V. TRANSFORMATION OF REPEATED OBJECT
Under specific configurations, relative affine structure,
which is projective structure, turns into affine structure. If the
reference plane is at infinity or in case of parallel projection,
relative affine structure approaches to affine structure [2]. Ratio
of two relative affine structures of a point with respect to
different reference planes does not depends on the depth. Thus,
that ratio is a projective structure. We have seen that relative
affine structure is proportional to Euclidean distance of a point
from the reference plane. Therefore, relative affine structure
subsumes projective, affine and Euclidean structures [2]. Here,
this statement is analyzed mathematically.
Suppose S and S
0
are repeated objects and are related by
S
0
= T (S), where T is a 4 × 4 general transformation matrix.
A point M
0
S
0
is corresponding to M S. By using Eq.
(16) and (17), the relation between corresponding points can
be written as follows,
X
0
Y
0
Z
0
1
=
α
x
0 0 0
0 α
y
0 0
0 0 α
z
0
0 0 0 1
X
Y
Z
1
= K
X
Y
Z
1
(18)
where K is the transformation matrix in 3D Euclidean space.
A. Translational Repetition
If object S and S
0
are related by pure translation, the
transformation is represented as,
X
0
Y
0
Z
0
1
=
1 0 0 T
x
0 1 0 T
y
0 0 1 T
z
0 0 0 1
X
Y
Z
1
(19)
From Eq. (18) and (19),
X
0
= X + T
x
= Xα
x
T
x
= X(α
x
1) (20)
Similarly,
T
y
= Y (α
y
1), T
z
= Z(α
z
1) (21)
Therefore, translation vector can be represented in terms of
three relative affine structures with respect to uniform scale
along X, Y and Z axes, respectively.
B. Pure Rotational Repetition
If object S undergoes pure rotation and results in S
0
, the
transformation is represented as,
X
0
Y
0
Z
0
=
r
1
r
2
r
3
r
4
r
5
r
6
r
7
r
8
r
9
X
Y
Z
=
α
x
0 0
0 α
y
0
0 0 α
z
X
Y
Z
Therefore, the rotation between two points is equivalent to the
following diagonal matrix
R =
α
x
0 0
0 α
y
0
0 0 α
z
where three columns are scaled uniformly along X, Y and Z
directions, respectively.
C. Affine Repetition
The most general case is affine repetition that encapsulates
rotation, translation, scaling and shearing [10]. This trans-
formation in 3D Euclidean space is equivalent to K. Once
constants ψ
x0
, ψ
y0
and ψ
z0
along X, Y and Z directions are
computed from image (coordinates of M
0
), affine repetition
can be computed uniquely.

Fig. 3. Repeated Objects
VI. RESULTS
In our experiments, we consider a real image, as shown in
figure (3). It has objects with affine repetition. Table I and
II display the measurements (centimeter) of objects computed
using methods discussed in sections III and IV, respectively.
Based on the precision required for an application, the errors
can be further reduced by employing efficient techniques
for computing point correspondences and vanishing points
from image. Additionally, proper uncertainly analysis will also
improve the results [6].
TABLE I
MEASUREMENTS - INDIVIDUAL OBJECT
ID Source X5 Z5 Y6 Z6 X7 Y7 Z7
Scene 3.1 5.1 3.1 5.1 3.1 3.1 5.1
1
Image 3.3 5.0 3.3 4.8 3.4 3.5 4.6
Error -0.2 0.1 -0.2 0.3 -0.3 -0.4 0.5
Scene 3.1 5.1 3.1 5.1 3.1 3.1 5.1
2
Image 2.3 5.3 2.1 4.2 2.1 2.1 4.3
Error 0.8 -0.2 1.0 0.9 1.0 1.0 0.8
Scene 2.6 6.0 2.6 6.0 2.6 2.6 6.0
3
Image 2.4 6.1 1.5 5.7 2.5 1.3 5.8
Error 0.2 -0.1 1.1 0.3 0.2 1.3 0.2
TABLE II
MEASUREMENTS - PRINCIPAL (I
st
) OBJECT AS REFERENCE
ID Source X5 Z5 Y6 Z6 X7 Y7 Z7
Scene 3.1 5.1 3.1 5.1 3.1 3.1 5.1
2
Image 2.3 5.3 3.5 4.2 2.1 3.5 4.3
Error 0.8 -0.2 -0.4 0.9 1.0 -0.4 0.8
Scene 2.6 6.0 2.6 6.0 2.6 2.6 6.0
3
Image 2.4 5.9 3.2 5.2 2.6 2.9 5.1
Error 0.2 -0.1 -0.8 0.8 0.0 -0.3 0.9
VII. CONCLUSION
We extended prior work on relative affine structure for
computing three dimensional measurement from a single
perspective image of repeated objects. The transformation
between repeated objects can be represented in terms of
relative affine structures along three orthogonal directions.
Therefore, one invariant is used to analyze projective, affine
and Euclidean space for vision tasks. Camera transformation
for repeated object can also be expressed in terms of relative
affine structures.
Furthermore, three dimensional motion of an object or
a camera can be parameterized in terms of relative affine
structure. So, motion analysis related tasks such as motion
segmentation and tracking can use relative affine structure, an
invariant.
ACKNOWLEDGMENT
This work has been supported under the research grant,
towards the setting up of the Program on Autonomous
Robotics’, from the Board of Research in Nuclear Sciences
(BRNS), India.
REFERENCES
[1] Y. Cao and J. McDonald, “Viewpoint invariant features from single
images using 3d geometry, in Applications of Computer Vision (WACV),
2009 Workshop on, 2009, pp. 1–6.
[2] A. Shashua and N. Navab, “Relative affine structure: canonical model for
3d from 2d geometry and applications, Pattern Analysis and Machine
Intelligence, IEEE Transactions on, vol. 18, no. 9, pp. 873–883, 1996.
[3] R. Choudhury, J. B. Srivastava, and S. Chaudhury, “Reconstruction-
based recognition of scenes with translationally repeated quadrics,
Pattern Analysis and Machine Intelligence, IEEE Transactions on,
vol. 23, no. 6, pp. 617–632, 2001.
[4] A. Shashua and N. Navab, “Relative affine structure: theory and ap-
plication to 3d reconstruction from perspective views, in Computer
Vision and Pattern Recognition, 1994. Proceedings CVPR ’94., 1994
IEEE Computer Society Conference on, 1994, pp. 483–489.
[5] S. Tebaldini, M. Marcon, A. Sarti, and S. Tubaro, “Uncalibrated view
synthesis from relative affine structure based on planes parallelism, in
Proceedings of the International Conference on Image Processing, ICIP
2008, October 12-15, 2008, San Diego, California, USA. IEEE, 2008,
pp. 317–320.
[6] A. Criminisi, Accurate visual metrology from single and multiple
uncalibrated images, Ph.D. dissertation, University of Oxford, Dept.
Engineering Science, 1999, d.Phil. thesis.
[7] A. Criminisi, I. Reid, and A. Zisserman, “Single view metrology, Int.
J. Comput. Vision, vol. 40, no. 2, pp. 123–148, Nov. 2000.
[8] K. Peng, L. Hou, R. Ren, X. Ying, and H. Zha, “Single view
metrology along orthogonal directions, in Proceedings of the 2010
20th International Conference on Pattern Recognition, ser. ICPR ’10.
Washington, DC, USA: IEEE Computer Society, 2010, pp. 1658–1661.
[Online]. Available: http://dx.doi.org/10.1109/ICPR.2010.410
[9] G. Wang, Z. Hu, F. Wu, and H.-T. Tsui, “Single view
metrology from scene constraints, Image Vision Comput.,
vol. 23, no. 9, pp. 831–840, Sep. 2005. [Online]. Available:
http://dx.doi.org/10.1016/j.imavis.2005.04.002
[10] R. I. Hartley and A. Zisserman, Multiple View Geometry in Computer
Vision, 2nd ed. Cambridge University Press, ISBN: 0521540518, 2004.
[11] R. Cipolla, T. Drummond, and D. Robertson, “Camera Calibration from
Vanishing Points in Images of Architectural Scenes, in Proc. British
Machine Vision Conference. Nottingham, UK: British Machine Vision
Association, 1999, pp. 382–391.
[12] E. Guillou, D. Meneveaux, E. Maisel, and K. Bouatouch, “Using
vanishing points for camera calibration and coarse 3d reconstruction
from a single image, The Visual Computer, vol. 16, no. 7, pp. 396–410,
2000. [Online]. Available: http://dx.doi.org/10.1007/PL00013394
Citations
More filters
Proceedings ArticleDOI
01 Dec 2015
TL;DR: A framework for retrieving metric information of featureless cylindrical pellet imaged through stereo vision by considering the isometry property, relative affine structure which is an invariant that depends on the depth of the point is employed.
Abstract: In the present work, we propose a framework for retrieving metric information of featureless cylindrical pellet imaged through stereo vision. Considering the isometry property, relative affine structure which is an invariant that depends on the depth of the point is employed for obtaining the information. This method skips considering vanishing points for depth estimation, and hence the errors are reduced remarkably as no points are assumed to be at infinity. Real world measurements are taken and the metrics of a selected reference point is assumed to be known. With these minimal information the mapping between the images using the relative affine structure is carried out that enables the 3D reconstruction. This work also blends the concepts of 360 degree rotational symmetry and orthogonal planes with the stereo vision that has minimized the error percentage. The results presented by the theory will have an impact on the design of 3D reconstruction systems for computer vision and its applications.

1 citations


Cites methods or result from "Complete visual metrology using rel..."

  • ...Table I displays the measurements (in centimeters) computed using methods discussed in section II and III and compares the error percentage results with previous works [4][7]....

    [...]

  • ...In single view metrology, the uncertainty is added to the calculations and to metric values as the method considers vanishing point and vanishing line, hence the errors are inevitable because these parameters are assumed to be at infinity [4][5][7]....

    [...]

References
More filters
01 Jan 2001
TL;DR: This book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts and it will show the best book collections and completed collections.
Abstract: Downloading the book in this website lists can give you more advantages. It will show you the best book collections and completed collections. So many books can be found in this website. So, this is not only this multiple view geometry in computer vision. However, this book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts. This is simple, read the soft file of the book and you get it.

14,282 citations


"Complete visual metrology using rel..." refers background or methods in this paper

  • ...And, homology, a plane projective transformation, was used for vanishing points based visual metrology techniques and camera calibration [6][7][10][11][12]....

    [...]

  • ...This is a requirement to establish homology between two parallel planes [10]....

    [...]

Journal ArticleDOI
TL;DR: An algebraic representation is developed which unifies the three types of measurement and permits a first order error propagation analysis to be performed, associating an uncertainty with each measurement.
Abstract: We describe how 3D affine measurements may be computed from a single perspective view of a scene given only minimal geometric information determined from the image This minimal information is typically the vanishing line of a reference plane, and a vanishing point for a direction not parallel to the plane It is shown that affine scene structure may then be determined from the image, without knowledge of the camera's internal calibration (eg focal length), nor of the explicit relation between camera and world (pose) In particular, we show how to (i) compute the distance between planes parallel to the reference plane (up to a common scale factor)s (ii) compute area and length ratios on any plane parallel to the reference planes (iii) determine the camera's location Simple geometric derivations are given for these results We also develop an algebraic representation which unifies the three types of measurement and, amongst other advantages, permits a first order error propagation analysis to be performed, associating an uncertainty with each measurement We demonstrate the technique for a variety of applications, including height measurements in forensic images and 3D graphical modelling from single images

760 citations


"Complete visual metrology using rel..." refers background or methods in this paper

  • ...And, homology, a plane projective transformation, was used for vanishing points based visual metrology techniques and camera calibration [6][7][10][11][12]....

    [...]

  • ...In this paper, we blend and develop previous results on relative affine structure and single view metrology [4][6][7][8][9]....

    [...]

  • ...In single view metrology, we can compute the distance between two parallel planes when the corresponding points on the planes are along the direction normal to the planes [7]....

    [...]

Proceedings ArticleDOI
16 Sep 1999
TL;DR: A simple, geometrically intuitive method which exploits the strong rigidity constraints of paral-lelism and orthogonality present in indoor and outdoor architectural scenes to recover the projection matrices for each viewpoint is proposed.
Abstract: We address the problem of recovering 3D models from uncalibrated images of architectural scenes. We propose a simple, geometrically intuitive method which exploits the strong rigidity constraints of paral-lelism and orthogonality present in indoor and outdoor architectural scenes. We present a n o vel algorithm that uses these simple constraints to recover the projection matrices for each viewpoint and relate our method to the algorithm of Caprile and Torre 2]. The projection matrices are used to recover partial 3D models of the scene and these can be used to visualise new viewpoints. Our approach d o e s not need any a priori information about the cameras being used. A w orking system called PhotoBuilder has been designed and implemented to allow a user to interactively build a VRML model of a building from uncalibrated images from arbitrary viewpoints 3, 4 ].

249 citations


"Complete visual metrology using rel..." refers methods in this paper

  • ...And, homology, a plane projective transformation, was used for vanishing points based visual metrology techniques and camera calibration [6][7][10][11][12]....

    [...]

Journal ArticleDOI
TL;DR: The aim of this work is to make it possible walkthrough and augment reality in a 3D model reconstructed from a single image to calibrate a camera and to recover the geometry and the photometry (textures) of objects from asingle image.
Abstract: In this paper, we show how to calibrate a camera and to recover the geometry and the photometry (textures) of objects from a single image. The aim of this work is to make it possible walkthrough and augment reality in a 3D model reconstructed from a single image. The calibration step does not need any calibration target and makes only four assumptions: (1) the single image contains at least two vanishing points, (2) the length (in 3D space) of one line segment (for determining the translation vector) in the image is known, (3) the principle point is the center of the image, and (4) the aspect ratio is fixed by the user. Each vanishing point is determined from a set of parallel lines. These vanishing points help determine a 3D world coordinate system R o. After having computed the focal length, the rotation matrix and the translation vector are evaluated in turn for describing the rigid motion between R o and the camera coordinate system R c. Next, the reconstruction step consists in placing, rotating, scaling, and translating a rectangular 3D box that must fit at best with the potential objects within the scene as seen through the single image. With each face of a rectangular box, a texture that may contain holes due to invisible parts of certain objects is assigned. We show how the textures are extracted and how these holes are located and filled. Our method has been applied to various real images (pictures scanned from books, photographs) and synthetic images.

170 citations


"Complete visual metrology using rel..." refers methods in this paper

  • ...And, homology, a plane projective transformation, was used for vanishing points based visual metrology techniques and camera calibration [6][7][10][11][12]....

    [...]

Book
01 Sep 2001
TL;DR: In this article, a hierarchy of novel, accurate and flexible techniques are developed to address a number of different situations ranging from the absence of scene metric information to cases where some world distances are known but there is not sufficient information for a complete camera calibration.
Abstract: This monograph presents some research carried out by the author into three- dimensional visual reconstruction during his studies for the achievement of a Doctor of Philosophy Degree at the University of Oxford. This book constitutes the author’s D.Phil. dissertation which, having been awarded the British Computer Society Distinguished Dissertation Award for the year 2000, has kindly been published by Springer-Verlag London Ltd. The work described in this book develops the theory of computing world measurements (e.g. distances, areas etc.) from photographs of scenes and reconstructing three-dimensional models of the scene. The main tool used is projective geometry which forms the basis for accurate estimation algorithms. Novel methods are described for computing virtual reality-like environments from any kind of perspective image. The techniques presented employ uncalibrated images; no knowledge of the internal parameters of the camera (such as focal length and aspect ratio) nor its pose (position and orientation with respect to the viewed scene) are required at any time. Extensive use is made of geometric characteristics of the scene. Thus there is no need for specialized calibration devices. A hierarchy of novel, accurate and flexible techniques is developed to address a number of different situations ranging from the absence of scene metric information to cases where some world distances are known but there is not sufficient information for a complete camera calibration. The geometry of single views is explored and monocular vision shown to be sufficient to obtain a partial or complete three-dimensional reconstruction of a scene. To achieve this, the properties of planar homographies and planar homologies are extensively exploited. The geometry of multiple views is also investigated, particularly the use of a parallax-based approach for structure and camera recovery. The duality between two-view and three-view configurations is described in detail. In order to pro

167 citations

Frequently Asked Questions (10)
Q1. What are the contributions in "Complete visual metrology using relative affine structure" ?

The authors propose a framework for retrieving metric information for repeated objects from single perspective image. The authors represent this transformation in terms of three relative affine structures along X, Y and Z axes. Additionally, the authors propose the possible extension of this framework for motion analysis structure from motion and motion segmentation. 

Once constants ψx0, ψy0 and ψz0 along X , Y and Z directions are computed from image (coordinates of M0), affine repetition can be computed uniquely. 

Xi X ′i λ′i λi ψx0 ∼= Xi X ′i λ′i λi(14)Since ψx0 = λi0λ′ i0 X′i0 Xi0 is fixed for all points, the ratio kxik′ xi can be computed up to uniform scale along X-axis. 

If the reference plane is at infinity or in case of parallel projection, relative affine structure approaches to affine structure [2]. 

the relative affine structure is defined as [2],k = X1 λ1 λ0 X0(3)where λ1 and λ0 are depth of point M1 and M0 and X1 and X0 are perpendicular distance of these points from reference plane π which is Y Z plane in this case. 

For every point M1 /∈ {πY Z , πZX , πXY } will have three relative affine structures, kx, ky and kz , respectively.kx = X1 1λ λ0 X0 , ky = Y1 1 λ λ0 X0 , and kz = Z1 1 λ λ0 X0(8)where, X0 and λ0 are X coordinate and depth of a fixed point M0 /∈ {πXY , πY Z , πZX}. 

The transformationbetween repeated objects can be represented in terms of relative affine structures along three orthogonal directions. 

If object S undergoes pure rotation and results in S′, the transformation is represented as,X ′Y ′ Z ′ = r1 r2 r3r4 r5 r6 r7 r8 r9 XY Z = αx 0 00 αy 0 0 0 αz XY Z Therefore, the rotation between two points is equivalent to the following diagonal matrixR = αx 0 00 αy 0 0 0 αz where three columns are scaled uniformly along X , Y and Z directions, respectively. 

S′ is corresponding to M ∈ S. By using Eq. (16) and (17), the relation between corresponding points can be written as follows, X ′ Y ′Z ′1 = αx 0 0 0 0 αy 0 0 0 0 αz 0 0 0 0 1 X Y Z 1 = K X Y Z 1 (18) where K is the transformation matrix in 3D Euclidean space. 

The ratio λ ′ i λi can be computed by solving equation (4).λ′i λi = ||((Hπmi)× e′N )|| ||m′i ×mi||(15)Equation (14) can be written asX ′i ∼= Xi k′xi kxi λ′i λi = Xiαxi (16)Similarly, the authors can write expressions for Y and Z directions as below,Y ′i ∼= Yi k′yi kyi λ′i λi = Yiαyi, Z ′ i ∼= Zi k′zi kzi λ′i λi = Ziαzi (17)Given reference measurements on the principal object along X , Y and Z axes, measurements of affinely repeated object can be computed up to a respective scale by Eq. (16) and (17).