Proceedings Article•DOI•

Distributed calibration of pan-tilt camera network using multi-layered belief propagation

Ayesha Choudhary¹, Gaurav Sharma², Santanu Chaudhury¹, Subhashis Banerjee¹•Institutions (2)

Indian Institute of Technology Delhi¹, University of Caen Lower Normandy²

13 Jun 2010-pp 33-40

TL;DR: A technique for distributed self-calibration of pan-tilt camera network using multi-layered belief propagation to obtain globally consistent estimates of the camera parameters for each camera with respect to a global world coordinate system.

read less

Abstract: In this paper, we present a technique for distributed self-calibration of pan-tilt camera network using multi-layered belief propagation. Our goal is to obtain globally consistent estimates of the camera parameters for each camera with respect to a global world coordinate system. The network configuration changes with time as the cameras can pan and tilt. We also give a distributed algorithm for automatically finding which cameras have overlapping views at a certain point in time. We argue that using belief propagation it is sufficient to have correspondences between three cameras at a time for calibrating a larger set of (static) cameras with overlapping views. Our method gives an accurate and globally consistent estimate of the camera parameters of each camera in the network.

...read moreread less

Summary (3 min read)

Jump to: [1. Introduction] – [2. Related Work] – [3. Distributed calibration of pan-tilt camera] – [4. Automatically finding corresponding points] – [5. Finding the graphs] – [6. Camera calibration within a graph] – [7. Belief Propagation within a graph] – [7.1. Multi-layered Belief Propagation] – [8. Forming new graphs] – [9. Aligning cameras to a global world coordinate system] – [10. Results and Discussion] and [11. Conclusion]

1. Introduction

The authors present a distributed algorithm for selfcalibration of a pan-tilt camera network using multi-layered belief propagation.
As the cameras can pan and tilt, the camera network contains various mutually exclusive sub-networks, where, all cameras in a sub-network view a common region.
The authors then propagate belief between sub-networks to obtain the globally consistent and accurate estimates of the camera parameters for each camera in the network.
That the camera network be calibrated with respect to a global WCS so that tasks such as 3D-tracking, recognition of objects, activities and events can be effectively performed.
The authors distributed calibration also leads to making the system scalable, as large camera networks spanning a wide geographical area would contain mutually exclusive sub-networks, thereby, no communication and computation among the cameras of these sub-networks would be necessary for calibration.

3. Distributed calibration of pan-tilt camera

The authors also assume that each camera has a processing unit attached with it and that there exists an underlying communication network such that each camera can communicate with every other camera.
In a pan-tilt camera network, there may exist many such mutually exclusive graphs at any point in time.
To perform multi-layered belief propagation between two graphs containing the same camera in different pan-tilt positions, the authors need to bring the cameras to their home (zero pan and zero tilt) position in both the graphs.
The authors also propose a protocol in Section 9, for aligning all the cameras’ home positions to a global WCS, to get a globally consistent estimate of the camera’s home position (zero pan, zero tilt position).
In the next section, the authors give a method for automatically finding correspondences between three images.

4. Automatically finding corresponding points

The authors propose a method for automatically finding corresponding points in three images.
But, as the number of images increase, the error in correspondences also increase.
First, compute the SIFT features in all three images and then, compute the SIFT matches between the pairs I1−I2, I1−I3 and I2−I3.
Figure 1 shows the common points found between three images taken by three different cameras.

5. Finding the graphs

Starting with the camera with the smallest number that does not belong to any graph currently, say Ci, find the camera with the next smallest number, say Cj , that has an overlap with Ci and which does not belong to any graph.
In general, there will be more than one graph in the pan-tilt camera network.
Moreover, each graph will be a complete graph.
In a wide area pan-tilt camera network it is possible that two sets of cameras are geographically so far apart that there will be no overlapping view between these two sets of cameras.

6. Camera calibration within a graph

The authors assume that the cameras in a graph, say Gk, remain static for a certain time period.
Thus, standard multi-camera self-calibration techniques can be used for calibrating the cameras within a graph.
The crucial point here is to automatically find multiview correspondences at each node.
The corresponding points between the nodes ofGik are found automatically as discussed in Section 4.
Belief propagation (discussed in Section 7) between the nodes of Gik gives a consistent estimate of the camera parameters for each camera in Gik.

7. Belief Propagation within a graph

For distributed calibration of cameras in a graph, sayGk, multi-camera self-calibration is carried out at each node, using the automatically found corresponding points.
As has been shown in [3], belief propagation can be directly applied on a graph which has cameras viewing a common scene as its nodes.
Μ̃i,k and Σ̃i,k are the estimates of the camera parameters after belief propagation within graph Gk.
The covariance matrix is calculated based on the forward covariance propagation from bundle adjustment.

7.1. Multi-layered Belief Propagation

Since the graphs are dynamic and the same camera Ci can be a part of two graphs, say Gk−1 and Gk, in different pan-tilt orientations at different points in time, the authors perform belief propagation between graphs at each node, Ci, which is common in both Gk−1 and Gk.
Similarly, the authors can get to the pan-tilt position as: Pθφ = H−1 ∗ Phome.
In case, the pan-tilt view of the camera does not have any overlap with the home position’s view, a sequence of homographies can be used, again calculated automatically, as shown in Figure 2.
The home position is calculated in each graph using the imageto-image homography before applying the update equations for multi-layered belief propagation.

8. Forming new graphs

The multi-layered belief propagation mechanism can be utilized only if the graphs change across time.
Each camera will have information of all other cameras about the landmark they are viewing.
This also makes their system scalable as the correspondences have to be calculated among only those cameras which view the same landmark and in step 3, the messages have to be passed only between those cameras which can have overlapping views in some pan-tilt configuration.
In the current time period these cameras are not considered for calibration and therefore, remain idle.
In the next time period, they shall repeat the above protocol and become part of graphs with ≥ 3 nodes and hence, will be used for calibration and multi-layered belief propagation.

9. Aligning cameras to a global world coordinate system

The authors want the position and orientation of each camera’s home position with respect to a global WCS.
Moreover, belief propagation can be carried out only if all the cameras are aligned with respect to a common coordinate system in the world.
These two conditions establish a common coordinate system at the lowest numbered camera, say Ci, in each graph formed in the camera network.
All other cameras in Gj are aligned to this common coordinate system.
In case the global WCS is not pre-specified, the lowest numbered camera in the network may be assumed to be at the origin of the global WCS.

10. Results and Discussion

The authors use 6 SONY EVI-D70 PTZ cameras for their experiments.
If the authors randomly select one camera (all its parameters) from each node, for example, P1 from node C2, P2 from C3 and P3 from C1, then as seen in Figure 3(b) and (c) the reprojection error is high and vary based on which camera is selected from which node.
The reprojection error statistics are given in Table 1.
The authors consider five 3-cliques of the graph for calibrating this graph.
Multi-layered belief propagation at the nodes of the graph results in consistent and accurate camera parameters as seen in Figure 4.

11. Conclusion

The authors have presented a multi-layered belief propagation based distributed algorithm for self-calibration of a pantilt camera network.
The authors have shown that by using multi- layered belief propagation it is possible to get accurate and globally consistent estimates of the camera parameters for each pan-tilt camera in the network with respect to a global world coordinate system.
The authors system does not require that all the cameras should have overlapping views at all times.
The authors method gives an accurate and globally consistent estimate of the camera parameters for the home position of each camera and using the method for automatically finding correspondences in two views, homographies between the home view and any pan/tilt view can be automatically computed.
Therefore, it is possible to obtain accurate and globally consistent camera parameters for any pan/tilt position of the pan-tilt cameras in the network with respect to a global world coordinate system.

Did you find this useful? Give us your feedback

Figures (12)

Table 1. The re-projection statistics for graph, G1. (Refer Fig 3)

Table 2. The re-projection error for the graph with 5 cameras. (Refer Figs. 4, 5, 6)

Figure 3. (a) Re-projections after belief propagation within the graph. (b) and (c) Re-projection after randomly choosing camera parameters after multi-camera self-calibration at each node of the graph G1. The yellow and green ‘+’ denote the reprojections, the red ‘o’ are the input points.

Figure 4. The five cameras are calibrated by first distributed calibration and belief propagation within the 3-cliques and then multi-layered belief propagation across the nodes of the graph. The red circles denote the input points and the green ’+’ are the reprojections of the 3D-points found by triangulating the input points using all five cameras.(Note: same notation for the two images below)

Figure 5. Random set 1: The five cameras are calibrated by first distributed calibration and belief propagation within the 3-cliques and then randomly chosen from the different cliques.

Figure 6. Random set 2: The five cameras are calibrated by first distributed calibration and belief propagation within the 3-cliques and then randomly chosen from the different cliques.

Figure 1. Example of common points found in three images. Note: All images are best viewed in color and at a high resolution.

Table 4. The re-projection statistics for home position of all cameras after multi-layered belief propagation. (Refer Fig. 8)

Figure 8. Reprojections of the input points in each of the six cameras in their home positions after the camera network is completely calibrated. The red ‘o’ are the input points and the yellow ‘+’ are the reprojections.

Table 3. The re-projection statistics for home position of C1. (Refer Fig 7)

Figure 7. (a) Re-projections after multi-layered belief propagation at the home position of C1, computed from 7 pan-tilt views of C1, (b) Reprojection after belief propagation within a graph containing C1 at its home position. The red ‘o’ are the input points, the green ‘+’ are the reprojections.

Figure 2. These images are from one pan-tilt camera taken at different pan and tilt positions. To find the homography between (a) and (f), where (f) is the home position, we find a sequence of homographies: between (a) and (b), then (b)=(c) and (d) and then (d) = (e) and (f). The point correspondences for finding the homographies are automatically found as explained in text.

Content maybe subject to copyright Report

Distributed Calibration of Pan-Tilt Camera Network using Multi-Layered Belief

Propagation

Ayesha Choudhary

Gaurav Sharma

2∗

Santanu Chaudhury

Subhashis Banerjee

Indian Institute of Technology, Delhi, India.

University of Caen, France.

{ayesha, suban}@cse.iitd.ernet.in santanuc@ee.iitd.ernet.in gaurav.sharma@info.unicaen.fr

Abstract

In this paper, we present a technique for distributed self-

calibration of pan-tilt camera network using multi-layered

belief propagation. Our goal is to obtain globally consistent

estimates of the camera parameters for each camera with

respect to a global world coordinate system. The network

conﬁguration changes with time as the cameras can pan and

tilt. We also give a distributed algorithm for automatically

ﬁnding which cameras have overlapping views at a certain

point in time. We argue that using belief propagation it is

sufﬁcient to have correspondences between three cameras

at a time for calibrating a larger set of (static) cameras with

overlapping views. Our method gives an accurate and glob-

ally consistent estimate of the camera parameters of each

camera in the network.

1. Introduction

In this paper, we present a distributed algorithm for self-

calibration of a pan-tilt camera network using multi-layered

belief propagation. The goal of our distributed calibration

algorithm is to obtain a globally consistent and accurate

estimate of each camera’s parameters (intrinsic as well as

extrinsic) with respect to a global world coordinate sys-

tem (WCS). As the cameras can pan and tilt, the camera

network contains various mutually exclusive sub-networks,

where, all cameras in a sub-network view a common re-

gion. For distributed calibration, we perform multi-camera

self-calibration at each camera in a sub-network and apply

belief propagation to obtain consistent camera parameters

in each sub-network. We then propagate belief between

sub-networks to obtain the globally consistent and accurate

estimates of the camera parameters for each camera in the

network.

In general, pan-tilt camera networks are well-suited for

wide area surveillance. Automated surveillance requires

∗

The work was done when Gaurav Sharma was with Multimedia lab,

Indian Institute of Technology, Delhi.

that the camera network be calibrated with respect to a

global WCS so that tasks such as 3D-tracking, recogni-

tion of objects, activities and events can be effectively per-

formed. Moreover, this also requires that the camera param-

eters be consistent and accurate with respect to one another,

which cannot be achieved by individually calibrating each

camera. Self-calibration of a pan-tilt camera network is nec-

essary as it is, in general, difﬁcult and impractical to use an

external calibration object.

Distributed calibration is advantageous for pan-tilt cam-

era network, as it is more robust against failures. In case of

failure of a camera, the information can be retrieved from

its neighbors. Moreover, unlike failure of the central server

which may lead to shutting down of the system, failure of

a camera does not impact the complete network. Also, in

case of distributed calibration, addition of new cameras in

the network does not require re-calibration of the complete

camera network. Our distributed calibration also leads to

making the system scalable, as large camera networks span-

ning a wide geographical area would contain mutually ex-

clusive sub-networks, thereby, no communication and com-

putation among the cameras of these sub-networks would

be necessary for calibration. Therefore, in effect, cameras

which do not view a common scene in any of their pan-tilt

positions do not affect each other. Therefore, our distributed

algorithm calibrates the complete camera network by cali-

brating smaller sub-networks, making the system scalable.

Distributed calibration of the camera network may lead

to inconsistencies in the estimation of the camera parame-

ters since these parameters are computed at each node of

the network. We use belief propagation to leverage on the

information at each node of the camera network to arrive at

a consistent and accurate estimate of the camera parameters

of each camera in the network.

The conﬁguration of a pan-tilt camera network is dy-

namic. The various sub-networks that exist in the system

change across time, that is, cameras in different pan-tilt po-

sitions become a part of different sub-networks across time.

Moreover, within a ﬁxed time interval, a camera can be a

part of only one sub-network. We give a technique to au-

tomatically ﬁnd the sub-networks as well as a method to

automatically control the cameras so that they become parts

of different sub-networks across time, which is essential for

propagating belief across various sub-networks. We discuss

the related work in the next section.

2. Related Work

Multi-camera calibration is a well-studied problem in

computer vision. Pan-tilt camera network calibration has

also become an important area of research. Most of the

multi-camera calibration methods are based on centralized

processing. As camera networks are becoming larger, dis-

tributed algorithms are becoming a necessity. Recently,

in [10], an online distributed algorithm has been proposed

for cluster based calibration of a large wireless static cam-

era network using features detected on known moving tar-

get objects. They assume that the intrinsic parameters are

known and that each target object has known multiple dis-

tinctive features. In [7], 3D features and geographic hash

tables are used while in [5] object motion is used for cali-

bration. Very recently, authors in [4], have proposed a dis-

tributed algorithm for calibration of a camera sensor net-

work, where they assume that one of the cameras is cali-

brated and use epipolar geometry based algorithms at each

node to obtain its calibration parameters. They show that a

globally consistent solution can be reached in a distributed

manner by solving a set of linear equations.

In [1], a method for self-calibration of purely rotating

cameras using inﬁnite homography constraint is proposed.

Davis et al. [2] present a method for calibrating pan-tilt

cameras and introduce a complete model of the pan-tilt ro-

tation occurring around arbitrary axes. Both these methods

are for calibrating a single camera and not for calibration

of a pan-tilt camera network. Authors in [12], estimate both

the internal and external parameters of a pan-tilt camera net-

work without requiring any special calibration object. But,

their method is feature based and estimates the camera pa-

rameters by using the complete set of images captured at

each pan-tilt-zoom conﬁguration of the camera.

Radke et al. [3], give a distributed calibration method

for a static camera network using belief propagation. They

assume that the cameras form a graph where cameras are

the nodes and an edge exists between the nodes if they

have overlapping views. In their case, since the cameras

are static, the conﬁguration of the network does not change

with time and the cameras form one connected graph. We

extend this approach for distributed calibration of pan-tilt

camera network using multi-layered belief propagation. In

our case, many mutually exclusive graphs exist at the same

time and the same camera may belong to many different

graphs across time. We also address the issues of automat-

ically ﬁnding the various graphs in the system. In [3], they

assume that the camera network forms a connected graph,

whereas we give a method for automatically controlling the

cameras to create connected graphs. Also, we propose the

use of multi-layered belief propagation, ﬁrst within a graph

for a consistent measure of the camera parameters within

the graph, and then between multiple graphs to get a consis-

tent estimate of the camera parameters in the pan-tilt camera

network.

The methods in [3, 10, 7, 4] are for distributed calibration

of static camera networks while we propose a technique for

distributed calibration of pan-tilt camera network. More-

over, unlike [4, 10], we do not require that the internal or

external parameters of any camera be known and do not re-

quire any external calibration object. Also, unlike [12], our

method does not consider every pan-tilt conﬁguration of any

camera in the network.

3. Distributed calibration of pan-tilt camera

network: an overview

We assume that the camera network has N ≥ 3 cameras

and each camera has a unique number n ∈ {1, 2, . . . , N}

associated with it. We also assume that each camera has a

processing unit attached with it and that there exists an un-

derlying communication network such that each camera can

communicate with every other camera. A sub-network in a

pan-tilt camera network consists of cameras viewing a com-

mon area. The cameras which have overlapping views form

a complete graph G = (V, E) where, the cameras C

∈ V

and edge e

∈ E between cameras C

and C

for all cam-

eras in the graph. In a pan-tilt camera network, there may

exist many such mutually exclusive graphs at any point in

time. Moreover, if a camera pans and/or tilts, then it may

cease to remain a part of one graph and become a part of

another graph. In Section 5, we give a distributed algorithm

for ﬁnding these graphs automatically.

We assume that the cameras remain in a certain pan-tilt

position for a ﬁxed period of time. During this time in-

terval, the cameras in each graph are considered as static

cameras. Corresponding points between the views of the

cameras in each graph are found automatically and multi-

camera self-calibration is performed at each node of the

graph. It is well-known that ﬁnding automatic correspon-

dences between multiple views is not an easy problem. We

show that by using multi-layered belief propagation it is

sufﬁcient to have correspondences between only three cam-

eras at a time for consistent calibration of a larger N > 3

static camera network. In Section 6, we give the method

to calibrate a large N > 3 (static) camera network us-

ing multi-layered belief propagation by iteratively calibrat-

ing its 3-cliques. We discuss belief propagation and multi-

layered belief propagation in Section 7 and discuss how

multi-layered belief propagation is applied at each camera

in the network. Since the information is combined from

Figure 1. Example of common points found in three images. Note:

All images are best viewed in color and at a high resolution.

graphs containing the cameras in various pan-tilt conﬁgura-

tions, it is unlikely that belief propagation will get stuck in

a local minima and hence, globally consistent estimates are

achieved.

In Section 8, we give a protocol for automatically con-

trolling the cameras so that they become a part of various

sub-networks across time which is necessary for distributed

calibration of the pan-tilt camera network. Otherwise, the

network will remain divided into mutually exclusive sub-

networks and there will be no exchange of information

between various pan-tilt views of the same camera across

time. To perform multi-layered belief propagation between

two graphs containing the same camera in different pan-tilt

positions, we need to bring the cameras to their home (zero

pan and zero tilt) position in both the graphs. We show that

the camera matrix for the home position of the camera can

be computed by automatically ﬁnding pairwise correspon-

dences to compute the homography or a sequence of homo-

graphies between the camera’s pan-tilt view and the home

view. We also propose a protocol in Section 9, for aligning

all the cameras’ home positions to a global WCS, to get a

globally consistent estimate of the camera’s home position

(zero pan, zero tilt position). In the next section, we give a

method for automatically ﬁnding correspondences between

three images. The same method can be used for ﬁnding cor-

respondences automatically between a pair of images.

4. Automatically ﬁnding corresponding points

between three images

We propose a method for automatically ﬁnding corre-

sponding points in three images. It can also be used to ﬁnd

correspondences in a pair of images or more than three im-

ages. But, as the number of images increase, the error in

correspondences also increase. Let I

, I

and I

be three

images taken by three different cameras of the same scene.

We perform the following steps to automatically ﬁnd corre-

spondences between the three images. First, compute the

SIFT features in all three images and then, compute the

SIFT matches between the pairs I

−I

, I

−I

and I

−I

Next, ﬁnd the common SIFT matches between these three

pairs, denoted by X = {x

, x

} for points in I

, I

and

respectively. Further, reﬁne these points by ﬁtting funda-

mental matrices between pairs of images and taking points

which are common in all the three images. This is done by

ﬁrst ﬁtting fundamental matrix to the pairs F

= {x

, x

= {x

, x

} and F

= {x

, x

} and then, ﬁnding the

common points between the inliers in F

, F

and F

, say

, y

and y

. If the number of points are ≥ 50, then we say

that there exists overlap between the three images and y

and y

are the correspondences in the three views. Fig-

ure 1 shows the common points found between three images

taken by three different cameras.

5. Finding the graphs

We develop an algorithm to automatically ﬁnd the graphs

in the network. Starting with the camera with the smallest

number that does not belong to any graph currently, say C

ﬁnd the camera with the next smallest number, say C

, that

has an overlap with C

and which does not belong to any

graph. Form a graph G = (V, E) where, V = {C

, C

} is

the set of nodes and e

∈ E is the edge between C

and

. Incrementally, ﬁnd all those cameras (by automatically

ﬁnding the corresponding points) which have overlapping

views with C

and C

and are not a part of any graph cur-

rently. Add them as nodes of G and add edges between all

the nodes of G. Continue till either there is no camera that

does not belong to a graph in the system or no other camera

has overlapping views with the nodes in graph G.

Repeat this with all the cameras in the network that are

not a part of any graph. In general, there will be more than

one graph in the pan-tilt camera network. Moreover, each

graph will be a complete graph. A priori knowledge of the

camera network topology can be used to reduce the amount

of communication across cameras as well as the number of

computations for SIFT matches. For example, in a wide

area pan-tilt camera network it is possible that two sets of

cameras are geographically so far apart that there will be no

overlapping view between these two sets of cameras. There-

fore, no communication or computation needs to be carried

out between such mutually exclusive and distant camera

sub-sets.

6. Camera calibration within a graph

We assume that the cameras in a graph, say G

, remain

static for a certain time period. Thus, standard multi-camera

self-calibration techniques can be used for calibrating the

cameras within a graph. In a distributed system, multi-

camera calibration is carried out at each node of the graph,

. The crucial point here is to automatically ﬁnd multi-

view correspondences at each node. Since this is not an

easy task, we show that it is possible to calibrate a graph of

size N > 3 by calibrating its 3−cliques and using multi-

layered belief propagation to reach a consistent estimate of

the camera parameters of all the cameras in the graph.

We consider all possible 3-cliques of the graph G

. Let

Figure 2. These images are from one pan-tilt camera taken at different pan and tilt positions. To ﬁnd the homography between (a) and (f),

where (f) is the home position, we ﬁnd a sequence of homographies: between (a) and (b), then (b)=(c) and (d) and then (d) = (e) and (f).

The point correspondences for ﬁnding the homographies are automatically found as explained in text.

be the i

3-clique of G

. The corresponding points be-

tween the nodes of G

are found automatically as discussed

in Section 4. Standard multi-camera self-calibration tech-

nique is used at each node of G

to get estimates of camera

parameters of each camera in G

. Belief propagation (dis-

cussed in Section 7) between the nodes of G

gives a con-

sistent estimate of the camera parameters for each camera in

. This is done for each of the 3-cliques of G

, which will

not be more than





for a graph of size n. Therefore, there

will be





estimates of each camera after belief propaga-

tion is carried out within each 3-clique. Then, multi-layered

belief propagation at each node of G

is carried out between

the estimates of the camera parameters of that node in the

various (at most





) 3-cliques. If this procedure is carried

out iteratively, then it is not necessary to calibrate all the





3-cliques. It is possible that a consistent estimate of the

camera parameters for each camera in G

can be reached

with a lesser number of 3-cliques than





. Thus, we are

able to calibrate the complete graph of N > 3 cameras

without knowing multi-view correspondences among all the

nodes of the graph. Figure 4 shows a result of this tech-

nique for calibrating a graph of ﬁve cameras by using ﬁve

3-cliques of the graph. An important point to be noted here

is that the camera matrices have to be aligned to a common

WCS for this graph before propagating belief at a node be-

tween the subgraphs. The common WCS for this graph can

be a predeﬁned WCS or we can take the lowest numbered

camera in the graph to be at the origin of the WCS.

7. Belief Propagation within a graph

For distributed calibration of cameras in a graph, say G

multi-camera self-calibration is carried out at each node, us-

ing the automatically found corresponding points. There-

fore, at each node C

of G

, we obtain an estimate of the

camera parameters P

for all j cameras in G

. Let y

the true camera parameters for the i

camera. Our aim is

to ﬁnd y

from the estimates of the camera parameters com-

puted at each node of G

, using belief propagation. The

estimates of the camera parameters of all cameras in G

computed at each node are considered as the beliefs at each

node. In general, belief propagation algorithm is used for

solving inference problems based on local message pass-

ing [11]. Each node updates its beliefs by using the esti-

mates it receives from its neighbors in the form of “mes-

sages”. These beliefs are iteratively updated until there is

no change in the belief at a node. As has been shown in [3],

belief propagation can be directly applied on a graph which

has cameras viewing a common scene as its nodes. In this

case, the update equations are of the form:

i,k

← [Σ

−1

i,k

j∈N(i,k)

−1

j,k

]

−1

˜µ

i,k

←

i,k

∗ [Σ

−1

i,k

j∈N(i,k)

−1

j,k

] (1)

Here, µ

i,k

and Σ

i,k

are the estimate and covariance of the

camera parameters computed at the i

camera C

in the k

graph, G

. N (i, k) denotes the set of neighbors of camera

in graph G

. Moreover, the i

node, C

receives µ

j,k

and Σ

j,k

from C

, its j

neighbor, j ∈ N (i, k). ˜µ

i,k

and

i,k

are the estimates of the camera parameters after belief

propagation within graph G

. The covariance matrix is cal-

culated based on the forward covariance propagation from

bundle adjustment. We consider the diagonal terms of the

covariance matrix only, resulting in it being a diagonal ma-

trix which is positive deﬁnite and invertible. Moreover, we

use all the 11 camera parameters [6] as the belief at a node.

7.1. Multi-layered Belief Propagation

Since the graphs are dynamic and the same camera C

can be a part of two graphs, say G

k−1

and G

, in different

pan-tilt orientations at different points in time, we perform

belief propagation between graphs at each node, C

, which

is common in both G

k−1

and G

. Here, the belief at C

k−1

is the estimate of the camera matrix of C

(after be-

lief propagation within G

k−1

) at its home position, obtained

by using the homography between C

’s view in G

k−1

and

the image taken at the home position of C

. Similarly, the

belief at C

in G

is the estimate of camera matrix of C

(af-

ter belief propagation within G

) at home position obtained

using homography between the view of C

in G

and the

home view of C

As is well-known [6], two views of a camera in differ-

ent pan-tilt positions are related by a 3 × 3 image to image

homography. Therefore, we automatically compute the ho-

mography between the pan/tilt view and the home view of

a camera by automatically ﬁnding corresponding points be-

tween the two images, using SIFT matches further reﬁned

by ﬁtting fundamental matrices to the points obtained, as

described in Section 4. This homography is then used to

get the camera matrix of the home position from the camera

matrix of the pan-tilt position. Let P

θφ

be the camera matrix

at pan θ and tilt φ position, P

home

be the camera matrix at

the home position, and H be the homography between the

home view and the pan-tilt view. Then, if x = P

home

= P

θφ

X and x = Hx

, ⇒ P

home

= H ∗ P

θφ

. Similarly,

we can get to the pan-tilt position as: P

θφ

= H

−1

∗ P

home

In case, the pan-tilt view of the camera does not have any

overlap with the home position’s view, a sequence of ho-

mographies can be used, again calculated automatically, as

shown in Figure 2. Let ˜µ

i,k

be the estimate of the cam-

era parameters of C

after belief propagation within graph

, where C

is in pan θ

and tilt φ

position. Homogra-

phy or a sequence of homographies is used to calculate the

camera parameters for the home position of C

, denoted by

home

. These parameters, taken as a vector, are the belief

at C

in G

denoted by µ

home

. Let ˜µ

k−1

home

and

k−1

home

the estimates of the camera parameters and the covariance

matrix after the (k − 1)

iteration, at the home position

of C

, of multi-layered belief propagation between k − 1

graphs containing C

in different pan-tilt positions. The

home position is calculated in each graph using the image-

to-image homography before applying the update equations

for multi-layered belief propagation. The belief is updated

using Equations 2.

home

← [(

k−1

home

)

−1

+ Σ

−1

home

]

−1

˜µ

home

←

home

[(

k−1

home

)

−1

˜µ

k−1

home

+Σ

−1

home

](2)

where, ˜µ

home

denotes the estimate of the camera parame-

ters and

home

is the estimate of the covariance matrix of

the home position of C

after the k

iteration.

8. Forming new graphs

The multi-layered belief propagation mechanism can be

utilized only if the graphs change across time. We de-

velop a protocol for automatically controlling the pan-tilt

of the cameras so that the network conﬁguration changes

after a ﬁxed time period. We deﬁne a set of landmarks

L = {L

, L

, . . . , L

} in the scene with respect to the

global WCS. Initially, the graphs are found using the tech-

nique discussed in Section 5. Once the estimate of the cam-

era parameters for cameras have been computed in each

of these graphs by multi-camera self-calibration and belief

propagation within each graph, these cameras are aligned

to the global WCS. The camera parameter estimates after

alignment are then used for controlling the cameras to form

new graphs in the network. The protocol is:

1. For each camera, compute the pan-tilt rotations re-

quired to view all the landmarks. (It is possible that

a camera may not be able to view all the landmarks,

therefore, only those that are visible are considered).

2. For each camera, rotate by the smallest pan-tilt angles

such that it views a landmark other than the one it is

currently viewing.

3. Send a message to all the other cameras about the new

landmark that it is viewing. If it is known a priori that

two cameras will never have overlapping views, they

need not inform each other about the new landmark

they are viewing, thereby reducing unnecessary com-

munication.

4. Each camera will have information of all other cam-

eras about the landmark they are viewing. It takes into

consideration all the cameras, say set S, that are view-

ing the same landmark as itself.

5. For each camera, check whether the cameras in its set

S form a graph by using the procedure given in Sec-

tion 5.

This also makes our system scalable as the correspondences

have to be calculated among only those cameras which view

the same landmark and in step 3, the messages have to be

passed only between those cameras which can have over-

lapping views in some pan-tilt conﬁguration. In general,

these will be much smaller in number compared to the size

of the camera network. The above algorithm ensures that

the graphs in the camera network change over time. This

is essential because if the graphs remained static, since they

are mutually exclusive no information would be shared be-

tween the graphs and it would not be possible to calibrate

the complete network. It is possible that there will be cam-

eras which do not have overlapping views with any other

camera or graphs that have less than 3 cameras. In the cur-

rent time period these cameras are not considered for cal-

ibration and therefore, remain idle. In the next time pe-

riod, they shall repeat the above protocol and become part

of graphs with ≥ 3 nodes and hence, will be used for cali-

bration and multi-layered belief propagation.

9. Aligning cameras to a global world coordi-

nate system

We want the position and orientation of each camera’s

home position with respect to a global WCS. Moreover, be-

lief propagation can be carried out only if all the cameras

are aligned with respect to a common coordinate system in

the world. For the cameras to align themselves to a global

HTML Viewer

Frequently Asked Questions (8)

Q1. What contributions have the authors mentioned in the paper "Distributed calibration of pan-tilt camera network using multi-layered belief propagation" ?

In this paper, the authors present a technique for distributed selfcalibration of pan-tilt camera network using multi-layered belief propagation.

Q2. How many cameras can be used to calibrate a larger network?

The authors show that by using multi-layered belief propagation it is sufficient to have correspondences between only three cameras at a time for consistent calibration of a larger N > 3 static camera network.

Q3. How do the authors perform multi-layered belief propagation between two graphs?

To perform multi-layered belief propagation between two graphs containing the same camera in different pan-tilt positions, the authors need to bring the cameras to their home (zero pan and zero tilt) position in both the graphs.

Q4. How can the authors get the camera parameters for pan-tilt cameras?

The authors have shown that by using multi-layered belief propagation it is possible to get accurate and globally consistent estimates of the camera parameters for each pan-tilt camera in the network with respect to a global world coordinate system.

Q5. What is the method for finding the points between the views of the cameras in each graph?

Corresponding points between the views of the cameras in each graph are found automatically and multicamera self-calibration is performed at each node of the graph.

Q6. What is the use of the equations for the home position of Ci?

Homography or a sequence of homographies is used to calculate the camera parameters for the home position of Ci, denoted by Pihome,k.

Q7. How can the authors calculate the camera matrix for the home position of a camera?

The authors show that the camera matrix for the home position of the camera can be computed by automatically finding pairwise correspondences to compute the homography or a sequence of homographies between the camera’s pan-tilt view and the home view.

Q8. How do they calculate the parameters of a camera network?

Very recently, authors in [4], have proposed a distributed algorithm for calibration of a camera sensor network, where they assume that one of the cameras is calibrated and use epipolar geometry based algorithms at each node to obtain its calibration parameters.

Distributed calibration of pan-tilt camera network using multi-layered belief propagation

Summary (3 min read)

1. Introduction

3. Distributed calibration of pan-tilt camera

4. Automatically finding corresponding points

5. Finding the graphs

6. Camera calibration within a graph

7. Belief Propagation within a graph

7.1. Multi-layered Belief Propagation

8. Forming new graphs

9. Aligning cameras to a global world coordinate system

10. Results and Discussion

11. Conclusion

Figures (12)

Citations

Cites background from "Distributed calibration of pan-tilt..."

References

"Distributed calibration of pan-tilt..." refers methods in this paper

Related Papers (5)

Frequently Asked Questions (8)

Q1. What contributions have the authors mentioned in the paper "Distributed calibration of pan-tilt camera network using multi-layered belief propagation" ?

Q2. How many cameras can be used to calibrate a larger network?

Q3. How do the authors perform multi-layered belief propagation between two graphs?

Q4. How can the authors get the camera parameters for pan-tilt cameras?

Q5. What is the method for finding the points between the views of the cameras in each graph?

Q6. What is the use of the equations for the home position of Ci?

Q7. How can the authors calculate the camera matrix for the home position of a camera?

Q8. How do they calculate the parameters of a camera network?