What are the contributions mentioned in the paper "Markov chain monte carlo data association for general multiple target tracking problems" ?

In this paper, the authors consider the general multiple target tracking problem in which an unknown number of targets appears and disappears at random times and the goal is to find the tracks of targets from noisy observations. The authors propose an efficient real-time algorithm that solves the data association problem and is capable of initiating and terminating a varying number of tracks.

What is the number of objects arising at each time over R?

The number of objects arising at each time over R has a Poisson distribution with a parameter (λbV ) where λb is the birth rate of new objects per unit time, per unit volume.

What is the criterion used to compare multiple target tracking algorithms?

Since the number of targets is not fixed, it is difficult to compare algorithms using a standard criterion such as the residual mean square error.

What is the value of xt without fixing the number of tracks?

Since the size of xt = (x1t T , . . . , xKtT )T depends on the number of tracks K, the estimation of xt without fixing the number of tracks is not meaningful.

What is the goal of the inverse optimization approach?

Now under the data-oriented, combinatorial optimization approach, their goal is to find a partition of observations such that P (ω|Y ) is maximized.

What is the way to estimate the number of states?

under the Bayesian approach, if a single set of state estimation is required, the authors might first estimate the most likely number of targets and then estimate the expected values of states given the estimated number of targets.

What is the main advantage of the MCMCDA algorithm?

Another noticeable benefit of the MCMCDA algorithm is that its running time can be regulated by the number of samples and the number of observations but the running time of MHT depends on the complexity of the problem instance and is not predictable in advance.

What is the conditional observation covariance for the track?

For each track τ ∈ ω, the authors apply the Kalman filter to estimate the states x̄t(τ) and covariances Bt(τ), where Bt(τ) = CP̄t(τ)CT + R is the conditional observation covariance at time t for the track τ .

what is the expected value of xt given y1?

P̄t be the covariance of xt given y1, . . . , yt−1; x̂t be the expected value of xt given y1, . . . , yt; and P̂t be the covariance of xt given y1, . . . , yt.

What is the effect of pruning on the performance of MCMCDA?

Although the maximum number of hypotheses of 1000 per group is a large number, with increasing number of tracks, the performance of MHT deteriorates as the optimality is compromised by pruning.

How many samples are used for MCMCDA?

The results for MCMCDA are the average values over 10 repeated runs and the initial state is initialized with the greedy algorithm and 10,000 samples are used.

What are the two new metrics to measure the effectiveness of each data association algorithm?

The two new metrics the authors will be using are the normalized correct associations (NCA) and incorrect-to-correct association ratio (ICAR):NCA(ω) = |CA(ω)| |SA(ω∗)|(4)ICAR(ω) = |SA(ω)| − |CA(ω)||CA(ω)| .

(Open Access) Markov chain Monte Carlo data association for general multiple-target tracking problems (2004) | Songhwai Oh

Q: What are the types of moves that are used in the MCMC sampler?

They are1) birth/death move pair; 2) split/merge move pair; 3) extension/reduction move pair; 4) track update move; and 5) track switch move.

Markov Chain Monte Carlo Data Association for

General Multiple Target Tracking Problems

Songhwai Oh, Stuart Russell, Shankar Sastry

Abstract—In this paper, we consider the general multiple

target tracking problem in which an unknown number of

targets appears and disappears at random times and the goal

is to ﬁnd the tracks of targets from noisy observations. We

propose an efﬁcient real-time algorithm that solves the data

association problem and is capable of initiating and terminat-

ing a varying number of tracks. We take the data-oriented,

combinatorial optimization approach to the data association

problem but avoid the enumeration of tracks by applying a

sampling method called Markov chain Monte Carlo (MCMC).

The MCMC data association algorithm can be considered as a

deferred logic since its decision about forming a track is based

on the current and past observations. But, at the same time, it

can be considered as an approximation to the optimal Bayesian

ﬁlter. The algorithm shows remarkable performance compared

to the greedy algorithm and the multiple hypothesis tracker

(MHT) under the extreme conditions, such as a large number

of targets in a dense environment, low detection probabilities,

and a large number of false alarms.

I. INTRODUCTION

The multiple target tracking plays an important role in

many areas of engineering such as surveillance, computer

vision, and signal processing [1], [4]. Under the most

general setup, a varying number of indistinguishable targets

is moving around in a region with continuous motions and

the positions of moving targets are sampled at random

intervals. The measurements about the positions are noisy,

with detection probability less than one, and there is a

noise background of spurious position reports, i.e., false

alarms. Targets arise at random in space and time. Each

target persists independently for a random length of time

and ceases to exist. A track of a target is deﬁned as a path

in space-time traveled by the target. The essence of the

multiple target tracking problem is to ﬁnd tracks from the

noisy observations and it requires solutions to both data

association and state estimation problems [16].

The data association problem in multiple target tracking

is described as a problem of ﬁnding a partition of observa-

tions such that each element of a partition is a collection

of observations generated by a single target or clutter

[16]. However, due to the noises in state transitions and

observations, we cannot expect to ﬁnd the exact solution.

This data-oriented view of data association has been applied

and extended by many authors [9], [17], [15], [8], [5],

[14]. The most successful multiple target tracking algorithm

This work was supported by DARPA F30602-00-2-0538 and ARO

MURI DAAD 19-OZ-1-0383.

The authors are with the Department of Electrical Engineering and

Computer Sciences, University of California, Berkeley, CA 94720,

{sho,russell,sastry}@eecs.berkeley.edu.

based on this view is the multiple hypothesis tracker (MHT)

[15]. In MHT, each hypothesis associates past observations

with a target and, as a new set of observations arrives, a new

set of hypotheses is formed from the previous hypotheses.

Each hypothesis is scored by its posterior and the algorithm

returns a hypothesis with the highest score as a solution.

MHT is categorized as a deferred logic [14] in which the

decision about forming a new track or removing an existing

track is delayed until enough observations are collected.

Hence, MHT is capable of initiating and terminating a

varying number of tracks and suitable for surveillance

applications in which an autonomous tracker is required.

However, the construction of new hypotheses requires an

enumeration of all possibilities and the size of hypotheses

grows exponentially. The initial implementation and later

extensions proposed several heuristics, such as pruning,

gating, clustering and N-scan-back logic, to reduce the

complexity of the problem [15], [8]. However, the heuristics

are used at the expense of the optimality and the algorithm

can still suffer in a dense environment. Furthermore, the

running time at each step of the algorithm cannot be

bounded easily, making it difﬁcult to be deployed in a

real-time surveillance system. As a method of pruning, an

efﬁcient method of ﬁnding k-best hypotheses based on the

algorithm by Murty [10] is developed in [5].

A different approach to the data association problem is

the joint probabilistic data association ﬁlter (JPDAF) [1].

JPDAF is a suboptimal single-stage approximation to the

optimal Bayesian ﬁlter. Given a ﬁxed number of targets,

JPDAF enumerates all possible associations between the

latest set of observations and the known tracks and clutter

and computes each association weight. For each association,

the conditional expectation of the state of a target is

estimated by a ﬁltering algorithm. Then, the state of a target

is estimated by summing over the conditional expectations

weighted by association weights. JPDAF is a sequential

tracker in which the associations between the known targets

and the latest observations are made sequentially and the

associations made in the past are not reversible [14]. Since

only the current set of observations is considered, JPDAF

cannot initiate or terminate tracks. Also JPDAF assumes

a ﬁxed number of targets and requires a good initial state

for each target. There are restricted extensions to JPDAF to

allow the formation of a new track (see [4] and references

within). Other multiple target tracking algorithms include

the multisensor multitarget mixture reduction (MTMR) [12]

and the probabilistic multi-hypothesis tracker (PMHT) [18].

But they also assume a ﬁxed number of targets and cannot

initiate or terminate tracks. Recently, a Bayesian model-

based approach for tracking an unknown number of targets

which can initiate and terminate tracks is presented in [11].

The sequential trackers are more efﬁcient than deferred

logic trackers such as MHT but they are prone to make

erroneous associations [14]. In addition, it is inferred that

the exact calculation at each stage is NP-hard [3] since the

related problem of ﬁnding the permanent of a 0-1 matrix

is #P-complete [19]. In [6], a single-stage data association

problem is considered and a leave-one-out heuristic is

developed to avoid the enumeration of all possible asso-

ciations. Later, the approach is extended to a multi-stage

data association problem using Markov chain Monte Carlo

(MCMC) [13].

The data association problem of multiple target tracking

formulated under the data-oriented view is also known

to be NP-hard [14]. Hence, we cannot expect to ﬁnd an

optimal solution in polynomial time unless P = N P .

An optimization approach to data association has been

applied as a 0-1 integer programming problem [9] and as

a multidimensional assignment problem [14]. In both cases

one needs to ﬁnd a feasible set of tracks from all possible

tracks to prevent the exponential explosion and compute

the cost of each feasible track, such as the negative log

likelihood. Then the optimization routine ﬁnds a subset

from the feasible tracks such that the combined costs are

minimized while satisfying the constraints, i.e., each track

has at most one observation at each time and no two tracks

share the same observation. The gating method similar to

the ones described in [17], [15] is used to ﬁnd a feasible

set of tracks. However, in a dense environment, the size of

the feasible tracks can be very large and the complexity

of the optimization routine increases dramatically, since the

number of parameters in the optimization routine depends

on the number of feasible tracks.

The main contribution of this paper is the development

of an efﬁcient real-time algorithm that solves the data

association problem and is capable of initiating and ter-

minating a varying number of tracks. We take the data-

oriented, combinatorial optimization approach to the data

association problem but avoid the enumeration of tracks by

applying a sampling method called Markov chain Monte

Carlo (MCMC). The immediate beneﬁt of using MCMC is

the low memory requirement. The MCMC data association

(MCMCDA) algorithm can be considered as a deferred

logic since its decision about forming a track is based on

the current and past observations. But, at the same time,

it can be considered as an approximation to the optimal

Bayesian ﬁlter if it is used to approximate the association

probabilities or expectations such as the average link travel

time as done in [13]. So, from the Bayesian point of view,

the algorithm can be considered as a generalization of

[13] to handle an unknown number of objects, missing

observations and false alarms. MCMCDA shows remarkable

performance compared to the greedy algorithm and MHT

under the extreme conditions such as a large number of

targets in a dense environment, low detection probabilities,

and a large number of false alarms.

The remainder of this paper is structured as follows. We

formally state the (discrete-time) general multiple target

tracking problem in Section II. In Section III, we present

a general purpose MCMCDA algorithm for multiple target

tracking. The algorithm is applied in simulation to extreme

situations and its performance is compared with the greedy

algorithm and MHT in Section IV.

II. GENERAL MULTIPLE TARGET TRACKING

A. Problem

Let T ∈ Z

be the duration of surveillance. Let K

be the unknown number of objects moving around the

surveillance region R for some duration [t

, t

] ⊂ [1, T ]

for k = 1, . . . , K. Let V be the volume of R. Each object

arises at a random position in R at t

, moves independently

around R until t

and disappears. At each time, an existing

target persists with probability (1 − p

) and disppears with

probability p

. The number of objects arising at each time

over R has a Poisson distribution with a parameter (λ

V )

where λ

is the birth rate of new objects per unit time,

per unit volume. The initial position of a new object is

uniformly distributed over R.

Let F

: R

→ R

be the discrete-time dynamics of the

object k, where d is the dimension of the state variable,

and let x

∈ R

be the state of the object k at time t for

k = 1, 2, . . . , K. The object k moves according to

t+1

= F

) + w

for t = t

, . . . , t

− 1,

where w

∈ R

are white noise processes. The noisy

observation about the state of the object is measured with

the detection probability p

which is less than unity. There

are also false alarms and the number of false alarms has a

Poisson distribution with a parameter (λ

V ) where λ

is the

false alarm rate per unit time, per unit volume. Let n

the number of observations at time t which includes both

noisy observations and false alarms. Let y

∈ R

be the

j-th observation at time t for j = 1, . . . , n

, where m is

the dimensionality of each observation vector. Each object

generates a unique observation at each sampling time if it

is detected. Let H

: R

→ R

be the observation model.

Then the observations are generated as follows:



) + v

if j-th observation is from x

otherwise,

where v

∈ R

are white noise processes and u

∼

Unif(R) is a random process for false alarms. Notice that,

with probability 1 − p

, the object is not detected and we

call this a missing observation. We assume that targets are

indistinguishable in this paper. But, if observations include

target type or attribute information, the state variable can

be extended to include target type information.

Under the data-oriented approach, the multiple target

tracking problem is to partition the observations such that

the posterior is maximized, i.e., the maximum a posteriori

(MAP) estimate. Under the Bayesian approach, if we are

given a function deﬁned on Ω, the collection of all partitions

of observations (see below for its deﬁnition), we seek the

expected value of the function given the observations. The

MAP estimate found under the data-oriented approach may

not be robust in the Bayesian sense. But it is sometimes

more convenient when estimating parameters whose dimen-

sion is dependent on the number of tracks, such as the states

of targets. Since the size of x

= (x

, . . . , x

)

depends

on the number of tracks K, the estimation of x

without

ﬁxing the number of tracks is not meaningful. Hence, under

the Bayesian approach, if a single set of state estimation is

required, we might ﬁrst estimate the most likely number of

targets and then estimate the expected values of states given

the estimated number of targets.

B. Probabilistic Model

Let us ﬁrst specify the dynamic and measurement models.

Here we use the usual linear system model but the method

can be easily extended to non-linear models coupled with a

non-linear regression algorithm. If an object is observed k

times at t

, t

, . . . , t

, its dynamic and measurement models

can be expressed as:

i+1

= A(t

i+1

− t

+ G(t

i+1

− t

= Cx

+ v

for i = 1, . . . , k,

(1)

where w

and v

are white Gaussian noises with zero mean

and covariance Q and R, respectively. A(·), G(·), and C are

matrices with appropriate sizes. The entries of the matrix

A(t

i+1

−t

) and G(t

i+1

−t

) are determined by the sampling

interval t

i+1

− t

for each i. For clarity, the subsequence

notation for the time index is suppressed for now. Let ¯x

be the expected value of x

given y

, . . . , y

t−1

;

be the

covariance of x

given y

, . . . , y

t−1

; ˆx

be the expected

value of x

given y

, . . . , y

; and

be the covariance of

given y

, . . . , y

Let y

= {y

: j = 1, . . . , n

} and Y =

t∈{1,,...,T }

Let Ω be a collection of partitions of Y such that, for ω ∈ Ω,

1) ω = {τ

, τ

, . . . , τ

};

k=0

= Y and τ

∩ τ

= ∅ for i 6= j;

3) τ

is a set of false alarms;

4) |τ

∩ y

| ≤ 1 for k = 1, . . . , K and t = 1, . . . , T ; and

5) |τ

| > 1 for k = 1, . . . , K.

Here, K is the number of tracks for the given partition

ω ∈ Ω. We call τ

a track when there is no confusion

although the actual track is the set of estimated states

from the observations τ

. However, we assume there is a

deterministic function that returns a set of estimated states

given a set of observations, so no distinction is required. We

denote by τ

(t) the observation at time t that is assigned to

the track τ

. Notice that τ

(t) can be empty if it is a missing

observation. The fourth requirement says that a track can

have at most one observation at each time, but, in the case

of multiple sensors, we can easily relax this requirement to

allow multiple observations per track. A track is assumed to

contain at least two observations since we cannot distinguish

a track with a single observation from a false alarm. Once

a partition ω ∈ Ω is chosen, the tracks τ

, . . . , τ

∈ ω and

a set of false alarms τ

∈ ω are completely determined.

Hence, for each track, we can estimate the states of an

object independently since each object moves independently

from the other objects. For each track τ ∈ ω, we apply the

Kalman ﬁlter to estimate the states ¯x

(τ) and covariances

(τ), where B

(τ) = C

(τ)C

+ R is the conditional

observation covariance at time t for the track τ .

Let e

be the number of targets from time t −1 and a

the number of new targets at time t. Let z

be the number of

targets terminated at time t and c

= e

− z

. Let d

be the

number of detections at time t and u

= e

−z

−d

the number of undetected targets. Finally, let f

= n

− d

be the number of false alarms. It can be shown that the

posterior of ω is:

P (ω|Y ) =

t=1

(1 − p

)

(1 − p

)

τ ∈ω\{τ

}

|τ |−1

i=1

N (τ(t

i+1

)|¯x

i+1

(τ), B

i+1

(τ)),

(2)

where Z is a normalizing constant and N (·|µ, Σ) is the

Gaussian density function with mean µ and covariance

matrix Σ. Now under the data-oriented, combinatorial op-

timization approach, our goal is to ﬁnd a partition of

observations such that P (ω|Y ) is maximized.

III. MCMC DATA ASSOCIATION ALGORITHM

In this section, we develop an MCMC sampler to solve

the multiple target tracking problem. Solving complex prob-

lems by sampling methods such as Markov chain Monte

Carlo (MCMC) has become more tractable, due to the

increased computational power. MCMC-based algorithms

play a signiﬁcant role in many ﬁelds such as physics,

statistics, economics, and engineering [2]. In some cases,

MCMC is the only known general algorithm which ﬁnds

a good approximate solution to a complex problem in

polynomial time [7]. MCMC techniques have been applied

to the complex probability distribution integration problems,

counting problems such as #P-complete problems, and

combinatorial optimization problems [7], [2]. The MCMC

approach applied to the combinatorial optimization prob-

lems is generally known as simulated annealing.

The set Ω becomes a state space of the MCMC sampler

and we sample from Ω such that its stationary distribution

is P (ω|Y ). If we are at state ω ∈ Ω, we propose ω

∈ Ω

following the proposal distribution q(ω, ω

). The move is

accepted with an acceptance probability A(ω, ω

) where

A(ω, ω

) = min



P (ω

|Y )q(ω

, ω)

P (ω|Y )q(ω, ω

)



, (3)

otherwise the sampler stays at ω, so that the detailed balance

is satisﬁed. If we make sure that the chain is irreducible

and aperiodic, then the chain converges to its stationary

distribution. The sampler consists of ﬁve types of moves.

They are

1) birth/death move pair;

2) split/merge move pair;

3) extension/reduction move pair;

4) track update move; and

5) track switch move.

We index each move by an integer such that m = 1 for a

birth move, m = 2 for a death move and so on. The move

m is chosen randomly from the distribution ξ

(m) where

K is the number of tracks of the current partition ω. When

there is no track, we can only propose a birth move, so we

set ξ

(m = 1) = 1 and 0 for all other moves. When there

is only a single target, we cannot propose a merge or track

switch move, so ξ

(m = 4) = ξ

(m = 8) = 0. For other

values of K and m, we assume ξ

(m) > 0. The MCMC

data association (MCMCDA) algorithm is described in

Algorithm 1. The inputs are the set of all observations

Y , the number of samples n

, and the initial state ω

init

At each step of the algorithm, ω is the current state of

the Markov chain. The acceptance probability A(ω, ω

) is

deﬁned in (3) where the posterior (2) is used.

Algorithm 1 (MCMC Data Association (MCMCDA)):

Input: Y, n

, ω

init

Output: ˆω = arg max p(ω (n)|Y )

ω ← ω

init

for n = 1 to n

sample m from ξ

(·)

propose ω

based on m and ω (described below)

sample U from Unif[0, 1]

if U < A(ω, ω

)

ω ← ω

end

ω(n) ← ω

end

In Algorithm 1, we use MCMC to ﬁnd a solution to a

combinatorial optimization problem. So it can be considered

as simulated annealing at a constant temperature. No burn-in

samples are used since we are simply looking for a partition

which maximizes the posterior. In addition, the memory

requirement of the algorithm is at its bare minimum. Instead

of keeping all {ω(n)}

n=1

, we can simply keep the partition

with the maximum posterior. If the algorithm is used

to estimate E

P (ω|Y )

f(ω) for some bounded function f,

we will need burn-in samples and need to maintain the

sufﬁcient statistics for the desired expectation.

In order to make the algorithm more efﬁcient, we make

two additional assumptions: (1) the maximal directional

speed of any target in R is less than ¯v; and (2) the number of

consecutive missing observations of any track is less than

The ﬁrst assumption is reasonable in a surveillance scenario

since, in many cases, the maximal speed of a vehicle is

generally known based on its type and terrain conditions.

The second assumption is a user deﬁned parameter and it

can be used as one of the criteria to distinguish an event of

a new object’s appearance from an event of a continuation

of an existing object. We will now assume that these two

new conditions are added to the deﬁnition of Ω so each

element ω ∈ Ω satisﬁes these two additional assumptions.

We now introduce a data structure which is used to

propose a new partition ω

in Algorithm 1. We deﬁne a

neighborhood tree of observations as

) = {y

t+d

∈ y

t+d

: ky

− y

t+d

k ≤ d · ¯v}

for d = 1, . . . ,

d, j = 1, . . . , n

and t = 1, . . . , T − 1. Here

k · k is the usual Euclidean distance. This neighborhood

tree groups temporally separated observations based on their

distances. The parameter d allows missing observations.

The use of this neighborhood tree makes the algorithm

more scalable since distant observations will be considered

separately and makes the computations of the proposal dis-

tribution easier. It is similar to the clustering technique used

in MHT but L

(·) is ﬁxed for a given set of observations

Y .

We now describe each move of the sampler in detail.

First, let ζ(d) be a distribution of a random variable d taking

values from {1, 2, . . . ,

d}. We assume the current state of

the chain is ω = ω

∪ ω

∈ Ω, where ω

= {τ

} and

= {τ

, . . . , τ

}. The proposed partition is denoted by

= ω

∪ ω

∈ Ω. Note the abuse of notation below with

indexing of time, i.e., when we say τ(t

), t

means the time

at which a target corresponding to the track τ is observed

i times.

A. Birth and Death Moves

For a birth move, we increase the number of tracks from

K to K

= K +1 and select t

uniformly at random (u.a.r.)

from {1, . . . , T − 1} as an appearance time of a new track.

Let τ

be the track of this new object. Then we choose

from the distribution ζ. Let L

= {y

: L

) 6=

∅, y

6∈ τ

), j = 1, . . . , n

, k = 1, . . . , K}. L

a set of observations at t

such that, for any y ∈ L

y does not belong to other tracks and y has at least one

descendant in L

(y). We choose τ

) u.a.r. from L

If L

is empty, the move is rejected since the move is

not reversible. Once the initial observation is chosen, we

then choose the subsequent observations for the track τ

For i = 2, 3, . . ., we choose d

from ζ and choose τ

)

u.a.r. from L

(τ

i−1

))\ {τ

i−1

) : k = 1, . . . , K}

unless this set is empty. But, for i = 3, 4, . . ., the processe

of adding observations to τ

terminates with probability

γ, where 0 < γ < 1. If |τ

| ≤ 1, the move is rejected. We

then propose this modiﬁed partition where ω

= ω

∪{τ

}

and ω

= {τ

\τ

}. For a death move, we simply choose k

u.a.r. from {1, . . . , K} and delete the k-th track and propose

a new partition where ω

= ω

\{τ

} and ω

= {τ

∪τ

B. Split and Merge Moves

For a split move, we select τ

) u.a.r. from {τ

) :

|τ

| ≥ 4, i = 2, . . . , |τ

| − 2, k = 1, . . . , K}. Then we split

the track τ

into τ

and τ

such that τ

= {τ

) :

i = 1, . . . , r} and τ

= {τ

) : i = r + 1, . . . , |τ

|}.

The modiﬁed track partition becomes ω

= (ω

\ {τ

}) ∪

{τ

} ∪ {τ

} and the false alarm partition ω

is updated

accordingly. For a merge move, we consider the set

M = {(τ

), τ

)) : τ

) ∈ L

−t

(τ

)),

f = |τ

| for k

6= k

, 1 ≤ k

, k

≤ K}.

We select a pair (τ

), τ

)) u.a.r. from M . The tracks

are combined into a single track τ

= τ

∪ τ

. Then we

propose a new partition where ω

= (ω

\({τ

}∪{τ

}))∪

{τ

} and ω

with appropriate rearrangements.

C. Extension and Reduction Moves

In a track extension move, we select a track τ u.a.r. from

K available tracks in ω. We reassign observations for τ after

the disappearance time t

|τ |

as done in the track birth move.

For a track reduction move, we select a track τ u.a.r. from

K available tracks in ω and r u.a.r. from {2, . . . , |τ| − 1}.

We shorten the track τ to {τ (t

), . . . , τ (t

)} by removing

the observations assigned to τ after the time t

r+1

D. Track Update Move

In a track update move, we select a track τ u.a.r. from

K available tracks in ω. Then we pick r u.a.r. from

{1, 2, . . . , |τ|} and reassign observations for τ after the time

as done in the track birth move.

E. Track Switch Move

For a track switch move, we select a pair of observa-

tions (τ

), τ

)) from two different tracks such that,

p+1

) ∈ L

(τ

)) and τ

q+1

) ∈ L

(τ

)),

where d = t

p+1

− t

, d

= t

q+1

− t

and 0 < d, d

≤

Then we let

= {τ

), . . . , τ

), τ

q+1

), . . . , τ

|τ

)}

= {τ

), . . . , τ

), τ

p+1

), . . . , τ

|τ

)}.

Theorem 1: Suppose that 0 < p

, p

< 1 and λ

, λ

> 0.

If ζ(d) > 0, for all d ∈ {1, . . . ,

d}, then the Markov chain

Proof: The birth and death moves are sufﬁcient to

illustrate the irreducibility of the chain. Since 0 < p

, p

< 1

and λ

, λ

> 0, P (ω|Y ) > 0 for all ω ∈ Ω. Take an arbitrary

partition ω ∈ Ω, say ω = {τ

, τ

, . . . , τ

}. Now consider

the partition ω

∈ Ω, such that ω

= {τ

}, i.e., ω

assigns all

observations as false alarms. Since ω is arbitrary, the chain

is irreducible if the chain can move from ω

to ω and from ω

to ω

. For the move from ω

to ω, consider K consecutive

birth moves: ω

= ω

, ω

= {{τ

\ τ

}, τ

}, . . . , ω

{{τ

\ {∪

i=1

}}, τ

, . . . , τ

} = ω. Since ω ∈ Ω, all tracks

are legal, i.e., τ

satisﬁes the constraints described in Sec-

tion II-B and, for i = 1, . . . , |τ

|−1, τ

i+1

) ∈ L

(τ

))

for 1 ≤ d = t

i+1

− t

≤

d. Thus, ω

∈ Ω for all k. Because

ζ(d) > 0 and all tracks τ

are legal, the probability of

proposing τ

at ω

k−1

by the birth move is positive and

q(ω

, ω

k+1

) > 0. For the move from ω to ω

, consider

K consecutive death moves: ω

= ω, ω

K−1

, . . . , ω

= ω

The probability of removing the track τ

at ω

by the death

move is positive and q(ω

k+1

, ω

) > 0. Since P (ω

|Y ) > 0

for all k, the chain can move from ω

to ω and from ω to

. Hence, the chain is irreducible.

The Markov chain designed by Algorithm 1 is irreducible

(Theorem 1) and aperiodic since there is always a positive

probability of staying at the current state in the track update

move. In addition, the transitions described in Algorithm 1

satisfy the detailed balance. Hence, by the ergodic theorem,

the chain converges to its stationary distribution. Notice that

the other moves are designed to improve the performance

of the algorithm.

Algorithm 2 (Greedy Multiple Target Tracking):

Input: Y, σ(threshold function)

Output: ω = ω

∪ ω

← ∅

for t = 1 to T − 1

repeat

← ∅

G ← {(y

, y

) : 1 ≤ s − t ≤

d and y

, y

6∈ τ ∈ ω

}

foreach (y

, y

) in G

← ∅

estimate an initial state from (y

, y

)

) ← y

and τ

) ← y

r ← s

while r < T

for d = 1 to

B ← {y ∈ L

(τ

(r)) : y 6∈ τ ∈ ω

}

if B 6= ∅

(r + d) ← arg min

y∈B

ky − ¯x

r+d

(τ

break

end

r ← r + d

end

if p(τ

|Y ) ≥ σ(|τ

← ω

∪ {τ

}

end

← ω

∪ {arg max

τ ∈ω

p(τ |Y )}

until ω

6= ∅

end

← {Y \ (∪

τ ∈ω

τ )}

IV. SIMULATION RESULTS

For the simulations we consider the surveillance over a

rectangular region on a plane, R = [0, L]×[0, L] ⊂ R

. The

state vector is x = [x, y, ˙x, ˙y]

where (x, y) is a position

on R along the usual x and y axes and ( ˙x, ˙y) is a velocity

vector. The linear system model (1) is used where δ is an

interval between observations and

A(δ) =







1 0 δ 0

0 1 0 δ

0 0 1 0

0 0 0 1







G(δ) =







δ 0

0 δ







C =







1 0

0 1

0 0







The covariance matrices are Q = diag(100, 100) and R =

diag(25, 25).

The complexity of the multiple target tracking problems

can be measured by several metrics: (1) the intensity of

the false alarm rate λ

; (2) the detection probability p

;

(3) the number of tracks K; and (4) the density of tracks.

Markov chain Monte Carlo data association for general multiple-target tracking problems

Figures

Citations

Simple online and realtime tracking

Simple Online and Realtime Tracking

Stable multi-target tracking in real-time surveillance video

BLOG: Probabilistic Models with Unknown Objects

BLOG: probabilistic models with unknown objects

References

Markov Chain Monte Carlo in Practice

Tracking and data association

Tracking and data association

The complexity of computing the permanent

An algorithm for tracking multiple targets

Related Papers (5)

An algorithm for tracking multiple targets

Tracking and data association

Design and Analysis of Modern Tracking Systems

Multiple hypothesis tracking for multiple target tracking

Sonar tracking of multiple targets using joint probabilistic data association

Frequently Asked Questions (17)

Q1. What are the contributions mentioned in the paper "Markov chain monte carlo data association for general multiple target tracking problems" ?

Q2. What are the MCMC techniques used to solve?

Q3. What is the number of objects arising at each time over R?

Q4. What are the types of moves that are used in the MCMC sampler?

Q5. What is the criterion used to compare multiple target tracking algorithms?

Q6. What is the MAP estimate under the data-oriented approach?

Q7. What is the value of xt without fixing the number of tracks?

Q8. What is the goal of the inverse optimization approach?

Q9. What is the way to estimate the number of states?

Q10. What is the main advantage of the MCMCDA algorithm?

Q11. What is the use of this neighborhood tree?

Q12. What is the conditional observation covariance for the track?

Q13. What is the MAP estimate under the Bayesian approach?

Q14. what is the expected value of xt given y1?

Q15. What is the effect of pruning on the performance of MCMCDA?

Q16. How many samples are used for MCMCDA?

Q17. What are the two new metrics to measure the effectiveness of each data association algorithm?