Filling in scenes by propagating probabilities through layers and into appearance models

doi:10.1109/CVPR.2000.855818

Proceedings ArticleDOI

Filling in scenes by propagating probabilities through layers and into appearance models

- Vol. 1, pp 185-192

TLDR

A Bayesian network is constructed that describes the occlusion process and iterative probability propagation is used to approximately recover the identities and positions of the objects in the scene in time that is linear in K and L.

Abstract:

Inferring the identities and positions of multiple occluding objects in a noisy image is a difficult problem, even when the shapes and appearances of the allowable objects are known. Methods that detect and analyze shape features, occlusion boundaries and optical flow break down when the image is noisy. In situations where we know the boundaries and appearances of the allowable objects, a brute force method can be used to perform MAP inference. If there are K possible objects (including translations, etc.) in up to L layers, the number of possible configurations of the scene is K/sup L/, so exact inference is intractable for large numbers of objects and reasonably large numbers of layers. We construct a Bayesian network that describes the occlusion process and we use iterative probability propagation to approximately recover the identities and positions of the objects in the scene in time that is linear in K and L. Although iterative probability propagation is an approximate inference technique, it was recently used to obtain the world record in error-correcting decoding. Experiments show that when one explanation of the scene is most probable, the algorithm finds the solution. For a small problem, we show that as the number of iterations increases, iterative probability propagation performs better than a greedy technique and becomes closer to the exact MAP algorithm. Quite surprisingly, we also find that when the order of occlusion is ambiguous, the output of the algorithm may oscillate between plausible interpretations of the scene.

Filling in scenes by propagating probabilities through layers and into appearance models

Citations

Learning Low-Level Vision

Robust online appearance models for visual tracking

Computer vision and pattern recognition

Learning low-level vision

Robust online appearance models for visual tracking

References

Good error-correcting codes based on very sparse matrices

Local computations with probabilities on graphical structures and their application to expert systems

Near optimum error correcting coding and decoding: turbo-codes

A view of the EM algorithm that justifies incremental, sparse, and other variants

Good error-correcting codes based on very sparse matrices (vol 45, pg 339, 1999)

Related Papers (5)

Maximum likelihood from incomplete data via the EM algorithm

Real-time tracking of non-rigid objects using mean shift

EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation

An introduction to variational methods for graphical models

Generalized Belief Propagation