M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo

doi:10.1007/3-540-47969-4_2

Book ChapterDOI

M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo

Anurag Mittal, +1 more

- pp 18-36

Chats0

TLDR

A system that is capable of segmenting, detecting and tracking multiple people in a cluttered scene using multiple synchronized cameras located far from each other and a scheme for combining evidences gathered from different camera pairs using occlusion analysis so as to obtain a globally optimum detection and tracking of objects.

Abstract:

We present a system that is capable of segmenting, detecting and tracking multiple people in a cluttered scene using multiple synchronized cameras located far from each other. The system improves upon existing systems in many ways including: (1)We do not assume that a foreground connected component belongs to only one object; rather, we segment the views taking into account color models for the objects and the background. This helps us to not only separate foreground regions belonging to different objects, but to also obtain better background regions than traditional background subtraction methods (as it uses foreground color models in the algorithm). (2) It is fully automatic and does not require any manual input or initializations of any kind. (3) Instead of taking decisions about object detection and tracking from a single view or camera pair, we collect evidences from each pair and combine the evidence to obtain a decision in the end. This helps us to obtain much better detection and tracking as opposed to traditional systems.Several innovations help us tackle the problem. The first is the introduction of a region-based stereo algorithm that is capable of finding 3D points inside an object if we know the regions belonging to the object in two views. No exact point matching is required. This is especially useful in wide baseline camera systems where exact point matching is very difficult due to self-occlusion and a substantial change in viewpoint. The second contribution is the development of a scheme for setting priors for use in segmentation of a view using bayesian classification. The scheme, which assumes knowledge of approximate shape and location of objects, dynamically assigns priors for different objects at each pixel so that occlusion information is encoded in the priors. The third contribution is a scheme for combining evidences gathered from different camera pairs using occlusion analysis so as to obtain a globally optimum detection and tracking of objects.The system has been tested using different density of people in the scene which helps us to determine the number of cameras required for a particular density of people.

M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo

Citations

A survey of advances in vision-based human motion capture and analysis

A survey on visual surveillance of object motion and behaviors

Evaluating multiple object tracking performance: the CLEAR MOT metrics

M 2 Tracker: A Multi-View Approach to Segmenting and Tracking People in a Cluttered Scene

Segmentation and Tracking of Multiple Humans in Crowded Environments

References

Pfinder: real-time tracking of the human body

A System for Video Surveillance and Monitoring

Multi-camera multi-person tracking for EasyLiving

W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people

Stereo correspondence through feature grouping and maximal cliques

Related Papers (5)

Multi-camera multi-person tracking for EasyLiving

W/sup 4/: real-time surveillance of people and their activities

Adaptive background mixture models for real-time tracking

Pfinder: real-time tracking of the human body

C ONDENSATION —Conditional Density Propagation forVisual Tracking