scispace - formally typeset
Proceedings ArticleDOI

Appearance management and cue fusion for 3D model-based tracking

TLDR
This paper presents a systematic approach to acquiring model appearance information online for monocular model-based tracking and shows that the presented algorithm is able to robustly track a wide variety of targets under challenging conditions.
Abstract
This paper presents a systematic approach to acquiring model appearance information online for monocular model-based tracking. The acquired information is used to drive a set of complementary imaging cues to obtain a highly discriminatory observation model. Appearance is modeled as a Markov random field of color distributions over the model surface. The online acquisition process estimates appearance-based on uncertain image measurements and is designed to greatly reduce the chance of mapping non-object image data onto the model. Confidences about the different appearance driven imaging cues are estimated in order to adaptively balance the contributions of the different cues. The discriminatory power of the resulting model is good enough to allow long-duration single-hypothesis model-based tracking with no prior appearance information. Careful evaluation based on real and semi-synthetic video sequences shows that the presented algorithm is able to robustly track a wide variety of targets under challenging conditions.

read more

Citations
More filters
Proceedings ArticleDOI

Activity Recognition using Visual Tracking and RFID

TL;DR: A framework that combines visual human motion tracking with RFID based object tracking is proposed that enables the accurate estimation of high-level interactions between people and objects for application domains such as retail, home-care, workplace-safety, manufacturing and others.
Book ChapterDOI

Automated Person Identification in Video

TL;DR: Progress in the automatic detection and identification of humans in video, given a minimal number of labelled faces as training data, is described.
Patent

Visually tracking an object in real world using 2D appearance and multicue depth estimations

TL;DR: In this paper, the 2D image information is used to track a 2D position of the object as well as its 2D size of the appearance and change in the appearance of the objects.
Proceedings ArticleDOI

Articulated models from video

TL;DR: A model-acquisition framework for acquiring articulated models directly from monocular video that has in particular the ability to process human as well as non-human targets and makes no assumptions with respect to the structure of the kinematic tree or complexity.
Patent

Method and system for multi-modal component-based tracking of an object using robust information fusion

TL;DR: In this article, a system and method for tracking an object is disclosed, where a video sequence including a plurality of image frames are received and a sample based representation of object appearance distribution is maintained.
References
More filters
Journal ArticleDOI

The Earth Mover's Distance as a Metric for Image Retrieval

TL;DR: This paper investigates the properties of a metric between two distributions, the Earth Mover's Distance (EMD), for content-based image retrieval, and compares the retrieval performance of the EMD with that of other distances.
Book

Markov Random Field Modeling in Computer Vision

TL;DR: This book presents a comprehensive study on the use of MRFs for solving computer vision problems, and covers the following parts essential to the subject: introduction to fundamental theories, formulations of MRF vision models, MRF parameter estimation, and optimization algorithms.
Proceedings ArticleDOI

Articulated body motion capture by annealed particle filtering

TL;DR: The principal contribution of the paper is the development of a modified particle filter for search in high dimensional configuration spaces that uses a continuation principle based on annealing to introduce the influence of narrow peaks in the fitness function, gradually.
Proceedings ArticleDOI

3-D model-based tracking of humans in action: a multi-view approach

TL;DR: A vision system for the 3-D model-based tracking of unconstrained human movement and initial tracking results from a large new Humans-in-Action database containing more than 2500 frames in each of four orthogonal views are presented.