scispace - formally typeset
Open AccessJournal ArticleDOI

A Novel Reconstruction and Tracking of 3D-Articulated Human Body from 2D Point Correspondences of a Monocular Image Sequence

TLDR
This work introduced a technique based on the concept of multiple hypothesis tracking with a motion-smoothness function between consecutive frames to automatically find the optimal solution for this ill-posed problem of reconstruction from 2D to 3D.
Abstract
A novel method is proposed to estimate the 3D relative positions of an articulated body from point correspondences in an uncalibrated monocular image sequence. It is based on a camera perspective model. Unlike previous approaches, our proposed method does not require camera parameters or a manual specification of the 3D pose at the first frame, nor does it require the assumption that at least one predefined segment in every frame is parallel to the image plane. Our work assumes a simpler assumption, for example, the actor stands vertically parallel to the image plane and not all of his/her joints lie on a plane parallel to the image plane in the first frame. Input into our algorithm consists of a topological skeleton model and 2D position data on the joints of a human actor. By geometric constraint of body parts in the skeleton model, 3D relative coordinates of the model are obtained. This reconstruction from 2D to 3D is an ill-posed problem due to non-uniqueness of solutions. Therefore, we introduced a technique based on the concept of multiple hypothesis tracking (MHT) with a motion-smoothness function between consecutive frames to automatically find the optimal solution for this ill-posed problem. Since reconstruction configurations are obtained from our closed-form equation, our technique is very efficient. Very accurate results were attained for both synthesized and real-world image sequences. We also compared our technique with both scaled-orthographic and existing perspective approaches. Our proposed method outperformed other approaches, especially in scenes with strong perspective effects and difficult poses.

read more

Citations
More filters
Journal ArticleDOI

A Dynamic Portrait Segmentation by Merging Colors and Depth Information

TL;DR: The results show that the proposed method to improve the dynamic portrait segmentation in Kinect can completely segment out the portrait image as well as reduce its error rate significantly.
Journal ArticleDOI

Portrait Vision Fusion for Augmented Reality

TL;DR: A method using superimposing a segmented human portrait on a panoramic background is proposed, then the limb interactive element is added into these videos involved with a dynamic portrait segmentation method, and the users can verbally and physically communicate through these video interactive environments.
References
More filters
Journal ArticleDOI

An efficient implementation of Reid's multiple hypothesis tracking algorithm and its evaluation for the purpose of visual tracking

TL;DR: An efficient implementation of Reid's multiple hypothesis tracking (MHT) algorithm is presented in which the k-best hypotheses are determined in polynomial time using an algorithm due to Murly (1968).

Motion Capture Using Joint Skeleton Tracking and Surface Estimation

TL;DR: This paper proposes a method for capturing the performance of a human or an animal from a multi-view video sequence and proposes a novel optimization scheme for skeleton-based pose estimation that exploits the skeleton's tree structure to split the optimization problem into a local one and a lower dimensional global one.
Journal ArticleDOI

Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image

TL;DR: This paper investigates the problem of recovering information about the configuration of an articulated object, such as a human figure, from point correspondences in a single image by considering the foreshortening of the segments of the model in the image.
Proceedings ArticleDOI

Motion capture using joint skeleton tracking and surface estimation

TL;DR: This paper proposes a method for capturing the performance of a human or an animal from a multi-view video sequence and proposes a novel optimization scheme for skeleton-based pose estimation that exploits the skeleton's tree structure to split the optimization problem into a local one and a lower dimensional global one.
Journal ArticleDOI

Recovering 3D human body configurations using shape contexts

TL;DR: The problem is to take a single two-dimensional image containing a human figure, locate the joint positions, and use these to estimate the body configuration and pose in three-dimensional space.
Related Papers (5)