Proceedings ArticleDOI
Efficient Hand Pose Estimation from a Single Depth Image
TLDR
This work tackles the practical problem of hand pose estimation from a single noisy depth image, and proposes a dedicated three-step pipeline that is able to work with Kinect-type noisy depth images, and reliably produces pose estimations of general motions efficiently.Abstract:
We tackle the practical problem of hand pose estimation from a single noisy depth image. A dedicated three-step pipeline is proposed: Initial estimation step provides an initial estimation of the hand in-plane orientation and 3D location, Candidate generation step produces a set of 3D pose candidate from the Hough voting space with the help of the rotational invariant depth features, Verification step delivers the final 3D hand pose as the solution to an optimization problem. We analyze the depth noises, and suggest tips to minimize their negative impacts on the overall performance. Our approach is able to work with Kinect-type noisy depth images, and reliably produces pose estimations of general motions efficiently (12 frames per second). Extensive experiments are conducted to qualitatively and quantitatively evaluate the performance with respect to the state-of-the-art methods that have access to additional RGB images. Our approach is shown to deliver on par or even better results.read more
Citations
More filters
Proceedings ArticleDOI
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Proceedings ArticleDOI
Hand Keypoint Detection in Single Images Using Multiview Bootstrapping
TL;DR: In this paper, a multi-camera system is used to train fine-grained detectors for keypoints that are prone to occlusion, such as the joints of a hand.
Proceedings ArticleDOI
Realtime and Robust Hand Tracking from Depth
TL;DR: A hybrid method that combines gradient based and stochastic optimization methods to achieve fast convergence and good accuracy is proposed and presented, making it the first system that achieves such robustness, accuracy, and speed simultaneously.
Proceedings ArticleDOI
GANerated Hands for Real-Time 3D Hand Tracking from Monocular RGB
Franziska Mueller,Florian Bernard,Oleksandr Sotnychenko,Dushyant Mehta,Srinath Sridhar,Dan Casas,Christian Theobalt +6 more
TL;DR: This work proposes a novel approach for the synthetic generation of training data that is based on a geometrically consistent image-to-image translation network, and uses a neural network that translates synthetic images to "real" images, such that the so-generated images follow the same statistical distribution as real-world hand images.
Proceedings ArticleDOI
Accurate, Robust, and Flexible Real-time Hand Tracking
Toby Sharp,Cem Keskin,Duncan Robertson,Jonathan Taylor,Jamie Shotton,David Kim,Christoph Rhemann,Ido Leichter,Alon Vinnikov,Yichen Wei,Daniel Freedman,Pushmeet Kohli,Eyal Krupka,Andrew Fitzgibbon,Shahram Izadi +14 more
TL;DR: A new real-time hand tracking system based on a single depth camera that can accurately reconstruct complex hand poses across a variety of subjects and is highly flexible, dramatically improving upon previous approaches which have focused on front-facing close-range scenarios.
References
More filters
Journal ArticleDOI
Random Forests
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal ArticleDOI
Mean shift: a robust approach toward feature space analysis
Dorin Comaniciu,Peter Meer +1 more
TL;DR: It is proved the convergence of a recursive mean shift procedure to the nearest stationary point of the underlying density function and, thus, its utility in detecting the modes of the density.
Proceedings ArticleDOI
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Andrew Fitzgibbon,Mat Cook,Toby Sharp,Mark J. Finocchio,Richard E. Moore,Alex Aben-Athar Kipman,Andrew Blake +7 more
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
Journal ArticleDOI
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Toby Sharp,Alex Aben-Athar Kipman,Andrew Fitzgibbon,Mark J. Finocchio,Andrew Blake,Mat Cook,Richard Moore +7 more
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.