scispace - formally typeset
Open AccessProceedings ArticleDOI

DeepPrior++: Improving Fast and Accurate 3D Hand Pose Estimation

Markus Oberweger, +1 more
- pp 585-594
Reads0
Chats0
TLDR
Weger et al. as discussed by the authors proposed to add ResNet layers, data augmentation, and better initial hand localization, achieving better or similar performance than more sophisticated recent methods on the three main benchmarks.
Abstract
DeepPrior [18] is a simple approach based on Deep Learning that predicts the joint 3D locations of a hand given a depth map. Since its publication early 2015, it has been outperformed by several impressive works. Here we show that with simple improvements: adding ResNet layers, data augmentation, and better initial hand localization, we achieve better or similar performance than more sophisticated recent methods on the three main benchmarks (NYU, ICVL, MSRA) while keeping the simplicity of the original method. Our new implementation is available at https://github.com/moberweger/deep-prior-pp.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map

TL;DR: This model is designed as a 3D CNN that provides accurate estimates while running in real-time and outperforms previous methods in almost all publicly available 3D hand and human pose estimation datasets and placed first in the HANDS 2017 frame-based3D hand pose estimation challenge.
Proceedings ArticleDOI

Cross-Modal Deep Variational Hand Pose Estimation

TL;DR: This work proposes a method to learn a statistical hand model represented by a cross-modal trained latent space via a generative deep neural network, which can be directly used to estimate 3D hand poses from RGB images, outperforming the state-of-the art in different settings.
Proceedings ArticleDOI

Hand PointNet: 3D Hand Pose Estimation Using Point Sets

TL;DR: The proposed Hand PointNet directly processes the 3D point cloud that models the visible surface of the hand for pose regression, and takes the normalized point cloud as the input to capture complex hand structures and accurately regress a low dimensional representation of the3D hand pose.
Proceedings ArticleDOI

Using a Single RGB Frame for Real Time 3D Hand Pose Estimation in the Wild

TL;DR: This work capitalize on the latest advancements of deep learning, combining them with the power of generative hand pose estimation techniques to achieve real-time monocular 3D hand Pose estimation in unrestricted scenarios.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Related Papers (5)