Robust Part-Based Hand Gesture Recognition Using Kinect Sensor

doi:10.1109/TMM.2013.2246148

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Survey on 3D Hand Gesture Recognition

[...]

Hong Cheng¹, Lu Yang¹, Zicheng Liu²•Institutions (2)

University of Electronic Science and Technology of China¹, Microsoft²

01 Sep 2016-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: This paper presents a survey of some recent works on hand gesture recognition using 3D depth sensors, and reviews the commercial depth sensors and public data sets that are widely used in this field.

...read moreread less

Abstract: Three-dimensional hand gesture recognition has attracted increasing research interests in computer vision, pattern recognition, and human-computer interaction. The emerging depth sensors greatly inspired various hand gesture recognition approaches and applications, which were severely limited in the 2D domain with conventional cameras. This paper presents a survey of some recent works on hand gesture recognition using 3D depth sensors. We first review the commercial depth sensors and public data sets that are widely used in this field. Then, we review the state-of-the-art research for 3D hand gesture recognition in four aspects: 1) 3D hand modeling; 2) static hand gesture recognition; 3) hand trajectory gesture recognition; and 4) continuous hand gesture recognition. While the emphasis is on 3D hand gesture recognition approaches, the related applications and typical systems are also briefly summarized for practitioners.

...read moreread less

291 citations

Cites methods from "Robust Part-Based Hand Gesture Reco..."

...After the hand detection and tracking, either stat ic hand gesture recognition [68]–[70] or hybrid hand gesture recognition [71, 72] can be applied....
[...]

Book Chapter•DOI•

Weakly-supervised 3D Hand Pose Estimation from Monocular RGB Images

[...]

Yujun Cai¹, Liuhao Ge¹, Jianfei Cai¹, Junsong Yuan²•Institutions (2)

Nanyang Technological University¹, University at Buffalo²

08 Sep 2018

TL;DR: A weakly-supervised method, adaptating from fully-annotated synthetic dataset toWeakly-labeled real-world dataset with the aid of a depth regularizer, which generates depth maps from predicted 3D pose and serves as weak supervision for3D pose regression.

...read moreread less

Abstract: Compared with depth-based 3D hand pose estimation, it is more challenging to infer 3D hand pose from monocular RGB images, due to substantial depth ambiguity and the difficulty of obtaining fully-annotated training data. Different from existing learning-based monocular RGB-input approaches that require accurate 3D annotations for training, we propose to leverage the depth images that can be easily obtained from commodity RGB-D cameras during training, while during testing we take only RGB inputs for 3D joint predictions. In this way, we alleviate the burden of the costly 3D annotations in real-world dataset. Particularly, we propose a weakly-supervised method, adaptating from fully-annotated synthetic dataset to weakly-labeled real-world dataset with the aid of a depth regularizer, which generates depth maps from predicted 3D pose and serves as weak supervision for 3D pose regression. Extensive experiments on benchmark datasets validate the effectiveness of the proposed depth regularizer in both weakly-supervised and fully-supervised settings.

...read moreread less

288 citations

Additional excerpts

...Inspired by the great improvement of CNN-based 3D hand pose estimation from depth images[24], deep learning has also been adopted in some recent works on monocular RGB-based applications [46, 18]....
[...]

Journal Article•DOI•

Superpixel-Based Hand Gesture Recognition With Kinect Depth Camera

[...]

Chong Wang¹, Zhong Liu¹, Shing-Chow Chan¹•Institutions (1)

University of Hong Kong¹

01 Jan 2015-IEEE Transactions on Multimedia

TL;DR: A novel distance metric, superpixel earth mover's distance (SP-EMD), is proposed to measure the dissimilarity between the hand gestures, which is robust to distortion and articulation, but also invariant to scaling, translation and rotation with proper preprocessing.

...read moreread less

Abstract: This paper presents a new superpixel-based hand gesture recognition system based on a novel superpixel earth mover's distance metric, together with Kinect depth camera The depth and skeleton information from Kinect are effectively utilized to produce markerless hand extraction The hand shapes, corresponding textures and depths are represented in the form of superpixels, which effectively retain the overall shapes and color of the gestures to be recognized Based on this representation, a novel distance metric, superpixel earth mover's distance (SP-EMD), is proposed to measure the dissimilarity between the hand gestures This measurement is not only robust to distortion and articulation, but also invariant to scaling, translation and rotation with proper preprocessing The effectiveness of the proposed distance metric and recognition algorithm are illustrated by extensive experiments with our own gesture dataset as well as two other public datasets Simulation results show that the proposed system is able to achieve high mean accuracy and fast recognition speed Its superiority is further demonstrated by comparisons with other conventional techniques and two real-life applications

...read moreread less

271 citations

Cites background or methods from "Robust Part-Based Hand Gesture Reco..."

...We now evaluate and compare the proposed hand gesture recognition system with various state-of-the-art recognition algorithms including Shape Context [27], Skeleton Matching [26], FEMD [12], Random Forest (RF) [32], HOG [24] and H3DF [23], using three different real world datasets, namely our joint color-depth hand gesture dataset, NTU hand digit dataset [12] and American Sign Language (ASL) finger spelling dataset [32]....
[...]
...Instead of representing the hand shape in contour [12], [27] or skeleton [26], we propose to use superpixels to simplify the hand shape but retain as much information as possible....
[...]
...Comparisons With Other Methods: To further illustrate the advantage of our system, we first compare it with other three state-of-the-art recognition algorithms, Shape Context [27], Skeleton Matching [26] and FEMD [12] on our dataset....
[...]
...Comparing with previous distance measures such as FEMD, shape context distance and path similarity, the proposed SP-EMD metric achieves better performance for hand gesture recognition....
[...]
...It is worth noting that FEMD is particularly designed for depth-camera based hand gesture recognition....
[...]

Proceedings Article•DOI•

Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs

[...]

Liuhao Ge¹, Hui Liang¹, Junsong Yuan¹, Daniel Thalmann¹•Institutions (1)

Nanyang Technological University¹

27 Jun 2016

TL;DR: This work proposes to first project the query depth image onto three orthogonal planes and utilize these multi-view projections to regress for 2D heat-maps which estimate the joint positions on each plane to produce final 3D hand pose estimation with learned pose priors.

...read moreread less

Abstract: Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, the accuracy of existing methods is still not satisfactory, partially due to the difficulty of embedded highdimensional and non-linear regression problem. Different from the existing discriminative methods that regress for the hand pose with a single depth image, we propose to first project the query depth image onto three orthogonal planes and utilize these multi-view projections to regress for 2D heat-maps which estimate the joint positions on each plane. These multi-view heat-maps are then fused to produce final 3D hand pose estimation with learned pose priors. Experiments show that the proposed method largely outperforms state-of-the-art on a challenging dataset. Moreover, a cross-dataset experiment also demonstrates the good generalization ability of the proposed method.

...read moreread less

266 citations

Cites methods from "Robust Part-Based Hand Gesture Reco..."

...Index Terms—3D hand pose estimation, Convolutional neural networks, Multi-view CNNs F...
[...]

Journal Article•DOI•

A Survey of Applications and Human Motion Recognition with Microsoft Kinect

[...]

Roanna Lun¹, Wenbing Zhao¹•Institutions (1)

Cleveland State University¹

09 Jul 2015-International Journal of Pattern Recognition and Artificial Intelligence

TL;DR: A comprehensive survey on Kinect applications, and the latest research and development on motion recognition using data captured by the Kinect sensor, and a classification of motion recognition techniques to highlight the different approaches used in human motion recognition.

...read moreread less

Abstract: Microsoft Kinect, a low-cost motion sensing device, enables users to interact with computers or game consoles naturally through gestures and spoken commands without any other peripheral equipment. As such, it has commanded intense interests in research and development on the Kinect technology. In this paper, we present, a comprehensive survey on Kinect applications, and the latest research and development on motion recognition using data captured by the Kinect sensor. On the applications front, we review the applications of the Kinect technology in a variety of areas, including healthcare, education and performing arts, robotics, sign language recognition, retail services, workplace safety training, as well as 3D reconstructions. On the technology front, we provide an overview of the main features of both versions of the Kinect sensor together with the depth sensing technologies used, and review literatures on human motion recognition techniques used in Kinect applications. We provide a classification of motion recognition techniques to highlight the different approaches used in human motion recognition. Furthermore, we compile a list of publicly available Kinect datasets. These datasets are valuable resources for researchers to investigate better methods for human motion recognition and lower-level computer vision tasks such as segmentation, object detection and human pose estimation.

...read moreread less

261 citations

Collapse

Robust Part-Based Hand Gesture Recognition Using Kinect Sensor

Citations

Cites methods from "Robust Part-Based Hand Gesture Reco..."

Additional excerpts

Cites background or methods from "Robust Part-Based Hand Gesture Reco..."

Cites methods from "Robust Part-Based Hand Gesture Reco..."

References

"Robust Part-Based Hand Gesture Reco..." refers background or methods in this paper

"Robust Part-Based Hand Gesture Reco..." refers background in this paper

"Robust Part-Based Hand Gesture Reco..." refers background in this paper

Related Papers (5)