Journal ArticleDOI
Microsoft Kinect Sensor and Its Effect
TLDR
While the Kinect sensor incorporates several advanced sensing hardware, this article focuses on the vision aspect of the sensor and its impact beyond the gaming industry.Abstract:Â
Recent advances in 3D depth cameras such as Microsoft Kinect sensors (www.xbox.com/en-US/kinect) have created many opportunities for multimedia computing. The Kinect sensor lets the computer directly sense the third dimension (depth) of the players and the environment. It also understands when users talk, knows who they are when they walk up to it, and can interpret their movements and translate them into a format that developers can use to build new experiences. While the Kinect sensor incorporates several advanced sensing hardware, this article focuses on the vision aspect of the Kinect sensor and its impact beyond the gaming industry.read more
Citations
More filters
Journal ArticleDOI
Enhanced Computer Vision With Microsoft Kinect Sensor: A Review
TL;DR: A comprehensive review of recent Kinect-based computer vision algorithms and applications covering topics including preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping.
Posted Content
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
TL;DR: In this paper, a large-scale dataset for RGB+D human action recognition was introduced with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects.
Proceedings ArticleDOI
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
TL;DR: A large-scale dataset for RGB+D human action recognition with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects is introduced and a new recurrent neural network structure is proposed to model the long-term temporal correlation of the features for each body part, and utilize them for better action classification.
Journal ArticleDOI
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding
TL;DR: This work introduces a large-scale dataset for RGB+D human action recognition, which is collected from 106 distinct subjects and contains more than 114 thousand video samples and 8 million frames, and investigates a novel one-shot 3D activity recognition problem on this dataset.
Journal ArticleDOI
Enhanced skeleton visualization for view invariant human action recognition
Mengyuan Liu,Hong Liu,Chen Chen +2 more
TL;DR: Enhanced skeleton visualization method encodes spatio-temporal skeletons as visual and motion enhanced color images in a compact yet distinctive manner and consistently achieves the highest accuracies on four datasets, including the largest and most challenging NTU RGB+D dataset for skeleton-based action recognition.
References
More filters
Journal ArticleDOI
A flexible new technique for camera calibration
TL;DR: A flexible technique to easily calibrate a camera that only requires the camera to observe a planar pattern shown at a few (at least two) different orientations is proposed and advances 3D computer vision one more step from laboratory environments to real world use.
Proceedings ArticleDOI
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Andrew Fitzgibbon,Mat Cook,Toby Sharp,Mark J. Finocchio,Richard E. Moore,Alex Aben-Athar Kipman,Andrew Blake +7 more
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
Proceedings ArticleDOI
Action recognition based on a bag of 3D points
TL;DR: An action graph is employed to model explicitly the dynamics of the actions and a bag of 3D points to characterize a set of salient postures that correspond to the nodes in the action graph to recognize human actions from sequences of depth maps.
Proceedings ArticleDOI
Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera
TL;DR: A novel distance metric for hand dissimilarity measure, called Finger-Earth Mover's Distance (FEMD), which only matches fingers while not the whole hand shape, can better distinguish hand gestures of slight differences.
Proceedings ArticleDOI
Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras
Andrew Maimone,Henry Fuchs +1 more
TL;DR: A proof-of-concept telepresence system that offers fully dynamic, real-time 3D scene capture and continuous-viewpoint, head-tracked stereo 3D display without requiring the user to wear any tracking or viewing apparatus is introduced.