Microsoft Kinect Sensor and Its Effect

doi:10.1109/MMUL.2012.24

Journal ArticleDOI

Microsoft Kinect Sensor and Its Effect

Zhengyou Zhang

- 01 Apr 2012 -

IEEE MultiMedia

- Vol. 19, Iss: 2, pp 4-10

TLDR

While the Kinect sensor incorporates several advanced sensing hardware, this article focuses on the vision aspect of the sensor and its impact beyond the gaming industry.

Abstract:

Recent advances in 3D depth cameras such as Microsoft Kinect sensors (www.xbox.com/en-US/kinect) have created many opportunities for multimedia computing. The Kinect sensor lets the computer directly sense the third dimension (depth) of the players and the environment. It also understands when users talk, knows who they are when they walk up to it, and can interpret their movements and translate them into a format that developers can use to build new experiences. While the Kinect sensor incorporates several advanced sensing hardware, this article focuses on the vision aspect of the Kinect sensor and its impact beyond the gaming industry.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review

Jungong Han, +3 more

- 25 Jun 2013 -

IEEE Transactions on Systems, Man, and C...

TL;DR: A comprehensive review of recent Kinect-based computer vision algorithms and applications covering topics including preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping.

...read moreread less

Posted Content

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Amir Shahroudy, +3 more

- 11 Apr 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, a large-scale dataset for RGB+D human action recognition was introduced with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects.

...read moreread less

Proceedings ArticleDOI

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

Amir Shahroudy, +3 more

TL;DR: A large-scale dataset for RGB+D human action recognition with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects is introduced and a new recurrent neural network structure is proposed to model the long-term temporal correlation of the features for each body part, and utilize them for better action classification.

...read moreread less

Journal ArticleDOI

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

Jun Liu, +5 more

- 01 Oct 2020 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a large-scale dataset for RGB+D human action recognition, which is collected from 106 distinct subjects and contains more than 114 thousand video samples and 8 million frames, and investigates a novel one-shot 3D activity recognition problem on this dataset.

...read moreread less

Journal ArticleDOI

Enhanced skeleton visualization for view invariant human action recognition

Mengyuan Liu, +2 more

- 01 Aug 2017 -

Pattern Recognition

TL;DR: Enhanced skeleton visualization method encodes spatio-temporal skeletons as visual and motion enhanced color images in a compact yet distinctive manner and consistently achieves the highest accuracies on four datasets, including the largest and most challenging NTU RGB+D dataset for skeleton-based action recognition.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A flexible new technique for camera calibration

ZhenQiu Zhang

- 01 Nov 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A flexible technique to easily calibrate a camera that only requires the camera to observe a planar pattern shown at a few (at least two) different orientations is proposed and advances 3D computer vision one more step from laboratory environments to real world use.

...read moreread less

Proceedings ArticleDOI

Real-time human pose recognition in parts from single depth images

Jamie Shotton, +7 more

TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.

...read moreread less

Proceedings ArticleDOI

Action recognition based on a bag of 3D points

Wanqing Li, +2 more

TL;DR: An action graph is employed to model explicitly the dynamics of the actions and a bag of 3D points to characterize a set of salient postures that correspond to the nodes in the action graph to recognize human actions from sequences of depth maps.

...read moreread less

Proceedings ArticleDOI

Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera

Zhou Ren, +2 more

TL;DR: A novel distance metric for hand dissimilarity measure, called Finger-Earth Mover's Distance (FEMD), which only matches fingers while not the whole hand shape, can better distinguish hand gestures of slight differences.

...read moreread less

Proceedings ArticleDOI

Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras

Andrew Maimone, +1 more

TL;DR: A proof-of-concept telepresence system that offers fully dynamic, real-time 3D scene capture and continuous-viewpoint, head-tracked stereo 3D display without requiring the user to wear any tracking or viewing apparatus is introduced.

...read moreread less

Microsoft Kinect Sensor and Its Effect

Citations

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding

Enhanced skeleton visualization for view invariant human action recognition

References

A flexible new technique for camera calibration

Real-time human pose recognition in parts from single depth images

Action recognition based on a bag of 3D points

Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera

Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras

Related Papers (5)

Real-time human pose recognition in parts from single depth images

Deep Residual Learning for Image Recognition

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

KinectFusion: Real-time dense surface mapping and tracking

ImageNet Classification with Deep Convolutional Neural Networks