Multimodal Deep Autoencoder for Human Pose Recovery

doi:10.1109/TIP.2015.2487860

Journal ArticleDOI

Multimodal Deep Autoencoder for Human Pose Recovery

Chaoqun Hong, +4 more

- 07 Oct 2015 -

IEEE Transactions on Image Processing

- Vol. 24, Iss: 12, pp 5659-5670

TLDR

A novel pose recovery method using non-linear mapping with multi-layered deep neural network and back-propagation deep learning to obtain a unified feature description by standard eigen-decomposition of the hypergraph Laplacian matrix.

Abstract:

Video-based human pose recovery is usually conducted by retrieving relevant poses using image features. In the retrieving process, the mapping between 2D images and 3D poses is assumed to be linear in most of the traditional methods. However, their relationships are inherently non-linear, which limits recovery performance of these methods. In this paper, we propose a novel pose recovery method using non-linear mapping with multi-layered deep neural network. It is based on feature extraction with multimodal fusion and back-propagation deep learning. In multimodal fusion, we construct hypergraph Laplacian with low-rank representation. In this way, we obtain a unified feature description by standard eigen-decomposition of the hypergraph Laplacian matrix. In back-propagation deep learning, we learn a non-linear mapping from 2D images to 3D poses with parameter fine-tuning. The experimental results on three data sets show that the recovery error has been reduced by 20%–25%, which demonstrates the effectiveness of the proposed method.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A survey of deep neural network architectures and their applications

Weibo Liu, +6 more

- 19 Apr 2017 -

Neurocomputing

TL;DR: This work was supported in part by the Royal Society of the UK, the National Natural Science Foundation of China, and the Alexander von Humboldt Foundation of Germany.

...read moreread less

Journal ArticleDOI

A review on neural networks with random weights

Weipeng Cao, +3 more

- 31 Jan 2018 -

Neurocomputing

TL;DR: This paper objectively reviews the advantages and disadvantages of N NRW model, tries to reveal the essence of NNRW, and provides some useful guidelines for users to choose a mechanism to train a feed-forward neural network.

...read moreread less

Journal ArticleDOI

1-D CNNs for structural damage detection: Verification on a structural health monitoring benchmark data

Osama Abdeljaber, +6 more

- 31 Jan 2018 -

Neurocomputing

TL;DR: This paper presents an enhanced CNN-based approach that requires only two measurement sets regardless of the size of the structure and successfully estimated the actual amount of damage for the nine damage scenarios of the benchmark study.

...read moreread less

Journal ArticleDOI

3D Human pose estimation

Nikolaos Sarafianos, +3 more

- 01 Nov 2016 -

Computer Vision and Image Understanding

TL;DR: An extensive experimental evaluation of state-of-the-art approaches in a synthetic dataset created specifically for 3D human pose estimation, which along with its ground truth is made publicly available for research purposes.

...read moreread less

Journal ArticleDOI

Multi-view low-rank sparse subspace clustering

Maria Brbic, +1 more

- 01 Jan 2018 -

Pattern Recognition

TL;DR: An approach to multi-view subspace clustering that learns a joint subspace representation by constructing affinity matrix shared among all views is presented, relying on the importance of both low-rank and sparsity constraints in the construction of the affinity matrix.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Book

Learning Deep Architectures for AI

Yoshua Bengio

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Proceedings ArticleDOI

Extracting and composing robust features with denoising autoencoders

Pascal Vincent, +3 more

TL;DR: This work introduces and motivate a new training principle for unsupervised learning of a representation based on the idea of making the learned representations robust to partial corruption of the input pattern.

...read moreread less

Journal ArticleDOI

Shape matching and object recognition using shape contexts

Serge Belongie, +2 more

- 01 Apr 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.

...read moreread less

Collapse

Multimodal Deep Autoencoder for Human Pose Recovery

Citations

A survey of deep neural network architectures and their applications

A review on neural networks with random weights

1-D CNNs for structural damage detection: Verification on a structural health monitoring benchmark data

3D Human pose estimation

Multi-view low-rank sparse subspace clustering

References

Dropout: a simple way to prevent neural networks from overfitting

Histograms of oriented gradients for human detection

Learning Deep Architectures for AI

Extracting and composing robust features with denoising autoencoders

Shape matching and object recognition using shape contexts

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Deep learning

Reducing the Dimensionality of Data with Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition