scispace - formally typeset
Journal ArticleDOI

Multimodal Deep Autoencoder for Human Pose Recovery

TLDR
A novel pose recovery method using non-linear mapping with multi-layered deep neural network and back-propagation deep learning to obtain a unified feature description by standard eigen-decomposition of the hypergraph Laplacian matrix.
Abstract
Video-based human pose recovery is usually conducted by retrieving relevant poses using image features. In the retrieving process, the mapping between 2D images and 3D poses is assumed to be linear in most of the traditional methods. However, their relationships are inherently non-linear, which limits recovery performance of these methods. In this paper, we propose a novel pose recovery method using non-linear mapping with multi-layered deep neural network. It is based on feature extraction with multimodal fusion and back-propagation deep learning. In multimodal fusion, we construct hypergraph Laplacian with low-rank representation. In this way, we obtain a unified feature description by standard eigen-decomposition of the hypergraph Laplacian matrix. In back-propagation deep learning, we learn a non-linear mapping from 2D images to 3D poses with parameter fine-tuning. The experimental results on three data sets show that the recovery error has been reduced by 20%–25%, which demonstrates the effectiveness of the proposed method.

read more

Citations
More filters
Journal ArticleDOI

A survey of deep neural network architectures and their applications

TL;DR: This work was supported in part by the Royal Society of the UK, the National Natural Science Foundation of China, and the Alexander von Humboldt Foundation of Germany.
Journal ArticleDOI

A review on neural networks with random weights

TL;DR: This paper objectively reviews the advantages and disadvantages of N NRW model, tries to reveal the essence of NNRW, and provides some useful guidelines for users to choose a mechanism to train a feed-forward neural network.
Journal ArticleDOI

1-D CNNs for structural damage detection: Verification on a structural health monitoring benchmark data

TL;DR: This paper presents an enhanced CNN-based approach that requires only two measurement sets regardless of the size of the structure and successfully estimated the actual amount of damage for the nine damage scenarios of the benchmark study.
Journal ArticleDOI

3D Human pose estimation

TL;DR: An extensive experimental evaluation of state-of-the-art approaches in a synthetic dataset created specifically for 3D human pose estimation, which along with its ground truth is made publicly available for research purposes.
Journal ArticleDOI

Multi-view low-rank sparse subspace clustering

Maria Brbic, +1 more
- 01 Jan 2018 - 
TL;DR: An approach to multi-view subspace clustering that learns a joint subspace representation by constructing affinity matrix shared among all views is presented, relying on the importance of both low-rank and sparsity constraints in the construction of the affinity matrix.
References
More filters
Journal Article

Dropout: a simple way to prevent neural networks from overfitting

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Book

Learning Deep Architectures for AI

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.
Proceedings ArticleDOI

Extracting and composing robust features with denoising autoencoders

TL;DR: This work introduces and motivate a new training principle for unsupervised learning of a representation based on the idea of making the learned representations robust to partial corruption of the input pattern.
Journal ArticleDOI

Shape matching and object recognition using shape contexts

TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.