scispace - formally typeset
Search or ask a question
Author

Yue Ming

Bio: Yue Ming is an academic researcher from Beijing University of Posts and Telecommunications. The author has contributed to research in topics: Facial recognition system & Feature extraction. The author has an hindex of 15, co-authored 70 publications receiving 575 citations. Previous affiliations of Yue Ming include Tencent & Beijing Jiaotong University.


Papers
More filters
Journal ArticleDOI
TL;DR: Recently, a large body of deep learning methods have been proposed and has shown great promise in handling the traditional ill-posed problem of depth estimation as discussed by the authors, which is of great significance for many applications such as augmented reality, target tracking and autonomous driving.

94 citations

Proceedings ArticleDOI
21 Jul 2015
TL;DR: This paper proposes an effective deep learning framework by stacking multiple output features that learned through each stage of the Convolutional Neural Network (CNN), which is name as Stacked PCA Network (SPCANet).
Abstract: High-level features can represent the semantics of the original data and it is a plausible way to avoid the problem of hand-crafted features for face recognition. This paper proposes an effective deep learning framework by stacking multiple output features that learned through each stage of the Convolutional Neural Network (CNN). Different from the traditional deep learning network, we use Principal Component Analysis (PCA) to get the filter kernels of convolutional layer, which is name as Stacked PCA Network (SPCANet). Our SPCANet model follows the basic architecture of the CNN, which comprises three layers in each stage: convolutional filter layer, nonlinear processing layer and feature pooling layer. Firstly, in the convolutional filter layer of our model, PCA instead of stochastic gradient descent (SGD) is employed to learn filter kernels, and the output of all cascaded convolutional filter layers is used as the input of nonlinear processing layer. Secondly, the following nonlinear processing layer is also simplified. We use hashing method for nonlinear processing. Thirdly, the block based histograms instead of max-pooling technique are employed in the feature pooling layer. In the last output layer, the output of each stage is stacked together as one final feature output of our model. Extensive ex- periments conducted on many different face recognition scenarios demonstrate the effectiveness of our proposed approach.

42 citations

Journal ArticleDOI
TL;DR: A robust regional bounding spherical descriptor (RBSR) is proposed to facilitate 3D face recognition and emotion analysis and three largest available databases, FRGC v2, CASIA and BU-3DFE, are contributed to the performance comparison.

42 citations

Proceedings ArticleDOI
03 Dec 2010
TL;DR: The facial feature vectors with the information of slopes and the angles as the feature vectors got from the facial feature points, not only the distance information mentioned in the previous work are added.
Abstract: This paper describes a 3D facial expression recognition approach based on distance and angle features, which can be got from the localized facial feature points. The probabilistic Neutral Network (PNN) architecture is used to classify the facial expressions based on BU-3DFE database. This paper adds the facial feature vectors with the information of slopes and the angles as the feature vectors got from the facial feature points, not only the distance information mentioned in the previous work. Thus it receives a better performance with an average recognition rate of 90.2%.

36 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: This survey presents a brief survey on the advances that have occurred in the area of Deep Learning (DL), starting with the Deep Neural Network and goes on to cover Convolutional Neural Network, Recurrent Neural Network (RNN), and Deep Reinforcement Learning (DRL).
Abstract: In recent years, deep learning has garnered tremendous success in a variety of application domains. This new field of machine learning has been growing rapidly and has been applied to most traditional application domains, as well as some new areas that present more opportunities. Different methods have been proposed based on different categories of learning, including supervised, semi-supervised, and un-supervised learning. Experimental results show state-of-the-art performance using deep learning when compared to traditional machine learning approaches in the fields of image processing, computer vision, speech recognition, machine translation, art, medical imaging, medical information processing, robotics and control, bioinformatics, natural language processing, cybersecurity, and many others. This survey presents a brief survey on the advances that have occurred in the area of Deep Learning (DL), starting with the Deep Neural Network (DNN). The survey goes on to cover Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), including Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU), Auto-Encoder (AE), Deep Belief Network (DBN), Generative Adversarial Network (GAN), and Deep Reinforcement Learning (DRL). Additionally, we have discussed recent developments, such as advanced variant DL techniques based on these DL approaches. This work considers most of the papers published after 2012 from when the history of deep learning began. Furthermore, DL approaches that have been explored and evaluated in different application domains are also included in this survey. We also included recently developed frameworks, SDKs, and benchmark datasets that are used for implementing and evaluating deep learning approaches. There are some surveys that have been published on DL using neural networks and a survey on Reinforcement Learning (RL). However, those papers have not discussed individual advanced techniques for training large-scale deep learning models and the recently developed method of generative models.

922 citations

Journal ArticleDOI
TL;DR: An overview of the state-of-the-art attention models proposed in recent years is given and a unified model that is suitable for most attention structures is defined.

620 citations

Journal ArticleDOI
TL;DR: Developments in 3D facial data acquisition and tracking are discussed, and currently available 3D/4D face databases suitable for 3D and 4D facial expressions analysis are presented as well as the existing facial expression recognition systems that exploit either 3D or 4D data in detail.

360 citations

Journal ArticleDOI
TL;DR: Major deep learning concepts pertinent to face image analysis and face recognition are reviewed, and a concise overview of studies on specific face recognition problems is provided, such as handling variations in pose, age, illumination, expression, and heterogeneous face matching.

312 citations