scispace - formally typeset
Open AccessJournal ArticleDOI

Facial Expression Recognition Using Salient Features and Convolutional Neural Network

Md. Zia Uddin, +2 more
- 23 Nov 2017 - 
- Vol. 5, pp 26146-26161
Reads0
Chats0
TLDR
A depth camera-based novel method is proposed here for efficient facial expression recognition that is compared and shown to outperform the traditional expression recognition methods.
Abstract
A depth camera-based novel method is proposed here for efficient facial expression recognition. For each pixel in a depth image, eight local directional strengths are obtained and ranked. Once the rank of all pixels is obtained, eight histograms are developed for the eight surrounding directions. The histograms are then concatenated to represent features for a depth image of a face. This approach is named local directional rank histogram pattern (LDRHP). To combine with LDRHP features, one more robust feature extraction technique named local directional strength pattern (LDSP) is proposed. Typical local directional pattern (LDP) considers only absolute values of edge strengths for a pixel. This generalization in LDP may generate the same patterns for two different kinds of edge pixels. LDSP can overcome this problem. It considers the binary values of the position with the directions representing the highest and lowest original strengths. The highest strength indicates the strongest direction on the bright side of a pixel and the lowest one indicates the strongest direction in the dark side of that pixel. Hence, combining binary positions of these two directions can generate more robust patterns than LDP. Besides, LDSP pattern of a pixel is of six bits, whereas traditional LDP-based patterns are of eight bits (e.g., local directional deviation-based pattern and local directional position pattern). Thus, LDSP reduces the dimension of features with the same time adding robustness. For a depth image in a depth video, LDSP features are augmented with LDRHP features followed by Kernel principal component analysis and generalized discriminant analysis to generate more robust features. At last, the features are trained with a deep learning approach and convolutional neural network for successful facial expression recognition. The proposed approach is compared and shown to outperform the traditional expression recognition methods.

read more

Citations
More filters
Journal ArticleDOI

Deep Facial Expression Recognition: A Survey

TL;DR: A comprehensive survey on deep facial expression recognition (FER) can be found in this article, including datasets and algorithms that provide insights into the intrinsic problems of deep FER, including overfitting caused by lack of sufficient training data and expression-unrelated variations, such as illumination, head pose and identity bias.
Journal ArticleDOI

Intelligent video surveillance: a review through deep learning techniques for crowd analysis

TL;DR: The main focus of this survey is application of deep learning techniques in detecting the exact count, involved persons and the happened activity in a large crowd at all climate conditions.
Journal ArticleDOI

Comprehensive Review of Artificial Neural Network Applications to Pattern Recognition

TL;DR: There is a need for state-of-the-art in neural networks application to PR to urgently address the above-highlights problems and the research focus on current models and the development of new models concurrently for more successes in the field.
Journal ArticleDOI

Multimodal Biometric Authentication Systems Using Convolution Neural Network Based on Different Level Fusion of ECG and Fingerprint

TL;DR: A secure multimodal biometric system that uses convolution neural network (CNN) and Q-Gaussian multi support vector machine (QG-MSVM) based on a different level fusion to protect these templates and increase the security of the proposed system.
Journal ArticleDOI

A review of state-of-the-art techniques for abnormal human activity recognition

TL;DR: The proposed literature provides feature designs of abnormal human activity recognition in a video with respect to the context or application such as fall detection, Ambient Assistive Living, homeland security, surveillance or crowd analysis using RGB, depth and skeletal evidence.
References
More filters
Journal ArticleDOI

A fast learning algorithm for deep belief nets

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.
Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.
Proceedings ArticleDOI

The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression

TL;DR: The Cohn-Kanade (CK+) database is presented, with baseline results using Active Appearance Models (AAMs) and a linear support vector machine (SVM) classifier using a leave-one-out subject cross-validation for both AU and emotion detection for the posed data.
Proceedings ArticleDOI

Comprehensive database for facial expression analysis

TL;DR: The problem space for facial expression analysis is described, which includes level of description, transitions among expressions, eliciting conditions, reliability and validity of training and test data, individual differences in subjects, head orientation and scene complexity image characteristics, and relation to non-verbal behavior.
Journal ArticleDOI

Facial expression recognition based on Local Binary Patterns: A comprehensive study

TL;DR: This paper empirically evaluates facial representation based on statistical local features, Local Binary Patterns, for person-independent facial expression recognition, and observes that LBP features perform stably and robustly over a useful range of low resolutions of face images, and yield promising performance in compressed low-resolution video sequences captured in real-world environments.
Related Papers (5)