Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

doi:10.1109/CVPR.2017.277

Proceedings ArticleDOI

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Shan Li, +2 more

- pp 2584-2593

Chats0

TLDR

A new DLP-CNN (Deep Locality-Preserving CNN) method, which aims to enhance the discriminative power of deep features by preserving the locality closeness while maximizing the inter-class scatters, is proposed.

Abstract:

Past research on facial expressions have used relatively limited datasets, which makes it unclear whether current methods can be employed in real world. In this paper, we present a novel database, RAF-DB, which contains about 30000 facial images from thousands of individuals. Each image has been individually labeled about 40 times, then EM algorithm was used to filter out unreliable labels. Crowdsourcing reveals that real-world faces often express compound emotions, or even mixture ones. For all we know, RAF-DB is the first database that contains compound expressions in the wild. Our cross-database study shows that the action units of basic emotions in RAF-DB are much more diverse than, or even deviate from, those of lab-controlled ones. To address this problem, we propose a new DLP-CNN (Deep Locality-Preserving CNN) method, which aims to enhance the discriminative power of deep features by preserving the locality closeness while maximizing the inter-class scatters. The benchmark experiments on the 7-class basic expressions and 11-class compound expressions, as well as the additional experiments on SFEW and CK+ databases, show that the proposed DLP-CNN outperforms the state-of-the-art handcrafted features and deep learning based methods for the expression recognition in the wild.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Facial Expression Recognition: A Survey

Shan Li, +1 more

- 23 Apr 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A comprehensive survey on deep facial expression recognition (FER) can be found in this article, including datasets and algorithms that provide insights into the intrinsic problems of deep FER, including overfitting caused by lack of sufficient training data and expression-unrelated variations, such as illumination, head pose and identity bias.

...read moreread less

Journal ArticleDOI

Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism

Yong Li, +3 more

- 01 May 2019 -

IEEE Transactions on Image Processing

TL;DR: Visualization results demonstrate that, compared with the CNN without Gate Unit, ACNNs are capable of shifting the attention from the occluded patches to other related but unobstructed ones and outperform other state-of-the-art methods on several widely used in thelab facial expression datasets under the cross-dataset evaluation protocol.

...read moreread less

Journal ArticleDOI

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition

Shan Li, +1 more

- 01 Jan 2019 -

IEEE Transactions on Image Processing

TL;DR: A new deep locality-preserving convolutional neural network (DLP-CNN) method that aims to enhance the discriminative power of deep features by preserving the locality closeness while maximizing the inter-class scatter is proposed.

...read moreread less

Proceedings ArticleDOI

Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Zhen-Hua Feng, +4 more

TL;DR: A new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs) is presented, and the superiority of the proposed method over the state-of-the-art approaches is proved.

...read moreread less

Journal ArticleDOI

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition

Kai Wang, +4 more

- 29 Jan 2020 -

IEEE Transactions on Image Processing

TL;DR: Zhang et al. as mentioned in this paper proposed a region attention network (RAN) to adaptively capture the importance of facial regions for occlusion and pose variant FER by aggregating and embedding varied number of region features produced by a backbone convolutional neural network into a compact fixed-length representation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Collapse

IEEE Signal Processing Letters

Going deeper in facial expression recognition using deep neural networks

Ali Mollahosseini, +2 more

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Citations

Deep Facial Expression Recognition: A Survey

Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition

Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

LIBSVM: A library for support vector machines

Histograms of oriented gradients for human detection

Related Papers (5)

The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression

Deep Residual Learning for Image Recognition

Challenges in Representation Learning: A Report on Three Machine Learning Contests

Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks

Going deeper in facial expression recognition using deep neural networks