Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

doi:10.1109/LSP.2016.2603342

Open AccessJournal ArticleDOI

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Kaipeng Zhang, +3 more

- 11 Apr 2016 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

A deep cascaded multitask framework that exploits the inherent correlation between detection and alignment to boost up their performance and achieves superior accuracy over the state-of-the-art techniques on the challenging face detection dataset and benchmark.

Abstract:

Face detection and alignment in unconstrained environment are challenging due to various poses, illuminations and occlusions. Recent studies show that deep learning approaches can achieve impressive performance on these two tasks. In this paper, we propose a deep cascaded multi-task framework which exploits the inherent correlation between them to boost up their performance. In particular, our framework adopts a cascaded structure with three stages of carefully designed deep convolutional networks that predict face and landmark location in a coarse-to-fine manner. In addition, in the learning process, we propose a new online hard sample mining strategy that can improve the performance automatically without manual sample selection. Our method achieves superior accuracy over the state-of-the-art techniques on the challenging FDDB and WIDER FACE benchmark for face detection, and AFLW benchmark for face alignment, while keeps real time performance.

Citations

PDF

Open Access

More filters

Posted Content

nuScenes: A multimodal dataset for autonomous driving

Holger Caesar, +9 more

- 26 Mar 2019 -

arXiv: Learning

TL;DR: nuScenes as mentioned in this paper is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

...read moreread less

Journal ArticleDOI

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020 -

International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Proceedings ArticleDOI

nuScenes: A Multimodal Dataset for Autonomous Driving

Holger Caesar, +9 more

TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

...read moreread less

Journal ArticleDOI

A survey of the recent architectures of deep convolutional neural networks

Asifullah Khan, +3 more

- 01 Dec 2020 -

Artificial Intelligence Review

TL;DR: Deep Convolutional Neural Networks (CNNs) as mentioned in this paper are a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing.

...read moreread less

Proceedings ArticleDOI

Sampling Matters in Deep Embedding Learning

R. Manmatha, +3 more

TL;DR: This paper proposes distance weighted sampling, which selects more informative and stable examples than traditional approaches, and shows that a simple margin based loss is sufficient to outperform all other loss functions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Robust Real-Time Face Detection

Paul A. Viola, +1 more

- 01 May 2004 -

International Journal of Computer Vision

TL;DR: In this paper, a face detection framework that is capable of processing images extremely rapidly while achieving high detection rates is described. But the detection performance is limited to 15 frames per second.

...read moreread less

Proceedings ArticleDOI

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Kaiming He, +3 more

TL;DR: In this paper, a Parametric Rectified Linear Unit (PReLU) was proposed to improve model fitting with nearly zero extra computational cost and little overfitting risk, which achieved a 4.94% top-5 test error on ImageNet 2012 classification dataset.

...read moreread less

Proceedings ArticleDOI

Deep Learning Face Attributes in the Wild

Ziwei Liu, +3 more

TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.

...read moreread less

Journal ArticleDOI

Active appearance models

Timothy F. Cootes, +2 more

- 01 Jun 2001 -

IEEE Transactions on Pattern Analysis an...

Abstract: We describe a new method of matching statistical models of appearance to images. A set of model parameters control modes of shape and gray-level variation learned from a training set. We construct an efficient iterative matching algorithm by learning the relationship between perturbations in the model parameters and the induced image errors.

...read moreread less

Proceedings ArticleDOI

Supervised Descent Method and Its Applications to Face Alignment

Xuehan Xiong, +1 more

TL;DR: A Supervised Descent Method (SDM) is proposed for minimizing a Non-linear Least Squares (NLS) function and achieves state-of-the-art performance in the problem of facial feature detection.

...read moreread less

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Citations

nuScenes: A multimodal dataset for autonomous driving

Deep Learning for Generic Object Detection: A Survey

nuScenes: A Multimodal Dataset for Autonomous Driving

A survey of the recent architectures of deep convolutional neural networks

Sampling Matters in Deep Embedding Learning

References

Robust Real-Time Face Detection

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Deep Learning Face Attributes in the Wild

Active appearance models

Supervised Descent Method and Its Applications to Face Alignment

Related Papers (5)

Deep Residual Learning for Image Recognition

FaceNet: A unified embedding for face recognition and clustering

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions