scispace - formally typeset
Proceedings ArticleDOI

Detecting Masked Faces in the Wild with LLE-CNNs

Reads0
Chats0
TLDR
This paper introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces, and proposes LLE-CNNs for masked face detection, which consist of three major modules.
Abstract
Detecting faces with occlusions is a challenging task due to two main reasons: 1) the absence of large datasets of masked faces, and 2) the absence of facial cues from the masked regions. To address these two issues, this paper first introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces. Faces in the dataset have various orientations and occlusion degrees, while at least one part of each face is occluded by mask. Based on this dataset, we further propose LLE-CNNs for masked face detection, which consist of three major modules. The Proposal module first combines two pre-trained CNNs to extract candidate facial regions from the input image and represent them with high dimensional descriptors. After that, the Embedding module is incorporated to turn such descriptors into a similarity-based descriptor by using locally linear embedding (LLE) algorithm and the dictionaries trained on a large pool of synthesized normal faces, masked faces and non-faces. In this manner, many missing facial cues can be largely recovered and the influences of noisy cues introduced by diversified masks can be greatly alleviated. Finally, the Verification module is incorporated to identify candidate facial regions and refine their positions by jointly performing the classification and regression tasks within a unified CNN. Experimental results on the MAFA dataset show that the proposed approach remarkably outperforms 6 state-of-the-arts by at least 15.6%.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Binary Neural Networks: A Survey

TL;DR: A comprehensive survey of algorithms proposed for binary neural networks, mainly categorized into the native solutions directly conducting binarization, and the optimized ones using techniques like minimizing the quantization error, improving the network loss function, and reducing the gradient error are presented.
Journal ArticleDOI

Fighting against COVID-19: A novel deep learning model based on YOLO-v2 with ResNet-50 for medical face mask detection.

TL;DR: The objective of this paper is to annotate and localize the medical face mask objects in real-life images to improve the object detection process and it is concluded that the adam optimizer achieved the highest average precision percentage of 81% as a detector.
Journal ArticleDOI

SSDMNV2: A real time DNN-based face mask detection system using single shot multibox detector and MobileNetV2.

TL;DR: In this article, the authors proposed an approach using deep learning, TensorFlow, Keras, and OpenCV to detect face masks using Single Shot Multibox Detector as a face detector and MobilenetV2 architecture as a framework for the classifier.
Journal ArticleDOI

MaskedFace-Net – A dataset of correctly/incorrectly masked face images in the context of COVID-19

TL;DR: This work globally presents the applied mask-to-face deformable model for permitting the generation of other masked face images, notably with specific masks and their combination for the global masked face detection (MaskedFace-Net).
Posted Content

Face Attention Network: An effective Face Detector for the Occluded Faces

TL;DR: A novel face detector called Face Attention Network (FAN), which can significantly improve the recall of the face detection problem in the occluded case without compromising the speed, and is integrated with the anchor assign strategy and data augmentation techniques.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.
Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.
Related Papers (5)