scispace - formally typeset
Proceedings ArticleDOI

Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing

TLDR
A novel Multi-modal Multi-layer Fusion Convolutional Neural Network (mmfCNN), which targets at finding a discriminative model for recognizing the subtle differences between live and spoof faces, and utilizes a multi-layer fusion model to further aggregate the features from different layers.
Abstract
Face anti-spoofing detection is critical to guarantee the security of biometric face recognition systems. Despite extensive advances in facial anti-spoofing based on single-model image, little work has been devoted to multi-modal anti-spoofing, which is however widely encountered in real-world scenarios. Following the recent progress, this paper mainly focuses on multi-modal face anti-spoofing and aims to solve the following two challenges: (1) how to effectively fuse multi-modal information; and (2) how to effectively learn distinguishable features despite single cross-entropy loss. We propose a novel Multi-modal Multi-layer Fusion Convolutional Neural Network (mmfCNN), which targets at finding a discriminative model for recognizing the subtle differences between live and spoof faces. The mmfCNN can fully use different information provided by diverse modalities, which is based on a weight-adaptation aggregation approach. Specifically, we utilize a multi-layer fusion model to further aggregate the features from different layers, which fuses the low-, mid- and high-level information from different modalities in a unified framework. Moreover, a novel Average Binary Center (ABC) loss is proposed to maximize the dissimilarity between the features of live and spoof faces, which helps to stabilize the training to generate a robust and discriminative model. Extensive experiments conducted on the CISIA-SURF and 3DMAD datasets verify the significance and generalization capability of the proposed method for the face anti-spoofing task. Code is available at: https://github.com/SkyKuang/Face-anti-spoofing.

read more

Citations
More filters
Journal ArticleDOI

Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition

TL;DR: In this article , the authors propose an Expansion Squeeze-Excitation Fusion Network (ESE-FN) to fuse multi-modal features in the modal and channel-wise ways.
Proceedings ArticleDOI

Face Anti-Spoofing Using Spatial Pyramid Pooling

TL;DR: Zhang et al. as mentioned in this paper proposed a face anti-spoofing approach using spatial pyramid pooling (SPP) to enhance local features while fusing multi-scale information.
Journal ArticleDOI

Hierarchical Group-Level Emotion Recognition

TL;DR: A novel method for group-level emotion recognition using a hierarchical classification approach and the exploitation of object-wise semantic information (labels) for the second classification, which allows a more detailed description of the scene context in the image and enables performance enhancement in the second Classification.
Journal ArticleDOI

Deep Learning for Face Anti-Spoofing: A Survey

TL;DR: In this paper , a comprehensive review of recent advances in deep learning based face anti-spoofing (FAS) is presented, which covers several novel and insightful components: 1) besides supervision with binary label (e.g., ‘0’ for bonafide vs. ‘1' for PAs), also investigate recent methods with pixel-wise supervision, and 2) in addition to traditional intra-dataset evaluation, collect and analyze the latest methods specially designed for domain generalization and open-set FAS; and 3) besides commercial RGB camera, summarize the deep learning applications under multi-modal (i.e. light field and flash) sensors.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Journal ArticleDOI

Squeeze-and-Excitation Networks

TL;DR: This work proposes a novel architectural unit, which is term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels and finds that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost.
Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.
Book ChapterDOI

A Discriminative Feature Learning Approach for Deep Face Recognition

TL;DR: This paper proposes a new supervision signal, called center loss, for face recognition task, which simultaneously learns a center for deep features of each class and penalizes the distances between the deep features and their corresponding class centers.
Journal ArticleDOI

A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions

TL;DR: In this paper, the authors discuss human emotion perception from a psychological perspective, examine available approaches to solving the problem of machine understanding of human affective behavior, and discuss important issues like the collection and availability of training and test data.
Related Papers (5)