Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing

A novel Multi-modal Multi-layer Fusion Convolutional Neural Network (mmfCNN), which targets at finding a discriminative model for recognizing the subtle differences between live and spoof faces, and utilizes a multi-layer fusion model to further aggregate the features from different layers.

Abstract:

Face anti-spoofing detection is critical to guarantee the security of biometric face recognition systems. Despite extensive advances in facial anti-spoofing based on single-model image, little work has been devoted to multi-modal anti-spoofing, which is however widely encountered in real-world scenarios. Following the recent progress, this paper mainly focuses on multi-modal face anti-spoofing and aims to solve the following two challenges: (1) how to effectively fuse multi-modal information; and (2) how to effectively learn distinguishable features despite single cross-entropy loss. We propose a novel Multi-modal Multi-layer Fusion Convolutional Neural Network (mmfCNN), which targets at finding a discriminative model for recognizing the subtle differences between live and spoof faces. The mmfCNN can fully use different information provided by diverse modalities, which is based on a weight-adaptation aggregation approach. Specifically, we utilize a multi-layer fusion model to further aggregate the features from different layers, which fuses the low-, mid- and high-level information from different modalities in a unified framework. Moreover, a novel Average Binary Center (ABC) loss is proposed to maximize the dissimilarity between the features of live and spoof faces, which helps to stabilize the training to generate a robust and discriminative model. Extensive experiments conducted on the CISIA-SURF and 3DMAD datasets verify the significance and generalization capability of the proposed method for the face anti-spoofing task. Code is available at: https://github.com/SkyKuang/Face-anti-spoofing.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition

- 01 Aug 2022 -

IEEE Transactions on Circuits and System...

Show Less

TL;DR: In this article , the authors propose an Expansion Squeeze-Excitation Fusion Network (ESE-FN) to fuse multi-modal features in the modal and channel-wise ways.

...read moreread less

Proceedings ArticleDOI

Face Anti-Spoofing Using Spatial Pyramid Pooling

Lei Shi,Zhuo Zhou,Zhenhua Guo +2 moreTsinghua University

Show Less

TL;DR: Zhang et al. as mentioned in this paper proposed a face anti-spoofing approach using spatial pyramid pooling (SPP) to enhance local features while fusing multi-scale information.

...read moreread less

Journal ArticleDOI

Hierarchical Group-Level Emotion Recognition

Katsuya Fujii,Daisuke Sugimura,Takayuki Hamamoto +2 moreTokyo University of Science,Tsuda College

- 01 Jan 2021 -

IEEE Transactions on Multimedia

Show Less

TL;DR: A novel method for group-level emotion recognition using a hierarchical classification approach and the exploitation of object-wise semantic information (labels) for the second classification, which allows a more detailed description of the scene context in the image and enables performance enhancement in the second Classification.

...read moreread less

Journal ArticleDOI

Deep Learning for Face Anti-Spoofing: A Survey

- 01 Jan 2022 -

IEEE Transactions on Pattern Analysis an...

Show Less

TL;DR: In this paper , a comprehensive review of recent advances in deep learning based face anti-spoofing (FAS) is presented, which covers several novel and insightful components: 1) besides supervision with binary label (e.g., ‘0’ for bonafide vs. ‘1' for PAs), also investigate recent methods with pixel-wise supervision, and 2) in addition to traditional intra-dataset evaluation, collect and analyze the latest methods specially designed for domain generalization and open-set FAS; and 3) besides commercial RGB camera, summarize the deep learning applications under multi-modal (i.e. light field and flash) sensors.

...read moreread less

Journal ArticleDOI

Lightweight network-based multi-modal feature fusion for face anti-spoofing

Dan He,Xiping He,Rui-Ning Yuan,Yue Li,Chao Shen +4 more

- 19 Feb 2022 -

The Visual Computer

Show Less

1
2
3
4
…

Zhihong Zeng,Maja Pantic,Glenn I. Roisman,Thomas S. Huang +3 moreUniversity of Illinois at Urbana–Champaign,Imperial College London

- 01 Jan 2009 -

IEEE Transactions on Pattern Analysis an...

Show Less

TL;DR: In this paper, the authors discuss human emotion perception from a psychological perspective, examine available approaches to solving the problem of machine understanding of human affective behavior, and discuss important issues like the collection and availability of training and test data.

...read moreread less