scispace - formally typeset
Open AccessProceedings ArticleDOI

One Detector to Rule Them All: Towards a General Deepfake Attack Detection Framework

Reads0
Chats0
TLDR
Zhang et al. as mentioned in this paper proposed a Convolutional LSTM-based Residual Network (CLRNet) which adopts a unique model training strategy and explores spatial as well as the temporal information in deepfakes.
Abstract
Deep learning-based video manipulation methods have become widely accessible to the masses. With little to no effort, people can quickly learn how to generate deepfake (DF) videos. While deep learning-based detection methods have been proposed to identify specific types of DFs, their performance suffers for other types of deepfake methods, including real-world deepfakes, on which they are not sufficiently trained. In other words, most of the proposed deep learning-based detection methods lack transferability and generalizability. Beyond detecting a single type of DF from benchmark deepfake datasets, we focus on developing a generalized approach to detect multiple types of DFs, including deepfakes from unknown generation methods such as DeepFake-in-the-Wild (DFW) videos. To better cope with unknown and unseen deepfakes, we introduce a Convolutional LSTM-based Residual Network (CLRNet), which adopts a unique model training strategy and explores spatial as well as the temporal information in deepfakes. Through extensive experiments, we show that existing defense methods are not ready for real-world deployment. Whereas our defense method (CLRNet) achieves far better generalization when detecting various benchmark deepfake methods (97.57% on average). Furthermore, we evaluate our approach with a high-quality DeepFake-in-the-Wild dataset, collected from the Internet containing numerous videos and having more than 150,000 frames. Our CLRNet model demonstrated that it generalizes well against high-quality DFW videos by achieving 93.86% detection accuracy, outperforming existing state-of-the-art defense methods by a considerable margin.

read more

Citations
More filters
Proceedings ArticleDOI

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation

TL;DR: Continual Representation using Distillation (CoReD) as mentioned in this paper employs the concept of Continual Learning (CL), Representation Learning (RL), and Knowledge distillation (KD) to perform sequential domain adaptation tasks on new deepfake and GAN-generated synthetic face datasets.
Journal ArticleDOI

DeepFake Detection for Human Face Images and Videos: A Survey

- 01 Jan 2022 - 
TL;DR: DeepFake as mentioned in this paper is a generative deep learning algorithm that creates or modifies face features in a superrealistic form, in which it is difficult to distinguish between real and fake features.
Journal ArticleDOI

DeepFake Detection for Human Face Images and Videos: A Survey

TL;DR: This survey will summarize the DeepFake detection methods in face images and videos on the basis of their results, performance, methodology used and detection type, and review the existing types of DeepFake creation techniques and sort them into five major categories.
Proceedings ArticleDOI

Am I a Real or Fake Celebrity? Evaluating Face Recognition and Verification APIs under Deepfake Impersonation Attack

TL;DR: A case study on celebrity face recognition, which examines the robustness of black-box commercial face recognition web APIs and open-source tools against Deepfake Impersonation (DI) attacks, and demonstrates the vulnerability of face recognition technologies to DI attacks.
Proceedings ArticleDOI

Supervised Contrastive Learning for Generalizable and Explainable DeepFakes Detection

TL;DR: Zhang et al. as mentioned in this paper proposed a generalizable detection model that can detect novel and unknown/unseen DeepFakes using a supervised contrastive (SupCon) loss.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings ArticleDOI

Xception: Deep Learning with Depthwise Separable Convolutions

TL;DR: This work proposes a novel deep convolutional neural network architecture inspired by Inception, where Inception modules have been replaced with depthwise separable convolutions, and shows that this architecture, dubbed Xception, slightly outperforms Inception V3 on the ImageNet dataset, and significantly outperforms it on a larger image classification dataset.
Proceedings ArticleDOI

Learning Deep Features for Discriminative Localization

TL;DR: This work revisits the global average pooling layer proposed in [13], and sheds light on how it explicitly enables the convolutional neural network (CNN) to have remarkable localization ability despite being trained on imagelevel labels.
Posted Content

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting

TL;DR: This paper proposes the convolutional LSTM (ConvLSTM) and uses it to build an end-to-end trainable model for the precipitation nowcasting problem and shows that it captures spatiotemporal correlations better and consistently outperforms FC-L STM and the state-of-the-art operational ROVER algorithm.
Proceedings ArticleDOI

Adversarial Discriminative Domain Adaptation

TL;DR: Adversarial Discriminative Domain Adaptation (ADDA) as mentioned in this paper combines discriminative modeling, untied weight sharing, and a generative adversarial network (GAN) loss.
Related Papers (5)