scispace - formally typeset
Proceedings ArticleDOI

Leveraging edges and optical flow on faces for deepfake detection

Reads0
Chats0
TLDR
XceptionNet* as discussed by the authors leverages a combination of visual frames, edge maps, and dense optical flow maps together as inputs to this architecture and achieves state-of-the-art performance on the FaceForensics++ and DFDC-mini datasets.
Abstract
Deepfakes can be used maliciously to sway public opinion, defame an individual, or commit fraud. Hence, it is vital for journalists and social media platforms, as well as the general public, to be able to detect deepfakes. Existing deepfake detection methods, while highly accurate on datasets they have been trained on, falter in open-world scenarios due to different deepfake generations algorithms, video formats, and compression levels. In this paper, we seek to address this by building on the XceptionNet-based deepfake detection technique that utilizes convolutional latent representations with recurrent structures. In particular, we explore how to leverage a combination of visual frames, edge maps, and dense optical flow maps together as inputs to this architecture. We evaluate these techniques using the FaceForensics++ and DFDC-mini datasets. We also perform extensive studies to evaluate the robustness of our network against adversarial post-processing as well as the generalization capabilities to out-of-domain datasets and manipulation strategies. Our methods, which we call XceptionNet*, achieve 100% accuracy on the popular Face-Forensics-s+ dataset and set new benchmark standards on the difficult DFDC-mini dataset. The XceptionNet* models are shown to exhibit superior performance on cross-domain testing and demonstrate surprising resilience to adversarial manipulations.

read more

Citations
More filters
Journal ArticleDOI

Deepfake generation and detection, a survey

TL;DR: A survey on state-ofthe-art deepfake generation methods, detection methods, and existing datasets is made and future trends on deepfake detection can be efficient, robust and systematical detection methods and high quality datasets.
Proceedings ArticleDOI

Practical Face Swapping Detection Based on Identity Spatial Constraints

TL;DR: Li et al. as mentioned in this paper designed a new detection framework based on identity spatial constraints (DISC), which consists of a backbone network and an identity semantic encoder (ISE), which utilizes a real facial image of a particular person as the reference to constrain the backbone to focus on the identity-related facial areas.
Posted Content

An Experimental Evaluation on Deepfake Detection using Deep Face Recognition.

TL;DR: In this article, the authors evaluate the efficacy of deep face recognition in identifying deepfakes, using different loss functions and deepfake generation techniques. But, their performance drops significantly in cross-dataset evaluation with samples generated using advanced deepfake generating techniques.
References
More filters
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Journal ArticleDOI

Generative Adversarial Nets

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Proceedings ArticleDOI

Rethinking the Inception Architecture for Computer Vision

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.
Proceedings ArticleDOI

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

TL;DR: CycleGAN as discussed by the authors learns a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss.
Related Papers (5)