scispace - formally typeset
Open AccessPosted Content

DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection

TLDR
In this paper, the authors provide a thorough review of techniques for manipulating face images including DeepFake methods, and methods to detect such manipulations, including entire face synthesis, identity swap (DeepFakes), attribute manipulation and expression swap.
Abstract
The free access to large-scale public databases, together with the fast progress of deep learning techniques, in particular Generative Adversarial Networks, have led to the generation of very realistic fake content with its corresponding implications towards society in this era of fake news. This survey provides a thorough review of techniques for manipulating face images including DeepFake methods, and methods to detect such manipulations. In particular, four types of facial manipulation are reviewed: i) entire face synthesis, ii) identity swap (DeepFakes), iii) attribute manipulation, and iv) expression swap. For each manipulation group, we provide details regarding manipulation techniques, existing public databases, and key benchmarks for technology evaluation of fake detection methods, including a summary of results from those evaluations. Among all the aspects discussed in the survey, we pay special attention to the latest generation of DeepFakes, highlighting its improvements and challenges for fake detection. In addition to the survey information, we also discuss open issues and future trends that should be considered to advance in the field.

read more

Citations
More filters
Journal ArticleDOI

Security in Smart Cities: A Brief Review of Digital Forensic Schemes for Biometric Data

TL;DR: This work reviews state-of-the-art digital forensic schemes for audio-visual biometric data that can be leveraged by applications designed for smart cities.
Proceedings ArticleDOI

Detecting Deepfakes with Metric Learning

TL;DR: This work analyzes several deep learning approaches in the context of deepfakes classification in high compression scenarios and demonstrates that a proposed approach based on metric learning can be very effective in performing such a classification.
Proceedings ArticleDOI

Interpretable and Trustworthy Deepfake Detection via Dynamic Prototypes

TL;DR: In this paper, a human-centered approach for detecting forgery in face images, using dynamic prototypes as a form of visual explanations, is proposed, which utilizes dynamic representations (i.e., prototypes) to explain deepfake temporal artifacts.
Proceedings ArticleDOI

Detecting Deep-Fake Videos from Appearance and Behavior

TL;DR: In this paper, a biometric-based forensic technique for detecting face-swap deep fakes is proposed, which combines a static biometric based on facial recognition with a temporal, behavioral biometric, where the behavioral embedding is learned using a CNN with a metric-learning objective function.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Attention is All you Need

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Related Papers (5)