FaceNet: A Unified Embedding for Face Recognition and Clustering

doi:10.1109/CVPR.2015.7298682

Open AccessProceedings ArticleDOI

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, +2 more

- 12 Mar 2015 -

arXiv: Computer Vision and Pattern Recog...

TLDR

FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.

Abstract:

Despite significant recent advances in the field of face recognition, implementing face verification and recognition efficiently at scale presents serious challenges to current approaches. In this paper we present a system, called FaceNet, that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure of face similarity. Once this space has been produced, tasks such as face recognition, verification and clustering can be easily implemented using standard techniques with FaceNet embeddings as feature vectors. Our method uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches. To train, we use triplets of roughly aligned matching / non-matching face patches generated using a novel online triplet mining method. The benefit of our approach is much greater representational efficiency: we achieve state-of-the-art face recognition performance using only 128-bytes per face. On the widely used Labeled Faces in the Wild (LFW) dataset, our system achieves a new record accuracy of 99.63%. On YouTube Faces DB it achieves 95.12%. Our system cuts the error rate in comparison to the best published result by 30% on both datasets. We also introduce the concept of harmonic embeddings, and a harmonic triplet loss, which describe different versions of face embeddings (produced by different networks) that are compatible to each other and allow for direct comparison between each other.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Recent advances in convolutional neural networks

Jiuxiang Gu, +10 more

- 01 May 2018 -

Pattern Recognition

TL;DR: A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.

...read moreread less

Proceedings ArticleDOI

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

Guanshuo Wang, +4 more

TL;DR: Comprehensive experiments implemented on the mainstream evaluation datasets including Market-1501, DukeMTMC-reid and CUHK03 indicate that the proposed end-to-end feature learning strategy robustly achieves state-of-the-art performances and outperforms any existing approaches by a large margin.

...read moreread less

Proceedings Article

Large-margin softmax loss for convolutional neural networks

Weiyang Liu, +3 more

TL;DR: A generalized large-margin softmax (L-Softmax) loss which explicitly encourages intra-class compactness and inter-class separability between learned features and which not only can adjust the desired margin but also can avoid overfitting is proposed.

...read moreread less

Proceedings Article

Towards K-means-friendly spaces: simultaneous deep learning and clustering

Bo Yang, +3 more

TL;DR: A joint DR and K-means clustering approach in which DR is accomplished via learning a deep neural network (DNN) while exploiting theDeep neural network's ability to approximate any nonlinear function is proposed.

...read moreread less

Journal ArticleDOI

DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection

Ruben Tolosana, +4 more

- 01 Jan 2020 -

Information Fusion

TL;DR: This survey provides a thorough review of techniques for manipulating face images including DeepFake methods, and methods to detect such manipulations, with special attention to the latest generation of DeepFakes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Learning representations by back-propagating errors

David E. Rumelhart, +2 more

- 01 Jan 1988 -

Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

Book ChapterDOI

Visualizing and Understanding Convolutional Networks

Matthew D. Zeiler, +1 more

TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.

...read moreread less

Journal ArticleDOI

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989 -

Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

Journal Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

John C. Duchi, +2 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies setting a learning rate and results in regret guarantees that are provably as good as the best proximal functions that can be chosen in hindsight.

...read moreread less

Proceedings ArticleDOI

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Yaniv Taigman, +3 more

TL;DR: This work revisits both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a face representation from a nine-layer deep neural network.

...read moreread less

FaceNet: A Unified Embedding for Face Recognition and Clustering

Citations

Recent advances in convolutional neural networks

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

Large-margin softmax loss for convolutional neural networks

Towards K-means-friendly spaces: simultaneous deep learning and clustering

DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection

References

Learning representations by back-propagating errors

Visualizing and Understanding Convolutional Networks

Backpropagation applied to handwritten zip code recognition

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Related Papers (5)

Deep Residual Learning for Image Recognition

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments