ArcFace: Additive Angular Margin Loss for Deep Face Recognition

doi:10.1109/CVPR.2019.00482

Citations

PDF

Open Access

More filters

Posted Content•

Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds.

[...]

Yongming Rao¹, Jiwen Lu¹, Jie Zhou¹•Institutions (1)

Tsinghua University¹

29 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work hypothesizes that a powerful representation of a 3D object should model the attributes that are shared between parts and the whole object, and distinguishable from other objects, and proposes to learn point cloud representation by bidirectional reasoning between the local structures at different abstraction hierarchies and the global shape without human supervision.

...read moreread less

Abstract: Local and global patterns of an object are closely related. Although each part of an object is incomplete, the underlying attributes about the object are shared among all parts, which makes reasoning the whole object from a single part possible. We hypothesize that a powerful representation of a 3D object should model the attributes that are shared between parts and the whole object, and distinguishable from other objects. Based on this hypothesis, we propose to learn point cloud representation by bidirectional reasoning between the local structures at different abstraction hierarchies and the global shape without human supervision. Experimental results on various benchmark datasets demonstrate the unsupervisedly learned representation is even better than supervised representation in discriminative power, generalization ability, and robustness. We show that unsupervisedly trained point cloud models can outperform their supervised counterparts on downstream classification tasks. Most notably, by simply increasing the channel width of an SSG PointNet++, our unsupervised model surpasses the state-of-the-art supervised methods on both synthetic and real-world 3D object classification datasets. We expect our observations to offer a new perspective on learning better representation from data structures instead of human annotations for point cloud understanding.

...read moreread less

66 citations

Cites methods from "ArcFace: Additive Angular Margin Lo..."

...Inspired by the studies on metric learning for face recognition [9, 30, 48] that perform metric learning on features on a hypersphere, we normalize the outputs of prediction networks before computing similarities and use a constant value s = 64 [9] to re-scale the features....
[...]

Posted Content•

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

[...]

Vitor Albiero¹, Xingyu Chen², Xi Yin², Guan Pang², Tal Hassner² - Show less +1 more•Institutions (2)

University of Notre Dame¹, Facebook²

14 Dec 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: Tests show that the proposed real-time, six degrees of freedom, 3D face pose estimation without face detection or landmark localization outperforms state of the art (SotA) face pose estimators and surpasses SotA models of comparable complexity on the WIDER FACE detection benchmark, despite not been optimized on bounding box labels.

...read moreread less

Abstract: We propose real-time, six degrees of freedom (6DoF), 3D face pose estimation without face detection or landmark localization. We observe that estimating the 6DoF rigid transformation of a face is a simpler problem than facial landmark detection, often used for 3D face alignment. In addition, 6DoF offers more information than face bounding box labels. We leverage these observations to make multiple contributions: (a) We describe an easily trained, efficient, Faster R-CNN--based model which regresses 6DoF pose for all faces in the photo, without preliminary face detection. (b) We explain how pose is converted and kept consistent between the input photo and arbitrary crops created while training and evaluating our model. (c) Finally, we show how face poses can replace detection bounding box training labels. Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators. Remarkably, our method also surpasses SotA models of comparable complexity on the WIDER FACE detection benchmark, despite not been optimized on bounding box labels.

...read moreread less

66 citations

Cites background from "ArcFace: Additive Angular Margin Lo..."

...Together, these two steps are the cornerstones of many face-based reasoning tasks, most notably recognition [18, 47, 48, 49, 74, 76] and 3D reconstruction [20, 30, 71, 72]....
[...]

Proceedings Article•DOI•

Search to Distill: Pearls Are Everywhere but Not the Eyes

[...]

Yu Liu¹, Xuhui Jia², Mingxing Tan², Raviteja Vemulapalli², Yukun Zhu², Bradley Ray Green², Xiaogang Wang¹ - Show less +3 more•Institutions (2)

The Chinese University of Hong Kong¹, Google²

14 Jun 2020

TL;DR: This work presents a new architecture-aware Knowledge Distillation approach that finds student models (pearls for the teacher) that are best for distilling the given teacher model and leverages Neural Architecture Search (NAS), equipped with the authors' KD-guided reward, to search for the best student architectures for a given teacher.

...read moreread less

Abstract: Standard Knowledge Distillation (KD) approaches distill the knowledge of a cumbersome teacher model into the parameters of a student model with a pre-defined architecture. However, the knowledge of a neural network, which is represented by the network's output distribution conditioned on its input, depends not only on its parameters but also on its architecture. Hence, a more generalized approach for KD is to distill the teacher's knowledge into both the parameters and architecture of the student. To achieve this, we present a new \textit{Architecture-aware Knowledge Distillation (AKD)} approach that finds student models (pearls for the teacher) that are best for distilling the given teacher model. In particular, we leverage Neural Architecture Search (NAS), equipped with our KD-guided reward, to search for the best student architectures for a given teacher. Experimental results show our proposed AKD consistently outperforms the conventional NAS plus KD approach, and achieves state-of-the-art results on the ImageNet classification task under various latency settings. Furthermore, the best AKD student architecture for the ImageNet classification task also transfers well to other tasks such as million level face recognition and ensemble learning.

...read moreread less

66 citations

Additional excerpts

...3 + R100 [2] + EPolyFace [20] + IncRes-v2 [29] + SE154 [12]...
[...]

Posted Content•

Survey on Deep Neural Networks in Speech and Vision Systems

[...]

M. S. Alam¹, Manar D. Samad¹, Lasitha Vidyaratne¹, Alexander Glandon¹, Khan M. Iftekharuddin¹ - Show less +1 more•Institutions (1)

Tennessee State University¹

16 Aug 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: This survey presents a review of state-of-the-art deep neural network architectures, algorithms, and systems in vision and speech applications from the perspectives of both software and hardware systems.

...read moreread less

Abstract: This survey presents a review of state-of-the-art deep neural network architectures, algorithms, and systems in vision and speech applications. Recent advances in deep artificial neural network algorithms and architectures have spurred rapid innovation and development of intelligent vision and speech systems. With availability of vast amounts of sensor data and cloud computing for processing and training of deep neural networks, and with increased sophistication in mobile and embedded technology, the next-generation intelligent systems are poised to revolutionize personal and commercial computing. This survey begins by providing background and evolution of some of the most successful deep learning models for intelligent vision and speech systems to date. An overview of large-scale industrial research and development efforts is provided to emphasize future trends and prospects of intelligent vision and speech systems. Robust and efficient intelligent systems demand low-latency and high fidelity in resource-constrained hardware platforms such as mobile devices, robots, and automobiles. Therefore, this survey also provides a summary of key challenges and recent successes in running deep neural networks on hardware-restricted platforms, i.e. within limited memory, battery life, and processing capabilities. Finally, emerging applications of vision and speech across disciplines such as affective computing, intelligent transportation, and precision medicine are discussed. To our knowledge, this paper provides one of the most comprehensive surveys on the latest developments in intelligent vision and speech applications from the perspectives of both software and hardware systems. Many of these emerging technologies using deep neural networks show tremendous promise to revolutionize research and development for future vision and speech systems.

...read moreread less

66 citations

Cites background from "ArcFace: Additive Angular Margin Lo..."

...[103] reformulated the cost function for face recognition....
[...]

Posted Content•

Boosting Few-Shot Learning With Adaptive Margin Loss

[...]

Aoxue Li¹, Weiran Huang², Xu Lan³, Jiashi Feng⁴, Zhenguo Li², Liwei Wang¹ - Show less +2 more•Institutions (4)

Peking University¹, Huawei², Queen Mary University of London³, National University of Singapore⁴

28 May 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: An adaptive margin principle is proposed to improve the generalization ability of metric-based meta-learning approaches for few-shot learning problems by developing a class-relevant additive margin loss, where semantic similarity between each pair of classes is considered to separate samples in the feature embedding space from similar classes.

...read moreread less

Abstract: Few-shot learning (FSL) has attracted increasing attention in recent years but remains challenging, due to the intrinsic difficulty in learning to generalize from a few examples. This paper proposes an adaptive margin principle to improve the generalization ability of metric-based meta-learning approaches for few-shot learning problems. Specifically, we first develop a class-relevant additive margin loss, where semantic similarity between each pair of classes is considered to separate samples in the feature embedding space from similar classes. Further, we incorporate the semantic context among all classes in a sampled training task and develop a task-relevant additive margin loss to better distinguish samples from different classes. Our adaptive margin method can be easily extended to a more realistic generalized FSL setting. Extensive experiments demonstrate that the proposed method can boost the performance of current metric-based meta-learning approaches, under both the standard FSL and generalized FSL settings.

...read moreread less

65 citations

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

...By observing that the weights from the last fully connected layer of a classification DCNN trained on the softmax loss bear conceptual similarities with the centers of each class, the works in [4, 18, 33] proposed several margin losses to improve the discriminative power of the trained model....
[...]
...The two margin losses are: 1) Additive angular margin loss [4], which add an additive angular margin to the angle between the weight vector and feature embeddings....
[...]
...That is, our method involves semantic similarity among classes in meta-training task to learn a more suitable margin penalty, compared with a fixed one generated by [4, 33]....
[...]
...[4] proposed an additive angular margin loss to further improve the discriminative power of feature embedding space....
[...]

Collapse

ArcFace: Additive Angular Margin Loss for Deep Face Recognition

Citations

Cites methods from "ArcFace: Additive Angular Margin Lo..."

Cites background from "ArcFace: Additive Angular Margin Lo..."

Additional excerpts

Cites background from "ArcFace: Additive Angular Margin Lo..."

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

References

Related Papers (5)