(Open Access) ArcFace: Additive Angular Margin Loss for Deep Face Recognition (2019) | Jiankang Deng

Citations

PDF

Open Access

More filters

Proceedings Article•DOI•

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

[...]

AmirAli Bagher Zadeh¹, Yansheng Cao¹, Simon Hessner¹, Paul Pu Liang¹, Soujanya Poria², Louis-Philippe Morency¹ - Show less +2 more•Institutions (2)

Carnegie Mellon University¹, Singapore University of Technology and Design²

01 Nov 2020

TL;DR: The first large-scale multimodal language dataset for Spanish, Portuguese, German and French, called CMU-MOSEAS (CMU Multimodal Opinion Sentiment, Emotions and Attributes), is introduced, which is the largest of its kind with 40, 000 total labelled sentences.

...read moreread less

Abstract: Modeling multimodal language is a core research area in natural language processing. While languages such as English have relatively large multimodal language resources, other widely spoken languages across the globe have few or no large-scale datasets in this area. This disproportionately affects native speakers of languages other than English. As a step towards building more equitable and inclusive multimodal systems, we introduce the first large-scale multimodal language dataset for Spanish, Portuguese, German and French. The proposed dataset, called CMU-MOSEAS (CMU Multimodal Opinion Sentiment, Emotions and Attributes), is the largest of its kind with 40,000 total labelled sentences. It covers a diverse set topics and speakers, and carries supervision of 20 labels including sentiment (and subjectivity), emotions, and attributes. Our evaluations on a state-of-the-art multimodal model demonstrates that CMU-MOSEAS enables further research for multilingual studies in multimodal language.

...read moreread less

30 citations

Cites methods from "ArcFace: Additive Angular Margin Lo..."

...The bounding box of the face is extracted using the RetinaFace (Deng et al., 2019b)....
[...]
...Identities are extracted using ArcFace (Deng et al., 2019a)....
[...]

Journal Article•DOI•

Deep reinforcement learning in computer vision: a comprehensive survey

[...]

Ngan Le¹, Ngan Le², Vidhiwar Singh Rathour², Vidhiwar Singh Rathour¹, Kashu Yamazaki², Kashu Yamazaki¹, Khoa Luu², Khoa Luu¹, Marios Savvides², Marios Savvides¹ - Show less +6 more•Institutions (2)

University of Arkansas¹, Carnegie Mellon University²

29 Sep 2021-Artificial Intelligence Review

TL;DR: In this paper, a detailed review of recent and state-of-the-art research advances of deep reinforcement learning in computer vision is provided, including landmark localization, object detection, object tracking, registration on both 2D image and 3D image volumetric data, image segmentation, and videos analysis.

...read moreread less

Abstract: Deep reinforcement learning augments the reinforcement learning framework and utilizes the powerful representation of deep neural networks. Recent works have demonstrated the remarkable successes of deep reinforcement learning in various domains including finance, medicine, healthcare, video games, robotics, and computer vision. In this work, we provide a detailed review of recent and state-of-the-art research advances of deep reinforcement learning in computer vision. We start with comprehending the theories of deep learning, reinforcement learning, and deep reinforcement learning. We then propose a categorization of deep reinforcement learning methodologies and discuss their advantages and limitations. In particular, we divide deep reinforcement learning into seven main categories according to their applications in computer vision, i.e. (i) landmark localization (ii) object detection; (iii) object tracking; (iv) registration on both 2D image and 3D image volumetric data (v) image segmentation; (vi) videos analysis; and (vii) other applications. Each of these categories is further analyzed with reinforcement learning techniques, network design, and performance. Moreover, we provide a comprehensive analysis of the existing publicly available datasets and examine source code availability. Finally, we present some open issues and discuss future research directions on deep reinforcement learning in computer vision.

...read moreread less

30 citations

Posted Content•

iQIYI-VID: A Large Dataset for Multi-modal Person Identification.

[...]

Yuanliu Liu, Peipei Shi, Bo Peng, He Yan, Yong Zhou, Bing Han, Yi Zheng, Chao Lin, Jianbin Jiang, Yin Fan, Tingwei Gao, Ganwen Wang, Jian Liu, Xiangju Lu, Danming Xie - Show less +11 more

19 Nov 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper introduces iQIYI-VID, the largest video dataset for multi-modal person identification, and proposed a Multi- modal Attention module to fuse multi-Modal features that can improve person identification considerably.

...read moreread less

Abstract: Person identification in the wild is very challenging due to great variation in poses, face quality, clothes, makeup and so on. Traditional research, such as face recognition, person re-identification, and speaker recognition, often focuses on a single modal of information, which is inadequate to handle all the situations in practice. Multi-modal person identification is a more promising way that we can jointly utilize face, head, body, audio features, and so on. In this paper, we introduce iQIYI-VID, the largest video dataset for multi-modal person identification. It is composed of 600K video clips of 5,000 celebrities. These video clips are extracted from 400K hours of online videos of various types, ranging from movies, variety shows, TV series, to news broadcasting. All video clips pass through a careful human annotation process, and the error rate of labels is lower than 0.2\%. We evaluated the state-of-art models of face recognition, person re-identification, and speaker recognition on the iQIYI-VID dataset. Experimental results show that these models are still far from being perfect for the task of person identification in the wild. We proposed a Multi-modal Attention module to fuse multi-modal features that can improve person identification considerably. We have released the dataset online to promote multi-modal person identification research.

...read moreread less

30 citations

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

...In the area of face recognition, ArcFace [11] reached a precision of 99....
[...]
...It should be mentioned that, ArcFace [11] achieved a precision of 99....
[...]
...The faces are detected by the SSH model [39], and then recognized by the ArcFace model [11]....
[...]
...We train a head classifier based on the ArcFace model [11]....
[...]
...The state-of-art method, ArcFace [11] achieved a face verification accuracy of 99....
[...]

Proceedings Article•

BroadFace: Looking at Tens of Thousands of People at once for Face Recognition.

[...]

Yonghyun Kim¹, Wonpyo Park¹, Jongju Shin¹•Institutions (1)

Pohang University of Science and Technology¹

15 Aug 2020

TL;DR: This work proposes a novel method called BroadFace, which is a learning process to consider a massive set of identities, comprehensively, and achieves the state-of-the-art results with significant improvements on nine datasets in 1:1 face verification and 1:N face identification tasks, and is also effective in image retrieval.

...read moreread less

Abstract: The datasets of face recognition contain an enormous number of identities and instances. However, conventional methods have difficulty in reflecting the entire distribution of the datasets because a mini-batch of small size contains only a small portion of all identities. To overcome this difficulty, we propose a novel method called BroadFace, which is a learning process to consider a massive set of identities, comprehensively. In BroadFace, a linear classifier learns optimal decision boundaries among identities from a large number of embedding vectors accumulated over past iterations. By referring more instances at once, the optimality of the classifier is naturally increased on the entire datasets. Thus, the encoder is also globally optimized by referring the weight matrix of the classifier. Moreover, we propose a novel compensation method to increase the number of referenced instances in the training stage. BroadFace can be easily applied on many existing methods to accelerate a learning process and obtain a significant improvement in accuracy without extra computational burden at inference stage. We perform extensive ablation studies and experiments on various datasets to show the effectiveness of BroadFace, and also empirically prove the validity of our compensation method. BroadFace achieves the state-of-the-art results with significant improvements on nine datasets in 1:1 face verification and 1:N face identification tasks, and is also effective in image retrieval.

...read moreread less

30 citations

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

...As pre-processing, we normalize a face image to 112× 112 by warping a face-region using five facial points from two eyes, nose and two corners of mouth [6,22,40]....
[...]
...The recent adoption [6,7,22,32,37,39,40,41] of Convolutional Neural Networks (CNNs) has dramatically increased recognition accuracy....
[...]
...A backbone network is ResNet-100 [11] that is used in the recent works [6,15]....
[...]
...1 ArcFace [6] 99....
[...]
...The computed embedding vectors and the weight vectors of the linear classifier are L2-normalized and trained by the ArcFace [6]....
[...]

Book Chapter•DOI•

Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition

[...]

Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei¹, Tao Mei - Show less +2 more•Institutions (1)

Chinese Academy of Sciences¹

23 Aug 2020

TL;DR: A novel position-aware exclusivity is proposed to encourage large diversity among different filters of the same layer to alleviate the low-capability of student network and investigates the effect of several prevailing knowledge for face recognition distillation to conclude that the knowledge of feature consistency is more flexible and preserves much more information than others.

...read moreread less

Abstract: Knowledge distillation is an effective tool to compress large pre-trained Convolutional Neural Networks (CNNs) or their ensembles into models applicable to mobile and embedded devices. The success of which mainly comes from two aspects: the designed student network and the exploited knowledge. However, current methods usually suffer from the low-capability of mobile-level student network and the unsatisfactory knowledge for distillation. In this paper, we propose a novel position-aware exclusivity to encourage large diversity among different filters of the same layer to alleviate the low-capability of student network. Moreover, we investigate the effect of several prevailing knowledge for face recognition distillation and conclude that the knowledge of feature consistency is more flexible and preserves much more information than others. Experiments on a variety of face recognition benchmarks have revealed the superiority of our method over the state-of-the-arts.

...read moreread less

30 citations

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

...In face recognition, it is very important to perform open-set evaluation [28, 42, 7], i....
[...]
...To achieve better performance, large CNNs like ResNet [7] or AttentionNet [46] are usually employed, which makes them hard to deploy on mobile and embedded devices....
[...]
...There are many kinds of network architectures [28, 3, 41] and several loss functions [7, 46] for face recognition....
[...]
...Specifically, rather than the traditional softmax loss, face recognition is usually supervised by margin-based softmax losses [28, 24, 42, 47, 7, 46], metric learning losses [37] or both [39]....
[...]
...Without loss of generality, we use SEResNet50-IR [7] as the teacher model, which was trained by SV-AMSoftmax loss [46]....
[...]

Collapse

ArcFace: Additive Angular Margin Loss for Deep Face Recognition

Citations

Cites methods from "ArcFace: Additive Angular Margin Lo..."

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

Cites background or methods from "ArcFace: Additive Angular Margin Lo..."

References

Related Papers (5)