Author

Jinbin Wang

Bio: Jinbin Wang is an academic researcher. The author has contributed to research in topics: Deep learning & Ranking (information retrieval). The author has an hindex of 1, co-authored 1 publications receiving 829 citations.

Papers

PDF

Open Access

More filters

Posted Content•

Learning Fine-grained Image Similarity with Deep Ranking

[...]

Jiang Wang¹, Yang Song, Thomas Leung, Charles J. Rosenberg, Jinbin Wang, James Philbin, Bo Chen², Ying Wu¹ - Show less +4 more•Institutions (2)

Northwestern University¹, California Institute of Technology²

17 Apr 2014-arXiv: Computer Vision and Pattern Recognition

TL;DR: A deep ranking model that employs deep learning techniques to learn similarity metric directly from images has higher learning capability than models based on hand-crafted features and deep classification models.

...read moreread less

Abstract: Learning fine-grained image similarity is a challenging task. It needs to capture between-class and within-class image differences. This paper proposes a deep ranking model that employs deep learning techniques to learn similarity metric directly from this http URL has higher learning capability than models based on hand-crafted features. A novel multiscale network structure has been developed to describe the images effectively. An efficient triplet sampling algorithm is proposed to learn the model with distributed asynchronized stochastic gradient. Extensive experiments show that the proposed algorithm outperforms models based on hand-crafted visual features and deep classification models.

...read moreread less

967 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

FaceNet: A unified embedding for face recognition and clustering

[...]

Florian Schroff¹, Dmitry Kalenichenko¹, James Philbin¹•Institutions (1)

Google¹

07 Jun 2015

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.

...read moreread less

Abstract: Despite significant recent advances in the field of face recognition [10, 14, 15, 17], implementing face verification and recognition efficiently at scale presents serious challenges to current approaches. In this paper we present a system, called FaceNet, that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure of face similarity. Once this space has been produced, tasks such as face recognition, verification and clustering can be easily implemented using standard techniques with FaceNet embeddings as feature vectors.

...read moreread less

8,289 citations

Proceedings Article•DOI•

Deep Metric Learning via Lifted Structured Feature Embedding

[...]

Hyun Oh Song¹, Yu Xiang¹, Stefanie Jegelka², Silvio Savarese¹•Institutions (2)

Stanford University¹, Massachusetts Institute of Technology²

01 Jun 2016

TL;DR: In this article, the authors propose to lift the vector of pairwise distances within the batch to the matrix of pairswise distances, which enables the algorithm to learn the state-of-the-art feature embedding by optimizing a novel structured prediction objective on the lifted problem.

...read moreread less

Abstract: Learning the distance metric between pairs of examples is of great importance for learning and visual recognition. With the remarkable success from the state of the art convolutional neural networks, recent works [1, 31] have shown promising results on discriminatively training the networks to learn semantic feature embeddings where similar examples are mapped close to each other and dissimilar examples are mapped farther apart. In this paper, we describe an algorithm for taking full advantage of the training batches in the neural network training by lifting the vector of pairwise distances within the batch to the matrix of pairwise distances. This step enables the algorithm to learn the state of the art feature embedding by optimizing a novel structured prediction objective on the lifted problem. Additionally, we collected Stanford Online Products dataset: 120k images of 23k classes of online products for metric learning. Our experiments on the CUB-200-2011 [37], CARS196 [19], and Stanford Online Products datasets demonstrate significant improvement over existing deep feature embedding methods on all experimented embedding sizes with the GoogLeNet [33] network. The source code and the dataset are available at: https://github.com/rksltnl/ Deep-Metric-Learning-CVPR16.

...read moreread less

1,599 citations

Proceedings Article•

Improved deep metric learning with multi-class N-pair loss objective

[...]

Kihyuk Sohn

05 Dec 2016

TL;DR: This paper proposes a new metric learning objective called multi-class N-pair loss, which generalizes triplet loss by allowing joint comparison among more than one negative examples and reduces the computational burden of evaluating deep embedding vectors via an efficient batch construction strategy using only N pairs of examples.

...read moreread less

Abstract: Deep metric learning has gained much popularity in recent years, following the success of deep learning. However, existing frameworks of deep metric learning based on contrastive loss and triplet loss often suffer from slow convergence, partially because they employ only one negative example while not interacting with the other negative classes in each update. In this paper, we propose to address this problem with a new metric learning objective called multi-class N-pair loss. The proposed objective function firstly generalizes triplet loss by allowing joint comparison among more than one negative examples - more specifically, N-1 negative examples - and secondly reduces the computational burden of evaluating deep embedding vectors via an efficient batch construction strategy using only N pairs of examples, instead of (N+1) x N. We demonstrate the superiority of our proposed loss to the triplet loss as well as other competing loss functions for a variety of tasks on several visual recognition benchmark, including fine-grained object recognition and verification, image clustering and retrieval, and face verification and identification.

...read moreread less

1,454 citations

Proceedings Article•DOI•

Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function

[...]

De Cheng¹, Yihong Gong¹, Sanping Zhou¹, Jinjun Wang¹, Nanning Zheng¹ - Show less +1 more•Institutions (1)

Xi'an Jiaotong University¹

27 Jun 2016

TL;DR: A novel multi-channel parts-based convolutional neural network model under the triplet framework for person re-identification that significantly outperforms many state-of-the-art approaches, including both traditional and deep network-based ones, on the challenging i-LIDS, VIPeR, PRID2011 and CUHK01 datasets.

...read moreread less

Abstract: Person re-identification across cameras remains a very challenging problem, especially when there are no overlapping fields of view between cameras. In this paper, we present a novel multi-channel parts-based convolutional neural network (CNN) model under the triplet framework for person re-identification. Specifically, the proposed CNN model consists of multiple channels to jointly learn both the global full-body and local body-parts features of the input persons. The CNN model is trained by an improved triplet loss function that serves to pull the instances of the same person closer, and at the same time push the instances belonging to different persons farther from each other in the learned feature space. Extensive comparative evaluations demonstrate that our proposed method significantly outperforms many state-of-the-art approaches, including both traditional and deep network-based ones, on the challenging i-LIDS, VIPeR, PRID2011 and CUHK01 datasets.

...read moreread less

1,265 citations

Posted Content•

SphereFace: Deep Hypersphere Embedding for Face Recognition

[...]

Weiyang Liu¹, Yandong Wen², Zhiding Yu², Ming Li³, Bhiksha Raj², Le Song¹ - Show less +2 more•Institutions (3)

Georgia Institute of Technology¹, Carnegie Mellon University², Sun Yat-sen University³

26 Apr 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes the angular softmax (A-Softmax) loss that enables convolutional neural networks (CNNs) to learn angularly discriminative features in deep face recognition (FR) problem under open-set protocol.

...read moreread less

Abstract: This paper addresses deep face recognition (FR) problem under open-set protocol, where ideal face features are expected to have smaller maximal intra-class distance than minimal inter-class distance under a suitably chosen metric space. However, few existing algorithms can effectively achieve this criterion. To this end, we propose the angular softmax (A-Softmax) loss that enables convolutional neural networks (CNNs) to learn angularly discriminative features. Geometrically, A-Softmax loss can be viewed as imposing discriminative constraints on a hypersphere manifold, which intrinsically matches the prior that faces also lie on a manifold. Moreover, the size of angular margin can be quantitatively adjusted by a parameter $m$. We further derive specific $m$ to approximate the ideal feature criterion. Extensive analysis and experiments on Labeled Face in the Wild (LFW), Youtube Faces (YTF) and MegaFace Challenge show the superiority of A-Softmax loss in FR tasks. The code has also been made publicly available.

...read moreread less

1,215 citations

Collapse