Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification

doi:10.1109/CVPR.2016.144

Proceedings ArticleDOI

Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification

Faqiang Wang, +4 more

- pp 1288-1296

Chats0

TLDR

This work proposes a joint learning frame-work to unify SIR and CIR using convolutional neural network (CNN), and finds that the representations learned with pairwise comparison and triplet comparison objectives can be combined to improve matching performance.

Abstract:

Person re-identification has been usually solved as either the matching of single-image representation (SIR) or the classification of cross-image representation (CIR). In this work, we exploit the connection between these two categories of methods, and propose a joint learning frame-work to unify SIR and CIR using convolutional neural network (CNN). Specifically, our deep architecture contains one shared sub-network together with two sub-networks that extract the SIRs of given images and the CIRs of given image pairs, respectively. The SIR sub-network is required to be computed once for each image (in both the probe and gallery sets), and the depth of the CIR sub-network is required to be minimal to reduce computational burden. Therefore, the two types of representation can be jointly optimized for pursuing better matching accuracy with moderate computational cost. Furthermore, the representations learned with pairwise comparison and triplet comparison objectives can be combined to improve matching performance. Experiments on the CUHK03, CUHK01 and VIPeR datasets show that the proposed method can achieve favorable accuracy while compared with state-of-the-arts.

Citations

PDF

Open Access

More filters

Posted Content

In Defense of the Triplet Loss for Person Re-Identification

Alexander Hermans, +2 more

- 22 Mar 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: It is shown that, for models trained from scratch as well as pretrained ones, using a variant of the triplet loss to perform end-to-end deep metric learning outperforms most other published methods by a large margin.

...read moreread less

Proceedings ArticleDOI

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro

Zhedong Zheng, +2 more

TL;DR: A simple semisupervised pipeline that only uses the original training set without collecting extra data, which effectively improves the discriminative ability of learned CNN embeddings and proposes the label smoothing regularization for outliers (LSRO).

...read moreread less

Proceedings ArticleDOI

Harmonious Attention Network for Person Re-identification

Wei Li, +2 more

TL;DR: A novel Harmonious Attention CNN (HA-CNN) model is formulated for joint learning of soft pixel attention and hard regional attention along with simultaneous optimisation of feature representations, dedicated to optimise person re-id in uncontrolled (misaligned) images.

...read moreread less

Posted Content

Beyond triplet loss: a deep quadruplet network for person re-identification

Weihua Chen, +3 more

- 06 Apr 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A quadruplet deep network using a margin-based online hard negative mining is proposed based on the quadruplet loss for the person ReID, which can lead to the model output with a larger inter- class variation and a smaller intra-class variation compared to the triplet loss.

...read moreread less

Proceedings ArticleDOI

Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification

Weihua Chen, +3 more

TL;DR: In this article, a quadruplet deep network using a margin-based online hard negative mining is proposed based on the quadruplet loss for person ReID, which can lead to the model output with a larger interclass variation and a smaller intra-class variation compared to the triplet loss.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

FaceNet: A unified embedding for face recognition and clustering

Florian Schroff, +2 more

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.

...read moreread less

Collapse

Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification

Citations

In Defense of the Triplet Loss for Person Re-Identification

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro

Harmonious Attention Network for Person Re-identification

Beyond triplet loss: a deep quadruplet network for person re-identification

Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification

References

The Nature of Statistical Learning Theory

Caffe: Convolutional Architecture for Fast Feature Embedding

Object Detection with Discriminatively Trained Part-Based Models

Caffe: Convolutional Architecture for Fast Feature Embedding

FaceNet: A unified embedding for face recognition and clustering

Related Papers (5)

DeepReID: Deep Filter Pairing Neural Network for Person Re-identification

Person re-identification by Local Maximal Occurrence representation and metric learning

An improved deep learning architecture for person re-identification

Scalable Person Re-identification: A Benchmark

Large scale metric learning from equivalence constraints