scispace - formally typeset
Open AccessProceedings ArticleDOI

Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals

TLDR
In this article, a Siamese-CNN+Path-LSTM model was proposed to incorporate complex spatio-temporal information for regularizing the re-ID results.
Abstract
Vehicle re-identification is an important problem and has many applications in video surveillance and intelligent transportation. It gains increasing attention because of the recent advances of person re-identification techniques. However, unlike person re-identification, the visual differences between pairs of vehicle images are usually subtle and even challenging for humans to distinguish. Incorporating additional spatio-temporal information is vital for solving the challenging re-identification task. Existing vehicle re-identification methods ignored or used oversimplified models for the spatio-temporal relations between vehicle images. In this paper, we propose a two-stage framework that incorporates complex spatio-temporal information for effectively regularizing the re-identification results. Given a pair of vehicle images with their spatiotemporal information, a candidate visual-spatio-temporal path is first generated by a chain MRF model with a deeply learned potential function, where each visual-spatiotemporal state corresponds to an actual vehicle image with its spatio-temporal information. A Siamese-CNN+Path- LSTM model takes the candidate path as well as the pairwise queries to generate their similarity score. Extensive experiments and analysis show the effectiveness of our proposed method and individual components.

read more

Citations
More filters
Book ChapterDOI

Person Re-identification with Deep Similarity-Guided Graph Neural Network

TL;DR: Zhang et al. as mentioned in this paper proposed a Similarity-Guided Graph Neural Network (SGGNN) to estimate visual similarities between person images and gallery images in an end-to-end manner.
Proceedings ArticleDOI

Viewpoint-Aware Attentive Multi-view Inference for Vehicle Re-identification

TL;DR: A Viewpoint-aware Attentive Multi-view Inference (VAMI) model that only requires visual information to solve the multi-view vehicle reID problem and achieves consistent improvements over state-of-the-art vehicle re-ID methods on two public datasets: VeRi and VehicleID.
Proceedings ArticleDOI

VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild

TL;DR: A new method for vehicle ReID is proposed, in which, the ReID model is coupled into a Feature Distance Adversarial Network (FDA-Net), and a novel feature distance adversary scheme is designed to generate hard negative samples in feature space to facilitate Re ID model training.
Proceedings ArticleDOI

Part-Regularized Near-Duplicate Vehicle Re-Identification

TL;DR: This paper proposes a simple but efficient part-regularized discriminative feature preserving method which enhances the perceptive ability of subtle discrepancies in vehicle re-identification and develops a novel framework to integrate part constrains with the global Re-ID modules by introducing an detection branch.
Proceedings ArticleDOI

RAM: A Region-Aware Deep Model for Vehicle Re-Identification

TL;DR: A novel learning algorithm is introduced to jointly use vehicle IDs, types/models, and colors to train the Region-Aware deep Model (RAM), which fuses more cues for training and results in more discriminative global and regional features.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

Long short-term memory

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Related Papers (5)