Proceedings ArticleDOI
Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification
Zhongdao Wang,Luming Tang,Xihui Liu,Zhuliang Yao,Shuai Yi,Jing Shao,Junjie Yan,Shengjin Wang,Hongsheng Li,Xiaogang Wang +9 more
- pp 379-387
Reads0
Chats0
TLDR
Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements in the vehicle Re-identification problem.Abstract:
In this paper, we tackle the vehicle Re-identification (ReID) problem which is of great importance in urban surveillance and can be used for multiple applications. In our vehicle ReID framework, an orientation invariant feature embedding module and a spatial-temporal regularization module are proposed. With orientation invariant feature embedding, local region features of different orientations can be extracted based on 20 key point locations and can be well aligned and combined. With spatial-temporal regularization, the log-normal distribution is adopted to model the spatial-temporal constraints and the retrieval results can be refined. Experiments are conducted on public vehicle ReID datasets and our proposed method achieves state-of-the-art performance. Investigations of the proposed framework is conducted, including the landmark regressor and comparisons with attention mechanism. Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements.read more
Citations
More filters
Posted Content
Unsupervised Vehicle Re-identification with Progressive Adaptation
TL;DR: A novel progressive adaptation learning method is proposed for vehicle reID, named PAL, which infers from the abundant data without annotations, and a weighted label smoothing (WLS) loss is proposed, which considers the similarity between samples with different clusters to balance the confidence of pseudo labels.
Book ChapterDOI
MRNet: A Keypoint Guided Multi-scale Reasoning Network for Vehicle Re-identification
TL;DR: In this article, the authors propose an end-to-end framework called Keypoint Guided Multi-Scale Reasoning Network (MRNet) to infer multi-view vehicle features from a one-view image.
Journal ArticleDOI
A Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval
Jiacheng Zhang,Xiangru Lin,Minyue Jiang,Yue Yu,Chenting Gong,Wei Ting Zhang,Xiao Tan,Yingying Li,Errui Ding,Guanbin Li +9 more
TL;DR: A multi-granularity retrieval system, consisting of three main modules that aims to obtain the fine-grained vehicle attributes from the language descriptions, has achieved the 1st place on the 6th AI City Challenge, yielding a strong performance on the private test set.
Journal ArticleDOI
Learning latent features with local channel drop network for vehicle re-identification
TL;DR: Li et al. as mentioned in this paper proposed a local channel drop network (LCDNet) which focuses on seeking the latent features by releasing the constraint of most attentive features, and the batch ranking loss is introduced to split the samples into two groups in a batch and regularize them by enforcing a margin, which ensures the model to learn meaningful features to distinct vehicles.
Journal ArticleDOI
An efficient global representation constrained by Angular Triplet loss for vehicle re-identification
TL;DR: A simple Angular Triplet loss is introduced on the basis of analysis of different feature representations constrained by softmax loss and triplet loss, which can cooperate with softmax consistently and be seen as an effective baseline for vehicle re-identification task.
References
More filters
Proceedings ArticleDOI
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Journal Article
Visualizing Data using t-SNE
TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings ArticleDOI
FaceNet: A unified embedding for face recognition and clustering
TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Book ChapterDOI
Stacked Hourglass Networks for Human Pose Estimation
TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.
Proceedings ArticleDOI
Scalable Person Re-identification: A Benchmark
TL;DR: A minor contribution, inspired by recent advances in large-scale image search, an unsupervised Bag-of-Words descriptor is proposed that yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large- scale 500k dataset.