scispace - formally typeset
Proceedings ArticleDOI

Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification

Reads0
Chats0
TLDR
Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements in the vehicle Re-identification problem.
Abstract
In this paper, we tackle the vehicle Re-identification (ReID) problem which is of great importance in urban surveillance and can be used for multiple applications. In our vehicle ReID framework, an orientation invariant feature embedding module and a spatial-temporal regularization module are proposed. With orientation invariant feature embedding, local region features of different orientations can be extracted based on 20 key point locations and can be well aligned and combined. With spatial-temporal regularization, the log-normal distribution is adopted to model the spatial-temporal constraints and the retrieval results can be refined. Experiments are conducted on public vehicle ReID datasets and our proposed method achieves state-of-the-art performance. Investigations of the proposed framework is conducted, including the landmark regressor and comparisons with attention mechanism. Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements.

read more

Citations
More filters
Posted Content

Unsupervised Vehicle Re-identification with Progressive Adaptation

TL;DR: A novel progressive adaptation learning method is proposed for vehicle reID, named PAL, which infers from the abundant data without annotations, and a weighted label smoothing (WLS) loss is proposed, which considers the similarity between samples with different clusters to balance the confidence of pseudo labels.
Book ChapterDOI

MRNet: A Keypoint Guided Multi-scale Reasoning Network for Vehicle Re-identification

TL;DR: In this article, the authors propose an end-to-end framework called Keypoint Guided Multi-Scale Reasoning Network (MRNet) to infer multi-view vehicle features from a one-view image.
Journal ArticleDOI

A Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval

TL;DR: A multi-granularity retrieval system, consisting of three main modules that aims to obtain the fine-grained vehicle attributes from the language descriptions, has achieved the 1st place on the 6th AI City Challenge, yielding a strong performance on the private test set.
Journal ArticleDOI

Learning latent features with local channel drop network for vehicle re-identification

TL;DR: Li et al. as mentioned in this paper proposed a local channel drop network (LCDNet) which focuses on seeking the latent features by releasing the constraint of most attentive features, and the batch ranking loss is introduced to split the samples into two groups in a batch and regularize them by enforcing a margin, which ensures the model to learn meaningful features to distinct vehicles.
Journal ArticleDOI

An efficient global representation constrained by Angular Triplet loss for vehicle re-identification

TL;DR: A simple Angular Triplet loss is introduced on the basis of analysis of different feature representations constrained by softmax loss and triplet loss, which can cooperate with softmax consistently and be seen as an effective baseline for vehicle re-identification task.
References
More filters
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings ArticleDOI

FaceNet: A unified embedding for face recognition and clustering

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Book ChapterDOI

Stacked Hourglass Networks for Human Pose Estimation

TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.
Proceedings ArticleDOI

Scalable Person Re-identification: A Benchmark

TL;DR: A minor contribution, inspired by recent advances in large-scale image search, an unsupervised Bag-of-Words descriptor is proposed that yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large- scale 500k dataset.
Related Papers (5)