scispace - formally typeset
Proceedings ArticleDOI

Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification

Reads0
Chats0
TLDR
Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements in the vehicle Re-identification problem.
Abstract
In this paper, we tackle the vehicle Re-identification (ReID) problem which is of great importance in urban surveillance and can be used for multiple applications. In our vehicle ReID framework, an orientation invariant feature embedding module and a spatial-temporal regularization module are proposed. With orientation invariant feature embedding, local region features of different orientations can be extracted based on 20 key point locations and can be well aligned and combined. With spatial-temporal regularization, the log-normal distribution is adopted to model the spatial-temporal constraints and the retrieval results can be refined. Experiments are conducted on public vehicle ReID datasets and our proposed method achieves state-of-the-art performance. Investigations of the proposed framework is conducted, including the landmark regressor and comparisons with attention mechanism. Both the orientation invariant feature embedding and the spatio-temporal regularization achieve considerable improvements.

read more

Citations
More filters
Proceedings ArticleDOI

Local-guided Global Collaborative Learning Transformer for Vehicle Reidentification

TL;DR: Li et al. as mentioned in this paper proposed a global collaborative learning Transformer guided by local abstract features, which aims to highlight the highest-attention regions of vehicle images, and adopted Vision Transformer(ViT) as their backbone to extract global features and obtain all local tokens.
Proceedings ArticleDOI

A Vehicle Re-Identification Method Based on Fine-Grained Features and Metric Learning

He Yan, +1 more
TL;DR: In this paper , the fine-grained features of vehicles are extracted by using triplet constraints, and then combined with the global features extracted by the backbone network as vehicle features.
Journal ArticleDOI

Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification

TL;DR: Zhang et al. as mentioned in this paper proposed the Bi-level Implicit Semantic Data Augmentation (BIDA) framework to enhance the robustness of Re-ID models by augmenting the images semantically in the feature space according to the identity-level and superclass-level intra-class variations.
Proceedings ArticleDOI

Vehicle re-identification method based on semantic information enhancement and feature complementarity guided by keypoints

Huicheng Luo
TL;DR: Zhang et al. as mentioned in this paper proposed a vehicle re-ID method based on keypoint-guided semantic feature alignment and graph matching strategy to enhance complementary information under the transformer framework.
Journal ArticleDOI

Multiple Soft Attention Network for Vehicle Re-Identification

TL;DR: In this article , the authors proposed a multiple soft attention network to provide part-aware attention weights and extract more representative and robust features for vehicle ReID, which achieved state-of-the-art performance among the approaches that did not use metadata.
References
More filters
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Proceedings ArticleDOI

FaceNet: A unified embedding for face recognition and clustering

TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Book ChapterDOI

Stacked Hourglass Networks for Human Pose Estimation

TL;DR: This work introduces a novel convolutional network architecture for the task of human pose estimation that is described as a “stacked hourglass” network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions.
Proceedings ArticleDOI

Scalable Person Re-identification: A Benchmark

TL;DR: A minor contribution, inspired by recent advances in large-scale image search, an unsupervised Bag-of-Words descriptor is proposed that yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large- scale 500k dataset.
Related Papers (5)