A Structured Graph Attention Network for Vehicle Re-Identification

doi:10.1145/3394171.3413607

Citations

PDF

Open Access

More filters

Posted Content•

Videos as Space-Time Region Graphs.

[...]

Xiaolong Wang¹, Abhinav Gupta¹•Institutions (1)

Carnegie Mellon University¹

05 Jun 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, the authors propose to represent videos as space-time region graphs which capture temporal shape dynamics and functional relationships between humans and objects, and perform reasoning on this graph representation via Graph Convolutional Networks.

...read moreread less

Abstract: How do humans recognize the action "opening a book" ? We argue that there are two important cues: modeling temporal shape dynamics and modeling functional relationships between humans and objects. In this paper, we propose to represent videos as space-time region graphs which capture these two important cues. Our graph nodes are defined by the object region proposals from different frames in a long range video. These nodes are connected by two types of relations: (i) similarity relations capturing the long range dependencies between correlated objects and (ii) spatial-temporal relations capturing the interactions between nearby objects. We perform reasoning on this graph representation via Graph Convolutional Networks. We achieve state-of-the-art results on both Charades and Something-Something datasets. Especially for Charades, we obtain a huge 4.4% gain when our model is applied in complex environments.

...read moreread less

278 citations

Journal Article•DOI•

TBE-Net: A Three-Branch Embedding Network With Part-Aware Ability and Feature Complementary Learning for Vehicle Re-Identification

[...]

01 Sep 2022-IEEE Transactions on Intelligent Transportation Systems

TL;DR: Zhang et al. as discussed by the authors proposed a novel end-to-end three-branch embedding network (TBE-Net) with feature complementary learning and part-aware ability, which integrates complementary features, global appearance, and local region features into a unified framework for subtle feature learning.

...read moreread less

Abstract: Vehicle re-identification (Re-ID) is one of the promising applications in the field of computer vision. Existing vehicle Re-ID methods mainly focus on global appearance features or pre-defined local region features, which have difficulties in handling inter-class similarities and intra-class differences among vehicles in various traffic scenarios. This paper proposes a novel end-to-end three-branch embedding network (TBE-Net) with feature complementary learning and part-aware ability. The proposed TBE-Net integrates complementary features, global appearance, and local region features into a unified framework for subtle feature learning, thereby obtaining more integral and diverse vehicle features to re-identify the vehicle from similar ones. The local region feature branch in the proposed TBE-Net contains an attention module that highlights the major differences among local regions by adaptively assigning large weights to the critical local regions and small weights to insignificant local regions, thereby enhancing the perception sensitivity of the network to subtle discrepancies. The complementary branch in the proposed TBE-Net exploits different pooling operations to obtain more comprehensive structural features and multi-granularity features as a supplement to the global appearance and local region features. The abundant features help accommodate the ever-changing critical local regions in vehicles’ images due to the sensors’ settings, such as the position and shooting angle of surveillance cameras. The extensive experiments on VehicleID and VeRi-776 datasets show that the proposed TBE-Net outperforms the state-of-the-art methods.

...read moreread less

21 citations

Journal Article•DOI•

Multi-attribute adaptive aggregation transformer for vehicle re-identification

[...]

Zhiyu Yu, Jiaming Pei, Mingpeng Zhu, Jiwei Zhang, Jinhai Li - Show less +1 more

01 Mar 2022-Information Processing and Management

TL;DR: Zhang et al. as mentioned in this paper proposed a vehicle attribute transformer (VAT) for vehicle re-identification, which considers color and model as the most intuitive attributes of the vehicle, the vehicle colour and model are relatively stable and easy to distinguish.

...read moreread less

Abstract: With the continuous development of intelligent transportation systems, vehicle-related fields have emerged a research boom in detection, tracking, and retrieval. Vehicle re-identification aims to judge whether a specific vehicle appears in a video stream, which is a popular research direction. Previous researches have proven that the transformer is an efficient method in computer vision, which treats a visual image as a series of patch sequences. However, an efficient vehicle re-identification should consider the image feature and the attribute feature simultaneously. In this work, we propose a vehicle attribute transformer (VAT) for vehicle re-identification. First, we consider color and model as the most intuitive attributes of the vehicle, the vehicle color and model are relatively stable and easy to distinguish. Therefore, the color feature and the model feature are embedded in a transformer. Second, we consider that the shooting angle of each image may be different, so we encode the viewpoint of the vehicle image as another additional attribute. Besides, different attributes are supposed to have different importance. Based on this, we design a multi-attribute adaptive aggregation network, which can compare different attributes and assign different weights to the corresponding features. Finally, to optimize the proposed transformer network, we design a multi-sample dispersion triplet (MDT) loss. Not only the hardest samples based on hard mining strategy, but also some extra positive samples and negative samples are considered in this loss. The dispersion of multi-sample is utilized to dynamically adjust the loss, which can guide the network to learn more optimized division for feature space. Extensive experiments on popular vehicle re-identification datasets verify that the proposed method can achieve state-of-the-art performance.

...read moreread less

19 citations

Journal Article•DOI•

TBE-Net: A Three-Branch Embedding Network With Part-Aware Ability and Feature Complementary Learning for Vehicle Re-Identification

[...]

Wei Sun, Guangzhao Dai, Xiaorui Zhang, Xiaozheng He, Xuan Chen - Show less +1 more

01 Sep 2022

TL;DR: A novel end-to-end three-branch embedding network (TBE-Net) with feature complementary learning and part-aware ability that outperforms the state-of-the-art methods on VehicleID and VeRi-776 datasets.

...read moreread less

Abstract: Vehicle re-identification (Re-ID) is one of the promising applications in the field of computer vision. Existing vehicle Re-ID methods mainly focus on global appearance features or pre-defined local region features, which have difficulties in handling inter-class similarities and intra-class differences among vehicles in various traffic scenarios. This paper proposes a novel end-to-end three-branch embedding network (TBE-Net) with feature complementary learning and part-aware ability. The proposed TBE-Net integrates complementary features, global appearance, and local region features into a unified framework for subtle feature learning, thereby obtaining more integral and diverse vehicle features to re-identify the vehicle from similar ones. The local region feature branch in the proposed TBE-Net contains an attention module that highlights the major differences among local regions by adaptively assigning large weights to the critical local regions and small weights to insignificant local regions, thereby enhancing the perception sensitivity of the network to subtle discrepancies. The complementary branch in the proposed TBE-Net exploits different pooling operations to obtain more comprehensive structural features and multi-granularity features as a supplement to the global appearance and local region features. The abundant features help accommodate the ever-changing critical local regions in vehicles’ images due to the sensors’ settings, such as the position and shooting angle of surveillance cameras. The extensive experiments on VehicleID and VeRi-776 datasets show that the proposed TBE-Net outperforms the state-of-the-art methods.

...read moreread less

13 citations

Proceedings Article•DOI•

Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

[...]

Dechao Meng¹, Liang Li¹, Shuhui Wang¹, Xingyu Gao¹, Zheng-Jun Zha², Qingming Huang¹ - Show less +2 more•Institutions (2)

Chinese Academy of Sciences¹, University of Science and Technology of China²

12 Oct 2020

TL;DR: This paper proposes part perspective transformation module (PPT) to map the different parts of vehicle into a unified perspective respectively and proposes a dynamically batch hard triplet loss to select the common visible regions of the compared vehicles.

...read moreread less

Abstract: Given a query image, vehicle Re-Identification is to search the same vehicle in multi-camera scenarios, which are attracting much attention in recent years. However, vehicle ReID severely suffers from the perspective variation problem. For different vehicles with similar color and type which are taken from different perspectives, all visual patterns are misaligned and warped, which is hard for the model to find out the exact discriminative regions. In this paper, we propose part perspective transformation module (PPT) to map the different parts of vehicle into a unified perspective respectively. The PPT disentangles the vehicle features of different perspectives and then aligns them in a fine-grained level. Further, we propose a dynamically batch hard triplet loss to select the common visible regions of the compared vehicles. Our approach helps the model to generate the perspective invariant features and find out the exact distinguishable regions for vehicle ReID. Extensive experiments on three standard vehicle ReID datasets show the effectiveness of our method.

...read moreread less

11 citations

Cites background from "A Structured Graph Attention Networ..."

...It has attracted much attention recently as it serves as an important role in the field of the intelligent transportation systems and smart city [3, 4, 13, 14, 16, 17, 19, 20, 41]....
[...]

Collapse

A Structured Graph Attention Network for Vehicle Re-Identification

Citations

Cites background from "A Structured Graph Attention Networ..."

References

"A Structured Graph Attention Networ..." refers background in this paper

"A Structured Graph Attention Networ..." refers methods in this paper

"A Structured Graph Attention Networ..." refers background in this paper

Related Papers (5)

Trending Questions (1)