scispace - formally typeset
Open AccessJournal ArticleDOI

Adaptive Graph Representation Learning for Video Person Re-identification

Reads0
Chats0
TLDR
This work proposes an innovative adaptive graph representation learning scheme for video person Re-ID, which enables the contextual interactions between relevant regional features and proposes a novel temporal resolution-aware regularization, which enforces the consistency among different temporal resolutions for the same identities.
Abstract
Recent years have witnessed the remarkable progress of applying deep learning models in video person re-identification (Re-ID). A key factor for video person Re-ID is to effectively construct discriminative and robust video feature representations for many complicated situations. Part-based approaches employ spatial and temporal attention to extract representative local features. While correlations between parts are ignored in the previous methods, to leverage the relations of different parts, we propose an innovative adaptive graph representation learning scheme for video person Re-ID, which enables the contextual interactions between relevant regional features. Specifically, we exploit the pose alignment connection and the feature affinity connection to construct an adaptive structure-aware adjacency graph, which models the intrinsic relations between graph nodes. We perform feature propagation on the adjacency graph to refine regional features iteratively, and the neighbor nodes' information is taken into account for part feature representation. To learn compact and discriminative representations, we further propose a novel temporal resolution-aware regularization, which enforces the consistency among different temporal resolutions for the same identities. We conduct extensive evaluations on four benchmarks, i.e. iLIDS-VID, PRID2011, MARS, and DukeMTMC-VideoReID, experimental results achieve the competitive performance which demonstrates the effectiveness of our proposed method. The code is available at this https URL.

read more

Citations
More filters
Proceedings ArticleDOI

PhD Learning: Learning with Pompeiu-hausdorff Distances for Video-based Vehicle Re-Identification

TL;DR: Li et al. as mentioned in this paper proposed a new Pompeiu-hausdorff distance (PhD) learning method for video-to-video matching, which can alleviate the data noise problem caused by the occlusion in videos and thus improve re-ID performance significantly.
Journal ArticleDOI

Hypergraph video pedestrian re-identification based on posture structure relationship and action constraints

TL;DR: A hypergraph video pedestrian re-identification method based on posture structure relationships and action constraint (PA-HVPReid) is proposed, which aims to make full use of pedestrian walking postures to obtain more discriminative features.
Journal ArticleDOI

Relation-based global-partial feature learning network for video-based person re-identification

TL;DR: Wang et al. as mentioned in this paper proposed a relation-based global-partial feature learning framework to explore discriminative spatio-temporal features with the help of the global and partial relationship between frames.
Journal ArticleDOI

A sparse graph wavelet convolution neural network for video-based person re-identification

TL;DR: Wang et al. as discussed by the authors proposed a novel sparse graph wavelet convolution neural network (SGWCNN) for video-based person Re-ID, which exploited the weighted sparse graph to model the semantic relation among the local patches of pedestrians in the video.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Posted Content

Inductive Representation Learning on Large Graphs

TL;DR: GraphSAGE is presented, a general, inductive framework that leverages node feature information (e.g., text attributes) to efficiently generate node embeddings for previously unseen data and outperforms strong baselines on three inductive node-classification benchmarks.
Proceedings ArticleDOI

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

TL;DR: This work combines existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and applies it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures.
Journal ArticleDOI

The Graph Neural Network Model

TL;DR: A new neural network model, called graph neural network (GNN) model, that extends existing neural network methods for processing the data represented in graph domains, and implements a function tau(G,n) isin IRm that maps a graph G and one of its nodes n into an m-dimensional Euclidean space.
Related Papers (5)