Siamese Box Adaptive Network for Visual Tracking

Open AccessPosted Content

Siamese Box Adaptive Network for Visual Tracking

Zedu Chen, +4 more

- 15 Mar 2020 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

SiamBAN as discussed by the authors views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN.

Abstract:

Most of the existing trackers usually rely on either a multi-scale searching scheme or pre-defined anchor boxes to accurately estimate the scale and aspect ratio of a target. Unfortunately, they typically call for tedious and heuristic configurations. To address this issue, we propose a simple yet effective visual tracking framework (named Siamese Box Adaptive Network, SiamBAN) by exploiting the expressive power of the fully convolutional network (FCN). SiamBAN views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN. The no-prior box design avoids hyper-parameters associated with the candidate boxes, making SiamBAN more flexible and general. Extensive experiments on visual tracking benchmarks including VOT2018, VOT2019, OTB100, NFS, UAV123, and LaSOT demonstrate that SiamBAN achieves state-of-the-art performance and runs at 40 FPS, confirming its effectiveness and efficiency. The code will be available at this https URL.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Visual Tracking: A Comprehensive Survey

Seyed Mojtaba Marvasti-Zadeh, +3 more

- 28 Jan 2021 -

IEEE Transactions on Intelligent Transpo...

TL;DR: This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics, and extensively evaluates and analyzes the leading visualtracking methods.

...read moreread less

Posted Content

Graph Attention Tracking

Dongyan Guo, +5 more

- 23 Nov 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A simple target-aware Siamese graph attention network for general object tracking that establishes part-to-part correspondence between the target and the search region with a complete bipartite graph, and applies the graph attention mechanism to propagate target information from the template feature to the search feature.

...read moreread less

Posted Content

Siamese Network for RGB-D Salient Object Detection and Beyond.

Keren Fu, +5 more

- 26 Aug 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The proposed JL-DCF module provides robust saliency feature learning by exploiting cross-modal commonality via a Siamese network, while the DCF module is introduced for complementary feature discovery.

...read moreread less

Posted Content

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Zhihong Fu, +3 more

- 01 Apr 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A novel tracking framework built on top of a space-time memory network that is competent to make full use of historical information related to the target for better adapting to appearance variations during tracking is proposed.

...read moreread less

Proceedings ArticleDOI

MixFormer: End-to-End Tracking with Iterative Mixed Attention

Yutao Cui, +3 more

TL;DR: This paper proposes a compact tracking framework, termed as MixFormer, built upon transformers, to utilize the flexibility of attention operations, and proposes a Mixed Attention Module (MAM) for simultaneous feature extraction and target information integration.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 04 Jun 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.

...read moreread less

Collapse

Related Papers (5)

SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks

Bo Li, +5 more

Object Tracking Benchmark

Yi Wu, +2 more

- 01 Sep 2015 -

IEEE Transactions on Pattern Analysis an...

Siamese Box Adaptive Network for Visual Tracking

Citations

Deep Learning for Visual Tracking: A Comprehensive Survey

Graph Attention Tracking

Siamese Network for RGB-D Salient Object Detection and Beyond.

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

MixFormer: End-to-End Tracking with Iterative Mixed Attention

References

Deep Residual Learning for Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Microsoft COCO: Common Objects in Context

You Only Look Once: Unified, Real-Time Object Detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Related Papers (5)

SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks

Object Tracking Benchmark

Deep Residual Learning for Image Recognition

Fully-Convolutional Siamese Networks for Object Tracking

High Performance Visual Tracking with Siamese Region Proposal Network