The Eighth Visual Object Tracking VOT2020 Challenge Results

doi:10.1007/978-3-030-68238-5_39

Book ChapterDOI

The Eighth Visual Object Tracking VOT2020 Challenge Results

- pp 547-601

TLDR

A significant novelty is introduction of a new VOT short-term tracking evaluation methodology, and introduction of segmentation ground truth in the VOT-ST2020 challenge – bounding boxes will no longer be used in theVDT challenges.

Abstract:

The Visual Object Tracking challenge VOT2020 is the eighth annual tracker benchmarking activity organized by the VOT initiative. Results of 58 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The VOT2020 challenge was composed of five sub-challenges focusing on different tracking domains: (i) VOT-ST2020 challenge focused on short-term tracking in RGB, (ii) VOT-RT2020 challenge focused on “real-time” short-term tracking in RGB, (iii) VOT-LT2020 focused on long-term tracking namely coping with target disappearance and reappearance, (iv) VOT-RGBT2020 challenge focused on short-term tracking in RGB and thermal imagery and (v) VOT-RGBD2020 challenge focused on long-term tracking in RGB and depth imagery. Only the VOT-ST2020 datasets were refreshed. A significant novelty is introduction of a new VOT short-term tracking evaluation methodology, and introduction of segmentation ground truth in the VOT-ST2020 challenge – bounding boxes will no longer be used in the VOT-ST challenges. A new VOT Python toolkit that implements all these novelites was introduced. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website (http://votchallenge.net).

The Eighth Visual Object Tracking VOT2020 Challenge Results

Citations

RFN-Nest: An end-to-end residual fusion network for infrared and visible images

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

MixFormer: End-to-End Tracking with Iterative Mixed Attention

References

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet Large Scale Visual Recognition Challenge

Microsoft COCO: Common Objects in Context

U-Net: Convolutional Networks for Biomedical Image Segmentation

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Related Papers (5)

High Performance Visual Tracking with Siamese Region Proposal Network

SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks

Fully-Convolutional Siamese Networks for Object Tracking

Fast Online Object Tracking and Segmentation: A Unifying Approach

High-Speed Tracking with Kernelized Correlation Filters