Home
/
Authors
/
Bao Xin Chen

Author

Bao Xin Chen

Bio: Bao Xin Chen is an academic researcher from York University. The author has contributed to research in topics: Bulk synchronous parallel & Computer science. The author has an hindex of 6, co-authored 13 publications receiving 346 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

The Seventh Visual Object Tracking VOT2019 Challenge Results

[...]

Matej Kristan¹, Amanda Berg², Linyu Zheng³, Litu Rout⁴ +176 more•Institutions (43)

01 Oct 2019

TL;DR: The Visual Object Tracking challenge VOT2019 is the seventh annual tracker benchmarking activity organized by the VOT initiative; results of 81 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2019 is the seventh annual tracker benchmarking activity organized by the VOT initiative. Results of 81 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The evaluation included the standard VOT and other popular methodologies for short-term tracking analysis as well as the standard VOT methodology for long-term tracking analysis. The VOT2019 challenge was composed of five challenges focusing on different tracking domains: (i) VOTST2019 challenge focused on short-term tracking in RGB, (ii) VOT-RT2019 challenge focused on "real-time" shortterm tracking in RGB, (iii) VOT-LT2019 focused on longterm tracking namely coping with target disappearance and reappearance. Two new challenges have been introduced: (iv) VOT-RGBT2019 challenge focused on short-term tracking in RGB and thermal imagery and (v) VOT-RGBD2019 challenge focused on long-term tracking in RGB and depth imagery. The VOT-ST2019, VOT-RT2019 and VOT-LT2019 datasets were refreshed while new datasets were introduced for VOT-RGBT2019 and VOT-RGBD2019. The VOT toolkit has been updated to support both standard shortterm, long-term tracking and tracking with multi-channel imagery. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website.

...read moreread less

393 citations

Book Chapter•DOI•

Integrating Stereo Vision with a CNN Tracker for a Person-Following Robot

[...]

Bao Xin Chen¹, Raghavender Sahdev¹, John K. Tsotsos¹•Institutions (1)

York University¹

10 Jul 2017

TL;DR: A stereo vision based CNN tracker for a person following robot able to track a person in real-time using an online convolutional neural network that enables the robot to follow a target under challenging situations.

...read moreread less

Abstract: In this paper, we introduce a stereo vision based CNN tracker for a person following robot. The tracker is able to track a person in real-time using an online convolutional neural network. Our approach enables the robot to follow a target under challenging situations such as occlusions, appearance changes, pose changes, crouching, illumination changes or people wearing the same clothes in different environments. The robot follows the target around corners even when it is momentarily unseen by estimating and replicating the local path of the target. We build an extensive dataset for person following robots under challenging situations. We evaluate the proposed system quantitatively by comparing our tracking approach with existing real-time tracking algorithms.

...read moreread less

54 citations

Proceedings Article•DOI•

Person Following Robot Using Selected Online Ada-Boosting with Stereo Camera

[...]

Bao Xin Chen¹, Raghavender Sahdev¹, John K. Tsotsos¹•Institutions (1)

York University¹

16 May 2017

TL;DR: A modified Online Ada-Boosting (OAB) tracking algorithm with integrated scene depth information obtained from a stereo camera which runs in real-time on a mobile robot.

...read moreread less

Abstract: Person following behavior is an important task for social robots To enable robots to follow a person, we have to track the target in real-time without critical failures There are many situations where the robot will potentially loose tracking in a dynamic environment, eg, occlusion, illumination, pose-changes, etc Often, people use a complex tracking algorithm to improve robustness However, the trade-off is that their approaches may not able to run in real-time on mobile robots In this paper, we present Selected Online Ada-Boosting (SOAB) technique, a modified Online Ada-Boosting (OAB) tracking algorithm with integrated scene depth information obtained from a stereo camera which runs in real-time on a mobile robot We build and share our results on the performance of our technique on a new stereo dataset for the task of person following The dataset covers different challenging situations like squatting, partial and complete occlusion of the target being tracked, people wearing similar clothes, appearance changes, walking facing the front and back side of the person to the robot, and normal walking

...read moreread less

42 citations

Proceedings Article•DOI•

Dynamic Stale Synchronous Parallel Distributed Training for Deep Learning

[...]

Xing Zhao¹, Aijun An¹, Junfeng Liu², Bao Xin Chen¹•Institutions (2)

York University¹, IBM²

07 Jul 2019

TL;DR: In this article, a distributed paradigm on the parameter server framework called Dynamic Stale Synchronous Parallel (DSSP) is presented, which improves the state-of-the-art Stale Parallel (SSP) paradigm by dynamically determining the staleness threshold at the run time.

...read moreread less

Abstract: Deep learning is a popular machine learning technique and has been applied to many real-world problems, ranging from computer vision to natural language processing. However, training a deep neural network is very time-consuming, especially on big data. It has become difficult for a single machine to train a large model over large datasets. A popular solution is to distribute and parallelize the training process across multiple machines using the parameter server framework. In this paper, we present a distributed paradigm on the parameter server framework called Dynamic Stale Synchronous Parallel (DSSP) which improves the state-of-the-art Stale Synchronous Parallel (SSP) paradigm by dynamically determining the staleness threshold at the run time. Conventionally to run distributed training in SSP, the user needs to specify a particular stalenes threshold as a hyper-parameter. However, a user does not usually know how to set the threshold and thus often finds a threshold value through trial and error, which is time-consuming. Based on workers' recent processing time, our approach DSSP adaptively adjusts the threshold per iteration at running time to reduce the waiting time of faster workers for synchronization of the globally shared parameters (the weights of the model), and consequently increases the frequency of parameters updates (increases iteration through-put), which speedups the convergence rate. We compare DSSP with other paradigms such as Bulk Synchronous Parallel (BSP), Asynchronous Parallel (ASP), and SSP by running deep neural networks (DNN) models over GPU clusters in both homogeneous and heterogeneous environments. The results show that in a heterogeneous environment where the cluster consists of mixed models of GPUs, DSSP converges to a higher accuracy much earlier than SSP and BSP and performs similarly to ASP. In a homogeneous distributed cluster, DSSP has more stable and slightly better performance than SSP and ASP, and converges much faster than BSP.

...read moreread less

33 citations

Posted Content•

Fast Visual Object Tracking with Rotated Bounding Boxes

[...]

Bao Xin Chen, John K. Tsotsos¹•Institutions (1)

York University¹

08 Jul 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel algorithm that uses ellipse fitting to estimate the bounding box rotation angle and size with the segmentation(mask) on the target for online and real-time visual object tracking.

...read moreread less

Abstract: In this paper, we demonstrate a novel algorithm that uses ellipse fitting to estimate the bounding box rotation angle and size with the segmentation(mask) on the target for online and real-time visual object tracking. Our method, SiamMask E, improves the bounding box fitting procedure of the state-of-the-art object tracking algorithm SiamMask and still retains a fast-tracking frame rate (80 fps) on a system equipped with GPU (GeForce GTX 1080 Ti or higher). We tested our approach on the visual object tracking datasets (VOT2016, VOT2018, and VOT2019) that were labeled with rotated bounding boxes. By comparing with the original SiamMask, we achieved an improved Accuracy of 0.645 and 0.303 EAO on VOT2019, which is 0.049 and 0.02 higher than the original SiamMask. Our project website is available at this http URL.

...read moreread less

32 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild

[...]

Lianghua Huang¹, Xin Zhao¹, Kaiqi Huang¹•Institutions (1)

Chinese Academy of Sciences¹

01 May 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A large tracking database that offers an unprecedentedly wide coverage of common moving objects in the wild, called GOT-10k, and the first video trajectory dataset that uses the semantic hierarchy of WordNet to guide class population, which ensures a comprehensive and relatively unbiased coverage of diverse moving objects.

...read moreread less

Abstract: We introduce here a large tracking database that offers an unprecedentedly wide coverage of common moving objects in the wild, called GOT-10k. Specifically, GOT-10k is built upon the backbone of WordNet structure [1] and it populates the majority of over 560 classes of moving objects and 87 motion patterns, magnitudes wider than the most recent similar-scale counterparts [19] , [20] , [23] , [26] . By releasing the large high-diversity database, we aim to provide a unified training and evaluation platform for the development of class-agnostic, generic purposed short-term trackers. The features of GOT-10k and the contributions of this article are summarized in the following. (1) GOT-10k offers over 10,000 video segments with more than 1.5 million manually labeled bounding boxes, enabling unified training and stable evaluation of deep trackers. (2) GOT-10k is by far the first video trajectory dataset that uses the semantic hierarchy of WordNet to guide class population, which ensures a comprehensive and relatively unbiased coverage of diverse moving objects. (3) For the first time, GOT-10k introduces the one-shot protocol for tracker evaluation, where the training and test classes are zero-overlapped . The protocol avoids biased evaluation results towards familiar objects and it promotes generalization in tracker development. (4) GOT-10k offers additional labels such as motion classes and object visible ratios, facilitating the development of motion-aware and occlusion-aware trackers. (5) We conduct extensive tracking experiments with 39 typical tracking algorithms and their variants on GOT-10k and analyze their results in this paper. (6) Finally, we develop a comprehensive platform for the tracking community that offers full-featured evaluation toolkits, an online evaluation server, and a responsive leaderboard. The annotations of GOT-10k’s test data are kept private to avoid tuning parameters on it.

...read moreread less

852 citations

Proceedings Article•DOI•

The Seventh Visual Object Tracking VOT2019 Challenge Results

[...]

Matej Kristan¹, Amanda Berg², Linyu Zheng³, Litu Rout⁴ +176 more•Institutions (43)

01 Oct 2019

...read moreread less

393 citations

Proceedings Article•DOI•

Siamese Box Adaptive Network for Visual Tracking

[...]

Zedu Chen¹, Bineng Zhong², Guorong Li³, Shengping Zhang⁴, Rongrong Ji⁵ - Show less +1 more•Institutions (5)

Huaqiao University¹, Nanjing University of Science and Technology², Chinese Academy of Sciences³, Harbin Institute of Technology⁴, Xiamen University⁵

14 Jun 2020

TL;DR: SiamBAN views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN, making SiamB Ban more flexible and general.

...read moreread less

Abstract: Most of the existing trackers usually rely on either a multi-scale searching scheme or pre-defined anchor boxes to accurately estimate the scale and aspect ratio of a target. Unfortunately, they typically call for tedious and heuristic configurations. To address this issue, we propose a simple yet effective visual tracking framework (named Siamese Box Adaptive Network, SiamBAN) by exploiting the expressive power of the fully convolutional network (FCN). SiamBAN views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN. The no-prior box design avoids hyper-parameters associated with the candidate boxes, making SiamBAN more flexible and general. Extensive experiments on visual tracking benchmarks including VOT2018, VOT2019, OTB100, NFS, UAV123, and LaSOT demonstrate that SiamBAN achieves state-of-the-art performance and runs at 40 FPS, confirming its effectiveness and efficiency. The code will be available at https://github.com/hqucv/siamban.

...read moreread less

358 citations

Posted Content•

Siamese Box Adaptive Network for Visual Tracking

[...]

Zedu Chen¹, Bineng Zhong¹, Guorong Li², Shengping Zhang³, Rongrong Ji⁴ - Show less +1 more•Institutions (4)

Huaqiao University¹, Chinese Academy of Sciences², Harbin Institute of Technology³, Xiamen University⁴

15 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: SiamBAN as discussed by the authors views the visual tracking problem as a parallel classification and regression problem, and thus directly classifies objects and regresses their bounding boxes in a unified FCN.

...read moreread less

275 citations

Journal Article•DOI•

NestFuse: An Infrared and Visible Image Fusion Architecture Based on Nest Connection and Spatial/Channel Attention Models

[...]

Hui Li¹, Xiaojun Wu¹, Tariq S. Durrani²•Institutions (2)

Jiangnan University¹, University of Strathclyde²

29 Jun 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: A novel method for infrared and visible image fusion where the nest connection-based network and spatial/channel attention models are developed that describe the importance of each spatial position and of each channel with deep features is proposed.

...read moreread less

Abstract: In this article, we propose a novel method for infrared and visible image fusion where we develop nest connection-based network and spatial/channel attention models. The nest connection-based network can preserve significant amounts of information from input data in a multiscale perspective. The approach comprises three key elements: encoder, fusion strategy, and decoder, respectively. In our proposed fusion strategy, spatial attention models and channel attention models are developed that describe the importance of each spatial position and of each channel with deep features. First, the source images are fed into the encoder to extract multiscale deep features. The novel fusion strategy is then developed to fuse these features for each scale. Finally, the fused image is reconstructed by the nest connection-based decoder. Experiments are performed on publicly available data sets. These exhibit that our proposed approach has better fusion performance than other state-of-the-art methods. This claim is justified through both subjective and objective evaluations. The code of our fusion method is available at https://github.com/hli1221/imagefusion-nestfuse .

...read moreread less

235 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116

Collapse