Home
/
Authors
/
Jack Valmadre

Author

Jack Valmadre

Other affiliations: Commonwealth Scientific and Industrial Research Organisation, Queensland University of Technology, University of Queensland

Bio: Jack Valmadre is an academic researcher from University of Oxford. The author has contributed to research in topics: Video tracking & Structure from motion. The author has an hindex of 21, co-authored 32 publications receiving 8323 citations. Previous affiliations of Jack Valmadre include Commonwealth Scientific and Industrial Research Organisation & Queensland University of Technology.

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Fully-Convolutional Siamese Networks for Object Tracking

[...]

Luca Bertinetto¹, Jack Valmadre¹, João F. Henriques¹, Andrea Vedaldi¹, Philip H. S. Torr¹ - Show less +1 more•Institutions (1)

University of Oxford¹

08 Oct 2016

TL;DR: A basic tracking algorithm is equipped with a novel fully-convolutional Siamese network trained end-to-end on the ILSVRC15 dataset for object detection in video and achieves state-of-the-art performance in multiple benchmarks.

...read moreread less

Abstract: The problem of arbitrary object tracking has traditionally been tackled by learning a model of the object’s appearance exclusively online, using as sole training data the video itself. Despite the success of these methods, their online-only approach inherently limits the richness of the model they can learn. Recently, several attempts have been made to exploit the expressive power of deep convolutional networks. However, when the object to track is not known beforehand, it is necessary to perform Stochastic Gradient Descent online to adapt the weights of the network, severely compromising the speed of the system. In this paper we equip a basic tracking algorithm with a novel fully-convolutional Siamese network trained end-to-end on the ILSVRC15 dataset for object detection in video. Our tracker operates at frame-rates beyond real-time and, despite its extreme simplicity, achieves state-of-the-art performance in multiple benchmarks.

...read moreread less

2,936 citations

Posted Content•

Fully-Convolutional Siamese Networks for Object Tracking

[...]

Luca Bertinetto¹, Jack Valmadre¹, João F. Henriques¹, Andrea Vedaldi¹, Philip H. S. Torr¹ - Show less +1 more•Institutions (1)

University of Oxford¹

30 Jun 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, a fully-convolutional Siamese network is trained end-to-end on the ILSVRC15 dataset for object detection in video, which achieves state-of-the-art performance.

...read moreread less

Abstract: The problem of arbitrary object tracking has traditionally been tackled by learning a model of the object's appearance exclusively online, using as sole training data the video itself. Despite the success of these methods, their online-only approach inherently limits the richness of the model they can learn. Recently, several attempts have been made to exploit the expressive power of deep convolutional networks. However, when the object to track is not known beforehand, it is necessary to perform Stochastic Gradient Descent online to adapt the weights of the network, severely compromising the speed of the system. In this paper we equip a basic tracking algorithm with a novel fully-convolutional Siamese network trained end-to-end on the ILSVRC15 dataset for object detection in video. Our tracker operates at frame-rates beyond real-time and, despite its extreme simplicity, achieves state-of-the-art performance in multiple benchmarks.

...read moreread less

1,613 citations

Proceedings Article•DOI•

End-to-End Representation Learning for Correlation Filter Based Tracking

[...]

Jack Valmadre¹, Luca Bertinetto¹, João F. Henriques¹, Andrea Vedaldi¹, Philip H. S. Torr¹ - Show less +1 more•Institutions (1)

University of Oxford¹

21 Jul 2017

TL;DR: In this paper, the Correlation Filter learner is interpreted as a differentiable layer in a deep neural network, which enables learning deep features that are tightly coupled to the correlation filter.

...read moreread less

Abstract: The Correlation Filter is an algorithm that trains a linear template to discriminate between images and their translations. It is well suited to object tracking because its formulation in the Fourier domain provides a fast solution, enabling the detector to be re-trained once per frame. Previous works that use the Correlation Filter, however, have adopted features that were either manually designed or trained for a different task. This work is the first to overcome this limitation by interpreting the Correlation Filter learner, which has a closed-form solution, as a differentiable layer in a deep neural network. This enables learning deep features that are tightly coupled to the Correlation Filter. Experiments illustrate that our method has the important practical benefit of allowing lightweight architectures to achieve state-of-the-art performance at high framerates.

...read moreread less

1,329 citations

Proceedings Article•DOI•

Staple: Complementary Learners for Real-Time Tracking

[...]

Luca Bertinetto¹, Jack Valmadre¹, Stuart Golodetz¹, Ondrej Miksik¹, Philip H. S. Torr¹ - Show less +1 more•Institutions (1)

University of Oxford¹

27 Jun 2016

TL;DR: It is shown that a simple tracker combining complementary cues in a ridge regression framework can operate faster than 80 FPS and outperform not only all entries in the popular VOT14 competition, but also recent and far more sophisticated trackers according to multiple benchmarks.

...read moreread less

Abstract: Correlation Filter-based trackers have recently achieved excellent performance, showing great robustness to challenging situations exhibiting motion blur and illumination changes. However, since the model that they learn depends strongly on the spatial layout of the tracked object, they are notoriously sensitive to deformation. Models based on colour statistics have complementary traits: they cope well with variation in shape, but suffer when illumination is not consistent throughout a sequence. Moreover, colour distributions alone can be insufficiently discriminative. In this paper, we show that a simple tracker combining complementary cues in a ridge regression framework can operate faster than 80 FPS and outperform not only all entries in the popular VOT14 competition, but also recent and far more sophisticated trackers according to multiple benchmarks.

...read moreread less

1,285 citations

Book Chapter•DOI•

The Visual Object Tracking VOT2016 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Luka Cehovin¹, Tomas Vojir³, Gustav Häger⁴, Alan Lukežič¹, Gustavo Fernandez⁵, Abhinav Gupta⁶, Alfredo Petrosino⁷, Alireza Memarmoghadam⁸, Alvaro Garcia-Martin⁹, Andres Solis Montero¹⁰, Andrea Vedaldi¹¹, Andreas Robinson⁴, Andy J. Ma¹², Anton Varfolomieiev¹³, A. Aydin Alatan¹⁴, Aykut Erdem¹⁵, Bernard Ghanem¹⁶, Bin Liu, Bohyung Han¹⁷, Brais Martinez¹⁸, Chang-Ming Chang¹⁹, Changsheng Xu²⁰, Chong Sun²¹, Daijin Kim¹⁷, Dapeng Chen²², Dawei Du²⁰, Deepak Mishra²³, Dit-Yan Yeung²⁴, Erhan Gundogdu²⁵, Erkut Erdem¹⁵, Fahad Shahbaz Khan⁴, Fatih Porikli²⁶, Fatih Porikli²⁷, Fei Zhao²⁰, Filiz Bunyak²⁸, Francesco Battistone⁷, Gao Zhu²⁷, Giorgio Roffo²⁹, Gorthi R. K. Sai Subrahmanyam²³, Guilherme Sousa Bastos³⁰, Guna Seetharaman³¹, Henry Medeiros³², Hongdong Li²⁷, Honggang Qi²⁰, Horst Bischof³³, Horst Possegger³³, Huchuan Lu²¹, Hyemin Lee¹⁷, Hyeonseob Nam³⁴, Hyung Jin Chang³⁵, Isabela Drummond³⁰, Jack Valmadre¹¹, Jae-chan Jeong³⁶, Jaeil Cho³⁶, Jae-Yeong Lee³⁶, Jianke Zhu³⁷, Jiayi Feng²⁰, Jin Gao²⁰, Jin-Young Choi, Jingjing Xiao², Ji-Wan Kim³⁶, Jiyeoup Jeong, João F. Henriques¹¹, Jochen Lang¹⁰, Jongwon Choi, José M. Martínez⁹, Junliang Xing²⁰, Junyu Gao²⁰, Kannappan Palaniappan²⁸, Karel Lebeda³⁸, Ke Gao²⁸, Krystian Mikolajczyk³⁵, Lei Qin²⁰, Lijun Wang²¹, Longyin Wen¹⁹, Luca Bertinetto¹¹, Madan Kumar Rapuru²³, Mahdieh Poostchi²⁸, Mario Edoardo Maresca⁷, Martin Danelljan⁴, Matthias Mueller¹⁶, Mengdan Zhang²⁰, Michael Arens, Michel Valstar¹⁸, Ming Tang²⁰, Mooyeol Baek¹⁷, Muhammad Haris Khan¹⁸, Naiyan Wang²⁴, Nana Fan³⁹, Noor M. Al-Shakarji²⁸, Ondrej Miksik¹¹, Osman Akin¹⁵, Payman Moallem⁸, Pedro Senna³⁰, Philip H. S. Torr¹¹, Pong C. Yuen¹², Qingming Huang²⁰, Qingming Huang³⁹, Rafael Martin-Nieto⁹, Rengarajan Pelapur²⁸, Richard Bowden³⁸, Robert Laganiere¹⁰, Rustam Stolkin², Ryan Walsh³², Sebastian B. Krah, Shengkun Li¹⁹, Shengping Zhang³⁹, Shizeng Yao²⁸, Simon Hadfield³⁸, Simone Melzi²⁹, Siwei Lyu¹⁹, Siyi Li²⁴, Stefan Becker, Stuart Golodetz¹¹, Sumithra Kakanuru²³, Sunglok Choi³⁶, Tao Hu²⁰, Thomas Mauthner³³, Tianzhu Zhang²⁰, Tony P. Pridmore¹⁸, Vincenzo Santopietro⁷, Weiming Hu²⁰, Wenbo Li⁴⁰, Wolfgang Hübner, Xiangyuan Lan¹², Xiaomeng Wang¹⁸, Xin Li³⁹, Yang Li³⁷, Yiannis Demiris³⁵, Yifan Wang²¹, Yuankai Qi³⁹, Zejian Yuan²², Zexiong Cai¹², Zhan Xu³⁷, Zhenyu He³⁹, Zhizhen Chi²¹ - Show less +137 more•Institutions (40)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Austrian Institute of Technology⁵, Carnegie Mellon University⁶, Parthenope University of Naples⁷, University of Isfahan⁸, Autonomous University of Madrid⁹, University of Ottawa¹⁰, University of Oxford¹¹, Hong Kong Baptist University¹², Kyiv Polytechnic Institute¹³, Middle East Technical University¹⁴, Hacettepe University¹⁵, King Abdullah University of Science and Technology¹⁶, Pohang University of Science and Technology¹⁷, University of Nottingham¹⁸, University at Albany, SUNY¹⁹, Chinese Academy of Sciences²⁰, Dalian University of Technology²¹, Xi'an Jiaotong University²², Indian Institute of Space Science and Technology²³, Hong Kong University of Science and Technology²⁴, ASELSAN²⁵, Commonwealth Scientific and Industrial Research Organisation²⁶, Australian National University²⁷, University of Missouri²⁸, University of Verona²⁹, Universidade Federal de Itajubá³⁰, United States Naval Research Laboratory³¹, Marquette University³², Graz University of Technology³³, Naver Corporation³⁴, Imperial College London³⁵, Electronics and Telecommunications Research Institute³⁶, Zhejiang University³⁷, University of Surrey³⁸, Harbin Institute of Technology³⁹, Lehigh University⁴⁰

08 Oct 2016

TL;DR: The Visual Object Tracking challenge VOT2016 goes beyond its predecessors by introducing a new semi-automatic ground truth bounding box annotation methodology and extending the evaluation system with the no-reset experiment.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2016 aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 70 trackers are presented, with a large number of trackers being published at major computer vision conferences and journals in the recent years. The number of tested state-of-the-art trackers makes the VOT 2016 the largest and most challenging benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the Appendix. The VOT2016 goes beyond its predecessors by (i) introducing a new semi-automatic ground truth bounding box annotation methodology and (ii) extending the evaluation system with the no-reset experiment. The dataset, the evaluation kit as well as the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

744 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•

Optimization as a Model for Few-Shot Learning

[...]

Sachin Ravi¹, Hugo Larochelle²•Institutions (2)

Princeton University¹, Université de Sherbrooke²

24 Apr 2017

TL;DR: In this paper, an LSTM-based meta-learner model is proposed to learn the exact optimization algorithm used to train another learner neural network in the few-shot regime.

...read moreread less

Abstract: Though deep neural networks have shown great success in the large data domain, they generally perform poorly on few-shot learning tasks, where a model has to quickly generalize after seeing very few examples from each class. The general belief is that gradient-based optimization in high capacity models requires many iterative steps over many examples to perform well. Here, we propose an LSTM-based meta-learner model to learn the exact optimization algorithm used to train another learner neural network in the few-shot regime. The parametrization of our model allows it to learn appropriate parameter updates specifically for the scenario where a set amount of updates will be made, while also learning a general initialization of the learner network that allows for quick convergence of training. We demonstrate that this meta-learning model is competitive with deep metric-learning techniques for few-shot learning.

...read moreread less

2,981 citations

Book Chapter•DOI•

Fully-Convolutional Siamese Networks for Object Tracking

[...]

Luca Bertinetto¹, Jack Valmadre¹, João F. Henriques¹, Andrea Vedaldi¹, Philip H. S. Torr¹ - Show less +1 more•Institutions (1)

University of Oxford¹

08 Oct 2016

...read moreread less

2,936 citations

Proceedings Article•DOI•

Learning to Compare: Relation Network for Few-Shot Learning

[...]

Flood Sung¹, Yongxin Yang, Li Zhang², Tao Xiang¹, Philip H. S. Torr², Timothy M. Hospedales³ - Show less +2 more•Institutions (3)

Queen Mary University of London¹, University of Oxford², University of Edinburgh³

18 Jun 2018

TL;DR: A conceptually simple, flexible, and general framework for few-shot learning, where a classifier must learn to recognise new classes given only few examples from each, which is easily extended to zero- shot learning.

...read moreread less

Abstract: We present a conceptually simple, flexible, and general framework for few-shot learning, where a classifier must learn to recognise new classes given only few examples from each. Our method, called the Relation Network (RN), is trained end-to-end from scratch. During meta-learning, it learns to learn a deep distance metric to compare a small number of images within episodes, each of which is designed to simulate the few-shot setting. Once trained, a RN is able to classify images of new classes by computing relation scores between query images and the few examples of each new class without further updating the network. Besides providing improved performance on few-shot learning, our framework is easily extended to zero-shot learning. Extensive experiments on five benchmarks demonstrate that our simple approach provides a unified and effective approach for both of these two tasks.

...read moreread less

2,496 citations

Posted Content•

Learning to Compare: Relation Network for Few-Shot Learning

[...]

Flood Sung¹, Yongxin Yang, Li Zhang², Tao Xiang¹, Philip H. S. Torr², Timothy M. Hospedales³ - Show less +2 more•Institutions (3)

Queen Mary University of London¹, University of Oxford², University of Edinburgh³

16 Nov 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: Relation Network (RN) as mentioned in this paper learns to learn a deep distance metric to compare a small number of images within episodes, each of which is designed to simulate the few-shot setting.

...read moreread less

2,077 citations

Proceedings Article•DOI•

High Performance Visual Tracking with Siamese Region Proposal Network

[...]

Bo Li¹, Junjie Yan², Wei Wu³, Zheng Zhu⁴, Xiaolin Hu² - Show less +1 more•Institutions (4)

Beihang University¹, Tsinghua University², SenseTime³, Chinese Academy of Sciences⁴

18 Jun 2018

TL;DR: The Siamese region proposal network (Siamese-RPN) is proposed which is end-to-end trained off-line with large-scale image pairs for visual object tracking and consists of SiAMESe subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch.

...read moreread less

Abstract: Visual object tracking has been a fundamental topic in recent years and many deep learning based trackers have achieved state-of-the-art performance on multiple benchmarks. However, most of these trackers can hardly get top performance with real-time speed. In this paper, we propose the Siamese region proposal network (Siamese-RPN) which is end-to-end trained off-line with large-scale image pairs. Specifically, it consists of Siamese subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch. In the inference phase, the proposed framework is formulated as a local one-shot detection task. We can pre-compute the template branch of the Siamese subnetwork and formulate the correlation layers as trivial convolution layers to perform online tracking. Benefit from the proposal refinement, traditional multi-scale test and online fine-tuning can be discarded. The Siamese-RPN runs at 160 FPS while achieving leading performance in VOT2015, VOT2016 and VOT2017 real-time challenges.

...read moreread less

2,016 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse