Home
/
Authors
/
Jiří Matas

Author

Jiří Matas

Other affiliations: University of Surrey

Bio: Jiří Matas is an academic researcher from Czech Technical University in Prague. The author has contributed to research in topics: Video tracking & Computer science. The author has an hindex of 24, co-authored 52 publications receiving 2981 citations. Previous affiliations of Jiří Matas include University of Surrey.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2014
2013
2012
2010
2009
2008
2007
2006
2003
2002

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

The Visual Object Tracking VOT2016 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Luka Cehovin¹, Tomas Vojir³, Gustav Häger⁴, Alan Lukežič¹, Gustavo Fernandez⁵, Abhinav Gupta⁶, Alfredo Petrosino⁷, Alireza Memarmoghadam⁸, Alvaro Garcia-Martin⁹, Andres Solis Montero¹⁰, Andrea Vedaldi¹¹, Andreas Robinson⁴, Andy J. Ma¹², Anton Varfolomieiev¹³, A. Aydin Alatan¹⁴, Aykut Erdem¹⁵, Bernard Ghanem¹⁶, Bin Liu, Bohyung Han¹⁷, Brais Martinez¹⁸, Chang-Ming Chang¹⁹, Changsheng Xu²⁰, Chong Sun²¹, Daijin Kim¹⁷, Dapeng Chen²², Dawei Du²⁰, Deepak Mishra²³, Dit-Yan Yeung²⁴, Erhan Gundogdu²⁵, Erkut Erdem¹⁵, Fahad Shahbaz Khan⁴, Fatih Porikli²⁶, Fatih Porikli²⁷, Fei Zhao²⁰, Filiz Bunyak²⁸, Francesco Battistone⁷, Gao Zhu²⁶, Giorgio Roffo²⁹, Gorthi R. K. Sai Subrahmanyam²³, Guilherme Sousa Bastos³⁰, Guna Seetharaman³¹, Henry Medeiros³², Hongdong Li²⁶, Honggang Qi²⁰, Horst Bischof³³, Horst Possegger³³, Huchuan Lu²¹, Hyemin Lee¹⁷, Hyeonseob Nam³⁴, Hyung Jin Chang³⁵, Isabela Drummond³⁰, Jack Valmadre¹¹, Jae-chan Jeong³⁶, Jaeil Cho³⁶, Jae-Yeong Lee³⁶, Jianke Zhu³⁷, Jiayi Feng²⁰, Jin Gao²⁰, Jin-Young Choi, Jingjing Xiao², Ji-Wan Kim³⁶, Jiyeoup Jeong, João F. Henriques¹¹, Jochen Lang¹⁰, Jongwon Choi, José M. Martínez⁹, Junliang Xing²⁰, Junyu Gao²⁰, Kannappan Palaniappan²⁸, Karel Lebeda³⁸, Ke Gao²⁸, Krystian Mikolajczyk³⁵, Lei Qin²⁰, Lijun Wang²¹, Longyin Wen¹⁹, Luca Bertinetto¹¹, Madan Kumar Rapuru²³, Mahdieh Poostchi²⁸, Mario Edoardo Maresca⁷, Martin Danelljan⁴, Matthias Mueller¹⁶, Mengdan Zhang²⁰, Michael Arens, Michel Valstar¹⁸, Ming Tang²⁰, Mooyeol Baek¹⁷, Muhammad Haris Khan¹⁸, Naiyan Wang²⁴, Nana Fan³⁹, Noor M. Al-Shakarji²⁸, Ondrej Miksik¹¹, Osman Akin¹⁵, Payman Moallem⁸, Pedro Senna³⁰, Philip H. S. Torr¹¹, Pong C. Yuen¹², Qingming Huang²⁰, Qingming Huang³⁹, Rafael Martin-Nieto⁹, Rengarajan Pelapur²⁸, Richard Bowden³⁸, Robert Laganiere¹⁰, Rustam Stolkin², Ryan Walsh³², Sebastian B. Krah, Shengkun Li¹⁹, Shengping Zhang³⁹, Shizeng Yao²⁸, Simon Hadfield³⁸, Simone Melzi²⁹, Siwei Lyu¹⁹, Siyi Li²⁴, Stefan Becker, Stuart Golodetz¹¹, Sumithra Kakanuru²³, Sunglok Choi³⁶, Tao Hu²⁰, Thomas Mauthner³³, Tianzhu Zhang²⁰, Tony P. Pridmore¹⁸, Vincenzo Santopietro⁷, Weiming Hu²⁰, Wenbo Li⁴⁰, Wolfgang Hübner, Xiangyuan Lan¹², Xiaomeng Wang¹⁸, Xin Li³⁹, Yang Li³⁷, Yiannis Demiris³⁵, Yifan Wang²¹, Yuankai Qi³⁹, Zejian Yuan²², Zexiong Cai¹², Zhan Xu³⁷, Zhenyu He³⁹, Zhizhen Chi²¹ - Show less +137 more•Institutions (40)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Austrian Institute of Technology⁵, Carnegie Mellon University⁶, Parthenope University of Naples⁷, University of Isfahan⁸, Autonomous University of Madrid⁹, University of Ottawa¹⁰, University of Oxford¹¹, Hong Kong Baptist University¹², Kyiv Polytechnic Institute¹³, Middle East Technical University¹⁴, Hacettepe University¹⁵, King Abdullah University of Science and Technology¹⁶, Pohang University of Science and Technology¹⁷, University of Nottingham¹⁸, University at Albany, SUNY¹⁹, Chinese Academy of Sciences²⁰, Dalian University of Technology²¹, Xi'an Jiaotong University²², Indian Institute of Space Science and Technology²³, Hong Kong University of Science and Technology²⁴, ASELSAN²⁵, Australian National University²⁶, Commonwealth Scientific and Industrial Research Organisation²⁷, University of Missouri²⁸, University of Verona²⁹, Universidade Federal de Itajubá³⁰, United States Naval Research Laboratory³¹, Marquette University³², Graz University of Technology³³, Naver Corporation³⁴, Imperial College London³⁵, Electronics and Telecommunications Research Institute³⁶, Zhejiang University³⁷, University of Surrey³⁸, Harbin Institute of Technology³⁹, Lehigh University⁴⁰

08 Oct 2016

TL;DR: The Visual Object Tracking challenge VOT2016 goes beyond its predecessors by introducing a new semi-automatic ground truth bounding box annotation methodology and extending the evaluation system with the no-reset experiment.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2016 aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 70 trackers are presented, with a large number of trackers being published at major computer vision conferences and journals in the recent years. The number of tested state-of-the-art trackers makes the VOT 2016 the largest and most challenging benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the Appendix. The VOT2016 goes beyond its predecessors by (i) introducing a new semi-automatic ground truth bounding box annotation methodology and (ii) extending the evaluation system with the no-reset experiment. The dataset, the evaluation kit as well as the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

744 citations

Book Chapter•DOI•

The sixth visual object tracking VOT2018 challenge results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴ +155 more•Institutions (47)

23 Jan 2019

TL;DR: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative; results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative. Results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The evaluation included the standard VOT and other popular methodologies for short-term tracking analysis and a “real-time” experiment simulating a situation where a tracker processes images as if provided by a continuously running sensor. A long-term tracking subchallenge has been introduced to the set of standard VOT sub-challenges. The new subchallenge focuses on long-term tracking properties, namely coping with target disappearance and reappearance. A new dataset has been compiled and a performance evaluation methodology that focuses on long-term tracking capabilities has been adopted. The VOT toolkit has been updated to support both standard short-term and the new long-term tracking subchallenges. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

639 citations

Book Chapter•DOI•

Rotation Invariant Image Description with Local Binary Pattern Histogram Fourier Features

[...]

Timo Ahonen¹, Jiří Matas², Chu He¹, Matti Pietikäinen¹•Institutions (2)

University of Oulu¹, Czech Technical University in Prague²

14 Jul 2009

TL;DR: In the experiments, it is shown that these features outperform non-invariant and earlier version of rotation invariant LBP and the MR8 descriptor in texture classification, material categorization and face recognition tests.

...read moreread less

Abstract: In this paper, we propose Local Binary Pattern Histogram Fourier features (LBP-HF), a novel rotation invariant image descriptor computed from discrete Fourier transforms of local binary pattern (LBP) histograms Unlike most other histogram based invariant texture descriptors which normalize rotation locally, the proposed invariants are constructed globally for the whole region to be described In addition to being rotation invariant, the LBP-HF features retain the highly discriminative nature of LBP histograms In the experiments, it is shown that these features outperform non-invariant and earlier version of rotation invariant LBP and the MR8 descriptor in texture classification, material categorization and face recognition tests

...read moreread less

441 citations

Journal Article•DOI•

Discriminative Correlation Filter Tracker with Channel and Spatial Reliability

[...]

Alan LukeźIăź¹, Tomáš Vojíř², Luka Čehovin Zajc¹, Jiří Matas², Matej Kristan¹ - Show less +1 more•Institutions (2)

University of Ljubljana¹, Czech Technical University in Prague²

01 Jul 2018-International Journal of Computer Vision

TL;DR: In this article, the authors introduce the channel and spatial reliability concepts to discriminative correlation filters (DCF) and provide a learning algorithm for its efficient and seamless integration in the filter update and the tracking process.

...read moreread less

Abstract: Short-term tracking is an open and challenging problem for which discriminative correlation filters (DCF) have shown excellent performance. We introduce the channel and spatial reliability concepts to DCF tracking and provide a learning algorithm for its efficient and seamless integration in the filter update and the tracking process. The spatial reliability map adjusts the filter support to the part of the object suitable for tracking. This both allows to enlarge the search region and improves tracking of non-rectangular objects. Reliability scores reflect channel-wise quality of the learned filters and are used as feature weighting coefficients in localization. Experimentally, with only two simple standard feature sets, HoGs and colornames, the novel CSR-DCF method--DCF with channel and spatial reliability--achieves state-of-the-art results on VOT 2016, VOT 2015 and OTB100. The CSR-DCF runs close to real-time on a CPU.

...read moreread less

228 citations

Journal Article•DOI•

Discriminative Correlation Filter with Channel and Spatial Reliability

[...]

Alan Lukežič, Tomáš Vojíř, Luka Cehovin, Jiří Matas, Matej Kristan - Show less +1 more

25 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, the authors introduce the channel and spatial reliability concepts to DCF tracking and provide a novel learning algorithm for its efficient and seamless integration in the filter update and the tracking process.

...read moreread less

Abstract: Short-term tracking is an open and challenging problem for which discriminative correlation filters (DCF) have shown excellent performance. We introduce the channel and spatial reliability concepts to DCF tracking and provide a novel learning algorithm for its efficient and seamless integration in the filter update and the tracking process. The spatial reliability map adjusts the filter support to the part of the object suitable for tracking. This both allows to enlarge the search region and improves tracking of non-rectangular objects. Reliability scores reflect channel-wise quality of the learned filters and are used as feature weighting coefficients in localization. Experimentally, with only two simple standard features, HoGs and Colornames, the novel CSR-DCF method -- DCF with Channel and Spatial Reliability -- achieves state-of-the-art results on VOT 2016, VOT 2015 and OTB100. The CSR-DCF runs in real-time on a CPU.

...read moreread less

203 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

Book•

Computer Vision: Algorithms and Applications

[...]

Richard Szeliski

30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Abstract: Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of recipes, this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

...read moreread less

4,146 citations

Posted Content•

CNN Features off-the-shelf: an Astounding Baseline for Recognition

[...]

Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, Stefan Carlsson

23 Mar 2014-arXiv: Computer Vision and Pattern Recognition

TL;DR: A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

Abstract: Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the \overfeat network which was trained to perform object classification on ILSVRC13. We use features extracted from the \overfeat network as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets. We selected these tasks and datasets as they gradually move further away from the original task and data the \overfeat network was trained to solve. Astonishingly, we report consistent superior results compared to the highly tuned state-of-the-art systems in all the visual classification tasks on various datasets. For instance retrieval it consistently outperforms low memory footprint methods except for sculptures dataset. The results are achieved using a linear SVM classifier (or $L2$ distance in case of retrieval) applied to a feature representation of size 4096 extracted from a layer in the net. The representations are further modified using simple augmentation techniques e.g. jittering. The results strongly suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

4,033 citations

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

[...]

David Forsyth, Jean Ponce

01 Jan 2004

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Abstract: From the Publisher: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications. Users learn techniques that have proven to be useful by first-hand experience and a wide range of mathematical methods. A CD-ROM with every copy of the text contains source code for programming practice, color images, and illustrative movies. Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance. Topics are discussed in substantial and increasing depth. Application surveys describe numerous important application areas such as image based rendering and digital libraries. Many important algorithms broken down and illustrated in pseudo code. Appropriate for use by engineers as a comprehensive reference to the computer vision enterprise.

...read moreread less

3,627 citations

Journal Article•DOI•

Robust wide-baseline stereo from maximally stable extremal regions

[...]

Jiri Matas¹, Ondrej Chum, Martin Urban, Tomas Pajdla•Institutions (1)

University of Surrey¹

01 Sep 2004-Image and Vision Computing

TL;DR: The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.

...read moreread less

3,422 citations

Proceedings Article•DOI•

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

[...]

Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, Stefan Carlsson

23 Jun 2014

TL;DR: In this paper, features extracted from the OverFeat network are used as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets.

...read moreread less

Abstract: Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13. We use features extracted from the OverFeat network as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets. We selected these tasks and datasets as they gradually move further away from the original task and data the OverFeat network was trained to solve. Astonishingly, we report consistent superior results compared to the highly tuned state-of-the-art systems in all the visual classification tasks on various datasets. For instance retrieval it consistently outperforms low memory footprint methods except for sculptures dataset. The results are achieved using a linear SVM classifier (or L2 distance in case of retrieval) applied to a feature representation of size 4096 extracted from a layer in the net. The representations are further modified using simple augmentation techniques e.g. jittering. The results strongly suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

3,346 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse