Home
/
Authors
/
Jiri Matas

Author

Jiri Matas

Other affiliations: University of Surrey, IEEE Computer Society, Tampere University of Technology

Bio: Jiri Matas is an academic researcher from Czech Technical University in Prague. The author has contributed to research in topics: RANSAC & Video tracking. The author has an hindex of 78, co-authored 345 publications receiving 44739 citations. Previous affiliations of Jiri Matas include University of Surrey & IEEE Computer Society.

Topics: RANSAC, Video tracking, Object detection, Affine transformation, Image retrieval ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992

Papers

PDF

Open Access

More filters

Journal Article•DOI•

USAC: A Universal Framework for Random Sample Consensus

[...]

Rahul Raguram¹, Ondrej Chum, Marc Pollefeys², Jiri Matas, Jan-Michael Frahm³ - Show less +1 more•Institutions (3)

Apple Inc.¹, ETH Zurich², University of North Carolina at Chapel Hill³

01 Aug 2013-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A comprehensive overview of recent research in RANSAC-based robust estimation is presented by analyzing and comparing various approaches that have been explored over the years and introducing a new framework for robust estimation, which is called Universal RANSac (USAC).

...read moreread less

Abstract: A computational problem that arises frequently in computer vision is that of estimating the parameters of a model from data that have been contaminated by noise and outliers. More generally, any practical system that seeks to estimate quantities from noisy data measurements must have at its core some means of dealing with data contamination. The random sample consensus (RANSAC) algorithm is one of the most popular tools for robust estimation. Recent years have seen an explosion of activity in this area, leading to the development of a number of techniques that improve upon the efficiency and robustness of the basic RANSAC algorithm. In this paper, we present a comprehensive overview of recent research in RANSAC-based robust estimation by analyzing and comparing various approaches that have been explored over the years. We provide a common context for this analysis by introducing a new framework for robust estimation, which we call Universal RANSAC (USAC). USAC extends the simple hypothesize-and-verify structure of standard RANSAC to incorporate a number of important practical and computational considerations. In addition, we provide a general-purpose C++ software library that implements the USAC framework by leveraging state-of-the-art algorithms for the various modules. This implementation thus addresses many of the limitations of standard RANSAC within a single unified package. We benchmark the performance of the algorithm on a large collection of estimation problems. The implementation we provide can be used by researchers either as a stand-alone tool for robust estimation or as a benchmark for evaluating new techniques.

...read moreread less

501 citations

Proceedings Article•DOI•

The Visual Object Tracking VOT2017 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiri Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Luka Čehovin Zajc¹, Tomas Vojir³, Gustav Häger⁴, Alan Lukezic¹, Abdelrahman Eldesokey⁴, Gustavo Fernandez⁵, Alvaro Garcia-Martin⁶, Andrej Muhič¹, Alfredo Petrosino⁷, Alireza Memarmoghadam⁸, Andrea Vedaldi⁹, Antoine Manzanera¹⁰, Antoine Tran¹⁰, A. Aydin Alatan¹¹, Bogdan Mocanu, Boyu Chen¹², Chang Huang, Changsheng Xu¹³, Chong Sun¹², Dalong Du, David Zhang, Dawei Du¹³, Deepak Mishra, Erhan Gundogdu¹¹, Erhan Gundogdu¹⁴, Erik Velasco-Salido, Fahad Shahbaz Khan⁴, Francesco Battistone, Gorthi R. K. Sai Subrahmanyam, Goutam Bhat⁴, Guan Huang, Guilherme Sousa Bastos, Guna Seetharaman¹⁵, Hongliang Zhang¹⁶, Houqiang Li¹⁷, Huchuan Lu¹², Isabela Drummond, Jack Valmadre⁹, Jae-chan Jeong¹⁸, Jaeil Cho¹⁸, Jae-Yeong Lee¹⁸, Jana Noskova, Jianke Zhu¹⁹, Jin Gao¹³, Jingyu Liu¹³, Ji-Wan Kim¹⁸, João F. Henriques⁹, José M. Martínez, Junfei Zhuang²⁰, Junliang Xing¹³, Junyu Gao¹³, Kai Chen²¹, Kannappan Palaniappan²², Karel Lebeda, Ke Gao²², Kris M. Kitani²³, Lei Zhang, Lijun Wang¹², Lingxiao Yang, Longyin Wen²⁴, Luca Bertinetto⁹, Mahdieh Poostchi²², Martin Danelljan⁴, Matthias Mueller²⁵, Mengdan Zhang¹³, Ming-Hsuan Yang²⁶, Nianhao Xie¹⁶, Ning Wang¹⁷, Ondrej Miksik⁹, Payman Moallem⁸, Pallavi Venugopal M, Pedro Senna, Philip H. S. Torr⁹, Qiang Wang¹³, Qifeng Yu¹⁶, Qingming Huang¹³, Rafael Martin-Nieto, Richard Bowden²⁷, Risheng Liu¹², Ruxandra Tapu, Simon Hadfield²⁷, Siwei Lyu²⁸, Stuart Golodetz⁹, Sunglok Choi¹⁸, Tianzhu Zhang¹³, Titus Zaharia, Vincenzo Santopietro, Wei Zou¹³, Weiming Hu¹³, Wenbing Tao²¹, Wenbo Li²⁸, Wengang Zhou¹⁷, Xianguo Yu¹⁶, Xiao Bian²⁴, Yang Li¹⁹, Yifan Xing²³, Yingruo Fan²⁰, Zheng Zhu¹³, Zhipeng Zhang¹³, Zhiqun He²⁰ - Show less +101 more•Institutions (28)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Austrian Institute of Technology⁵, Autonomous University of Madrid⁶, Parthenope University of Naples⁷, University of Isfahan⁸, University of Oxford⁹, Superior National School of Advanced Techniques¹⁰, Middle East Technical University¹¹, Dalian University of Technology¹², Chinese Academy of Sciences¹³, ASELSAN¹⁴, United States Naval Research Laboratory¹⁵, National University of Defense Technology¹⁶, University of Science and Technology of China¹⁷, Electronics and Telecommunications Research Institute¹⁸, Zhejiang University¹⁹, Beijing University of Posts and Telecommunications²⁰, Huazhong University of Science and Technology²¹, University of Missouri²², Carnegie Mellon University²³, General Electric²⁴, King Abdullah University of Science and Technology²⁵, University of California, Merced²⁶, University of Surrey²⁷, University at Albany, SUNY²⁸

01 Jul 2017

TL;DR: The Visual Object Tracking challenge VOT2017 is the fifth annual tracker benchmarking activity organized by the VOT initiative; results of 51 trackers are presented; many are state-of-the-art published at major computer vision conferences or journals in recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2017 is the fifth annual tracker benchmarking activity organized by the VOT initiative. Results of 51 trackers are presented; many are state-of-the-art published at major computer vision conferences or journals in recent years. The evaluation included the standard VOT and other popular methodologies and a new "real-time" experiment simulating a situation where a tracker processes images as if provided by a continuously running sensor. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The VOT2017 goes beyond its predecessors by (i) improving the VOT public dataset and introducing a separate VOT2017 sequestered dataset, (ii) introducing a realtime tracking experiment and (iii) releasing a redesigned toolkit that supports complex experiments. The dataset, the evaluation kit and the results are publicly available at the challenge website1.

...read moreread less

485 citations

Book Chapter•DOI•

The BANCA database and evaluation protocol

[...]

Enrique Bailly-Bailliére¹, Samy Bengio, Frédéric Bimbot², M. Hamouz³, Josef Kittler³, Johnny Mariéthoz, Jiri Matas³, Kieron Messer³, Vlad Popovici⁴, Fabienne Poree², Belén Ruiz¹, Jean-Philippe Thiran⁴ - Show less +8 more•Institutions (4)

Carlos III Health Institute¹, French Institute for Research in Computer Science and Automation², University of Surrey³, École Polytechnique Fédérale de Lausanne⁴

09 Jun 2003-Lecture Notes in Computer Science

TL;DR: A protocol for evaluating verification algorithms on the BANCA database, a new large, realistic and challenging multi-modal database intended for training and testing multi- modal verification systems, is described.

...read moreread less

Abstract: In this paper we describe the acquisition and content of a new large, realistic and challenging multi-modal database intended for training and testing multi-modal verification systems. The BANCA database was captured in four European languages in two modalities (face and voice). For recording, both high and low quality microphones and cameras were used. The subjects were recorded in three different scenarios, controlled, degraded and adverse over a period of three months. In total 208 people were captured, half men and half women. In this paper we also describe a protocol for evaluating verification algorithms on the database. The database will be made available to the research community through http://www.ee.surrey.ac.uk/Research/VSSP/banca.

...read moreread less

470 citations

Proceedings Article•

All you need is a good init

[...]

Dmytro Mishkin¹, Jiri Matas¹•Institutions (1)

Czech Technical University in Prague¹

01 Jan 2016

TL;DR: Performance is evaluated on GoogLeNet, CaffeNet, FitNets and Residual nets and the state-of-the-art, or very close to it, is achieved on the MNIST, CIFAR-10/100 and ImageNet datasets.

...read moreread less

Abstract: Layer-sequential unit-variance (LSUV) initialization - a simple method for weight initialization for deep net learning - is proposed. The method consists of the two steps. First, pre-initialize weights of each convolution or inner-product layer with orthonormal matrices. Second, proceed from the first to the final layer, normalizing the variance of the output of each layer to be equal to one. Experiment with different activation functions (maxout, ReLU-family, tanh) show that the proposed initialization leads to learning of very deep nets that (i) produces networks with test accuracy better or equal to standard methods and (ii) is at least as fast as the complex schemes proposed specifically for very deep nets such as FitNets (Romero et al. (2015)) and Highway (Srivastava et al. (2015)). Performance is evaluated on GoogLeNet, CaffeNet, FitNets and Residual nets and the state-of-the-art, or very close to it, is achieved on the MNIST, CIFAR-10/100 and ImageNet datasets.

...read moreread less

416 citations

Journal Article•DOI•

Optimal Randomized RANSAC

[...]

Ondrej Chum, Jiri Matas

01 Aug 2008-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A randomized model verification strategy for RANSAC that removes the requirement for a priori knowledge of the fraction of outliers and estimates the quantity online, and has performance close to the theoretically optimal and is up to four times faster than previously published methods.

...read moreread less

Abstract: A randomized model verification strategy for RANSAC is presented. The proposed method finds, like RANSAC, a solution that is optimal with user-specified probability. The solution is found in time that is close to the shortest possible and superior to any deterministic verification strategy. A provably fastest model verification strategy is designed for the (theoretical) situation when the contamination of data by outliers is known. In this case, the algorithm is the fastest possible (on the average) of all randomized RANSAC algorithms guaranteeing a confidence in the solution. The derivation of the optimality property is based on Wald's theory of sequential decision making, in particular, a modified sequential probability ratio test (SPRT). Next, the R-RANSAC with SPRT algorithm is introduced. The algorithm removes the requirement for a priori knowledge of the fraction of outliers and estimates the quantity online. We show experimentally that on standard test data, the method has performance close to the theoretically optimal and is 2 to 10 times faster than standard RANSAC and is up to four times faster than previously published methods.

...read moreread less

415 citations

1
…
2
3
4
5
6
7
8
…
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Distinctive Image Features from Scale-Invariant Keypoints

[...]

David G. Lowe¹•Institutions (1)

University of British Columbia¹

01 Nov 2004-International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Abstract: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene. The features are invariant to image scale and rotation, and are shown to provide robust matching across a substantial range of affine distortion, change in 3D viewpoint, addition of noise, and change in illumination. The features are highly distinctive, in the sense that a single feature can be correctly matched with high probability against a large database of features from many images. This paper also describes an approach to using these features for object recognition. The recognition proceeds by matching individual features to a database of features from known objects using a fast nearest-neighbor algorithm, followed by a Hough transform to identify clusters belonging to a single object, and finally performing verification through least-squares solution for consistent pose parameters. This approach to recognition can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

46,906 citations

Journal Article•DOI•

Fiji: an open-source platform for biological-image analysis

[...]

Johannes Schindelin¹, Ignacio Arganda-Carreras², Erwin Frise³, Verena Kaynig⁴, Mark Longair⁴, Tobias Pietzsch¹, Stephan Preibisch¹, Curtis Rueden⁵, Stephan Saalfeld¹, Benjamin Schmid¹, Jean-Yves Tinevez¹, Daniel J. White¹, Volker Hartenstein¹, Kevin W. Eliceiri⁵, Pavel Tomancak¹, Albert Cardona¹ - Show less +12 more•Institutions (5)

Max Planck Society¹, Massachusetts Institute of Technology², Lawrence Berkeley National Laboratory³, ETH Zurich⁴, University of Wisconsin-Madison⁵

01 Jul 2012-Nature Methods

TL;DR: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis that facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system.

...read moreread less

Abstract: Fiji is a distribution of the popular open-source software ImageJ focused on biological-image analysis. Fiji uses modern software engineering practices to combine powerful software libraries with a broad range of scripting languages to enable rapid prototyping of image-processing algorithms. Fiji facilitates the transformation of new algorithms into ImageJ plugins that can be shared with end users through an integrated update system. We propose Fiji as a platform for productive collaboration between computer science and biology research communities.

...read moreread less

43,540 citations

Proceedings Article•DOI•

Going deeper with convolutions

[...]

Christian Szegedy¹, Wei Liu², Yangqing Jia¹, Pierre Sermanet¹, Scott Reed³, Dragomir Anguelov¹, Dumitru Erhan¹, Vincent Vanhoucke¹, Andrew Rabinovich - Show less +5 more•Institutions (3)

Google¹, University of North Carolina at Chapel Hill², University of Michigan³

07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Abstract: We propose a deep convolutional neural network architecture codenamed Inception that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14). The main hallmark of this architecture is the improved utilization of the computing resources inside the network. By a carefully crafted design, we increased the depth and width of the network while keeping the computational budget constant. To optimize quality, the architectural decisions were based on the Hebbian principle and the intuition of multi-scale processing. One particular incarnation used in our submission for ILSVRC14 is called GoogLeNet, a 22 layers deep network, the quality of which is assessed in the context of classification and detection.

...read moreread less

40,257 citations

Book•

Deep Learning

[...]

Ian Goodfellow¹, Yoshua Bengio², Aaron Courville²•Institutions (2)

Google¹, Université de Montréal²

18 Nov 2016

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Abstract: Deep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by building them out of simpler ones; a graph of these hierarchies would be many layers deep. This book introduces a broad range of topics in deep learning. The text offers mathematical and conceptual background, covering relevant concepts in linear algebra, probability theory and information theory, numerical computation, and machine learning. It describes deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology; and it surveys such applications as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames. Finally, the book offers research perspectives, covering such theoretical topics as linear factor models, autoencoders, representation learning, structured probabilistic models, Monte Carlo methods, the partition function, approximate inference, and deep generative models. Deep Learning can be used by undergraduate or graduate students planning careers in either industry or research, and by software engineers who want to begin using deep learning in their products or platforms. A website offers supplementary material for both readers and instructors.

...read moreread less

38,208 citations

Distinctive Image Features from Scale-Invariant Keypoints

[...]

Matthijs Dorst

01 Jan 2011

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.

...read moreread less

Abstract: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images. These features can then be used to reliably match objects in diering images. The algorithm was rst proposed by Lowe [12] and further developed to increase performance resulting in the classic paper [13] that served as foundation for SIFT which has played an important role in robotic and machine vision in the past decade.

...read moreread less

14,708 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse