Home
/
Authors
/
Jure Zbontar

Author

Jure Zbontar

Other affiliations: University of Ljubljana

Bio: Jure Zbontar is an academic researcher from Facebook. The author has contributed to research in topics: Autoencoder & Rank (linear algebra). The author has an hindex of 10, co-authored 19 publications receiving 1361 citations. Previous affiliations of Jure Zbontar include University of Ljubljana.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Computing the stereo matching cost with a convolutional neural network

[...]

Jure Zbontar¹, Yann LeCun²•Institutions (2)

University of Ljubljana¹, New York University²

07 Jun 2015

TL;DR: This work trains a convolutional neural network to predict how well two image patches match and uses it to compute the stereo matching cost, which achieves an error rate of 2.61% on the KITTI stereo dataset.

...read moreread less

Abstract: We present a method for extracting depth information from a rectified image pair. We train a convolutional neural network to predict how well two image patches match and use it to compute the stereo matching cost. The cost is refined by cross-based cost aggregation and semiglobal matching, followed by a left-right consistency check to eliminate errors in the occluded regions. Our stereo method achieves an error rate of 2.61% on the KITTI stereo dataset and is currently (August 2014) the top performing method on this dataset.

...read moreread less

762 citations

Posted Content•

fastMRI: An Open Dataset and Benchmarks for Accelerated MRI.

[...]

Jure Zbontar, Florian Knoll, Anuroop Sriram, Matthew J. Muckley, Mary Bruno, Aaron Defazio, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzal, Adriana Romero, Michael G. Rabbat, Pascal Vincent, James Pinkerton, Duo Wang, Nafissa Yakubova, Erich James Owens, C. Lawrence Zitnick, Michael P. Recht, Daniel K. Sodickson, Yvonne W. Lui - Show less +19 more

21 Nov 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: The fastMRI dataset is introduced, a large-scale collection of both raw MR measurements and clinical MR images that can be used for training and evaluation of machine-learning approaches to MR image reconstruction.

...read moreread less

Abstract: Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive. We introduce the fastMRI dataset, a large-scale collection of both raw MR measurements and clinical MR images, that can be used for training and evaluation of machine-learning approaches to MR image reconstruction. By introducing standardized evaluation criteria and a freely-accessible dataset, our goal is to help the community make rapid advances in the state of the art for MR image reconstruction. We also provide a self-contained introduction to MRI for machine learning researchers with no medical imaging background.

...read moreread less

480 citations

Journal Article•DOI•

fastMRI: A Publicly Available Raw k-Space and DICOM Dataset of Knee Images for Accelerated MR Image Reconstruction Using Machine Learning.

[...]

Florian Knoll¹, Jure Zbontar², Anuroop Sriram², Matthew J. Muckley¹, Mary Bruno², Aaron Defazio³, Marc Parente¹, Krzysztof J. Geras¹, Joe Katsnelson¹, Hersh Chandarana¹, Zizhao Zhang³, Michal Drozdzalv², Adriana Romero², Michael G. Rabbat², Pascal Vincent², James Pinkerton², Duo Wang¹, Nafissa Yakubova¹, Erich James Owens², C. Lawrence Zitnick², Michael P. Recht¹, Daniel K. Sodickson, Yvonne W. Lui¹ - Show less +19 more•Institutions (3)

New York University¹, Facebook², University of Florida³

29 Jan 2020

TL;DR: A publicly available dataset containing k-space data as well as Digital Imaging and Communications in Medicine image data of knee images for accelerated MR image reconstruction using machine learning is presented.

...read moreread less

Abstract: A publicly available dataset containing k-space data as well as Digital Imaging and Communications in Medicine image data of knee images for accelerated MR image reconstruction using machine learning is presented.

...read moreread less

211 citations

Book Chapter•DOI•

End-to-End Variational Networks for Accelerated MRI Reconstruction

[...]

Anuroop Sriram¹, Jure Zbontar¹, Tullie Murrell¹, Aaron Defazio¹, C. Lawrence Zitnick¹, Nafissa Yakubova¹, Florian Knoll², Patricia M. Johnson² - Show less +4 more•Institutions (2)

Facebook¹, New York University²

04 Oct 2020

TL;DR: This paper presents a new approach to this problem that extends previously proposed variational methods by learning fully end-to-end and obtains new state-of-the-art results on the fastMRI dataset for both brain and knee MRIs.

...read moreread less

Abstract: The slow acquisition speed of magnetic resonance imaging (MRI) has led to the development of two complementary methods: acquiring multiple views of the anatomy simultaneously (parallel imaging) and acquiring fewer samples than necessary for traditional signal processing methods (compressed sensing). While the combination of these methods has the potential to allow much faster scan times, reconstruction from such undersampled multi-coil data has remained an open problem. In this paper, we present a new approach to this problem that extends previously proposed variational methods by learning fully end-to-end. Our method obtains new state-of-the-art results on the fastMRI dataset [16] for both brain and knee MRIs.

...read moreread less

121 citations

Journal Article•DOI•

Advancing machine learning for MR image reconstruction with an open competition: Overview of the 2019 fastMRI challenge

[...]

Florian Knoll¹, Tullie Murrell², Anuroop Sriram², Nafissa Yakubova², Jure Zbontar², Michael G. Rabbat², Aaron Defazio², Matthew J. Muckley¹, Daniel K. Sodickson¹, C. Lawrence Zitnick², Michael P. Recht¹ - Show less +7 more•Institutions (2)

New York University¹, Facebook²

06 Jan 2020-arXiv: Image and Video Processing

TL;DR: To advance research in the field of machine learning for MR image reconstruction with an open challenge.

...read moreread less

Abstract: Purpose: To advance research in the field of machine learning for MR image reconstruction with an open challenge. Methods: We provided participants with a dataset of raw k-space data from 1,594 consecutive clinical exams of the knee. The goal of the challenge was to reconstruct images from these data. In order to strike a balance between realistic data and a shallow learning curve for those not already familiar with MR image reconstruction, we ran multiple tracks for multi-coil and single-coil data. We performed a two-stage evaluation based on quantitative image metrics followed by evaluation by a panel of radiologists. The challenge ran from June to December of 2019. Results: We received a total of 33 challenge submissions. All participants chose to submit results from supervised machine learning approaches. Conclusion: The challenge led to new developments in machine learning for image reconstruction, provided insight into the current state of the art in the field, and highlighted remaining hurdles for clinical adoption.

...read moreread less

111 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

FlowNet: Learning Optical Flow with Convolutional Networks

[...]

Alexey Dosovitskiy¹, Philipp Fischery, Eddy Ilg¹, Philip Häusser², Caner Hazirbas², Vladimir Golkov², Patrick van der Smagt², Daniel Cremers², Thomas Brox¹ - Show less +5 more•Institutions (2)

University of Freiburg¹, Technische Universität München²

07 Dec 2015

TL;DR: In this paper, the authors propose and compare two architectures: a generic architecture and another one including a layer that correlates feature vectors at different image locations, and show that networks trained on this unrealistic data still generalize very well to existing datasets such as Sintel and KITTI.

...read moreread less

Abstract: Convolutional neural networks (CNNs) have recently been very successful in a variety of computer vision tasks, especially on those linked to recognition. Optical flow estimation has not been among the tasks CNNs succeeded at. In this paper we construct CNNs which are capable of solving the optical flow estimation problem as a supervised learning task. We propose and compare two architectures: a generic architecture and another one including a layer that correlates feature vectors at different image locations. Since existing ground truth data sets are not sufficiently large to train a CNN, we generate a large synthetic Flying Chairs dataset. We show that networks trained on this unrealistic data still generalize very well to existing datasets such as Sintel and KITTI, achieving competitive accuracy at frame rates of 5 to 10 fps.

...read moreread less

3,833 citations

Journal Article•DOI•

Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources

[...]

Xiao Xiang Zhu¹, Devis Tuia², Lichao Mou¹, Gui-Song Xia³, Liangpei Zhang³, Feng Xu⁴, Friedrich Fraundorfer⁵ - Show less +3 more•Institutions (5)

Technische Universität München¹, Wageningen University and Research Centre², Wuhan University³, Fudan University⁴, Graz University of Technology⁵

01 Dec 2017-IEEE Geoscience and Remote Sensing Magazine

TL;DR: The challenges of using deep learning for remote-sensing data analysis are analyzed, recent advances are reviewed, and resources are provided that hope will make deep learning in remote sensing seem ridiculously simple.

...read moreread less

Abstract: Central to the looming paradigm shift toward data-intensive science, machine-learning techniques are becoming increasingly important. In particular, deep learning has proven to be both a major breakthrough and an extremely powerful tool in many fields. Shall we embrace deep learning as the key to everything? Or should we resist a black-box solution? These are controversial issues within the remote-sensing community. In this article, we analyze the challenges of using deep learning for remote-sensing data analysis, review recent advances, and provide resources we hope will make deep learning in remote sensing seem ridiculously simple. More importantly, we encourage remote-sensing scientists to bring their expertise into deep learning and use it as an implicit general model to tackle unprecedented, large-scale, influential challenges, such as climate change and urbanization.

...read moreread less

2,095 citations

Proceedings Article•DOI•

Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture

[...]

David Eigen¹, Rob Fergus²•Institutions (2)

New York University¹, Facebook²

07 Dec 2015

TL;DR: This paper addresses three different computer vision tasks using a single basic architecture: depth prediction, surface normal estimation, and semantic labeling using a multiscale convolutional network that is able to adapt easily to each task using only small modifications.

...read moreread less

Abstract: In this paper we address three different computer vision tasks using a single basic architecture: depth prediction, surface normal estimation, and semantic labeling. We use a multiscale convolutional network that is able to adapt easily to each task using only small modifications, regressing from the input image to the output map directly. Our method progressively refines predictions using a sequence of scales, and captures many image details without any superpixels or low-level segmentation. We achieve state-of-the-art performance on benchmarks for all three tasks.

...read moreread less

2,046 citations

Proceedings Article•DOI•

Learning to compare image patches via convolutional neural networks

[...]

Sergey Zagoruyko¹, Nikos Komodakis¹•Institutions (1)

École des ponts ParisTech¹

07 Jun 2015

TL;DR: This paper shows how to learn directly from image data a general similarity function for comparing image patches, which is a task of fundamental importance for many computer vision problems, and opts for a CNN-based model that is trained to account for a wide variety of changes in image appearance.

...read moreread less

Abstract: In this paper we show how to learn directly from image data (i.e., without resorting to manually-designed features) a general similarity function for comparing image patches, which is a task of fundamental importance for many computer vision problems. To encode such a function, we opt for a CNN-based model that is trained to account for a wide variety of changes in image appearance. To that end, we explore and study multiple neural network architectures, which are specifically adapted to this task. We show that such an approach can significantly outperform the state-of-the-art on several problems and benchmark datasets.

...read moreread less

1,364 citations

Book Chapter•DOI•

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

[...]

Ravi Garg¹, B. G. Vijay Kumar¹, Gustavo Carneiro¹, Ian Reid¹•Institutions (1)

University of Adelaide¹

08 Oct 2016

TL;DR: This work proposes a unsupervised framework to learn a deep convolutional neural network for single view depth prediction, without requiring a pre-training stage or annotated ground-truth depths, and shows that this network trained on less than half of the KITTI dataset gives comparable performance to that of the state-of-the-art supervised methods for singleView depth estimation.

...read moreread less

Abstract: A significant weakness of most current deep Convolutional Neural Networks is the need to train them using vast amounts of manually labelled data. In this work we propose a unsupervised framework to learn a deep convolutional neural network for single view depth prediction, without requiring a pre-training stage or annotated ground-truth depths. We achieve this by training the network in a manner analogous to an autoencoder. At training time we consider a pair of images, source and target, with small, known camera motion between the two such as a stereo pair. We train the convolutional encoder for the task of predicting the depth map for the source image. To do so, we explicitly generate an inverse warp of the target image using the predicted depth and known inter-view displacement, to reconstruct the source image; the photometric error in the reconstruction is the reconstruction loss for the encoder. The acquisition of this training data is considerably simpler than for equivalent systems, requiring no manual annotation, nor calibration of depth sensor to camera. We show that our network trained on less than half of the KITTI dataset gives comparable performance to that of the state-of-the-art supervised methods for single view depth estimation.

...read moreread less

1,238 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse