Home
/
Authors
/
Moritz Menze

Author

Moritz Menze

Bio: Moritz Menze is an academic researcher from Leibniz University of Hanover. The author has contributed to research in topics: Optical flow & Motion estimation. The author has an hindex of 6, co-authored 9 publications receiving 1862 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Object scene flow for autonomous vehicles

[...]

Moritz Menze¹, Andreas Geiger•Institutions (1)

Leibniz University of Hanover¹

07 Jun 2015

TL;DR: A novel model and dataset for 3D scene flow estimation with an application to autonomous driving by representing each element in the scene by its rigid motion parameters and each superpixel by a 3D plane as well as an index to the corresponding object.

...read moreread less

Abstract: This paper proposes a novel model and dataset for 3D scene flow estimation with an application to autonomous driving. Taking advantage of the fact that outdoor scenes often decompose into a small number of independently moving objects, we represent each element in the scene by its rigid motion parameters and each superpixel by a 3D plane as well as an index to the corresponding object. This minimal representation increases robustness and leads to a discrete-continuous CRF where the data term decomposes into pairwise potentials between superpixels and objects. Moreover, our model intrinsically segments the scene into its constituting dynamic components. We demonstrate the performance of our model on existing benchmarks as well as a novel realistic dataset with scene flow ground truth. We obtain this dataset by annotating 400 dynamic scenes from the KITTI raw data collection using detailed 3D CAD models for all vehicles in motion. Our experiments also reveal novel challenges which cannot be handled by existing methods.

...read moreread less

1,918 citations

Journal Article•DOI•

Joint 3d Estimation of Vehicles and Scene Flow

[...]

Moritz Menze¹, Christian Heipke¹, Andreas Geiger²•Institutions (2)

Leibniz University of Hanover¹, Max Planck Society²

20 Aug 2015-ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

TL;DR: A novel unified approach which reasons jointly about 3D scene flow as well as the pose, shape and motion of vehicles in the scene is proposed and the results provide a prove of concept and demonstrate the usefulness of the method.

...read moreread less

Abstract: . driving. While much progress has been made in recent years, imaging conditions in natural outdoor environments are still very challenging for current reconstruction and recognition methods. In this paper, we propose a novel unified approach which reasons jointly about 3D scene flow as well as the pose, shape and motion of vehicles in the scene. Towards this goal, we incorporate a deformable CAD model into a slanted-plane conditional random field for scene flow estimation and enforce shape consistency between the rendered 3D models and the parameters of all superpixels in the image. The association of superpixels to objects is established by an index variable which implicitly enables model selection. We evaluate our approach on the challenging KITTI scene flow dataset in terms of object and scene flow estimation. Our results provide a prove of concept and demonstrate the usefulness of our method.

...read moreread less

315 citations

Journal Article•DOI•

Object Scene Flow

[...]

Moritz Menze¹, Christian Heipke¹, Andreas Geiger², Andreas Geiger³•Institutions (3)

Leibniz University of Hanover¹, ETH Zurich², Max Planck Society³

22 Nov 2017-Isprs Journal of Photogrammetry and Remote Sensing

TL;DR: A unified random field model which reasons jointly about 3D scene flow as well as the location, shape and motion of vehicles in the observed scene is proposed, which is the first to provide stereo and optical flow ground truth for dynamic real-world urban scenes at large scale.

...read moreread less

Abstract: This work investigates the estimation of dense three-dimensional motion fields, commonly referred to as scene flow. While great progress has been made in recent years, large displacements and adverse imaging conditions as observed in natural outdoor environments are still very challenging for current approaches to reconstruction and motion estimation. In this paper, we propose a unified random field model which reasons jointly about 3D scene flow as well as the location, shape and motion of vehicles in the observed scene. We formulate the problem as the task of decomposing the scene into a small number of rigidly moving objects sharing the same motion parameters. Thus, our formulation effectively introduces long-range spatial dependencies which commonly employed local rigidity priors are lacking. Our inference algorithm then estimates the association of image segments and object hypotheses together with their three-dimensional shape and motion. We demonstrate the potential of the proposed approach by introducing a novel challenging scene flow benchmark which allows for a thorough comparison of the proposed scene flow approach with respect to various baseline models. In contrast to previous benchmarks, our evaluation is the first to provide stereo and optical flow ground truth for dynamic real-world urban scenes at large scale. Our experiments reveal that rigid motion segmentation can be utilized as an effective regularizer for the scene flow problem, improving upon existing two-frame scene flow methods. At the same time, our method yields plausible object segmentations without requiring an explicitly trained recognition model for a specific object class.

...read moreread less

198 citations

Book Chapter•DOI•

Discrete Optimization for Optical Flow

[...]

Moritz Menze¹, Christian Heipke¹, Andreas Geiger²•Institutions (2)

Leibniz University of Hanover¹, Max Planck Society²

07 Oct 2015

TL;DR: Three different strategies are investigated, each able to reduce computation and memory demands by several orders of magnitude, and their combination allows us to estimate large-displacement optical flow both accurately and efficiently and demonstrates the potential of discrete optimization for optical flow.

...read moreread less

Abstract: We propose to look at large-displacement optical flow from a discrete point of view. Motivated by the observation that sub-pixel accuracy is easily obtained given pixel-accurate optical flow, we conjecture that computing the integral part is the hardest piece of the problem. Consequently, we formulate optical flow estimation as a discrete inference problem in a conditional random field, followed by sub-pixel refinement. Naive discretization of the 2D flow space, however, is intractable due to the resulting size of the label set. In this paper, we therefore investigate three different strategies, each able to reduce computation and memory demands by several orders of magnitude. Their combination allows us to estimate large-displacement optical flow both accurately and efficiently and demonstrates the potential of discrete optimization for optical flow. We obtain state-of-the-art performance on MPI Sintel and KITTI.

...read moreread less

128 citations

Proceedings Article•

CamInSens - demonstration of a distributed smart camera system for in-situ threat detection

[...]

Carsten Grenz¹, Uwe Jänen¹, Jörg Hähner¹, Colin Kuntzsch², Moritz Menze², David D'Angelo, Manfred Bogen, Eduardo Monari - Show less +4 more•Institutions (2)

Augsburg College¹, Leibniz University of Hanover²

01 Oct 2012

TL;DR: Robust multi-camera multi-person tracking is combined with a flexible analysis module, which uses online learning classification algorithms as well as user-generated filters to process the persons' trajectories in the surveillance space.

...read moreread less

Abstract: The CamInSens system is a next-generation self-organizing video surveillance system that combines research being done in the fields of person-tracking, trajectory analysis, visual analytics, and self-organizing system management algorithms. Its purpose is the online threat detection by analysing anomalies in persons' trajectories. Therefore, robust multi-camera multi-person tracking is combined with a flexible analysis module, which uses online learning classification algorithms as well as user-generated filters to process the persons' trajectories in the surveillance space.

...read moreread less

8 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

[...]

Deqing Sun¹, Xiaodong Yang¹, Ming-Yu Liu¹, Jan Kautz¹•Institutions (1)

Nvidia¹

01 Jun 2018

TL;DR: PWC-Net as discussed by the authors uses the current optical flow estimate to warp the CNN features of the second image, which is processed by a CNN to estimate the optical flow, and achieves state-of-the-art performance on the MPI Sintel final pass and KITTI 2015 benchmarks.

...read moreread less

Abstract: We present a compact but effective CNN model for optical flow, called PWC-Net. PWC-Net has been designed according to simple and well-established principles: pyramidal processing, warping, and the use of a cost volume. Cast in a learnable feature pyramid, PWC-Net uses the current optical flow estimate to warp the CNN features of the second image. It then uses the warped features and features of the first image to construct a cost volume, which is processed by a CNN to estimate the optical flow. PWC-Net is 17 times smaller in size and easier to train than the recent FlowNet2 model. Moreover, it outperforms all published optical flow methods on the MPI Sintel final pass and KITTI 2015 benchmarks, running at about 35 fps on Sintel resolution (1024 A— 436) images. Our models are available on our project website.

...read moreread less

2,231 citations

Proceedings Article•DOI•

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

[...]

Nikolaus Mayer¹, Eddy Ilg¹, Philip Häusser², Philipp Fischer¹, Daniel Cremers², Alexey Dosovitskiy¹, Thomas Brox¹ - Show less +3 more•Institutions (2)

University of Freiburg¹, Technische Universität München²

07 Dec 2015-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a large-scale synthetic stereo video dataset is proposed to enable training and evaluation of optical flow estimation with a convolutional network and disparity estimation with CNNs.

...read moreread less

Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

...read moreread less

1,759 citations

Proceedings Article•DOI•

End-to-End Learning of Geometry and Context for Deep Stereo Regression

[...]

Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry

01 Oct 2017

TL;DR: A novel deep learning architecture for regressing disparity from a rectified pair of stereo images is proposed, leveraging knowledge of the problem’s geometry to form a cost volume using deep feature representations and incorporating contextual information using 3-D convolutions over this volume.

...read moreread less

Abstract: We propose a novel deep learning architecture for regressing disparity from a rectified pair of stereo images. We leverage knowledge of the problem’s geometry to form a cost volume using deep feature representations. We learn to incorporate contextual information using 3-D convolutions over this volume. Disparity values are regressed from the cost volume using a proposed differentiable soft argmin operation, which allows us to train our method end-to-end to sub-pixel accuracy without any additional post-processing or regularization. We evaluate our method on the Scene Flow and KITTI datasets and on KITTI we set a new stateof-the-art benchmark, while being significantly faster than competing approaches.

...read moreread less

1,204 citations

Proceedings Article•DOI•

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

[...]

Nikolaus Mayer¹, Eddy Ilg¹, Philip Häusser², Philipp Fischer¹, Daniel Cremers², Alexey Dosovitskiy¹, Thomas Brox¹ - Show less +3 more•Institutions (2)

University of Freiburg¹, Technische Universität München²

27 Jun 2016

TL;DR: This paper proposes three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks and presents a convolutional network for real-time disparity estimation that provides state-of-the-art results.

...read moreread less

Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluation of scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

...read moreread less

1,184 citations

Book Chapter•DOI•

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

[...]

Zachary Teed¹, Jia Deng¹•Institutions (1)

Princeton University¹

23 Aug 2020

TL;DR: RAFT as mentioned in this paper extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes.

...read moreread less

Abstract: We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.

...read moreread less

1,006 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse