Live Metric 3D Reconstruction on Mobile Phones

doi:10.1109/ICCV.2013.15

Home
/
Papers
/
Live Metric 3D Reconstruction on Mobile Phones

Proceedings Article•DOI•

Live Metric 3D Reconstruction on Mobile Phones

Petri Tanskanen¹, Kalin Kolev¹, Lorenz Meier¹, Federico Camposeco¹, Olivier Saurer¹, Marc Pollefeys¹ - Show less +2 more•Institutions (1)

ETH Zurich¹

01 Dec 2013-pp 65-72

TL;DR: This paper proposes a complete on-device 3D reconstruction pipeline for mobile monocular hand-held devices, which generates dense 3D models with absolute scale on-site while simultaneously supplying the user with real-time interactive feedback.

read less

Abstract: In this paper, we propose a complete on-device 3D reconstruction pipeline for mobile monocular hand-held devices, which generates dense 3D models with absolute scale on-site while simultaneously supplying the user with real-time interactive feedback. The method fills a gap in current cloud-based mobile reconstruction services as it ensures at capture time that the acquired image set fulfills desired quality and completeness criteria. In contrast to existing systems, the developed framework offers multiple innovative solutions. In particular, we investigate the usability of the available on-device inertial sensors to make the tracking and mapping process more resilient to rapid motions and to estimate the metric scale of the captured scene. Moreover, we propose an efficient and accurate scheme for dense stereo matching which allows to reduce the processing time to interactive speed. We demonstrate the performance of the reconstruction pipeline on multiple challenging indoor and outdoor scenes of different size and depth variability.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Tanks and temples: benchmarking large-scale scene reconstruction

[...]

A. Knapitsch¹, Jaesik Park¹, Qian-Yi Zhou¹, Vladlen Koltun¹•Institutions (1)

Intel¹

20 Jul 2017-ACM Transactions on Graphics

TL;DR: A benchmark for image-based 3D reconstruction with high-resolution video sequences provided as input, supporting the development of novel pipelines that take advantage of video input to increase reconstruction fidelity.

...read moreread less

Abstract: We present a benchmark for image-based 3D reconstruction. The benchmark sequences were acquired outside the lab, in realistic conditions. Ground-truth data was captured using an industrial laser scanner. The benchmark includes both outdoor scenes and indoor environments. High-resolution video sequences are provided as input, supporting the development of novel pipelines that take advantage of video input to increase reconstruction fidelity. We report the performance of many image-based 3D reconstruction pipelines on the new benchmark. The results point to exciting challenges and opportunities for future work.

...read moreread less

553 citations

Cites background from "Live Metric 3D Reconstruction on Mo..."

...…have considered 3D reconstruction from video [Frahm et al. 2010; Kolev et al. 2014; Newcombe et al. 2011; Pollefeys et al. 2008; Schöps et al. 2015; Tanskanen et al. 2013; Vogiatzis and Hernández 2011; Wendel et al. 2012], much more work in the literature is devoted to reconstruction from image…...
[...]
...While a number of projects have considered 3D reconstruction from video [Frahm et al. 2010; Kolev et al. 2014; Newcombe et al. 2011; Pollefeys et al. 2008; Schöps et al. 2015; Tanskanen et al. 2013; Vogiatzis and Hernández 2011; Wendel et al. 2012], much more work in the literature is devoted to reconstruction from image collections....
[...]

Proceedings Article•DOI•

A Multi-view Stereo Benchmark with High-Resolution Images and Multi-camera Videos

[...]

Thomas Schops¹, Johannes L. Schonberger¹, Silvano Galliani¹, Torsten Sattler¹, Konrad Schindler¹, Marc Pollefeys¹, Andreas Geiger² - Show less +3 more•Institutions (2)

ETH Zurich¹, Max Planck Society²

21 Jul 2017

TL;DR: This benchmark is the first to cover the important use case of hand-held mobile devices while also providing high-resolution DSLR camera images and provides data at significantly higher temporal and spatial resolution.

...read moreread less

Abstract: Motivated by the limitations of existing multi-view stereo benchmarks, we present a novel dataset for this task. Towards this goal, we recorded a variety of indoor and outdoor scenes using a high-precision laser scanner and captured both high-resolution DSLR imagery as well as synchronized low-resolution stereo videos with varying fields-of-view. To align the images with the laser scans, we propose a robust technique which minimizes photometric errors conditioned on the geometry. In contrast to previous datasets, our benchmark provides novel challenges and covers a diverse set of viewpoints and scene types, ranging from natural scenes to man-made indoor and outdoor environments. Furthermore, we provide data at significantly higher temporal and spatial resolution. Our benchmark is the first to cover the important use case of hand-held mobile devices while also providing high-resolution DSLR camera images. We make our datasets and an online evaluation server available at http://www.eth3d.net.

...read moreread less

537 citations

Cites background from "Live Metric 3D Reconstruction on Mo..."

...Applications range from 3D reconstruction of objects [4] and larger scenes [3, 5, 35] over dense sensing for autonomous vehicles [6–8, 11, 30] or obstacle detection [10] to 3D reconstruction from mobile devices [14, 20, 28, 36, 41]....
[...]
...(ii) By now, mobile devices have become powerful enough for real-time stereo [20,28,30,36,41], creating the need for benchmark datasets that model the acquisition process typical for such hand-held devices....
[...]

Book•

Multi-View Stereo: A Tutorial

[...]

Yasutaka Furukawa¹, Carlos Hernández²•Institutions (2)

Washington University in St. Louis¹, Google²

30 May 2015

TL;DR: This tutorial presents a hands-on view of the field of multi-view stereo with a focus on practical algorithms, describing in detail its main two ingredients: robust implementations of photometric consistency measures, and efficient optimization algorithms.

...read moreread less

Abstract: This tutorial presents a hands-on view of the field of multi-view stereo with a focus on practical algorithms. Multi-view stereo algorithms are able to construct highly detailed 3D models from images alone. They take a possibly very large set of images and construct a 3D plausible geometry that explains the images under some reasonable assumptions, the most important being scene rigidity. The tutorial frames the multiview stereo problem as an image/geometry consistency optimization problem. It describes in detail its main two ingredients: robust implementations of photometric consistency measures, and efficient optimization algorithms. It then presents how these main ingredients are used by some of the most successful algorithms, applied into real applications, and deployed as products in the industry. Finally it describes more advanced approaches exploiting domain-specific knowledge such as structural priors, and gives an overview of the remaining challenges and future research directions.

...read moreread less

459 citations

Cites background from "Live Metric 3D Reconstruction on Mo..."

...Note however that VSLAM has made very quick progress recently in the context of MVS [145, 180]....
[...]

Journal Article•DOI•

A Survey of Surface Reconstruction from Point Clouds

[...]

Matthew Berger¹, Andrea Tagliasacchi², Lee M. Seversky¹, Pierre Alliez³, Gaël Guennebaud³, Joshua A. Levine⁴, Andrei Sharf⁵, Cláudio T. Silva⁶ - Show less +4 more•Institutions (6)

Air Force Research Laboratory¹, University of Victoria², French Institute for Research in Computer Science and Automation³, Clemson University⁴, Ben-Gurion University of the Negev⁵, New York University⁶

01 Jan 2017-Computer Graphics Forum

TL;DR: A holistic view of surface reconstruction is considered, which shows a detailed characterization of the field, highlights similarities between diverse reconstruction techniques and provides directions for future work in surface reconstruction.

...read moreread less

Abstract: The area of surface reconstruction has seen substantial progress in the past two decades. The traditional problem addressed by surface reconstruction is to recover the digital representation of a physical shape that has been scanned, where the scanned data contain a wide variety of defects. While much of the earlier work has been focused on reconstructing a piece-wise smooth representation of the original shape, recent work has taken on more specialized priors to address significantly challenging data imperfections, where the reconstruction can take on different representations-not necessarily the explicit geometry. We survey the field of surface reconstruction, and provide a categorization with respect to priors, data imperfections and reconstruction output. By considering a holistic view of surface reconstruction, we show a detailed characterization of the field, highlight similarities between diverse reconstruction techniques and provide directions for future work in surface reconstruction.

...read moreread less

405 citations

Proceedings Article•DOI•

State of the Art in Surface Reconstruction from Point Clouds

[...]

Matthew Berger¹, Andrea Tagliasacchi, Lee M. Seversky¹, Pierre Alliez, Joshua A. Levine², Andrei Sharf³, Cláudio T. Silva - Show less +3 more•Institutions (3)

Air Force Research Laboratory¹, University of Utah², Ben-Gurion University of the Negev³

07 Apr 2014

TL;DR: A holistic view of surface reconstruction is considered, providing a detailed characterization of the field, highlights similarities between diverse reconstruction techniques, and provides directions for future work in surface reconstruction.

...read moreread less

Abstract: The area of surface reconstruction has seen substantial progress in the past two decades. The traditional problem addressed by surface reconstruction is to recover the digital representation of a physical shape that has been scanned, where the scanned data contains a wide variety of defects. While much of the earlier work has been focused on reconstructing a piece-wise smooth representation of the original shape, recent work has taken on more specialized priors to address significantly challenging data imperfections, where the reconstruction can take on different representations -- not necessarily the explicit geometry. This state-of-the-art report surveys the field of surface reconstruction, providing a categorization with respect to priors, data imperfections, and reconstruction output. By considering a holistic view of surface reconstruction, this report provides a detailed characterization of the field, highlights similarities between diverse reconstruction techniques, and provides directions for future work in surface reconstruction.

...read moreread less

330 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48

Collapse

References

PDF

Open Access

More filters

Book•

Multiple view geometry in computer vision

[...]

Richard Hartley¹, Andrew Zisserman²•Institutions (2)

Australian National University¹, University of Oxford²

01 Jan 2000

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.

...read moreread less

Abstract: From the Publisher: A basic problem in computer vision is to understand the structure of a real world scene given several images of it. Recent major developments in the theory and practice of scene reconstruction are described in detail in a unified framework. The book covers the geometric principles and how to represent objects algebraically so they can be computed and applied. The authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly.

...read moreread less

15,558 citations

Multiple View Geometry in Computer Vision.

[...]

Bernhard P. Wrobel

01 Jan 2001

TL;DR: This book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts and it will show the best book collections and completed collections.

...read moreread less

Abstract: Downloading the book in this website lists can give you more advantages. It will show you the best book collections and completed collections. So many books can be found in this website. So, this is not only this multiple view geometry in computer vision. However, this book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts. This is simple, read the soft file of the book and you get it.

...read moreread less

14,282 citations

"Live Metric 3D Reconstruction on Mo..." refers methods in this paper

...Its implementation is based on the method using the Schur complement trick that is described in [2]....
[...]

Proceedings Article•DOI•

ORB: An efficient alternative to SIFT or SURF

[...]

Ethan Rublee¹, Vincent Rabaud¹, Kurt Konolige¹, Gary Bradski¹•Institutions (1)

Willow Garage¹

06 Nov 2011

TL;DR: This paper proposes a very fast binary descriptor based on BRIEF, called ORB, which is rotation invariant and resistant to noise, and demonstrates through experiments how ORB is at two orders of magnitude faster than SIFT, while performing as well in many situations.

...read moreread less

Abstract: Feature matching is at the base of many computer vision problems, such as object recognition or structure from motion. Current methods rely on costly descriptors for detection and matching. In this paper, we propose a very fast binary descriptor based on BRIEF, called ORB, which is rotation invariant and resistant to noise. We demonstrate through experiments how ORB is at two orders of magnitude faster than SIFT, while performing as well in many situations. The efficiency is tested on several real-world applications, including object detection and patch-tracking on a smart phone.

...read moreread less

8,702 citations

"Live Metric 3D Reconstruction on Mo..." refers methods in this paper

...ORB features [14] are extracted from both frames and matched....
[...]

Proceedings Article•DOI•

Good features to track

[...]

Jianbo Shi¹, Tomasi²•Institutions (2)

Cornell University¹, Stanford University²

21 Jun 1994

TL;DR: A feature selection criterion that is optimal by construction because it is based on how the tracker works, and a feature monitoring method that can detect occlusions, disocclusions, and features that do not correspond to points in the world are proposed.

...read moreread less

Abstract: No feature-based vision system can work unless good features can be identified and tracked from frame to frame. Although tracking itself is by and large a solved problem, selecting features that can be tracked well and correspond to physical points in the world is still hard. We propose a feature selection criterion that is optimal by construction because it is based on how the tracker works, and a feature monitoring method that can detect occlusions, disocclusions, and features that do not correspond to points in the world. These methods are based on a new tracking algorithm that extends previous Newton-Raphson style search methods to work under affine image transformations. We test performance with several simulations and experiments. >

...read moreread less

8,432 citations

"Live Metric 3D Reconstruction on Mo..." refers background in this paper

...To this end, a list of candidates is created from non maximum suppressed FAST corners that have a Shi-Tomasi score [18] above a certain threshold....
[...]

Journal Article•DOI•

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

[...]

Daniel Scharstein¹, Richard Szeliski², Ramin Zabih³•Institutions (3)

Middlebury College¹, Microsoft², Cornell University³

09 Dec 2001-International Journal of Computer Vision

TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.

...read moreread less

Abstract: Stereo matching is one of the most active research areas in computer vision. While a large number of algorithms for stereo correspondence have been developed, relatively little work has been done on characterizing their performance. In this paper, we present a taxonomy of dense, two-frame stereo methods designed to assess the different components and design decisions made in individual stereo algorithms. Using this taxonomy, we compare existing stereo methods and present experiments evaluating the performance of many different variants. In order to establish a common software platform and a collection of data sets for easy evaluation, we have designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can be easily extended to include new algorithms. We have also produced several new multiframe stereo data sets with ground truth, and are making both the code and data sets available on the Web.

...read moreread less

7,458 citations