The Geometry of Multiple Images: The Laws That Govern the Formation of Multiple Images of a Scene and Some of Their Applications

Home
/
Papers
/
The Geometry of Multiple Images: The Laws That Govern the Formation of Multiple Images of a Scene and Some of Their Applications

Book•

The Geometry of Multiple Images: The Laws That Govern the Formation of Multiple Images of a Scene and Some of Their Applications

Olivier Faugeras, Quang-Tuan Luong, T. Papadopoulou

05 Mar 2001-

TL;DR: The state of knowledge in one subarea of vision is described, the geometric laws that relate different views of a scene from the perspective of various types of geometries, which is a unified framework for thinking about many geometric problems relevant to vision.

read less

Abstract: From the Publisher: with contributions from Theo Papadopoulo Over the last forty years, researchers have made great strides in elucidating the laws of image formation, processing, and understanding by animals, humans, and machines. This book describes the state of knowledge in one subarea of vision, the geometric laws that relate different views of a scene. Geometry, one of the oldest branches of mathematics, is the natural language for describing three-dimensional shapes and spatial relations. Projective geometry, the geometry that best models image formation, provides a unified framework for thinking about many geometric problems relevant to vision. The book formalizes and analyzes the relations between multiple views of a scene from the perspective of various types of geometries. A key feature is that it considers Euclidean and affine geometries as special cases of projective geometry. Images play a prominent role in computer communications. Producers and users of images, in particular three-dimensional images, require a framework for stating and solving problems. The book offers a number of conceptual tools and theoretical results useful for the design of machine vision algorithms. It also illustrates these tools and results with many examples of real applications.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

[...]

Daniel Scharstein¹, Richard Szeliski², Ramin Zabih³•Institutions (3)

Middlebury College¹, Microsoft², Cornell University³

09 Dec 2001-International Journal of Computer Vision

TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.

...read moreread less

Abstract: Stereo matching is one of the most active research areas in computer vision. While a large number of algorithms for stereo correspondence have been developed, relatively little work has been done on characterizing their performance. In this paper, we present a taxonomy of dense, two-frame stereo methods designed to assess the different components and design decisions made in individual stereo algorithms. Using this taxonomy, we compare existing stereo methods and present experiments evaluating the performance of many different variants. In order to establish a common software platform and a collection of data sets for easy evaluation, we have designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can be easily extended to include new algorithms. We have also produced several new multiframe stereo data sets with ground truth, and are making both the code and data sets available on the Web.

...read moreread less

7,458 citations

Proceedings Article•DOI•

Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV

[...]

Christoph Fehn

21 May 2004-electronic imaging

TL;DR: Details of a system that allows for an evolutionary introduction of depth perception into the existing 2D digital TV framework are presented and a comparison with the classical approach of "stereoscopic" video is compared.

...read moreread less

Abstract: This paper presents details of a system that allows for an evolutionary introduction of depth perception into the existing 2D digital TV framework. The work is part of the European Information Society Technologies (IST) project “Advanced Three-Dimensional Television System Technologies” (ATTEST), an activity, where industries, research centers and universities have joined forces to design a backwards-compatible, flexible and modular broadcast 3D-TV system. At the very heart of the described new concept is the generation and distribution of a novel data representation format, which consists of monoscopic color video and associated perpixel depth information. From these data, one or more “virtual” views of a real-world scene can be synthesized in real-time at the receiver side (i. e. a 3D-TV set-top box) by means of so-called depth-image-based rendering (DIBR) techniques. This publication will provide: (1) a detailed description of the fundamentals of this new approach on 3D-TV; (2) a comparison with the classical approach of “stereoscopic” video; (3) a short introduction to DIBR techniques in general; (4) the development of a specific DIBR algorithm that can be used for the efficient generation of high-quality “virtual” stereoscopic views; (5) a number of implementation details that are specific to the current state of the development; (6) research on the backwards-compatible compression and transmission of 3D imagery using state-of-the-art MPEG (Moving Pictures Expert Group) tools.

...read moreread less

1,560 citations

Journal Article•DOI•

Advances in computational stereo

[...]

Myron Z. Brown¹, Darius Burschka¹, Gregory D. Hager¹•Institutions (1)

Johns Hopkins University¹

01 Aug 2003-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work reviews recent advances in computational stereo, focusing primarily on three important topics: correspondence methods, methods for occlusion, and real-time implementations.

...read moreread less

Abstract: Extraction of three-dimensional structure of a scene from stereo images is a problem that has been studied by the computer vision community for decades. Early work focused on the fundamentals of image correspondence and stereo geometry. Stereo research has matured significantly throughout the years and many advances in computational stereo continue to be made, allowing stereo to be applied to new and more demanding problems. We review recent advances in computational stereo, focusing primarily on three important topics: correspondence methods, methods for occlusion, and real-time implementations. Throughout, we present tables that summarize and draw distinctions among key ideas and approaches. Where available, we provide comparative analyses and we make suggestions for analyses yet to be done.

...read moreread less

1,274 citations

Journal Article•DOI•

Visual Modeling with a Hand-Held Camera

[...]

Marc Pollefeys¹, Luc Van Gool², Maarten Vergauwen², Frank Verbiest², Kurt Cornelis², Jan Tops², Reinhard Koch³ - Show less +3 more•Institutions (3)

University of North Carolina at Chapel Hill¹, Katholieke Universiteit Leuven², University of Kiel³

21 Sep 2004-International Journal of Computer Vision

TL;DR: A complete system to build visual models from camera images is presented and a combined approach with view-dependent geometry and texture is presented, as an application fusion of real and virtual scenes is also shown.

...read moreread less

Abstract: In this paper a complete system to build visual models from camera images is presented. The system can deal with uncalibrated image sequences acquired with a hand-held camera. Based on tracked or matched features the relations between multiple views are computed. From this both the structure of the scene and the motion of the camera are retrieved. The ambiguity on the reconstruction is restricted from projective to metric through self-calibration. A flexible multi-view stereo matching scheme is used to obtain a dense estimation of the surface geometry. From the computed data different types of visual models are constructed. Besides the traditional geometry- and image-based approaches, a combined approach with view-dependent geometry and texture is presented. As an application fusion of real and virtual scenes is also shown.

...read moreread less

1,029 citations

Cites background from "The Geometry of Multiple Images: Th..."

...A more in depth description can be found in Faugeras et al. (2001) and Hartley and Zisserman (2000)....
[...]

Journal Article•DOI•

Robust Mapping and Localization in Indoor Environments Using Sonar Data

[...]

Juan D. Tardós¹, José Neira¹, Paul Newman², John J. Leonard²•Institutions (2)

University of Zaragoza¹, Massachusetts Institute of Technology²

01 Apr 2002-The International Journal of Robotics Research

TL;DR: A perceptual grouping process that permits the robust identification and localization of environmental features from the sparse and noisy sonar data, and a map joining technique that allows the system to build a sequence of independent limited-size stochastic maps and join them in a globally consistent way.

...read moreread less

Abstract: In this paper we describe a new technique for the creation of featurebased stochastic maps using standard Polaroid sonar sensors. The fundamental contributions of our proposal are: (1) a perceptual grouping process that permits the robust identification and localization of environmental features, such as straight segments and corners, from the sparse and noisy sonar data; (2) a map joining technique that allows the system to build a sequence of independent limited-size stochastic maps and join them in a globally consistent way; (3) a robust mechanism to determine which features in a stochastic map correspond to the same environment feature, allowing the system to update the stochastic map accordingly, and perform tasks such as revisiting and loop closing. We demonstrate the practicality of this approach by building a geometric map of a medium size, real indoor environment, with several people moving around the robot. Maps built from laser data for the same experiment are provided for comparison.

...read moreread less

577 citations

Cites background from "The Geometry of Multiple Images: Th..."

...However, a great deal of research in the last decade has shown that, using a single moving camera, it is possible to determine both the camera motion and the environment structure (Hartley and Zisserman 2000; Faugeras and Luong 2001)....
[...]