Showing papers by "Jana Kosecka published in 2009"

PDF

Open Access

Proceedings Article•DOI•

Multi-view image and ToF sensor fusion for dense 3D reconstruction

[...]

Young Min Kim¹, Christian Theobalt¹, James Diebel¹, Jana Kosecka², Branislav Miscusik², Sebastian Thrun¹ - Show less +2 more•Institutions (2)

Stanford University¹, George Mason University²

01 Sep 2009

TL;DR: This work proposes an integrated multi-view sensor fusion approach that combines information from multiple color cameras and multiple ToF depth sensors to obtain high quality dense and detailed 3D models of scenes challenging for stereo alone, while simultaneously reducing complex noise of ToF sensors.

...read moreread less

Abstract: Multi-view stereo methods frequently fail to properly reconstruct 3D scene geometry if visible texture is sparse or the scene exhibits difficult self-occlusions Time-of-Flight (ToF) depth sensors can provide 3D information regardless of texture but with only limited resolution and accuracy To find an optimal reconstruction, we propose an integrated multi-view sensor fusion approach that combines information from multiple color cameras and multiple ToF depth sensors First, multi-view ToF sensor measurements are combined to obtain a coarse but complete model Then, the initial model is refined by means of a probabilistic multi-view fusion framework, optimizing over an energy function that aggregates ToF depth sensor information with multi-view stereo and silhouette constraints We obtain high quality dense and detailed 3D models of scenes challenging for stereo alone, while simultaneously reducing complex noise of ToF sensors

...read moreread less

174 citations

Proceedings Article•DOI•

Piecewise planar city 3D modeling from street view panoramic sequences

[...]

Branislav Micusik¹, Jana Kosecka¹•Institutions (1)

George Mason University¹

20 Jun 2009

TL;DR: This work demonstrates how to robustly estimate camera poses without a need for bundle adjustment and proposes a multi-view stereo method which operates directly on panoramas, while enforcing the piecewise planarity constraints in the sweeping stage.

...read moreread less

Abstract: City environments often lack textured areas, contain repetitive structures, strong lighting changes and therefore are very difficult for standard 3D modeling pipelines We present a novel unified framework for creating 3D city models which overcomes these difficulties by exploiting image segmentation cues as well as presence of dominant scene orientations and piecewise planar structures Given panoramic street view sequences, we first demonstrate how to robustly estimate camera poses without a need for bundle adjustment and propose a multi-view stereo method which operates directly on panoramas, while enforcing the piecewise planarity constraints in the sweeping stage At last, we propose a new depth fusion method which exploits the constraints of urban environments and combines advantages of volumetric and viewpoint based fusion methods Our technique avoids expensive voxelization of space, operates directly on 3D reconstructed points through effective kd-tree representation, and obtains a final surface by tessellation of backprojections of those points into the reference image

...read moreread less

159 citations

Proceedings Article•DOI•

Experiments in place recognition using gist panoramas

[...]

Ana C. Murillo¹, Jana Kosecka•Institutions (1)

University of Zaragoza¹

01 Sep 2009

TL;DR: An extensive experimental validation of the global gist descriptor computed for portions of panoramic images and a simple similarity measure between two panoramas are presented, which is robust to changes in vehicle orientation, while traversing the same areas in different directions.

...read moreread less

Abstract: In this paper we investigate large scale view based localization in urban areas using panoramic images. The presented approach utilizes global gist descriptor computed for portions of panoramic images and a simple similarity measure between two panoramas, which is robust to changes in vehicle orientation, while traversing the same areas in different directions. The global gist feature [14] has been demonstrated previously to be a very effective conventional image descriptor, capturing the basic structure of different types of scenes in a very compact way. We present an extensive experimental validation of our panoramic gist approach on a large scale Street View data set of panoramic images for place recognition or topological localization.

...read moreread less

93 citations

Proceedings Article•DOI•

Semantic segmentation of street scenes by superpixel co-occurrence and 3D geometry

[...]

Branislav Micusik¹, Jana Kosecka²•Institutions (2)

Austrian Institute of Technology¹, George Mason University²

01 Sep 2009

TL;DR: The main novelty of this generative approach is the introduction of an explicit model of spatial co-occurrence of visual words associated with super-pixels and utilization of appearance, geometry and contextual cues in a probabilistic framework.

...read moreread less

Abstract: We present a novel approach for image semantic segmentation of street scenes into coherent regions, while simultaneously categorizing each region as one of the predefined categories representing commonly encountered object and background classes We formulate the segmentation on small blob-based superpixels and exploit a visual vocabulary tree as an intermediate image representation The main novelty of this generative approach is the introduction of an explicit model of spatial co-occurrence of visual words associated with super-pixels and utilization of appearance, geometry and contextual cues in a probabilistic framework We demonstrate how individual cues contribute towards global segmentation accuracy and how their combination yields superior performance to the best known method on the challenging benchmark dataset which exhibits diversity of street scenes with varying viewpoints, large number of categories, captured in daylight and dusk

...read moreread less

79 citations

Book Chapter•DOI•

Landmark-Based Pedestrian Navigation with Enhanced Spatial Reasoning

[...]

Harlan Hile¹, Radek Grzeszczuk², Alan Liu¹, Ramakrishna Vedantham², Jana Kosecka³, Gaetano Borriello¹ - Show less +2 more•Institutions (3)

University of Washington¹, Nokia², George Mason University³

11 May 2009

TL;DR: A system to use high- level reasoning to influence the selection of landmarks along a navigation path, and lower-level reasoning to select appropriate images of those landmarks to produce a more natural navigation plan and more understandable images in a fully automatic way is developed.

...read moreread less

Abstract: Computer vision techniques can enhance landmark-based navigation by better utilizing online photo collections. We use spatial reasoning to compute camera poses, which are then registered to the world using GPS information extracted from the image tags. Computed camera pose is used to augment the images with navigational arrows that fit the environment. We develop a system to use high-level reasoning to influence the selection of landmarks along a navigation path, and lower-level reasoning to select appropriate images of those landmarks. We also utilize an image matching pipeline based on robust local descriptors to give users of the system the ability to capture an image and receive navigational instructions overlaid on their current context. These enhancements to our previous navigation system produce a more natural navigation plan and more understandable images in a fully automatic way.

...read moreread less

77 citations

Proceedings Article•DOI•

Creating compact architectural models by geo-registering image collections

[...]

Radek Grzeszczuk¹, Jana Kosecka², Ramakrishna Vedantham¹, Harlan Hile³•Institutions (3)

Nokia¹, George Mason University², University of Washington³

01 Sep 2009

TL;DR: An optimal view-selection algorithm for selecting a small set of views for texture mapping that best describe the structure, while minimizing warping and stitching artifacts, and producing a consistent visual representation is proposed.

...read moreread less

Abstract: We present a method for automatically constructing compact, photo-realistic architectural 3D models. This method uses simple 3D building outlines obtained from existing GIS databases to bootstrap reconstruction and works with both structured and unstructured image datasets. We propose an optimal view-selection algorithm for selecting a small set of views for texture mapping that best describe the structure, while minimizing warping and stitching artifacts, and producing a consistent visual representation. The proposed method is fully automatic and can process large structured datasets in close to real-time, making it suitable for large scale urban modeling and 3D map construction.

...read moreread less

22 citations