Home
/
Authors
/
Peter Rander

Author

Peter Rander

Other affiliations: Carnegie Mellon University

Bio: Peter Rander is an academic researcher from Uber . The author has contributed to research in topics: Visual odometry & Mobile robot. The author has an hindex of 24, co-authored 52 publications receiving 3654 citations. Previous affiliations of Peter Rander include Carnegie Mellon University.

Papers published on a yearly basis

2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2006
2005
2004
2003
2002
1999
1998
1997
1996
1995

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Virtualized reality: constructing virtual worlds from real scenes

[...]

Takeo Kanade¹, Peter Rander, P. J. Narayanan•Institutions (1)

Carnegie Mellon University¹

01 Jan 1997-IEEE MultiMedia

TL;DR: In this paper, a new visual medium, Virtualized Reality, immerses viewers in a virtual reconstruction of real-world events, which consists of real images and depth information computed from these images.

...read moreread less

Abstract: A new visual medium, Virtualized Reality, immerses viewers in a virtual reconstruction of real-world events. The Virtualized Reality world model consists of real images and depth information computed from these images. Stereoscopic reconstructions provide a sense of complete immersion, and users can select their own viewpoints at view time, independent of the actual camera positions used to capture the event.

...read moreread less

677 citations

Journal Article•DOI•

Three-dimensional scene flow

[...]

Sundar Vedula, Peter Rander¹, Robert T. Collins, Takeo Kanade¹•Institutions (1)

Carnegie Mellon University¹

01 Mar 2005-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Three algorithms are described, the first two for computing scene flow from optical flows and the third for constraining scene structure from the inconsistencies in multiple optical flows.

...read moreread less

Abstract: Just as optical flow is the two-dimensional motion of points in an image, scene flow is the three-dimensional motion of points in the world. The fundamental difficulty with optical flow is that only the normal flow can be computed directly from the image measurements, without some form of smoothing or regularization. In this paper, we begin by showing that the same fundamental limitation applies to scene flow; however, many cameras are used to image the scene. There are then two choices when computing scene flow: 1) perform the regularization in the images or 2) perform the regularization on the surface of the object in the scene. In this paper, we choose to compute scene flow using regularization in the images. We describe three algorithms, the first two for computing scene flow from optical flows and the third for constraining scene structure from the inconsistencies in multiple optical flows.

...read moreread less

520 citations

Proceedings Article•DOI•

Three-dimensional scene flow

[...]

Sundar Vedula¹, Simon Baker¹, Peter Rander¹, Robert T. Collins, Takeo Kanade¹ - Show less +1 more•Institutions (1)

Carnegie Mellon University¹

20 Sep 1999

TL;DR: This work presents a framework for the computation of dense, non-rigid scene flow from optical flow and shows that multiple estimates of the normal flow cannot be used to estimate dense scene flow directly without some form of smoothing or regularization.

...read moreread less

Abstract: Scene flow is the three-dimensional motion field of points in the world, just as optical flow is the two-dimensional motion field of points in an image. Any optical flow is simply the projection of the scene flow onto the image plane of a camera. We present a framework for the computation of dense, non-rigid scene flow from optical flow. Our approach leads to straightforward linear algorithms and a classification of the task into three major scenarios: complete instantaneous knowledge of the scene structure; knowledge only of correspondence information; and no knowledge of the scene structure. We also show that multiple estimates of the normal flow cannot be used to estimate dense scene flow directly without some form of smoothing or regularization.

...read moreread less

335 citations

Proceedings Article•DOI•

Constructing virtual worlds using dense stereo

[...]

P. J. Narayanan, Peter Rander¹, Takeo Kanade¹•Institutions (1)

Carnegie Mellon University¹

04 Jan 1998

TL;DR: The intensity image and depth map for each camera view at each time instant are combined to form a Visible Surface Model, a technique to create virtual worlds out of dynamic events using densely distributed stereo views.

...read moreread less

Abstract: We present Virtualized Reality, a technique to create virtual worlds out of dynamic events using densely distributed stereo views. The intensity image and depth map for each camera view at each time instant are combined to form a Visible Surface Model. Immersive interaction with the virtualized event is possible using a dense collection of such models. Additionally, a Complete Surface Model of each instant can be built by merging the depth maps from different cameras into a common volumetric space. The corresponding model is compatible with traditional virtual models and can be interacted with immersively using standard tools. Because both VSMs and CSMs are fully three-dimensional, virtualized models can also be combined and modified to build larger, more complex environments, an important capability for many non-trivial applications. We present results from 3D Dome, our facility to create virtualized models.

...read moreread less

302 citations

Patent•

Method for creating virtual reality

[...]

Takeo Kanade¹, P. J. Narayanan¹, Peter Rander¹•Institutions (1)

Carnegie Mellon University¹

20 Jun 1996

TL;DR: In this paper, a method of creating virtual reality from images of a real event, is comprised of the steps of capturing a plurality of images of each time instant of real event using a pluralityof cameras positioned at a plurality-of-angles.

...read moreread less

Abstract: A method of virtualizing reality, i.e., a method of creating virtual reality from images of a real event, is comprised of the steps of capturing a plurality of images of each time instant of a real event using a plurality of cameras positioned at a plurality of angles. Each image is stored as intensity and/or color information. A suitable internal representation is computed from these images and the information regarding the camera angles. An image of each time instant may be generated from any viewing angle using the internal representation of it. The virtual viewpoints could be displayed on a single TV screen or using a stereoscopic display device for a true three-dimensional effect. The event thus virtualized can be navigated through, and interacted with, any virtual reality system.

...read moreread less

227 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Light field rendering

[...]

Marc Levoy¹, Pat Hanrahan¹•Institutions (1)

Stanford University¹

01 Aug 1996

TL;DR: This paper describes a sampled representation for light fields that allows for both efficient creation and display of inward and outward looking views, and describes a compression system that is able to compress the light fields generated by more than a factor of 100:1 with very little loss of fidelity.

...read moreread less

Abstract: A number of techniques have been proposed for flying through scenes by redisplaying previously rendered or digitized views. Techniques have also been proposed for interpolating between views by warping input images, using depth information or correspondences between multiple images. In this paper, we describe a simple and robust method for generating new views from arbitrary camera positions without depth information or feature matching, simply by combining and resampling the available images. The key to this technique lies in interpreting the input images as 2D slices of a 4D function the light field. This function completely characterizes the flow of light through unobstructed space in a static scene with fixed illumination. We describe a sampled representation for light fields that allows for both efficient creation and display of inward and outward looking views. We hav e created light fields from large arrays of both rendered and digitized images. The latter are acquired using a video camera mounted on a computer-controlled gantry. Once a light field has been created, new views may be constructed in real time by extracting slices in appropriate directions. Since the success of the method depends on having a high sample rate, we describe a compression system that is able to compress the light fields we have generated by more than a factor of 100:1 with very little loss of fidelity. We also address the issues of antialiasing during creation, and resampling during slice extraction. CR Categories: I.3.2 [Computer Graphics]: Picture/Image Generation — Digitizing and scanning, Viewing algorithms; I.4.2 [Computer Graphics]: Compression — Approximate methods Additional keywords: image-based rendering, light field, holographic stereogram, vector quantization, epipolar analysis

...read moreread less

4,426 citations

Book•

Computer Vision: Algorithms and Applications

[...]

Richard Szeliski

30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Abstract: Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of recipes, this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

...read moreread less

4,146 citations

Journal Article•DOI•

Recent advances in augmented reality

[...]

Ronald Azuma, Yohan Baillot, Robert Behringer, Steven Feiner, Simon Julier, Blair MacIntyre - Show less +2 more

01 Nov 2001-IEEE Computer Graphics and Applications

TL;DR: This work refers one to the original survey for descriptions of potential applications, summaries of AR system characteristics, and an introduction to the crucial problem of registration, including sources of registration error and error-reduction strategies.

...read moreread less

Abstract: In 1997, Azuma published a survey on augmented reality (AR). Our goal is to complement, rather than replace, the original survey by presenting representative examples of the new advances. We refer one to the original survey for descriptions of potential applications (such as medical visualization, maintenance and repair of complex equipment, annotation, and path planning); summaries of AR system characteristics (such as the advantages and disadvantages of optical and video approaches to blending virtual and real, problems in display focus and contrast, and system portability); and an introduction to the crucial problem of registration, including sources of registration error and error-reduction strategies.

...read moreread less

3,624 citations

Proceedings Article•DOI•

Comprehensive database for facial expression analysis

[...]

Takeo Kanade¹, Jeffrey F. Cohn¹, Yingli Tian¹•Institutions (1)

Carnegie Mellon University¹

26 Mar 2000

TL;DR: The problem space for facial expression analysis is described, which includes level of description, transitions among expressions, eliciting conditions, reliability and validity of training and test data, individual differences in subjects, head orientation and scene complexity image characteristics, and relation to non-verbal behavior.

...read moreread less

Abstract: Within the past decade, significant effort has occurred in developing methods of facial expression analysis. Because most investigators have used relatively limited data sets, the generalizability of these various methods remains unknown. We describe the problem space for facial expression analysis, which includes level of description, transitions among expressions, eliciting conditions, reliability and validity of training and test data, individual differences in subjects, head orientation and scene complexity image characteristics, and relation to non-verbal behavior. We then present the CMU-Pittsburgh AU-Coded Face Expression Image Database, which currently includes 2105 digitized image sequences from 182 adult subjects of varying ethnicity, performing multiple tokens of most primary FACS action units. This database is the most comprehensive testbed to date for comparative studies of facial expression analysis.

...read moreread less

2,705 citations

Proceedings Article•DOI•

A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms

[...]

Steven M. Seitz¹, Brian Curless¹, J. Diebel², Daniel Scharstein³, Richard Szeliski⁴ - Show less +1 more•Institutions (4)

University of Washington¹, Stanford University², Middlebury College³, Microsoft⁴

17 Jun 2006

TL;DR: This paper first survey multi-view stereo algorithms and compare them qualitatively using a taxonomy that differentiates their key properties, then describes the process for acquiring and calibrating multiview image datasets with high-accuracy ground truth and introduces the evaluation methodology.

...read moreread less

Abstract: This paper presents a quantitative comparison of several multi-view stereo reconstruction algorithms. Until now, the lack of suitable calibrated multi-view image datasets with known ground truth (3D shape models) has prevented such direct comparisons. In this paper, we first survey multi-view stereo algorithms and compare them qualitatively using a taxonomy that differentiates their key properties. We then describe our process for acquiring and calibrating multiview image datasets with high-accuracy ground truth and introduce our evaluation methodology. Finally, we present the results of our quantitative comparison of state-of-the-art multi-view stereo reconstruction algorithms on six benchmark datasets. The datasets, evaluation details, and instructions for submitting new models are available online at http://vision.middlebury.edu/mview.

...read moreread less

2,556 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse