Home
/
Authors
/
Pratul P. Srinivasan

Author

Pratul P. Srinivasan

Other affiliations: Duke University, University of California, Google

Bio: Pratul P. Srinivasan is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Rendering (computer graphics) & View synthesis. The author has an hindex of 23, co-authored 48 publications receiving 2580 citations. Previous affiliations of Pratul P. Srinivasan include Duke University & University of California.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2015
2014
2013

Papers

PDF

Open Access

More filters

Posted Content•

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

[...]

Ben Mildenhall¹, Pratul P. Srinivasan¹, Matthew Tancik¹, Jonathan T. Barron², Ravi Ramamoorthi³, Ren Ng¹ - Show less +2 more•Institutions (3)

University of California, Berkeley¹, Google², University of California, San Diego³

19 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work describes how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrates results that outperform prior work on neural rendering and view synthesis.

...read moreread less

Abstract: We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location $(x,y,z)$ and viewing direction $(\theta, \phi)$) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis. View synthesis results are best viewed as videos, so we urge readers to view our supplementary video for convincing comparisons.

...read moreread less

2,435 citations

Book Chapter•DOI•

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

[...]

Ben Mildenhall¹, Pratul P. Srinivasan¹, Matthew Tancik¹, Jonathan T. Barron², Ravi Ramamoorthi³, Ren Ng¹ - Show less +2 more•Institutions (3)

University of California, Berkeley¹, Google², University of California, San Diego³

23 Aug 2020

TL;DR: In this article, a fully-connected (non-convolutional) deep network is used to synthesize novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views.

...read moreread less

Abstract: We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction $(\theta ,\phi )$) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis. View synthesis results are best viewed as videos, so we urge readers to view our supplementary video for convincing comparisons.

...read moreread less

951 citations

Posted Content•

Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains

[...]

Matthew Tancik¹, Pratul P. Srinivasan¹, Ben Mildenhall¹, Sara Fridovich-Keil¹, Nithin Raghavan, Utkarsh Singhal¹, Ravi Ramamoorthi², Jonathan T. Barron³, Ren Ng¹ - Show less +5 more•Institutions (3)

University of California, Berkeley¹, University of California, San Diego², Google³

18 Jun 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: An approach for selecting problem-specific Fourier features that greatly improves the performance of MLPs for low-dimensional regression tasks relevant to the computer vision and graphics communities is suggested.

...read moreread less

Abstract: We show that passing input points through a simple Fourier feature mapping enables a multilayer perceptron (MLP) to learn high-frequency functions in low-dimensional problem domains These results shed light on recent advances in computer vision and graphics that achieve state-of-the-art results by using MLPs to represent complex 3D objects and scenes Using tools from the neural tangent kernel (NTK) literature, we show that a standard MLP fails to learn high frequencies both in theory and in practice To overcome this spectral bias, we use a Fourier feature mapping to transform the effective NTK into a stationary kernel with a tunable bandwidth We suggest an approach for selecting problem-specific Fourier features that greatly improves the performance of MLPs for low-dimensional regression tasks relevant to the computer vision and graphics communities

...read moreread less

787 citations

Proceedings Article•DOI•

IBRNet: Learning Multi-View Image-Based Rendering

[...]

Qianqian Wang¹, Zhicheng Wang¹, Kyle Genova¹, Pratul P. Srinivasan¹, Howard Zhou¹, Jonathan T. Barron¹, Ricardo Martin-Brualla¹, Noah Snavely¹, Thomas Funkhouser¹ - Show less +5 more•Institutions (1)

Google¹

20 Jun 2021

TL;DR: A method that synthesizes novel views of complex scenes by interpolating a sparse set of nearby views using a network architecture that includes a multilayer perceptron and a ray transformer that estimates radiance and volume density at continuous 5D locations.

...read moreread less

Abstract: We present a method that synthesizes novel views of complex scenes by interpolating a sparse set of nearby views. The core of our method is a network architecture that includes a multilayer perceptron and a ray transformer that estimates radiance and volume density at continuous 5D locations (3D spatial locations and 2D viewing directions), drawing appearance information on the fly from multiple source views. By drawing on source views at render time, our method hearkens back to classic work on image-based rendering (IBR), and allows us to render high-resolution imagery. Unlike neural scene representation work that optimizes per-scene functions for rendering, we learn a generic view interpolation function that generalizes to novel scenes. We render images using classic volume rendering, which is fully differentiable and allows us to train using only multi-view posed images as supervision. Experiments show that our method outperforms recent novel view synthesis methods that also seek to generalize to novel scenes. Further, if fine-tuned on each scene, our method is competitive with state-of-the-art single-scene neural rendering methods.1

...read moreread less

402 citations

Journal Article•DOI•

Local light field fusion: practical view synthesis with prescriptive sampling guidelines

[...]

Ben Mildenhall¹, Pratul P. Srinivasan¹, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari², Ravi Ramamoorthi¹, Ren Ng¹, Abhishek Kar - Show less +3 more•Institutions (2)

University of California¹, Texas A&M University²

12 Jul 2019-ACM Transactions on Graphics

TL;DR: An algorithm for view synthesis from an irregular grid of sampled views that first expands each sampled view into a local light field via a multiplane image (MPI) scene representation, then renders novel views by blending adjacent local light fields.

...read moreread less

Abstract: We present a practical and robust deep learning solution for capturing and rendering novel views of complex real world scenes for virtual exploration. Previous approaches either require intractably dense view sampling or provide little to no guidance for how users should sample views of a scene to reliably render high-quality novel views. Instead, we propose an algorithm for view synthesis from an irregular grid of sampled views that first expands each sampled view into a local light field via a multiplane image (MPI) scene representation, then renders novel views by blending adjacent local light fields. We extend traditional plenoptic sampling theory to derive a bound that specifies precisely how densely users should sample views of a given scene when using our algorithm. In practice, we apply this bound to capture and render views of real world scenes that achieve the perceptual quality of Nyquist rate view sampling while using up to 4000X fewer views. We demonstrate our approach's practicality with an augmented reality smart-phone app that guides users to capture input images of a scene and viewers that enable realtime virtual exploration on desktop and mobile platforms.

...read moreread less

400 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

Posted Content•

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

[...]

Ben Mildenhall¹, Pratul P. Srinivasan¹, Matthew Tancik¹, Jonathan T. Barron², Ravi Ramamoorthi³, Ren Ng¹ - Show less +2 more•Institutions (3)

University of California, Berkeley¹, Google², University of California, San Diego³

19 Mar 2020-arXiv: Computer Vision and Pattern Recognition

...read moreread less

2,435 citations

Journal Article•DOI•

Clinically applicable deep learning for diagnosis and referral in retinal disease

[...]

Jeffrey De Fauw, Joseph R. Ledsam, Bernardino Romera-Paredes, Stanislav Nikolov, Nenad Tomasev, Sam Blackwell, Harry Askham, Xavier Glorot, Brendan O'Donoghue, Daniel Visentin, George van den Driessche, Balaji Lakshminarayanan, Clemens Meyer, Faith Mackinder, Simon Bouton, Kareem Ayoub, Reena Chopra¹, Dominic King, Alan Karthikesalingam, Cian Hughes², Rosalind Raine², Julian Hughes¹, Dawn A Sim¹, Catherine A Egan¹, Adnan Tufail¹, Hugh Montgomery², Demis Hassabis, Geraint Rees², Trevor Back, Peng T. Khaw¹, Mustafa Suleyman, Julien Cornebise², Pearse A. Keane¹, Olaf Ronneberger - Show less +30 more•Institutions (2)

UCL Institute of Ophthalmology¹, University College London²

13 Aug 2018-Nature Medicine

TL;DR: A novel deep learning architecture performs device-independent tissue segmentation of clinical 3D retinal images followed by separate diagnostic classification that meets or exceeds human expert clinical diagnoses of retinal disease.

...read moreread less

Abstract: The volume and complexity of diagnostic imaging is increasing at a pace faster than the availability of human expertise to interpret it. Artificial intelligence has shown great promise in classifying two-dimensional photographs of some common diseases and typically relies on databases of millions of annotated images. Until now, the challenge of reaching the performance of expert clinicians in a real-world clinical pathway with three-dimensional diagnostic scans has remained unsolved. Here, we apply a novel deep learning architecture to a clinically heterogeneous set of three-dimensional optical coherence tomography scans from patients referred to a major eye hospital. We demonstrate performance in making a referral recommendation that reaches or exceeds that of experts on a range of sight-threatening retinal diseases after training on only 14,884 scans. Moreover, we demonstrate that the tissue segmentations produced by our architecture act as a device-independent representation; referral accuracy is maintained when using tissue segmentations from a different type of device. Our work removes previous barriers to wider clinical use without prohibitive training data requirements across multiple pathologies in a real-world setting.

...read moreread less

1,665 citations

Journal Article•

Optical Coherence Tomography of the Human Retina

[...]

Michael R. Hee, Joseph A. Izatt, Eric A. Swanson, David Huang, Joel S. Schuman, Charles P. Lin, Carmen A. Puliafito, James G. Fujimoto - Show less +4 more

01 Jan 2001-SPIE milestone series

TL;DR: In this article, optical coherence tomography is used for high-resolution, noninvasive imaging of the human retina, including the macula and optic nerve head in normal human subjects.

...read moreread less

Abstract: Objective: To demonstrate optical coherence tomography for high-resolution, noninvasive imaging of the human retina. Optical coherence tomography is a new imaging technique analogous to ultrasound B scan that can provide cross-sectional images of the retina with micrometer-scale resolution. Design: Survey optical coherence tomographic examination of the retina, including the macula and optic nerve head in normal human subjects. Settings Research laboratory. Participants: Convenience sample of normal human subjects. Main Outcome Measures: Correlation of optical coherence retinal tomographs with known normal retinal anatomy. Results: Optical coherence tomographs can discriminate the cross-sectional morphologic features of the fovea and optic disc, the layered structure of the retina, and normal anatomic variations in retinal and retinal nerve fiber layer thicknesses with 10- μm depth resolution. Conclusion: Optical coherence tomography is a potentially useful technique for high depth resolution, cross-sectional examination of the fundus.

...read moreread less

1,409 citations

Posted Content•

Score-Based Generative Modeling through Stochastic Differential Equations

[...]

Yang Song¹, Jascha Sohl-Dickstein², Diederik P. Kingma², Abhishek Kumar², Stefano Ermon¹, Ben Poole² - Show less +2 more•Institutions (2)

Stanford University¹, Google²

26 Nov 2020-arXiv: Learning

TL;DR: This work presents a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by Slowly removing the noise.

...read moreread less

Abstract: Creating noise from data is easy; creating data from noise is generative modeling. We present a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by slowly removing the noise. Crucially, the reverse-time SDE depends only on the time-dependent gradient field (\aka, score) of the perturbed data distribution. By leveraging advances in score-based generative modeling, we can accurately estimate these scores with neural networks, and use numerical SDE solvers to generate samples. We show that this framework encapsulates previous approaches in score-based generative modeling and diffusion probabilistic modeling, allowing for new sampling procedures and new modeling capabilities. In particular, we introduce a predictor-corrector framework to correct errors in the evolution of the discretized reverse-time SDE. We also derive an equivalent neural ODE that samples from the same distribution as the SDE, but additionally enables exact likelihood computation, and improved sampling efficiency. In addition, we provide a new way to solve inverse problems with score-based models, as demonstrated with experiments on class-conditional generation, image inpainting, and colorization. Combined with multiple architectural improvements, we achieve record-breaking performance for unconditional image generation on CIFAR-10 with an Inception score of 9.89 and FID of 2.20, a competitive likelihood of 2.99 bits/dim, and demonstrate high fidelity generation of 1024 x 1024 images for the first time from a score-based generative model.

...read moreread less

1,174 citations

Journal Article•DOI•

Physics-informed machine learning

[...]

George Em Karniadakis¹, Ioannis G. Kevrekidis², Lu Lu³, Paris Perdikaris⁴, Sifan Wang⁴, Liu Yang¹ - Show less +2 more•Institutions (4)

Brown University¹, Johns Hopkins University², Massachusetts Institute of Technology³, University of Pennsylvania⁴

01 Jun 2021

TL;DR: Some of the prevailing trends in embedding physics into machine learning are reviewed, some of the current capabilities and limitations are presented and diverse applications of physics-informed learning both for forward and inverse problems, including discovering hidden physics and tackling high-dimensional problems are discussed.

...read moreread less

Abstract: Despite great progress in simulating multiphysics problems using the numerical discretization of partial differential equations (PDEs), one still cannot seamlessly incorporate noisy data into existing algorithms, mesh generation remains complex, and high-dimensional problems governed by parameterized PDEs cannot be tackled. Moreover, solving inverse problems with hidden physics is often prohibitively expensive and requires different formulations and elaborate computer codes. Machine learning has emerged as a promising alternative, but training deep neural networks requires big data, not always available for scientific problems. Instead, such networks can be trained from additional information obtained by enforcing the physical laws (for example, at random points in the continuous space-time domain). Such physics-informed learning integrates (noisy) data and mathematical models, and implements them through neural networks or other kernel-based regression networks. Moreover, it may be possible to design specialized network architectures that automatically satisfy some of the physical invariants for better accuracy, faster training and improved generalization. Here, we review some of the prevailing trends in embedding physics into machine learning, present some of the current capabilities and limitations and discuss diverse applications of physics-informed learning both for forward and inverse problems, including discovering hidden physics and tackling high-dimensional problems. The rapidly developing field of physics-informed learning integrates data and mathematical models seamlessly, enabling accurate inference of realistic and high-dimensional multiphysics problems. This Review discusses the methodology and provides diverse examples and an outlook for further developments.

...read moreread less

1,114 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse