A Unified Deep Learning Approach for Foveated Rendering & Novel View Synthesis from Sparse RGB-D Light Fields

doi:10.1109/IC3D51119.2020.9376340

Proceedings ArticleDOI

A Unified Deep Learning Approach for Foveated Rendering & Novel View Synthesis from Sparse RGB-D Light Fields

Vineet Thumuluri, +1 more

- pp 1-8

Chats0

TLDR

In this paper, an end-to-end convolutional neural network was designed to perform both foveated reconstruction and view synthesis using only 1.2% of the total light field data.

Abstract:

Near-eye light field displays provide a solution to visual discomfort when using head mounted displays by presenting accurate depth and focal cues. However, light field HMDs require rendering the scene from a large number of viewpoints. This computational challenge of rendering sharp imagery of the foveal region and reproduce retinal defocus blur that correctly drives accommodation is tackled in this paper. We designed a novel end-to-end convolutional neural network that leverages human vision to perform both foveated reconstruction and view synthesis using only 1.2% of the total light field data. The proposed architecture comprises of log-polar sampling scheme followed by an interpolation stage and a convolutional neural network. To the best of our knowledge, this is the first attempt that synthesizes the entire light field from sparse RGB-D inputs and simultaneously addresses foveation rendering for computational displays. Our algorithm achieves fidelity in the fovea without any perceptible artifacts in the peripheral regions. The performance in fovea is comparable to the state-of-the-art view synthesis methods, despite using around 10x less light field data.

A Unified Deep Learning Approach for Foveated Rendering & Novel View Synthesis from Sparse RGB-D Light Fields

Citations

An integrative view of foveated rendering

2T-UNET: A Two-Tower UNet with Depth Clues for Robust Stereo Depth Estimation

References

Perceptually-guided foveation for light field displays

Fast gaze-contingent optimal decompositions for multifocal displays

DeepFocus: learned image synthesis for computational display

An introduction to the log-polar mapping [image sampling]

Related Papers (5)

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Neural Point-Based Graphics

Monocular Neural Image Based Rendering With Continuous View Control

IGNOR: Image-guided Neural Object Rendering

Deep view synthesis from sparse photometric images