Topic

View synthesis

About: View synthesis is a research topic. Over the lifetime, 1701 publications have been published within this topic receiving 42333 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Towards teleconferencing by view synthesis and large-baseline stereo

[...]

Francesco Isgrò¹, Emanuele Trucco, Li-Qun Xu•Institutions (1)

Heriot-Watt University¹

26 Sep 2001

TL;DR: This work addresses the application of computer vision to semi-immersive teleconferencing, and presents a prototype vision system synthesising a physically plausible video of a speaker to be displayed at a remote conferencing station.

...read moreread less

Abstract: We address the application of computer vision to semi-immersive teleconferencing, and present a prototype vision system synthesising a physically plausible video of a speaker to be displayed at a remote conferencing station. The main system components are a hierarchical, efficient large-baseline disparity estimation and a view synthesis module. We illustrate and discuss some results with a real-speaker sequence. We regard the development of such a system in the domain of advanced teleconferencing as the main contribution of this work.

...read moreread less

18 citations

Proceedings Article•DOI•

High-Speed Stream-Centric Dense Stereo and View Synthesis on Graphics Hardware

[...]

Jiangbo Lu¹, S. Rogmans¹, Gauthier Lafruit¹, F. Catthoor¹•Institutions (1)

Katholieke Universiteit Leuven¹

01 Oct 2007

TL;DR: This paper presents an efficient image-based rendering system capable of performing online stereo matching and view synthesis at high speed, completely on the graphics processing unit (GPU).

...read moreread less

Abstract: This paper presents an efficient image-based rendering system capable of performing online stereo matching and view synthesis at high speed, completely on the graphics processing unit (GPU). Given two rectified stereo images, our algorithm first extracts the disparity map with a stream-centric dense depth estimation approach. For high-quality view synthesis, multi-label masks are then automatically generated to postprocess occlusions and ambiguously estimated regions adaptively. To allow even faster interactive view generation, an alternative forward warping method is also integrated. The experiments show that photorealistic intermediate views of high image quality are yielded by our algorithm. The optimized implementation also provides the state-of-the-art stereo analysis and view synthesis speed, achieving over 47 fps with 450x375 stereo images and 60 disparity levels on an Nvidia GeForce 7900 graphics card.

...read moreread less

18 citations

Journal Article•DOI•

Efficient multiview depth video coding using depth synthesis prediction

[...]

Cheon Lee¹, Byeongho Choi, Yo-Sung Ho¹•Institutions (1)

Gwangju Institute of Science and Technology¹

01 Jul 2011-Optical Engineering

TL;DR: Experimental results demonstrate that the proposed depth view synthesis method provides high-quality depth images for the current view and the proposed VSP modes provide high coding gains, especially on the anchor frames.

...read moreread less

Abstract: The view synthesis prediction (VSP) method utilizes interview correlations between views by generating an additional reference frame in the multiview video coding. This paper describes a multiview depth video coding scheme that incorporates depth view synthesis and additional prediction modes. In the proposed scheme, we exploit the reconstructed neighboring depth frame to generate an additional reference depth image for the current viewpoint to be coded using the depth image-based-rendering technique. In order to generate high-quality reference depth images, we used pre-processing on depth, depth image warping, and two types of hole filling methods depending on the number of available reference views. After synthesizing the additional depth image, we encode the depth video using the proposed additional prediction modes named VSP modes; those additional modes refer to the synthesized depth image. In particular, the VSP_SKIP mode refers to the co-located block of the synthesized frame without the coding motion vectors and residual data, which gives most of the coding gains. Experimental results demonstrate that the proposed depth view synthesis method provides high-quality depth images for the current view and the proposed VSP modes provide high coding gains, especially on the anchor frames.

...read moreread less

18 citations

Proceedings Article•

PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments

[...]

David Novotny¹, Ben Graham², Jeremy Reizenstein³•Institutions (3)

University of Oxford¹, Facebook², University of Warwick³

01 Jan 2019

TL;DR: This work devise an approach that exploits known geometric properties of the scene (per-frame camera extrinsics and depth) in order to warp reference views into the new ones, and obtains images that are geometrically consistent with all the views in the scene camera system.

...read moreread less

Abstract: Given a set of a reference RGBD views of an indoor environment, and a new viewpoint, our goal is to predict the view from that location. Prior work on new-view generation has predominantly focused on significantly constrained scenarios, typically involving artificially rendered views of isolated CAD models. Here we tackle a much more challenging version of the problem. We devise an approach that exploits known geometric properties of the scene (per-frame camera extrinsics and depth) in order to warp reference views into the new ones. The defects in the generated views are handled by a novel RGBD inpainting network, PerspectiveNet, that is fine-tuned for a given scene in order to obtain images that are geometrically consistent with all the views in the scene camera system. Experiments conducted on the ScanNet and SceneNet datasets reveal performance superior to strong baselines.

...read moreread less

18 citations

Book Chapter•DOI•

Rotationally-Temporally Consistent Novel View Synthesis of Human Performance Video

[...]

Youngjoong Kwon¹, Youngjoong Kwon², Stefano Petrangeli¹, Dahun Kim³, Haoliang Wang¹, Eunbyung Park², Viswanathan Swaminathan¹, Henry Fuchs² - Show less +4 more•Institutions (3)

Adobe Systems¹, University of North Carolina at Chapel Hill², KAIST³

23 Aug 2020

TL;DR: A novel siamese network is introduced that employs a gating layer for better reconstruction of the latent volumetric representation and, consequently, final visual results, and a novel loss is introduced to explicitly enforce consistency across generated views both in space and in time.

...read moreread less

Abstract: Novel view video synthesis aims to synthesize novel viewpoints videos given input captures of a human performance taken from multiple reference viewpoints and over consecutive time steps. Despite great advances in model-free novel view synthesis, existing methods present three limitations when applied to complex and time-varying human performance. First, these methods (and related datasets) mainly consider simple and symmetric objects. Second, they do not enforce explicit consistency across generated views. Third, they focus on static and non-moving objects. The fine-grained details of a human subject can therefore suffer from inconsistencies when synthesized across different viewpoints or time steps. To tackle these challenges, we introduce a human-specific framework that employs a learned 3D-aware representation. Specifically, we first introduce a novel siamese network that employs a gating layer for better reconstruction of the latent volumetric representation and, consequently, final visual results. Moreover, features from consecutive time steps are shared inside the network to improve temporal consistency. Second, we introduce a novel loss to explicitly enforce consistency across generated views both in space and in time. Third, we present the Multi-View Human Action (MVHA) dataset, consisting of near 1200 synthetic human performance captured from 54 viewpoints. Experiments on the MVHA, Pose-Varying Human Model and ShapeNet datasets show that our method outperforms the state-of-the-art baselines both in view generation quality and spatio-temporal consistency.

...read moreread less

18 citations

Collapse

Network Information

Performance

Metrics

1,873

Papers

59,067

Citations

No. of papers in the topic in previous years
Year	Papers
2023	54
2022	117
2021	189
2020	158
2019	114
2018	102

View synthesis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics