Topic

View synthesis

About: View synthesis is a research topic. Over the lifetime, 1701 publications have been published within this topic receiving 42333 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A Multiple View Layered Representation for Dynamic Novel View Synthesis.

[...]

Keith Richard Connor, Ian Reid

01 Jan 2003

TL;DR: A persistent representation of occupancy is maintained in spite of occlusion without enforcing a particular parametric shape model, and a MAP solution for estimating layer parameters which are consistent across views is formulated.

...read moreread less

Abstract: We propose a multiple view layered representation for tracking and segmentation of multiple objects in a scene. Existing layered approaches are dominated by the single view case and generally exploit only motion cues. We extend this to integrate static, dynamic and structural cues over a pair of views. The goal is to update coherent correspondence information sequentially, producing a multi-object tracker as a natural byproduct. We formulate a MAP solution for estimating layer parameters which are consistent across views, with the EM algorithm used to determine both the hidden segmentation labelling and motion parameters. A persistent representation of occupancy is maintained in spite of occlusion without enforcing a particular parametric shape model. An immediate application is dynamic novel view synthesis, for which our layered approach offers a direct and convenient representation.

...read moreread less

23 citations

Book Chapter•DOI•

Automatic Camera Tracking

[...]

Andrew Fitzgibbon¹, Andrew Zisserman¹•Institutions (1)

University of Oxford¹

01 Jan 2003

TL;DR: This work states that the goal of automatic recovery of camera motion and scene structure from video sequences has been a staple of computer vision research for over a decade and now represents one of the success stories ofComputer vision.

...read moreread less

Abstract: The goal of automatic recovery of camera motion and scene structure from video sequences has been a staple of computer vision research for over a decade. As an area of endeavour, it has seen both steady and explosive progress over time, and now represents one of the success stories of computer vision. This task, automatic camera tracking or “matchmoving”, is the sine qua non of modern special effects, allowing the seamless insertion of computer generated objects onto live-action backgrounds (figure 2.1 shows an example). It has moved from a research problem for a small number of uncalibrated images to commercial software which can automatically track cameras through thousands of frames [1]. In addition, camera tracking is an important preprocess for many computer vision algorithms such as multiple-view shape reconstruction, novel view synthesis and autonomous vehicle navigation.

...read moreread less

23 citations

Journal Article•DOI•

View Synthesis Distortion Estimation With a Graphical Model and Recursive Calculation of Probability Distribution

[...]

Dong Zhang¹, Jie Liang¹•Institutions (1)

Simon Fraser University¹

01 May 2015-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: This paper considers the two DIBR algorithms used in the Moving Picture Experts Group view synthesis reference software, and develops a scheme for the encoder to estimate the distortion of the synthesized virtual view at the decoder when the reference texture and depth sequences experience transmission errors such as packet loss.

...read moreread less

Abstract: Depth-image-based rendering (DIBR) is frequently used in multiview video applications such as free-viewpoint television. In this paper, we consider the two DIBR algorithms used in the Moving Picture Experts Group view synthesis reference software, and develop a scheme for the encoder to estimate the distortion of the synthesized virtual view at the decoder when the reference texture and depth sequences experience transmission errors such as packet loss. We first develop a graphical model to analyze how random errors in the reference depth image affect the synthesized virtual view. The warping competition rule adopted in the DIBR algorithms is explicitly represented by the graphical model. We then consider the case where packet loss occurs to both the encoded texture and depth images during transmission and develop a recursive optimal distribution estimation (RODE) method to calculate the per-pixel texture and depth probability distributions in each frame of the reference views. The RODE is then integrated with the graphical model method to estimate the distortion in the synthesized view caused by packet loss. Experimental results verify the accuracy of the graphical model method, the RODE, and the combined estimation scheme.

...read moreread less

23 citations

Proceedings Article•DOI•

Depth assisted compression of full parallax light fields

[...]

Danillo B. Graziosi, Zahir Y. Alpaslan, Hussein S. El-Ghoroury

17 Mar 2015-electronic imaging

TL;DR: A view selection method inspired by plenoptic sampling followed by transform-based view coding and view synthesis prediction to code residual views is introduced, which has an improved rate-distortion performance and preserves the structure of the perceived light fields better.

...read moreread less

Abstract: Full parallax light field displays require high pixel density and huge amounts of data. Compression is a necessary tool used by 3D display systems to cope with the high bandwidth requirements. One of the formats adopted by MPEG for 3D video coding standards is the use of multiple views with associated depth maps. Depth maps enable the coding of a reduced number of views, and are used by compression and synthesis software to reconstruct the light field. However, most of the developed coding and synthesis tools target linearly arranged cameras with small baselines. Here we propose to use the 3D video coding format for full parallax light field coding. We introduce a view selection method inspired by plenoptic sampling followed by transform-based view coding and view synthesis prediction to code residual views. We determine the minimal requirements for view sub-sampling and present the rate-distortion performance of our proposal. We also compare our method with established video compression techniques, such as H.264/AVC, H.264/MVC, and the new 3D video coding algorithm, 3DV-ATM. Our results show that our method not only has an improved rate-distortion performance, it also preserves the structure of the perceived light fields better.

...read moreread less

23 citations

Proceedings Article•

MINE: Towards Continuous Depth MPI With NeRF for Novel View Synthesis

[...]

Jiaxin Li¹, Zijian Feng, Qi She², Henghui Ding³, Changhu Wang⁴, Gim Hee Lee¹ - Show less +2 more•Institutions (4)

National University of Singapore¹, Trinity College, Dublin², Nanyang Technological University³, Shanghai Jiao Tong University⁴

01 Jan 2021

TL;DR: MINE as mentioned in this paper predicts a 4-channel image (RGB and volume density) at arbitrary depth values to jointly reconstruct the camera frustum and fill in occluded contents, which can then be easily rendered into novel RGB or depth views using differentiable rendering.

...read moreread less

Abstract: In this paper, we propose MINE to perform novel view synthesis and depth estimation via dense 3D reconstruction from a single image. Our approach is a continuous depth generalization of the Multiplane Images (MPI) by introducing the NEural radiance fields (NeRF). Given a single image as input, MINE predicts a 4-channel image (RGB and volume density) at arbitrary depth values to jointly reconstruct the camera frustum and fill in occluded contents. The reconstructed and inpainted frustum can then be easily rendered into novel RGB or depth views using differentiable rendering. Extensive experiments on RealEstate10K, KITTI and Flowers Light Fields show that our MINE outperforms state-of-the-art by a large margin in novel view synthesis. We also achieve competitive results in depth estimation on iBims-1 and NYU-v2 without annotated depth supervision. Our source code is available at this https URL

...read moreread less

23 citations

Collapse

Network Information

Performance

Metrics

1,873

Papers

59,067

Citations

No. of papers in the topic in previous years
Year	Papers
2023	54
2022	117
2021	189
2020	158
2019	114
2018	102

View synthesis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics