Topic

2D to 3D conversion

About: 2D to 3D conversion is a research topic. Over the lifetime, 591 publications have been published within this topic receiving 7632 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks

[...]

Junyuan Xie¹, Ross Girshick¹, Ali Farhadi¹•Institutions (1)

University of Washington¹

08 Oct 2016

TL;DR: Deep3D as discussed by the authors uses deep neural networks to automatically convert 2D videos and images to a stereoscopic 3D format, which is trained end-to-end directly on stereo pairs extracted from existing 3D movies.

...read moreread less

Abstract: As 3D movie viewing becomes mainstream and the Virtual Reality (VR) market emerges, the demand for 3D contents is growing rapidly. Producing 3D videos, however, remains challenging. In this paper we propose to use deep neural networks to automatically convert 2D videos and images to a stereoscopic 3D format. In contrast to previous automatic 2D-to-3D conversion algorithms, which have separate stages and need ground truth depth map as supervision, our approach is trained end-to-end directly on stereo pairs extracted from existing 3D movies. This novel training scheme makes it possible to exploit orders of magnitude more data and significantly increases performance. Indeed, Deep3D outperforms baselines in both quantitative and human subject evaluations.

...read moreread less

435 citations

Journal Article•DOI•

Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling.

[...]

Kevin Karsch¹, Ce Liu², Sing Bing Kang²•Institutions (2)

Urbana University¹, Microsoft²

11 Apr 2014-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and is demonstrated through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.

...read moreread less

Abstract: We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our technique is applicable to single images as well as videos. For videos, we use local motion cues to improve the inferred depth maps, while optical flow is used to ensure temporal depth consistency. For training and evaluation, we use a Kinect-based system to collect a large data set containing stereoscopic videos with known depths. We show that our depth estimation technique outperforms the state-of-the-art on benchmark databases. Our technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and we demonstrate this through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.

...read moreread less

432 citations

Book Chapter•DOI•

Hand Pose Estimation via Latent 2.5D Heatmap Regression

[...]

Umar Iqbal¹, Umar Iqbal², Pavlo Molchanov², Thomas M. Breuel², Juergen Gall¹, Jan Kautz² - Show less +2 more•Institutions (2)

University of Bonn¹, Nvidia²

08 Sep 2018

TL;DR: This paper proposes a new method for 3D hand pose estimation from a monocular image through a novel 2.5D pose representation that implicitly learns depth maps and heatmap distributions with a novel CNN architecture.

...read moreread less

Abstract: Estimating the 3D pose of a hand is an essential part of human-computer interaction. Estimating 3D pose using depth or multi-view sensors has become easier with recent advances in computer vision, however, regressing pose from a single RGB image is much less straightforward. The main difficulty arises from the fact that 3D pose requires some form of depth estimates, which are ambiguous given only an RGB image. In this paper we propose a new method for 3D hand pose estimation from a monocular image through a novel 2.5D pose representation. Our new representation estimates pose up to a scaling factor, which can be estimated additionally if a prior of the hand size is given. We implicitly learn depth maps and heatmap distributions with a novel CNN architecture. Our system achieves state-of-the-art accuracy for 2D and 3D hand pose estimation on several challenging datasets in presence of severe occlusions.

...read moreread less

286 citations

Journal Article•DOI•

3D-TV Content Creation: Automatic 2D-to-3D Video Conversion

[...]

Liang Zhang, Carlos Vázquez, Sebastian Knorr

28 Mar 2011-IEEE Transactions on Broadcasting

TL;DR: An overview of automatic 2D-to-3D video conversion with a specific look at a number of approaches for both the extraction of depth information from monoscopic images and the generation of stereoscopic images is provided.

...read moreread less

Abstract: Three-dimensional television (3D-TV) is the next major revolution in television. A successful rollout of 3D-TV will require a backward-compatible transmission/distribution system, inexpensive 3D displays, and an adequate supply of high-quality 3D program material. With respect to the last factor, the conversion of 2D images/videos to 3D will play an important role. This paper provides an overview of automatic 2D-to-3D video conversion with a specific look at a number of approaches for both the extraction of depth information from monoscopic images and the generation of stereoscopic images. Some challenging issues for the success of automatic 2D-to-3D video conversion are pointed out as possible research topics for the future.

...read moreread less

161 citations

Proceedings Article•DOI•

Rapid 2D to 3D conversion

[...]

Philip Victor Harman, Julien Flack, Simon Richard Fox, Mark Robert Dowley

24 May 2002-electronic imaging

TL;DR: In this paper, the authors describe the application of MLAs to the generation of depth maps and presents the results of the commercial application of this approach, which is based upon the use of Machine Leaning Algorithm (MLAs).

...read moreread less

Abstract: The conversion of existing 2D images to 3D is proving commercially viable and fulfills the growing need for high quality stereoscopic images. This approach is particularly effective when creating content for the new generation of autostereoscopic displays that require multiple stereo images. The dominant technique for such content conversion is to develop a depth map for each frame of 2D material. The use of a depth map as part of the 2D to 3D conversion process has a number of desirable characteristics: 1. The resolution of the depth may be lower than that of the associated 2D image. 2. It can be highly compressed. 3. 2D compatibility is maintained. 4. Real time generation of stereo, or multiple stereo pairs, is possible. The main disadvantage has been the laborious nature of the manual conversion techniques used to create depth maps from existing 2D images, which results in a slow and costly process. An alternative, highly productive technique has been developed based upon the use of Machine Leaning Algorithm (MLAs). This paper describes the application of MLAs to the generation of depth maps and presents the results of the commercial application of this approach.

...read moreread less

148 citations

Collapse

Network Information

Performance

Metrics

595

Papers

8,157

Citations

No. of papers in the topic in previous years
Year	Papers
2023	3
2022	1
2021	8
2020	9
2019	10
2018	25

2D to 3D conversion

Papers published on a yearly basis

Papers

Trending Questions (2)

Network Information

Related Topics (5)

Performance

Metrics