scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

Compression Of Stereo Images And The Evaluation Of Its Effects On 3-D Perception

30 Jan 1990-Vol. 1153, pp 522-530
TL;DR: The proposed approach to stereo image coding takes advantage of the singleness of vision property of the human visual system and shows that a stereo image pair, in which one of the images is low-pass filtered and subsampled, is perceived as a sharp 3-D image.
Abstract: The proposed approach to stereo image coding takes advantage of the singleness of vision property of the human visual system. Experiments show that a stereo image pair, in which one of the images is low-pass filtered and subsampled, is perceived as a sharp 3-D image. The depth information is perceived due to the stereopsis effect, and the sharpness is maintained due to the details in the non-filtered image. A methodology for the evaluation of the compression effects on the 3-D perception of stereo images is presented. It is based on measurements of response-time and accuracy of human subjects performing simple 3-D perception tasks.
Citations
More filters
Proceedings ArticleDOI
18 Oct 1999
TL;DR: A novel object driven, block based algorithm for the compression of stereo image pairs that effectively combines the simplicity and adaptability of the existing block based stereo image compression techniques with an edge/contour based object extraction technique to determine appropriate compression strategy for various areas of the right image.
Abstract: In this paper, we propose a novel object driven, block based algorithm for the compression of stereo image pairs. The algorithm effectively combines the simplicity and adaptability of the existing block based stereo image compression techniques with an edge/contour based object extraction technique to determine appropriate compression strategy for various areas of the right image. Extensive experiments carried out support that significant improvements of up to 20% in compression ratio can be achieved by the proposed algorithm, compared with the existing stereo image compression techniques. Yet the reconstructed image quality is maintained at an equivalent level in terms of PSNR values. In terms of visual quality, the right image reconstructed by the proposed algorithm does not incur any noticeable effect compared with the outputs of the best algorithms. The proposed algorithm performs object extraction and matching between the reconstructed left frame and the original right frame to identify those objects that match but are displaced by varying amounts due to binocular parallax. Different coding strategies are then applied separately to internal areas and the bounding areas for each identified object. Based on the mean squared matching error of the internal blocks and a selected threshold, a decision is made whether or not to encode the predictive errors inside these objects. The output bit stream includes entropy coding of object disparity, block disparity and possibly some errors, which fail to meet the threshold requirement in the proposed algorithm.© (1999) COPYRIGHT SPIE--The International Society for Optical Engineering. Downloading of the abstract is permitted for personal use only.

2 citations

Journal ArticleDOI
TL;DR: Experimental studies conducted by the Communications Research Centre Canada within the last 15 years reflect empirical investigations on a wide range of topics from an applied vision research perspective: the prevalence of stereo-deficiency, the potential use of mixed-resolution binocular imaging for reducing transmission bandwidth, and the role of stereoscopic object motion on visual comfort.
Abstract: Stereoscopic three-dimensional (S3-D) visual entertainment has become a hot technology that is emerging very quickly in many new electronic devices and visual communication systems. Although this rapid growth of S3-D technology is relatively recent, visual scientists have been conducting research on fundamental issues of stereoscopic 3-D television (3-D TV) development and deployment for many years. To illustrate how human visual perception research has been applied to the study of stereoscopic 3-D TV related issues, selected experimental studies that were conducted by the Communications Research Centre Canada within the last 15 years are presented. The studies reflect empirical investigations on a wide range of topics from an applied vision research perspective: the prevalence of stereo-deficiency, the potential use of mixed-resolution binocular imaging for reducing transmission bandwidth, the role of stereoscopic object motion on visual comfort, and the use of surrogate depth maps for the conversion of standard video to S3-D format to increase the availability of S3-D program contents.

1 citations

Journal ArticleDOI
TL;DR: The simulations show that the proposed solution combined with the state of the art view synthesis distortion outperforms the anchor MVC+D coding scheme by 14.5 % of dBR on average for the total coded bitrate and by 17 % for the synthesized views.
Abstract: 3D video is composed out of two or more, temporally synchronized, 2D video streams acquired at different camera poses and accompanied by geometrical information. In a mixed resolution 3D video stream, a subset of views is coded at reduced resolution. It has been shown in the literature that subjective quality of mixed resolution 3D video is close to that of full resolution 3D video. In order to improve the coding gain in mixed resolution coding scenario we present a new depth encoding method called view upsampling optimization. A novel depth distortion metric based on the performance of the depth-based super resolution is also presented. Finally, to improve the quality of the decoded video an improved depth-based super resolution method that uses view synthesis quality mapping is used for upsampling of low resolution views. The simulations, performed with the recently standardized MVC+D encoder, show that the proposed solution combined with the state of the art view synthesis distortion outperforms the anchor MVC+D coding scheme by 14.5 % of dBR on average for the total coded bitrate and by 17 % of dBR on average for the synthesized views.

1 citations


Cites background or methods from "Compression Of Stereo Images And Th..."

  • ...mixed resolution stereopairs has been assessed first time in [15]....

    [...]

  • ...[15] and later utilized for image sequences and named as mixed resolution coding by Perkins et al....

    [...]

Dissertation
01 Jan 2004
TL;DR: Seven wavelet-based stereo image compression algorithms are proposed, to take advantage of the higher data compaction capability and better flexibility of wavelets.
Abstract: With the standardization of JPEG-2000, wavelet-based image and video compression technologies are gradually replacing the popular DCT-based methods. In parallel to this, recent developments in autostereoscopic display technology is now threatening to revolutionize the way in which consumers are used to enjoying the traditional 2-D display based electronic media such as television, computer and movies. However, due to the two-fold bandwidth/storage space requirement of stereoscopic imaging, an essential requirement of a stereo imaging system is efficient data compression. In this thesis, seven wavelet-based stereo image compression algorithms are proposed, to take advantage of the higher data compaction capability and better flexibility of wavelets. [Continues.]
Patent
01 Dec 2020
TL;DR: In this paper, a segmented 3D multi-view image generator is proposed to generate fewer multiview view images for partitions having less salient features, based on a depth map and image processing of the input image.
Abstract: A segmented 3D multi-view image generator generates fewer multi-view view images for partitions having less salient features. Saliency values are calculated based on a depth map and image processing of the input image. The saliency values are compared to thresholds to map pixel locations to first, second, and third partitions. First, second, and third segmented images are created from the input image using a partition map. A multi-view generator uses the depth map and viewer eye locations to generates 28 view images from the first segmented image, 14 unique view images from the second segmented image that are replicated to 28 view images, and 7 unique view images from the third segmented image that are replicated to 28 view images. The view images for each segment are interlaced to generated interlaced segmented images that are then integrated together into a single 3D image that drives a 28-view autostereoscopic display.
References
More filters
Book
01 Jan 1978
TL;DR: In this paper, a unified experimental approach to the study of the mind based on experiments concerning the time course of human information processing is described, drawing systematically on studies of performance, subjective experience, and brain processes.
Abstract: In this widely cited volume, Professor Posner describes a unified experimental approach to the study of the mind based on experiments concerning the time course of human information processing. Drawing systematically on studies of performance, subjective experience, and brain processes, he develops relationships between cognitive psychology and neuroscience. Professor Posner has written a new preface for the paperback edition.

69 citations