scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Hyper-Trellis Decoding of Pixel-Domain Wyner–Ziv Video Coding

TL;DR: A new decoding algorithm based on decoding on a hyper-trellis, in which multiple states of the original code trellis are combined, which significantly improves performance without changing the complexity of the decoder.
Abstract: In this paper, we present a new decoding algorithm for the Wyner-Ziv (WZ) video coding scheme based on turbo codes. In this scheme, a video frame is encoded using a turbo code, and only a subset of the parity bits are sent to the decoder. At the decoder, the temporal correlation of the video sequence is exploited by using the previous frame as noisy side information (SI) for the current frame. However, there is a mismatch between the SI, which is available as pixel values, and the binary code bits. Previous implementations of the decoder use suboptimal approaches that convert pixel values to soft information for code bits. We present a new decoding algorithm for this application based on decoding on a hyper-trellis, in which multiple states of the original code trellis are combined. We show that this approach significantly improves performance without changing the complexity of the decoder. We also introduce a new technique for the WZ decoder to exploit the spatial correlation within a frame without requiring transform-domain encoding at the encoder, thereby reducing its complexity. Simulation results for fixed-rate transmission show a 9-10-dB improvement in the peak signal-to-noise ratio when compared to a WZ video codec that does bitwise decoding and utilizes only the temporal correlation.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: The higher the estimation granularity is, the better the rate-distortion performance is since the deeper the adaptation of the decoding process is to the video statistical characteristics, which means that the pixel and coefficient levels are the best performing for PDWZ and TDWZ solutions, respectively.
Abstract: In recent years, practical Wyner-Ziv (WZ) video coding solutions have been proposed with promising results. Most of the solutions available in the literature model the correlation noise (CN) between the original frame and its estimation made at the decoder, which is the so-called side information (SI), by a given distribution whose relevant parameters are estimated using an offline process, assuming that the SI is available at the encoder or the originals are available at the decoder. The major goal of this paper is to propose a more realistic WZ video coding approach by performing online estimation of the CN model parameters at the decoder, for pixel and transform domain WZ video codecs. In this context, several new techniques are proposed based on metrics which explore the temporal correlation between frames with different levels of granularity. For pixel-domain WZ (PDWZ) video coding, three levels of granularity are proposed: frame, block, and pixel levels. For transform-domain WZ (TDWZ) video coding, DCT bands and coefficients are the two granularity levels proposed. The higher the estimation granularity is, the better the rate-distortion performance is since the deeper the adaptation of the decoding process is to the video statistical characteristics, which means that the pixel and coefficient levels are the best performing for PDWZ and TDWZ solutions, respectively.

241 citations


Cites background from "Hyper-Trellis Decoding of Pixel-Dom..."

  • ..., [4], [7], and [20], and, therefore, it will be also adopted in this paper....

    [...]

Journal ArticleDOI
TL;DR: A surveillance video compression system with low-complexity encoder based on Wyner-Ziv coding principles and an error resilience scheme for BCAWZ to address the concern of reliable transmission in the backward-channel, which is essential to the quality of video data for real-time and reliable object detection and event analysis.
Abstract: Video surveillance has been widely used in recent years to enhance public safety and privacy protection. A video surveillance system that deals with content analysis and activity monitoring needs efficient transmission and storage of the surveillance video data. Video compression techniques can be used to achieve this goal by reducing the size of the video with no or small quality loss. State-of-the-art video compression methods such as H.264/AVC often lead to high computational complexity at the encoder, which is generally implemented in a video camera in a surveillance system. This can significantly increase the cost of a surveillance system, especially when a mass deployment of end cameras is needed. In this paper, we discuss the specific considerations for surveillance video compression. We present a surveillance video compression system with low-complexity encoder based on Wyner-Ziv coding principles to address the tradeoff between computational complexity and coding efficiency. In addition, we propose a backward-channel aware Wyner-Ziv (BCAWZ) video coding approach to improve the coding efficiency while maintaining low complexity at the encoder. The experimental results show that for surveillance video contents, BCAWZ can achieve significantly higher coding efficiency than H.264/AVC intra coding as well as existing Wyner-Ziv video coding methods and is close to H.264/AVC inter coding, while maintaining similar coding complexity with intra coding. This shows that the low motion characteristics of many surveillance video contents and the low-complexity encoding requirement make our scheme a particularly suitable candidate for surveillance video compression. We further propose an error resilience scheme for BCAWZ to address the concern of reliable transmission in the backward-channel, which is essential to the quality of video data for real-time and reliable object detection and event analysis.

49 citations

Journal ArticleDOI
TL;DR: This paper reviews the state-of-the-art DVC architectures with a focus on understanding their opportunities and gaps in addressing the operational requirements and application needs of WVSNs.
Abstract: Distributed video coding (DVC) is a relatively new video coding architecture originated from two fundamental theorems namely, Slepian–Wolf and Wyner–Ziv. Recent research developments have made DVC attractive for applications in the emerging domain of wireless video sensor networks (WVSNs). This paper reviews the state-of-the-art DVC architectures with a focus on understanding their opportunities and gaps in addressing the operational requirements and application needs of WVSNs.

18 citations


Cites methods from "Hyper-Trellis Decoding of Pixel-Dom..."

  • ...• Hyper-Trellis decoding for PDWZ video coding is proposed in (Avudainayagam et al. 2008) to optimize the approach for the reconstruction of WZ frames....

    [...]

Book ChapterDOI
17 Dec 2007
TL;DR: In the proposed scheme, Wyner-Ziv decoder compensates wrong blocks by side information using side matching and bi-directional searching and the noise reduction in side information allows the proposed algorithm to achieve coding improvements not only in bitrate but also in PSNR.
Abstract: To make an encoder extremely simple by eliminating motion prediction/compensation from encoder, source coding with side information has been investigated based on the Wyner-Ziv theorem as the basic coding principle. However, the frame interpolation at decoder which is essential for redundancy elimination makes erroneous side information when the basic assumption of linear motion between frames is not satisfied. In this paper, we propose a new Wyner-Ziv video coding scheme featuring side matching in the frame interpolation to improve the side information. In the proposed scheme, Wyner-Ziv decoder compensates wrong blocks by side information using side matching and bi-directional searching. The noise reduction in side information allows the proposed algorithm to achieve coding improvements not only in bitrate but also in PSNR. Results of our experiments show improvement of PSNR up to 0.4dB.

17 citations


Cites methods or result from "Hyper-Trellis Decoding of Pixel-Dom..."

  • ...i) Conventional: Hyper-trellis-based PDWZ with frame interpolation[ 5 ] ii) Proposed: Hyper-trellis-based PDWZ with frame interpolation using the proposed side matching The test conditions are as follow....

    [...]

  • ...In this respect, the independence assumption of bit planes taken by previously proposed bitplane-based turbo coding has a problem in calculating channel likelihood [ 5 ]....

    [...]

  • ...Table 1 shows average performance improvements which lead us to conclude that our proposed scheme is better than the conventional scheme[ 5 ], and as we mentioned above, performance improvements occur in both PSNR and bit-rate....

    [...]

  • ...Second, we use the hyper-trellis turbo coding [ 5 ] instead of turbo coding based on bit-plane....

    [...]

  • ...Table 1. Average performance improvements of proposed scheme over the conventional one [ 5 ]...

    [...]

Proceedings ArticleDOI
23 Sep 2010
TL;DR: The the major research challenges and objectives of image coding for WMSN are discussed, including Wyner-Ziv coding and collaborative coding, and some theoretical explorations are investigated and classified.
Abstract: There are a large number of video data have to be processed and transmitted in resource-constrained wireless multimedia sensor networks (WMSN). One possible way of achieving maximum utilization of those resources is to apply an adaptive image coding scheme, which must consider the trade-off between energy consumption and image quality. The the major research challenges and objectives of image coding for WMSN are discussed. Distributed image coding schemes especially designed for WMSN and some theoretical explorations are investigated and classified, including Wyner-Ziv coding and collaborative coding.

14 citations


Cites background from "Hyper-Trellis Decoding of Pixel-Dom..."

  • ...[21] also exploit the spatial correlation within a frame at decoder....

    [...]

References
More filters
Journal ArticleDOI
27 Jun 2005
TL;DR: The recent development of practical distributed video coding schemes is reviewed, finding that the rate-distortion performance is superior to conventional intraframe coding, but there is still a gap relative to conventional motion-compensated interframe coding.
Abstract: Distributed coding is a new paradigm for video compression, based on Slepian and Wolf's and Wyner and Ziv's information-theoretic results from the 1970s. This paper reviews the recent development of practical distributed video coding schemes. Wyner-Ziv coding, i.e., lossy compression with receiver side information, enables low-complexity video encoding where the bulk of the computation is shifted to the decoder. Since the interframe dependence of the video sequence is exploited only at the decoder, an intraframe encoder can be combined with an interframe decoder. The rate-distortion performance is superior to conventional intraframe coding, but there is still a gap relative to conventional motion-compensated interframe coding. Wyner-Ziv coding is naturally robust against transmission errors and can be used for joint source-channel coding. A Wyner-Ziv MPEG encoder that protects the video waveform rather than the compressed bit stream achieves graceful degradation under deteriorating channel conditions without a layered signal representation.

1,342 citations


"Hyper-Trellis Decoding of Pixel-Dom..." refers background or methods in this paper

  • ...The WZ codec in [2], [17] utilizes only the temporal correlation of the video sequence....

    [...]

  • ...The Wyner-Ziv codec of Aaron and Girod [2], [17] operates as follows....

    [...]

  • ...In [2], [17] the interleaving is performed on the pixels, although this is not strictly required based on the way that the decoder is implemented in [2], [17]....

    [...]

  • ...As inter-frame coding is not the focus of this paper, we follow the approach of [2], [17] and do not consider coding of the odd frames....

    [...]

  • ...The reconstruction function used in [2], [17] is given by...

    [...]

Journal ArticleDOI
TL;DR: This work addresses the problem of compressing correlated distributed sources, i.e., correlated sources which are not co-located or which cannot cooperate to directly exploit their correlation and provides a constructive practical framework based on algebraic trellis codes dubbed as DIstributed Source Coding Using Syndromes (DISCUS), that can be applicable in a variety of settings.
Abstract: We address the problem of compressing correlated distributed sources, i.e., correlated sources which are not co-located or which cannot cooperate to directly exploit their correlation. We consider the related problem of compressing a source which is correlated with another source that is available only at the decoder. This problem has been studied in the information theory literature under the name of the Slepian-Wolf (1973) source coding problem for the lossless coding case, and as "rate-distortion with side information" for the lossy coding case. We provide a constructive practical framework based on algebraic trellis codes dubbed as DIstributed Source Coding Using Syndromes (DISCUS), that can be applicable in a variety of settings. Simulation results are presented for source coding of independent and identically distributed (i.i.d.) Gaussian sources with side information available at the decoder in the form of a noisy version of the source to be coded. Our results reveal the promise of this approach: using trellis-based quantization and coset construction, the performance of the proposed approach is 2-5 dB from the Wyner-Ziv (1976) bound.

1,060 citations


"Hyper-Trellis Decoding of Pixel-Dom..." refers background in this paper

  • ...The first approach [6]–[9] is based on explicit random binning techniques [10], [11] that are typically used in achievability proofs in information theory....

    [...]

Book
07 Oct 2001
TL;DR: Wang et al. as discussed by the authors received the B.S. and M.S degrees in electrical engineering from Tsinghua University, Beijing, China, in 1983 and 1985, respectively, and the Ph.D. degree in electrical and computer engineering from the University of California at Santa Barbara in 1990.
Abstract: From the Publisher: Yao Wang received the B.S. and M.S. degrees in electrical engineering from Tsinghua University, Beijing, China, in 1983 and 1985, respectively, and the Ph.D. degree in electrical and computer engineering from the University of California at Santa Barbara in 1990. Since 1990, she has been with the Faculty of Electrical Engineering, Polytechnic University, Brooklyn, NY. Her research areas include video communications, multimedia signal processing, and medical imaging. She has authored and co-authored over 100 papers in journals and conference proceedings. She is a senior member of IEEE and has served as an Associate Editor for the IEEE Transactions on Circuits and Systems for Video Technology and the IEEE Transactions on Multimedia. She won the Mayor's Award of the City of New York for Excellence in Science and Technology in the Young Investigator category in 2000. Jvrn Ostermann studied electrical engineering and communications engineering at the University of Hannover and Imperial College London, respectively. He received Dipl.-Ing. and Dr.-Ing. from the University of Hannover in 1988 and 1994, respectively. He has been a staff member with Image Processing and Technology Research, AT&T Labs>Research since 1996, where he is engaged in research on video coding, shape coding, multi-modal human-computer interfaces with talking avatars, standardization, and image analysis. He is a German National Foundation scholar. In 1998, he received the AT&T Standards Recognition Award and the ISO award. He is a member of the IEEE, the IEEE Technical Committee on Multimedia Signal Processing, and chair of the IEEE CAS Visual Signal Processing and Communications (VSPC) TechnicalCommittee. Ya-Qin Zhang received the B.S. and M.S. degrees in electrical engineering from the University of Science and Technology of China (USTC) in 1983 and 1985, respectively, and the Ph.D. degree from George Washington University in 1989. He is currently the Managing Director of Microsoft Research in Beijing, after leaving his post as the Director of Multimedia Technology Laboratory at the Sarnoff Corporation in Princeton, NJ (formerly the David Sarnoff Research Center, and RCA Laboratories). He has been engaged in research and commercialization of MPEG2/DTV, MPEG4/VLBR, and multimedia information technologies. He has authored and co-authored over 200-refereed papers in leading international conference proceedings and journals. He has been granted over 40 U.S. patents in digital video, Internet, multimedia, wireless and satellite communications. He was the Editor-in-Chief of the IEEE Transactions on Circuits and Systems for Video Technology from 1997 to 1999. He is a Fellow of the IEEE.

809 citations


"Hyper-Trellis Decoding of Pixel-Dom..." refers background or methods in this paper

  • ...The performance of the hyper-trellis based decoders converges to zero-rate performance (MCI performance) as the transmission rate decreases....

    [...]

  • ...We refer to the SI generated using SMV-MCI as “good” SI since the mean-squared error (MSE) between the SI and the original frame is typically low....

    [...]

  • ...This technique of generating SI will be referred to as SMV-MCI....

    [...]

  • ...First, our approach to generating the SMV-MCI SI is not as good as that used in [3], as evidenced by the performance difference of approximately 3.5 dB at zero rate....

    [...]

  • ...In the first approach, SI for the even frames is generated by using motion compensated interpolation (MCI) [1] between two consecutive odd frames ( and )....

    [...]

Journal ArticleDOI
TL;DR: The first part of this paper consists of short summaries of recent work in five rather traditional areas of the Shannon theory, namely: source and channel coding theorems for new situations, and calculation of source rate and channel capacity.
Abstract: The first part of this paper consists of short summaries of recent work in five rather traditional areas of the Shannon theory, namely: 1) source and channel coding theorems for new situations; 2) calculation of source rate and channel capacity; 3) channel coding with feedback; 4) source coding; 5) universal coding. The second part of thc paper consists of a relatively detailed discussion of some aspects of the area that the author considers to be the most dynamic and exciting in the Shannon theory: multiple-user communication. The discussion here includes "multiple-access channels," "broadcast channels," and various source coding problems with multiple-user constraints.

662 citations


"Hyper-Trellis Decoding of Pixel-Dom..." refers background in this paper

  • ...The first approach [6]–[9] is based on explicit random binning techniques [10], [11] that are typically used in achievability proofs in information theory....

    [...]

Proceedings ArticleDOI
07 Jan 2004
TL;DR: This work proposes a transformdomain Wyner-Ziv coding scheme for motion video that uses intraframe encoding, but interframe decoding, and shows significant gains above DCT-based intraframe coding and improvements over the pixel-domain Wynev video coder.
Abstract: In current interframe video compression systems, the encoder performs predictive coding to exploit the similarities of successive frames. The Wyner-Ziv Theorem on source coding with side information available only at the decoder suggests that an asymmetric video codec, where individual frames are encoded separately, but decoded conditionally (given temporally adjacent frames) could achieve similar efficiency. We propose a transformdomain Wyner-Ziv coding scheme for motion video that uses intraframe encoding, but interframe decoding. In this system, the transform coefficients of a Wyner-Ziv frame are encoded independently using a scalar quantizer and turbo coder. The decoder uses previously reconstructed frames to generate side information to conditionally decode the Wyner-Ziv frames. Simulation results show significant gains above DCT-based intraframe coding and improvements over the pixel-domain Wyner-Ziv video coder.

469 citations