scispace - formally typeset
Search or ask a question
MonographDOI

H.264 and MPEG-4 Video Compression

TL;DR: This paper presents a meta-review of the MPEG-4 and H.264 standards for video quality and design, and some of the standards themselves have been revised and improved since their publication in 2009.
Abstract: About the Author.Foreword.Preface.Glossary.1. Introduction.2. Video Formats and Quality.3. Video Coding Concepts.4. The MPEG-4 and H.264 Standards.5. MPEG-4 Visual.6. H.264/MPEG-4 Part 10.7. Design and Performance.8. Applications and Directions.Bibliography.Index.
Citations
More filters
Journal ArticleDOI
TL;DR: The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoder that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average.
Abstract: The compression capability of several generations of video coding standards is compared by means of peak signal-to-noise ratio (PSNR) and subjective testing results. A unified approach is applied to the analysis of designs, including H.262/MPEG-2 Video, H.263, MPEG-4 Visual, H.264/MPEG-4 Advanced Video Coding (AVC), and High Efficiency Video Coding (HEVC). The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoders that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average. The HEVC design is shown to be especially effective for low bit rates, high-resolution video content, and low-delay communication applications. The measured subjective improvement somewhat exceeds the improvement measured by the PSNR metric.

1,279 citations

Patent
30 Mar 2007
TL;DR: In this article, a method of decoding a video signal is described, which includes obtaining identification information indicating whether a coded picture of a current NAL unit is an inter-view picture group, obtaining interview reference information of the interview picture groups, and constructing a reference picture list for interview prediction.
Abstract: A method of decoding a video signal is disclosed. The present invention includes obtaining identification information indicating whether a coded picture of a current NAL unit is an inter-view picture group, obtaining interview reference information of the inter-view picture group, and constructing a reference picture list for inter-view prediction using the inter-view reference information according to the identification information, wherein the inter-view reference information includes view identification information of an inter-view reference picture.

505 citations

Journal ArticleDOI
TL;DR: It is concluded that better dehazed performance with fewer artifacts and better coding efficiency is achieved when the dehazing is applied before compression.
Abstract: This paper makes an investigation of the dehazing effects on image and video coding for surveillance systems. The goal is to achieve good dehazed images and videos at the receiver while sustaining low bitrates (using compression) in the transmission pipeline. At first, this paper proposes a novel method for single-image dehazing, which is used for the investigation. It operates at a faster speed than current methods and can avoid halo effects by using the median operation. We then consider the dehazing effects in compression by investigating the coding artifacts and motion estimation in cases of applying any dehazing method before or after compression. We conclude that better dehazing performance with fewer artifacts and better coding efficiency is achieved when the dehazing is applied before compression. Simulations for Joint Photographers Expert Group images in addition to subjective and objective tests with H.264 compressed sequences validate our conclusion.

295 citations

Proceedings ArticleDOI
Yan Ye1, Marta Karczewicz1
12 Dec 2008
TL;DR: Together the improvements can bring on average 7% and 10% coding gain for CABAC and for CAVLC, respectively, with average coding gain of 12% for HD sequences.
Abstract: In this paper, a novel intra coding scheme is proposed. The proposed scheme improves H.264 intra coding from three aspects: 1) H.264 intra prediction is enhanced with additional bi-directional intra prediction modes; 2) H.264 integer transform is supplemented with directional transforms for some prediction modes; and 3) residual coefficient coding in CAVLC is improved. Compared to H.264, together the improvements can bring on average 7% and 10% coding gain for CABAC and for CAVLC, respectively, with average coding gain of 12% for HD sequences.

259 citations


Cites methods from "H.264 and MPEG-4 Video Compression"

  • ...Index Terms— H.264, intra coding, intra prediction, block transform, Variable Length Coding....

    [...]

Journal ArticleDOI
TL;DR: The methods described here complement existing compressive sensing approaches and may be applied to enhance computational imagers that rely on sequential correlation measurements, thereby helping to mitigate one of the main drawbacks of single-pixel imaging techniques.
Abstract: In contrast to conventional multipixel cameras, single-pixel cameras capture images using a single detector that measures the correlations between the scene and a set of patterns. However, these systems typically exhibit low frame rates, because to fully sample a scene in this way requires at least the same number of correlation measurements as the number of pixels in the reconstructed image. To mitigate this, a range of compressive sensing techniques have been developed which use a priori knowledge to reconstruct images from an undersampled measurement set. Here, we take a different approach and adopt a strategy inspired by the foveated vision found in the animal kingdom—a framework that exploits the spatiotemporal redundancy of many dynamic scenes. In our system, a high-resolution foveal region tracks motion within the scene, yet unlike a simple zoom, every frame delivers new spatial information from across the entire field of view. This strategy rapidly records the detail of quickly changing features in the scene while simultaneously accumulating detail of more slowly evolving regions over several consecutive frames. This architecture provides video streams in which both the resolution and exposure time spatially vary and adapt dynamically in response to the evolution of the scene. The degree of local frame rate enhancement is scene-dependent, but here, we demonstrate a factor of 4, thereby helping to mitigate one of the main drawbacks of single-pixel imaging techniques. The methods described here complement existing compressive sensing approaches and may be applied to enhance computational imagers that rely on sequential correlation measurements.

194 citations

References
More filters
Book
19 Dec 2003
TL;DR: In this article, the MPEG-4 and H.264 standards are discussed and an overview of the technologies involved in their development is presented. But the focus is on the performance and not the technical aspects.
Abstract: About the Author.Foreword.Preface.Glossary.1. Introduction.2. Video Formats and Quality.3. Video Coding Concepts.4. The MPEG-4 and H.264 Standards.5. MPEG-4 Visual.6. H.264/MPEG-4 Part 10.7. Design and Performance.8. Applications and Directions.Bibliography.Index.

2,491 citations

Book
20 Jul 2002
TL;DR: A comprehensive, targeted guide to the MPEG-4 standard—and its use in cutting-edge applications, Fernando Pereira and Touradj Ebrahimi demonstrate how MPEG- 4 addresses tomorrow's multimedia applications more successfully than any previous standard.
Abstract: From the Publisher: The most complete, focused guide to MPEG-4—the breakthrough standard for interactive multimedia. The comprehensive, focused, up-to-the-minute guide to MPEG-4 Practical solutions for next-generation multimedia applications In-depth coverage of natural and synthetic audiovisual object coding, description, composition and synchronization Binary and textual scene description Transport and storage of MPEG-4 content MPEG-4 profiles and levels; verification tests MPEG-4 represents a breakthrough in multimedia, delivering not just outstanding compression but also a fully interactive user experience. In The MPEG-4 Book, two leaders of the MPEG-4 standards community offer a comprehensive, targeted guide to the MPEG-4 standard—and its use in cutting-edge applications. Fernando Pereira and Touradj Ebrahimi, together with a unique collection of key MPEG experts, demonstrate how MPEG-4 addresses tomorrow's multimedia applications more successfully than any previous standard. They review every element of the standard to offer you a book that covers: Synthetic and natural audio and video object coding, description and synchronization BIFS—the MPEG-4 language for scene description and interaction The extensible MPEG-4 textual format XMT Transport and delivery of MPEG-4 content MPEG-J: using Java classes within MPEG-4 content A complete overview of MPEG-4 Profiles and Levels Verification tests The authors also walk through the MPEG-4 Systems Reference Software ?offering powerful real-world insights for every product developer, softwareprofessional, engineer, and researcher involved with MPEG-4 and state-of-the-art multimedia delivery. Part of the new IMSC Press Series from the Integrated Multimedia System Center at the University of Southern California, a federally funded center specializing in cutting-edge multimedia research.

363 citations