scispace - formally typeset
Search or ask a question
Journal ArticleDOI

The MPEG-4 video standard verification model

TL;DR: The scope of the MPEG-4 video standard is described and the structure of the video verification model under development is outlined, to provide a fully defined core video coding algorithm platform for the development of the standard.
Abstract: The MPEG-4 standardization phase has the mandate to develop algorithms for audio-visual coding allowing for interactivity, high compression, and/or universal accessibility and portability of audio and video content. In addition to the conventional "frame"-based functionalities of the MPEG-1 and MPEG-2 standards, the MPEG-4 video coding algorithm will also support access and manipulation of "objects" within video scenes. The January 1996 MPEG Video Group meeting witnessed the definition of the first version of the MPEG-4 video verification model-a milestone in the development of the MPEG-4 standard. The primary intent of the video verification model is to provide a fully defined core video coding algorithm platform for the development of the standard. As such, the structure of the MPEG-4 video verification model already gives some indication about the tools and algorithms that will be provided by the final MPEG-4 standard. The paper describes the scope of the MPEG-4 video standard and outlines the structure of the MPEG-4 video verification model under development.
Citations
More filters
Journal ArticleDOI
TL;DR: A detailed review of the existing 2D appearance models for visual object tracking can be found in this article, where the authors decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling.
Abstract: Visual object tracking is a significant computer vision task which can be applied to many domains, such as visual surveillance, human computer interaction, and video compression. Despite extensive research on this topic, it still suffers from difficulties in handling complex object appearance changes caused by factors such as illumination variation, partial occlusion, shape deformation, and camera motion. Therefore, effective modeling of the 2D appearance of tracked objects is a key issue for the success of a visual tracker. In the literature, researchers have proposed a variety of 2D appearance models. To help readers swiftly learn the recent advances in 2D appearance models for visual object tracking, we contribute this survey, which provides a detailed review of the existing 2D appearance models. In particular, this survey takes a module-based architecture that enables readers to easily grasp the key points of visual object tracking. In this survey, we first decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling. Then, different 2D appearance models are categorized and discussed with respect to their composition modules. Finally, we address several issues of interest as well as the remaining challenges for future research on this topic. The contributions of this survey are fourfold. First, we review the literature of visual representations according to their feature-construction mechanisms (i.e., local and global). Second, the existing statistical modeling schemes for tracking-by-detection are reviewed according to their model-construction mechanisms: generative, discriminative, and hybrid generative-discriminative. Third, each type of visual representations or statistical modeling techniques is analyzed and discussed from a theoretical or practical viewpoint. Fourth, the existing benchmark resources (e.g., source codes and video datasets) are examined in this survey.

653 citations

Posted Content
TL;DR: This survey provides a detailed review of the existing 2D appearance models for visual object tracking and takes a module-based architecture that enables readers to easily grasp the key points ofVisual object tracking.
Abstract: Visual object tracking is a significant computer vision task which can be applied to many domains such as visual surveillance, human computer interaction, and video compression. In the literature, researchers have proposed a variety of 2D appearance models. To help readers swiftly learn the recent advances in 2D appearance models for visual object tracking, we contribute this survey, which provides a detailed review of the existing 2D appearance models. In particular, this survey takes a module-based architecture that enables readers to easily grasp the key points of visual object tracking. In this survey, we first decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling. Then, different 2D appearance models are categorized and discussed with respect to their composition modules. Finally, we address several issues of interest as well as the remaining challenges for future research on this topic. The contributions of this survey are four-fold. First, we review the literature of visual representations according to their feature-construction mechanisms (i.e., local and global). Second, the existing statistical modeling schemes for tracking-by-detection are reviewed according to their model-construction mechanisms: generative, discriminative, and hybrid generative-discriminative. Third, each type of visual representations or statistical modeling techniques is analyzed and discussed from a theoretical or practical viewpoint. Fourth, the existing benchmark resources (e.g., source code and video datasets) are examined in this survey.

605 citations

Journal ArticleDOI
TL;DR: The aim, methodologies, and broad details of the MPEG-7 standard development forVisual content description for visual content description are outlined.
Abstract: The MPEG-7 visual standard under development specifies content-based descriptors that allow users or agents (or search engines) to measure similarity in images or video based on visual criteria, and can be used to efficiently identify, filter, or browse images or video based on visual content. More specifically, MPEG-7 specifies color, texture, object shape, global motion, or object motion features for this purpose. This paper outlines the aim, methodologies, and broad details of the MPEG-7 standard development for visual content description.

561 citations


Cites methods from "The MPEG-4 video standard verificat..."

  • ...This procedure already proved successful in the course of the development of the MPEG-1, MPEG-2, and MPEG-4 Standards [1], [4]....

    [...]

Journal ArticleDOI
TL;DR: An efficient moving object segmentation algorithm suitable for real-time content-based multimedia communication systems is proposed and a processing speed of 25 QCIF fps can be achieved on a personal computer with a 450-MHz Pentium III processor.
Abstract: An efficient moving object segmentation algorithm suitable for real-time content-based multimedia communication systems is proposed in this paper. First, a background registration technique is used to construct a reliable background image from the accumulated frame difference information. The moving object region is then separated from the background region by comparing the current frame with the constructed background image. Finally, a post-processing step is applied on the obtained object mask to remove noise regions and to smooth the object boundary. In situations where object shadows appear in the background region, a pre-processing gradient filter is applied on the input image to reduce the shadow effect. In order to meet the real-time requirement, no computationally intensive operation is included in this method. Moreover, the implementation is optimized using parallel processing and a processing speed of 25 QCIF fps can be achieved on a personal computer with a 450-MHz Pentium III processor. Good segmentation performance is demonstrated by the simulation results.

441 citations


Cites background from "The MPEG-4 video standard verificat..."

  • ...V IDEO segmentation, which extracts the shape information of moving object form the video sequence, is a key operation for content-based video coding [1], multimedia content description [2], [3], and intelligent signal processing....

    [...]

Journal ArticleDOI
TL;DR: An analytic solution for adaptive intra mode selection and joint source-channel rate control under time-varying wireless channel conditions is derived and significantly improves the end-to-end video quality in wireless video coding and transmission.
Abstract: We first develop a rate-distortion (R-D) model for DCT-based video coding incorporating the macroblock (MB) intra refreshing rate. For any given bit rate and intra refreshing rate, this model is capable of estimating the corresponding coding distortion even before a video frame is coded. We then present a theoretical analysis of the picture distortion caused by channel errors and the subsequent inter-frame propagation. Based on this analysis, we develop a statistical model to estimate such channel errors induced distortion for different channel conditions and encoder settings. The proposed analytic model mathematically describes the complex behavior of channel errors in a video coding and transmission system. Unlike other experimental approaches for distortion estimation reported in the literature, this analytic model has very low computational complexity and implementation cost, which are highly desirable in wireless video applications. Simulation results show that this model is able to accurately estimate the channel errors induced distortion with a minimum delay in processing. Based on the proposed source coding R-D model and the analytic channel-distortion estimation, we derive an analytic solution for adaptive intra mode selection and joint source-channel rate control under time-varying wireless channel conditions. Extensive experimental results demonstrate that this scheme significantly improves the end-to-end video quality in wireless video coding and transmission.

390 citations


Cites background from "The MPEG-4 video standard verificat..."

  • ...Due to the limited bandwidth of the wireless channels, video signals have to be highly compressed by efficient coding algorithms, such as H.263 [1] and MPEG-4 [2]....

    [...]

  • ...Let be the original value of pixel in the th video frame, and be the corresponding re- construction value in the feedback loop at the encoder....

    [...]

References
More filters
01 Jan 1996

1,354 citations


Additional excerpts

  • ...The MPEG-4 content-based approach can thus be seen as a logical extension of the conventional MPEG-1 and MPEG-2 coding approach toward image input sequences of arbitrary shape....

    [...]

Journal ArticleDOI
TL;DR: The quality of the compressed video with the MPEG algorithm at about 1.5 Mbit/s has been compared to that of consumer grade VCR's and the prediction error signal is further compressed with spatial redundancy reduction (DCT).
Abstract: The video compression technique developed by MPEG covers many applications from interactive systems on CD-ROM to delivery of video information over telecommunications networks. The MPEG video compression algorithm relies on two basic techniques: block based motion compensation for the reduction of the temporal redundancy and transform domain based compression for the reduction of spatial redundancy. Motion compensation techniques are applied with both predictive and interpolative techniques. The prediction error signal is further compressed with spatial redundancy reduction (DCT). The quality of the compressed video with the MPEG algorithm at about 1.5 Mbit/s has been compared to that of consumer grade VCR's.

155 citations


"The MPEG-4 video standard verificat..." refers background in this paper

  • ...The primary intent of the Video Verification Model is to provide a fully defined core video coding algorithm platform for the development of the standard....

    [...]

  • ...The MPEG-4 content-based approach can thus be seen as a logical extension of the conventional MPEG-1 and MPEG-2 coding approach toward image input sequences of arbitrary shape....

    [...]

Journal ArticleDOI
01 Jun 1995
TL;DR: An overview of today's image and video coding standards and their role in video communications is provided, the different coding algorithms developed for each standard are reviewed and the commonalities between the standards are discussed.
Abstract: The efficient digital representation of image and video signals has been subject of considerable research over the past 20 years. With the growing availability of digital transmission links, progress in signal processing, VLSI technology and image compression research, visual communications has become more feasible than ever. Digital video coding technology has developed into a mature field and a diversity of products has been developed-targeted for a wide range of emerging applications, such as video on demand, digital TV/HDTV broadcasting, and multimedia image/video database services. With the increased commercial interest in video communications the need for international image and video coding standards arose. Standardization of video coding algorithms holds the promise of large markets for video communication equipment. Interoperability of implementations from different vendors enables the consumer to access video from a wider range of services and VLSI implementations of coding algorithms conforming to international standards can be manufactured at considerably reduced costs. The purpose of this paper is to provide an overview of today's image and video coding standards and their role in video communications. The different coding algorithms developed for each standard are reviewed and the commonalities between the standards are discussed. >

134 citations


"The MPEG-4 video standard verificat..." refers background in this paper

  • ...In addition, the coding of bidirectionally predicted VOP’s (-VOP’s) is also supported....

    [...]

  • ...The primary intent of the Video Verification Model is to provide a fully defined core video coding algorithm platform for the development of the standard....

    [...]

  • ...…the MPEG group officially initiated a new MPEG-4 standardization phase in 1994—with the mandate to standardize algorithms for audiovisual coding in multimedia applications, allowing for interactivity, high compression, and/or universal accessibility and portability of audio and video content....

    [...]

  • ...The MPEG-4 content-based approach can thus be seen as a logical extension of the conventional MPEG-1 and MPEG-2 coding approach toward image input sequences of arbitrary shape....

    [...]

Journal ArticleDOI
01 Feb 1995
TL;DR: The paper presents the approach followed in the development of the MPEG-1 and MPEG-2 standards and explains some of the basic technical elements and an overview of the current MPEG work is given.
Abstract: These days companies from many industry segments are moving towards the adoption of digital technology for upgrading existing or introducing new services. This unstoppable movement prompted by technological progress could have produced the digital replica of the existing incompatible systems put in place in the analog world. It is the merit of MPEG to have provided an integrated audiovisual coding standard to implement this epochal conversion from one technology to another while supporting a universal syntax representing audiovisual information that makes the standard usable for a variety of interoperable applications. The paper presents the approach followed in the development of the MPEG-1 and MPEG-2 standards and explains some of the basic technical elements. An overview of the current MPEG work is also given. >

88 citations


"The MPEG-4 video standard verificat..." refers background in this paper

  • ...The January 1996 MPEG Video group meeting witnessed the definition of the first version of the MPEG-4 Video Verification Model—a milestone in the development of the MPEG-4 standard....

    [...]

Journal ArticleDOI
TL;DR: The conditions under which the proposals to the MPEG-4 first round of video subjective tests have been evaluated are addressed and the most significative results of these tests are presented.
Abstract: In the recent years, the technical developments in the area of audio-visual communications, notably in video coding, encouraged the emergence of new services which are already changing our everyday life. The convergence of the telecommunications, computer, and TV/film technologies is leading to the intermixture of elements formerly characteristic of each one of these fields, creating new needs and new requirements. Among the most important trends is the need to increase the interaction capabilities between the user and the audio-visual information, notably by considering the scene as a composition of objects-the content-according to a script that describes their spatial and temporal behavior and not just a set of pixels. MPEG-4 is a new audio-visual standard aiming to establish a universal, efficient coding of different forms of audio-visual data, called audio-visual objects. To reach this target, MPEG-4 has called for proposals on techniques that may be instrumental to efficiently represent visual information, allowing simultaneously high degrees of content-based interactivity and error resilience. This paper addresses the conditions under which the proposals to the MPEG-4 first round of video subjective tests have been evaluated. Moreover, the most significative results of these tests are also presented.

51 citations


Additional excerpts

  • ...The MPEG-4 content-based approach can thus be seen as a logical extension of the conventional MPEG-1 and MPEG-2 coding approach toward image input sequences of arbitrary shape....

    [...]