RBF based spatio-temporal representation technique for video compression

doi:10.1145/1924559.1924624

Home
/
Papers
/
RBF based spatio-temporal representation technique for video compression

Proceedings Article•DOI•

RBF based spatio-temporal representation technique for video compression

Santanu Chaudhury¹, Brejesh Lall¹, Mona Mathur², Kartik Mehta¹•Institutions (2)

Indian Institute of Technology Delhi¹, STMicroelectronics²

12 Dec 2010-pp 485-489

TL;DR: This paper performs Oct-Tree Decomposition on a video stack, followed by parameter extraction using Radial Basis Function Networks (RBFN) to achieve exceptionally high compression ratios, even higher than the state of art H.264 codec.

read less

Abstract: Parametric coding is a technique in which data is processed to extract meaningful information and then representing it compactly using appropriate parameters. Parametric Coding exploits redundancy in information to provide a very compact representation and thus achieves very high compression ratios. However, this is achieved at the cost of higher computation complexity. This disadvantage is now being offset by the availability of high speed processors, thus making it possible to exploit the high compression ratios of the parametric video coding techniques. In this paper a novel idea for efficient parametric representation of video is proposed. We perform Oct-Tree Decomposition on a video stack, followed by parameter extraction using Radial Basis Function Networks (RBFN) to achieve exceptionally high compression ratios, even higher than the state of art H.264 codec. The proposed technique exploits spatial-temporal redundancy and therefore inherently achieves multiframe prediction.

...read moreread less

Content maybe subject to copyright Report

References

PDF

Open Access

More filters

Journal Article•DOI•

Networks for approximation and learning

[...]

Tomaso Poggio¹, Federico Girosi¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Sep 1990

TL;DR: Regularization networks are mathematically related to the radial basis functions, mainly used for strict interpolation tasks as mentioned in this paper, and two extensions of the regularization approach are presented, along with the approach's corrections to splines, regularization, Bayes formulation, and clustering.

...read moreread less

Abstract: The problem of the approximation of nonlinear mapping, (especially continuous mappings) is considered. Regularization theory and a theoretical framework for approximation (based on regularization techniques) that leads to a class of three-layer networks called regularization networks are discussed. Regularization networks are mathematically related to the radial basis functions, mainly used for strict interpolation tasks. Learning as approximation and learning as hypersurface reconstruction are discussed. Two extensions of the regularization approach are presented, along with the approach's corrections to splines, regularization, Bayes formulation, and clustering. The theory of regularization networks is generalized to a formulation that includes task-dependent clustering and dimensionality reduction. Applications of regularization networks are discussed. >

...read moreread less

3,595 citations

"RBF based spatio-temporal represent..." refers background in this paper

...RBFN are known to provide universal approximations on a compact subset of [[8]]....
[...]

Advanced video coding for generic audiovisual services

[...]

Itu-T and Iso Iec Jtc

01 Jan 2010

2,972 citations

Proceedings Article•DOI•

A feature-based algorithm for detecting and classifying scene breaks

[...]

Ramin Zabih¹, Justin Miller¹, Kevin Mai¹•Institutions (1)

Cornell University¹

01 Jan 1995

TL;DR: A new approach to the detection and classification of scene breaks in video sequences that can withstand compression artifacts such as those introduced by JPEG and MPEG, even at very high compression rates.

...read moreread less

Abstract: We describe a new approach to the detection and classification of scene breaks in video sequences. Our method can detect and classify a variety of scene breaks, including cuts, fades, dissolves and wipes, even in sequences involving significant motion. We detect the appearance of intensity edges that are distant from edges in the previous frame. A global motion computation is used to handle camera or object motion. The algorithms we propose withstand compression artifacts such as those introduced by JPEG and MPEG, even at very high compression rates. Experimental evidence demonstrates that our method can detect and classify scene breaks that are difficult to detect with previous approaches. An initial implementation runs at approximately 2 frames per second on a Sun workstation.

...read moreread less

582 citations

"RBF based spatio-temporal represent..." refers background in this paper

...Motion complexity is determined by the edge change ratio (ECR) proposed as a characteristic feature by Zabih et al [[6]]....
[...]

Journal Article•DOI•

Video Epitomes

[...]

Vincent Cheung¹, Brendan J. Frey¹, Nebojsa Jojic²•Institutions (2)

University of Toronto¹, Microsoft²

01 Feb 2008-International Journal of Computer Vision

TL;DR: It is described how epitomes can be used to model video data and significant computational speedups that can be incorporated into the epitome inference and learning algorithm are described.

...read moreread less

Abstract: Recently, "epitomes" were introduced as patch-based probability models that are learned by compiling together a large number of examples of patches from input images. In this paper, we describe how epitomes can be used to model video data and we describe significant computational speedups that can be incorporated into the epitome inference and learning algorithm. In the case of videos, epitomes are estimated so as to model most of the small space-time cubes from the input data. Then, the epitome can be used for various modeling and reconstruction tasks, of which we show results for video super-resolution, video interpolation, and object removal. Besides computational efficiency, an interesting advantage of the epitome as a representation is that it can be reliably estimated even from videos with large amounts of missing data. We illustrate this ability on the task of reconstructing the dropped frames in video broadcast using only the degraded video and also in denoising a severely corrupted video.

...read moreread less

128 citations

"RBF based spatio-temporal represent..." refers methods in this paper

...Irrespective of the method used, a model can either be a Geometric/Structural model like Mesh based model, or an Appearance model like Eigen-space or Probabilistic models [[3]],[[4]]....
[...]

Proceedings Article•DOI•

Video epitomes

[...]

Vincent Cheung¹, Brendan J. Frey¹, Nebojsa Jojic²•Institutions (2)

University of Toronto¹, Microsoft²

20 Jun 2005

TL;DR: This paper describes how epitomes can be used to model video data and it describes significant computational speedups that can be incorporated into the epitome inference and learning algorithm.

...read moreread less

116 citations