scispace - formally typeset
Open AccessProceedings ArticleDOI

Predictive perceptual compression for real time video communication

Reads0
Chats0
TLDR
This paper has developed an eye gaze-aware MPEG-2 transcoder that can perceptually re-encode a live video stream in real time and compensates the interim eye movements between the sampling and actual coding.
Abstract
Approximately 2 degrees in our 140 degree vision span has sharp vision. Many researchers have been fascinated by the idea of eye-tracking integrated perceptual compression of an image or video, yet any practical system has yet to emerge. The unique challenge presented by real time perceptual video streaming is how to handle the fast nature of the human eye and provide its integration with computationally intensive video transcoding scheme. The delay introduced by video transmission in the network presents a difficulty. This delay creates a problem when we try to use information about eye movements for perceptual encoding. In this paper we discuss a new approach to the eye-tracker based video compression. Rather than relying on the point of gaze, this novel scheme tracks a vicinity of interest and offers a prediction mechanism for eye movements. The described system compensates the interim eye movements between the sampling and actual coding. The proposed scheme can be applied to a large variety of today's video compression standards. We have developed an eye gaze-aware MPEG-2 transcoder that can perceptually re-encode a live video stream in real time. The experiments we have conducted illustrate the substantial impact this integrated prediction method has on perceptual video compression and bit-rate reduction.

read more

Citations
More filters
Journal ArticleDOI

Video SnapCut: robust video object cutout using localized classifiers

TL;DR: Video SnapCut is presented, a robust video object cutout system that significantly advances the state-of-the-art in segmentation and is completed with a novel coherent video matting technique.
Journal ArticleDOI

How late can you update gaze-contingent multiresolutional displays without detection?

TL;DR: It is found that image update delays as late as 60 ms after an eye movement did not significantly increase the detectability of image blur and/or motion transients due to the update, good news for designers ofGCMRDs, since 60 ms is ample time to update many GCMRDs after anEye movement without disrupting perception.
Patent

Person identification using ocular biometrics with liveness detection

TL;DR: In this article, a method of making a biometric assessment includes measuring eye movement of a subject, assessing characteristics from the measured eye movement, and assessing a state of the subject based on the assessed characteristics.
Proceedings Article

A Preliminary Investigation into Eye Gaze Data in a First Person Shooter Game

TL;DR: This work shows the design and implementation of a simple game and how the execution of the game can be synchronized with an eye tracking system and may allow for improvements in rendering and new compression algorithms to be created for an online FPS game.
Proceedings ArticleDOI

Design and evaluation of a foveated video streaming service for commodity client devices

TL;DR: A multi-resolution video coding approach that is scalable in that it is possible to pre-code the video in a small number of copies for a given set of resolutions and designed to match the error performance of an eye tracker built using commodity webcams.
References
More filters
Proceedings ArticleDOI

Real-time foveated multiresolution system for low-bandwidth video communication

TL;DR: This work has developed a foveated multiresolution pyramid video coder/decoder which runs in real-time on a general purpose computer and includes zero-tree coding.
Journal ArticleDOI

Foveated video compression with optimal rate control

TL;DR: A new optimal rate control algorithm for maximizing the FSNR is established using a Lagrange multiplier method defined on a curvilinear coordinate system and a piecewise R-D (rate-distortion)/R-Q ( rate-quantization) model is developed.
Proceedings ArticleDOI

Implementation of a foveated image coding system for image bandwidth reduction

TL;DR: A preliminary version of a foveated imaging system, implemented on a general purpose computer, which greatly reduces the transmission bandwidth of images, based on the fact that the spatial resolution of the human eye is space variant, decreasing with increasing eccentricity from the point of gaze.
Book ChapterDOI

Visual Memory Within and Across Fixations

TL;DR: The area that is seen most clearly, with the highest resolution, corresponds to that part of the world that falls on the fovea, but this includes only 3 or 4 square degrees out of the 25,000 available as discussed by the authors.
Proceedings ArticleDOI

User performance with gaze contingent multiresolutional displays

TL;DR: This paper summarizes results from a series of 6 studies investigating spatial, resolutional, and temporal parameters affecting perception and performance in such eye-contingent multi-resolutional displays.
Related Papers (5)