scispace - formally typeset
Journal ArticleDOI

Video scene segmentation using Markov chain Monte Carlo

Yun Zhai, +1 more
- 01 Aug 2006 - 
- Vol. 8, Iss: 4, pp 686-697
Reads0
Chats0
TLDR
A general framework for temporal scene segmentation in various video domains that is able to find the weak boundaries as well as the strong boundaries, i.e., it does not rely on the fixed threshold and can be applied to different video domains.
Abstract
Videos are composed of many shots that are caused by different camera operations, e.g., on/off operations and switching between cameras. One important goal in video analysis is to group the shots into temporal scenes, such that all the shots in a single scene are related to the same subject, which could be a particular physical setting, an ongoing action or a theme. In this paper, we present a general framework for temporal scene segmentation in various video domains. The proposed method is formulated in a statistical fashion and uses the Markov chain Monte Carlo (MCMC) technique to determine the boundaries between video scenes. In this approach, a set of arbitrary scene boundaries are initialized at random locations and are automatically updated using two types of updates: diffusion and jumps. Diffusion is the process of updating the boundaries between adjacent scenes. Jumps consist of two reversible operations: the merging of two scenes and the splitting of an existing scene. The posterior probability of the target distribution of the number of scenes and their corresponding boundary locations is computed based on the model priors and the data likelihood. The updates of the model parameters are controlled by the hypothesis ratio test in the MCMC process, and the samples are collected to generate the final scene boundaries. The major advantage of the proposed framework is two-fold: 1) it is able to find the weak boundaries as well as the strong boundaries, i.e., it does not rely on the fixed threshold; 2) it can be applied to different video domains. We have tested the proposed method on two video domains: home videos and feature films, and accurate results have been obtained

read more

Citations
More filters
Journal ArticleDOI

A Survey on Visual Content-Based Video Indexing and Retrieval

TL;DR: Methods for video structure analysis, including shot boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, and video retrieval including query interfaces are analyzed.
Book ChapterDOI

Movie/Script: Alignment and Parsing of Video and Text Transcription

TL;DR: A weakly supervised algorithm is presented that uses the screenplay and closed captions to parse a movie into a hierarchy of shots and scenes, and the recovered structure is used to improve character naming and retrieval of common actions in several episodes of popular TV series.
Journal ArticleDOI

Scene Detection in Videos Using Shot Clustering and Sequence Alignment

TL;DR: To clusters the shots into groups based only on their visual similarity and a label is assigned to each shot according to the group that it belongs to, a sequence alignment algorithm is applied to detect when the pattern of shot labels changes, providing the final scene segmentation result.
Journal ArticleDOI

Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features

TL;DR: Improved performance of the proposed approach in comparison to other unimodal and multimodal techniques of the relevant literature is demonstrated and the contribution of high-level audiovisual features toward improved video segmentation to scenes is highlighted.
Posted Content

Learning visual groups from co-occurrences in space and time

TL;DR: A self-supervised framework that learns to group visual entities based on their rate of co-occurrence in space and time is proposed, and it is demonstrated that in each case the learned affinities uncover meaningful semantic groupings.
References
More filters
Journal ArticleDOI

Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images

TL;DR: The analogy between images and statistical mechanics systems is made and the analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations, creating a highly parallel ``relaxation'' algorithm for MAP estimation.
Journal ArticleDOI

Monte Carlo Sampling Methods Using Markov Chains and Their Applications

TL;DR: A generalization of the sampling method introduced by Metropolis et al. as mentioned in this paper is presented along with an exposition of the relevant theory, techniques of application and methods and difficulties of assessing the error in Monte Carlo estimates.
BookDOI

Markov Chain Monte Carlo in Practice

TL;DR: The Markov Chain Monte Carlo Implementation Results Summary and Discussion MEDICAL MONITORING Introduction Modelling Medical Monitoring Computing Posterior Distributions Forecasting Model Criticism Illustrative Application Discussion MCMC for NONLINEAR HIERARCHICAL MODELS.
Journal ArticleDOI

Reversible jump Markov chain Monte Carlo computation and Bayesian model determination

Peter H.R. Green
- 01 Dec 1995 - 
TL;DR: In this article, the authors propose a new framework for the construction of reversible Markov chain samplers that jump between parameter subspaces of differing dimensionality, which is flexible and entirely constructive.
Journal ArticleDOI

Region competition: unifying snakes, region growing, and Bayes/MDL for multiband image segmentation

TL;DR: A novel statistical and variational approach to image segmentation based on a new algorithm, named region competition, derived by minimizing a generalized Bayes/minimum description length (MDL) criterion using the variational principle is presented.
Related Papers (5)