scispace - formally typeset
Book ChapterDOI

Story segmentation in news videos using visual and text cues

Reads0
Chats0
TLDR
The proposed framework for segmenting the news programs into different story topics has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.
Abstract
In this paper, we present a framework for segmenting the news programs into different story topics. The proposed method utilizes both visual and text information of the video. We represent the news video by a Shot Connectivity Graph (SCG), where the nodes in the graph represent the shots in the video, and the edges between nodes represent the transitions between shots. The cycles in the graph correspond to the story segments in the news program. We first detect the cycles in the graph by finding the anchor persons in the video. This provides us with the coarse segmentation of the news video. The initial segmentation is later refined by the detections of the weather and sporting news, and the merging of similar stories. For the weather detection, the global color information of the images and the motion of the shots are considered. We have used the text obtained from automatic speech recognition (ASR) for detecting the potential sporting shots to form the sport stories. Adjacent stories with similar semantic meanings are further merged based on the visual and text similarities. The proposed framework has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.

read more

Citations
More filters
Journal ArticleDOI

Image retrieval: Ideas, influences, and trends of the new age

TL;DR: Almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation are surveyed, and the spawning of related subfields are discussed, to discuss the adaptation of existing image retrieval techniques to build systems that can be useful in the real world.
Journal ArticleDOI

A Survey on Visual Content-Based Video Indexing and Retrieval

TL;DR: Methods for video structure analysis, including shot boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, and video retrieval including query interfaces are analyzed.
Journal ArticleDOI

Vlogging: A survey of videoblogging technology on the web

TL;DR: A comprehensive survey of videoblogging (vlogging for short) as a new technological trend is presented and several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability.
Journal ArticleDOI

State-of-the-art and future challenges in video scene detection: a survey

TL;DR: This paper tries to make video scene detection approaches better assessable and comparable by making a categorization of the evaluation strategies used, including size and type of the dataset used as well as the evaluation metrics.
Proceedings ArticleDOI

Tracking news stories across different sources

TL;DR: The proposed semantic linking framework and the story ranking method have been tested on a set of 60 hours open-benchmark TRECVID video data, and very satisfactory results for both tasks have been obtained.
References
More filters
Journal ArticleDOI

Robust Real-Time Face Detection

TL;DR: In this paper, a face detection framework that is capable of processing images extremely rapidly while achieving high detection rates is described. But the detection performance is limited to 15 frames per second.
Proceedings ArticleDOI

Robust real-time face detection

TL;DR: A new image representation called the “Integral Image” is introduced which allows the features used by the detector to be computed very quickly and a method for combining classifiers in a “cascade” which allows background regions of the image to be quickly discarded while spending more computation on promising face-like regions.
Journal ArticleDOI

The LIMSI Broadcast News transcription system

TL;DR: Development work in moving from laboratory read speech data to real-world or `found' speech data in preparation for the DARPA evaluations on this task from 1996 to 1999 is described.
Journal ArticleDOI

Automated high-level movie segmentation for advanced video-retrieval systems

TL;DR: A newly developed strategy for automatically segmenting movies into logical story units, designed to work on MPEG-DC sequences, where it is taken into account that at least a partial decoding is required for performing content-based operations on MPEG compressed video streams.
Journal ArticleDOI

Segmentation of Video by Clustering and Graph Analysis

TL;DR: This paper proposes techniques and formulations to match and cluster video shots of similar visual contents, taking into account the visual characteristics and temporal dynamics of video, and extends the Scene Transition Graphrepresentation for the analysis of temporal structures extracted from video.
Related Papers (5)