Story segmentation in news videos using visual and text cues

doi:10.1007/11526346_13

Book ChapterDOI

Story segmentation in news videos using visual and text cues

Yun Zhai, +2 more

- pp 92-102

Chats0

TLDR

The proposed framework for segmenting the news programs into different story topics has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.

Abstract:

In this paper, we present a framework for segmenting the news programs into different story topics. The proposed method utilizes both visual and text information of the video. We represent the news video by a Shot Connectivity Graph (SCG), where the nodes in the graph represent the shots in the video, and the edges between nodes represent the transitions between shots. The cycles in the graph correspond to the story segments in the news program. We first detect the cycles in the graph by finding the anchor persons in the video. This provides us with the coarse segmentation of the news video. The initial segmentation is later refined by the detections of the weather and sporting news, and the merging of similar stories. For the weather detection, the global color information of the images and the motion of the shots are considered. We have used the text obtained from automatic speech recognition (ASR) for detecting the potential sporting shots to form the sport stories. Adjacent stories with similar semantic meanings are further merged based on the visual and text similarities. The proposed framework has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Image retrieval: Ideas, influences, and trends of the new age

Ritendra Datta, +3 more

- 08 May 2008 -

ACM Computing Surveys

TL;DR: Almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation are surveyed, and the spawning of related subfields are discussed, to discuss the adaptation of existing image retrieval techniques to build systems that can be useful in the real world.

...read moreread less

Journal ArticleDOI

A Survey on Visual Content-Based Video Indexing and Retrieval

Weiming Hu, +4 more

TL;DR: Methods for video structure analysis, including shot boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, and video retrieval including query interfaces are analyzed.

...read moreread less

Journal ArticleDOI

Vlogging: A survey of videoblogging technology on the web

Wen Gao, +3 more

- 23 Jun 2010 -

ACM Computing Surveys

TL;DR: A comprehensive survey of videoblogging (vlogging for short) as a new technological trend is presented and several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability.

...read moreread less

Journal ArticleDOI

State-of-the-art and future challenges in video scene detection: a survey

Manfred Del Fabro, +1 more

- 01 Oct 2013 -

Multimedia Systems

TL;DR: This paper tries to make video scene detection approaches better assessable and comparable by making a categorization of the evaluation strategies used, including size and type of the dataset used as well as the evaluation metrics.

...read moreread less

Proceedings ArticleDOI

Tracking news stories across different sources

Yun Zhai, +1 more

TL;DR: The proposed semantic linking framework and the story ranking method have been tested on a set of 60 hours open-benchmark TRECVID video data, and very satisfactory results for both tasks have been obtained.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Robust Real-Time Face Detection

Paul A. Viola, +1 more

- 01 May 2004 -

International Journal of Computer Vision

TL;DR: In this paper, a face detection framework that is capable of processing images extremely rapidly while achieving high detection rates is described. But the detection performance is limited to 15 frames per second.

...read moreread less

Proceedings ArticleDOI

Robust real-time face detection

Paul A. Viola, +1 more

TL;DR: A new image representation called the “Integral Image” is introduced which allows the features used by the detector to be computed very quickly and a method for combining classifiers in a “cascade” which allows background regions of the image to be quickly discarded while spending more computation on promising face-like regions.

...read moreread less

Journal ArticleDOI

The LIMSI Broadcast News transcription system

Jean-Luc Gauvain, +2 more

- 01 May 2002 -

Speech Communication

TL;DR: Development work in moving from laboratory read speech data to real-world or `found' speech data in preparation for the DARPA evaluations on this task from 1996 to 1999 is described.

...read moreread less

Journal ArticleDOI

Automated high-level movie segmentation for advanced video-retrieval systems

Alan Hanjalic, +2 more

- 01 Jun 1999 -

IEEE Transactions on Circuits and System...

TL;DR: A newly developed strategy for automatically segmenting movies into logical story units, designed to work on MPEG-DC sequences, where it is taken into account that at least a partial decoding is required for performing content-based operations on MPEG compressed video streams.

...read moreread less

Journal ArticleDOI

Segmentation of Video by Clustering and Graph Analysis

Minerva M. Yeung, +2 more

- 01 Jul 1998 -

Computer Vision and Image Understanding

TL;DR: This paper proposes techniques and formulations to match and cluster video shots of similar visual contents, taking into account the visual characteristics and temporal dynamics of video, and extends the Scene Transition Graphrepresentation for the analysis of temporal structures extracted from video.

...read moreread less