Multimodal Person Identification in Movies

doi:10.1007/3-540-45479-9_19

Book ChapterDOI

Multimodal Person Identification in Movies

- pp 175-185

TLDR

Quantitative results show that Who isWho is successful in helping annotators identify movie characters, and employment of a user model enables evaluation of interactivity in WhoIsWho.

Abstract:

An important task for annotation of movies is finding out which characters are playing in a shot. Character identification is based on available information sources from various modalities. Fully automatic character identification is not feasible as the modalities are not semantically synchronized. As manual annotation is too time consuming, an interactive tool assisting the annotator is needed. We propose the WhoIsWho function for our interactive i-Notation system.WhoIsWho relates visual content to names extracted from movie scripts, working in both ways. We present extensive evaluation of character identification on six hours of movies. Employment of a user model enables evaluation of interactivity in WhoIsWho. Quantitative results show that WhoIsWho is successful in helping annotators identify movie characters.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Challenges of Image and Video Retrieval

Michael S. Lew, +2 more

TL;DR: The most frequently used image and video retrieval systems are typically oriented around text searches where manual annotation was already performed, which indicates that images and videos in large digital collections are being searched for through text searches.

...read moreread less

Proceedings ArticleDOI

Multi-modal Person Identification in a Smart Environment

Hazim Kemal Ekenel, +3 more

TL;DR: Experimental results obtained on the CLEAR 2007 evaluation corpus show that CRCM-based modality weighting improves the correct identification rates significantly, and the cumulative ratio of correct matches (CRCM) and distance-to-second-closest (DT2ND) measures are introduced.

...read moreread less

Book ChapterDOI

ISL person identification systems in the CLEAR evaluations

Hazim Kemal Ekenel, +1 more

- 06 Apr 2006 -

CLEaR

TL;DR: Three person identification systems that have been developed for the CLEAR evaluations are presented, based on single modalities- audio and video, whereas the third system uses both of these modalities.

...read moreread less

Journal ArticleDOI

Interactive adaptive movie annotation

Jeroen Vendrig, +1 more

- 01 Jul 2003 -

IEEE MultiMedia

TL;DR: In this paper, the authors present an interactive and adaptive i-Notation system, which describes actors' names, automatically processes multimodal information sources, and deals with available sources' varying quality.

...read moreread less

Book ChapterDOI

ISL Person Identification Systems in the CLEAR 2007 Evaluations

Hazim Kemal Ekenel, +3 more

TL;DR: The experimental results show that the face recognition system outperforms the speaker identification system significantly on the short duration test segments and Combination of the individual systems improves the performance further.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Name-It: naming and detecting faces in news videos

Shin'ichi Satoh, +2 more

- 01 Jan 1999 -

IEEE MultiMedia

TL;DR: Name-It, a system that associates faces and names in news videos, takes a multimodal video analysis approach: face sequence extraction and similarity evaluation from videos, name extraction from transcripts, and video-caption recognition.

...read moreread less

Journal ArticleDOI

Constructing table-of-content for videos

Yong Rui, +2 more

- 01 Sep 1999 -

Multimedia Systems

TL;DR: This paper presents an effective semantic-level ToC construction technique based on intelligent unsupervised clustering that has the characteristics of better modeling the time locality and scene structure.

...read moreread less

Journal ArticleDOI

Systematic evaluation of logical story unit segmentation

Jeroen Vendrig, +1 more

- 01 Dec 2002 -

IEEE Transactions on Multimedia

TL;DR: A systematic evaluation of the mutual dependencies of segmentation methods and their performances and introduces a method measuring the quality of a segmentation method and its economic impact rather than the amount of errors.

...read moreread less

Journal ArticleDOI

Learning to recognize speech by watching television

P.J. Jang, +1 more

- 01 Sep 1999 -

IEEE Intelligent Systems & Their Applica...

TL;DR: This work describes its approach to collecting almost unlimited amounts of accurately transcribed speech data, which serves as training data for the acoustic model component of most high-accuracy speaker-independent speech-recognition systems.

...read moreread less

Journal ArticleDOI

Tools for Browsing a TV Situation Comedy Based on Content Specific Attributes

Joshua S. Wachman, +1 more

- 01 Mar 2001 -

Multimedia Tools and Applications

TL;DR: An evaluation of the learning performance shows that a combination of low-level color signal features outperforms several other combinations of signal features in learning character labels in an episode of the TV situation comedy, Seinfeld.

...read moreread less

Signal, Image and Video Processing

Utilizing affective analysis for efficient movie browsing

Shiliang Zhang, +4 more

Interactive Visualizations of Video Tours in Space and Time

Ana Jorge, +2 more

Multimodal Person Identification in Movies

Citations

Challenges of Image and Video Retrieval

Multi-modal Person Identification in a Smart Environment

ISL person identification systems in the CLEAR evaluations

Interactive adaptive movie annotation

ISL Person Identification Systems in the CLEAR 2007 Evaluations

References

Name-It: naming and detecting faces in news videos

Constructing table-of-content for videos

Systematic evaluation of logical story unit segmentation

Learning to recognize speech by watching television

Tools for Browsing a TV Situation Comedy Based on Content Specific Attributes

Related Papers (5)

Character-Net: Character Network Analysis from Video

SceneSkim: Searching and Browsing Movies Using Synchronized Captions, Scripts and Plot Summaries

Interactive visualization of video content and associated description for semantic annotation

Utilizing affective analysis for efficient movie browsing

Interactive Visualizations of Video Tours in Space and Time

Trending Questions (1)