Total Cluster: A person agnostic clustering method for broadcast videos

doi:10.1145/2683483.2683490

Proceedings ArticleDOI

Total Cluster: A person agnostic clustering method for broadcast videos

- pp 7

TLDR

The extent to which faces can be clustered automatically without making an error is explored, and an extension of the clustering method to entire episodes using exemplar SVMs based on the negative training data automatically harvested from the editing structure is proposed.

Abstract:

The goal of this paper is unsupervised face clustering in edited video material – where face tracks arising from different people are assigned to separate clusters, with one cluster for each person. In particular we explore the extent to which faces can be clustered automatically without making an error. This is a very challenging problem given the variation in pose, lighting and expressions that can occur, and the similarities between different people.The novelty we bring is three fold: first, we show that a form of weak supervision is available from the editing structure of the material – the shots, threads and scenes that are standard in edited video; second, we show that by first clustering within scenes the number of face tracks can be significantly reduced with almost no errors; third, we propose an extension of the clustering method to entire episodes using exemplar SVMs based on the negative training data automatically harvested from the editing structure.The method is demonstrated on multiple episodes from two very different TV series, Scrubs and Buffy. For both series it is shown that we move towards our goal, and also outperform a number of baselines from previous works.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Clustering Millions of Faces by Identity

Charles Otto, +2 more

- 01 Feb 2018 -

IEEE Transactions on Pattern Analysis an...

TL;DR: In this paper, an approximate Rank-Order clustering algorithm was proposed to address the challenges of run-time complexity and cluster quality, which achieves an F-measure of 0.87 on the LFW benchmark (13 K faces of 5,749 individuals).

...read moreread less

Proceedings ArticleDOI

Efficient Parameter-Free Clustering Using First Neighbor Relations

M. Saquib Sarfraz, +2 more

TL;DR: A new clustering method in the form of a single clustering equation that is able to directly discover groupings in the data that has a very low computational overhead, is easily scalable and applicable to large practical problems.

...read moreread less

Posted Content

Clustering Millions of Faces by Identity

Charles Otto, +2 more

- 04 Apr 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An approximate Rank-Order clustering algorithm is presented that performs better than popular clustering algorithms (k-Means and Spectral) and an internal per-cluster quality measure is developed to rank individual clusters for manual exploration of high quality clusters that are compact and isolated.

...read moreread less

Book ChapterDOI

Joint Face Representation Adaptation and Clustering in Videos

Zhanpeng Zhang, +3 more

TL;DR: Experiments demonstrate that the proposed joint face representation adaptation and clustering approach generates character clusters with high purity compared to existing video face clustering methods, which are either based on deep face representation (without adaptation) or carefully engineered features.

...read moreread less

Proceedings ArticleDOI

End-to-End Face Detection and Cast Grouping in Movies Using Erdös-Rényi Clustering

SouYoung Jin, +3 more

TL;DR: A novel clustering method is introduced, motivated by the classic graph theory results of Erdös and Rényi, based on the observations that large clusters can be fully connected by joining just a small fraction of their point pairs, while just a single connection between two different people can lead to poor clustering results.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Proceedings ArticleDOI

Good features to track

Jianbo Shi, +1 more

TL;DR: A feature selection criterion that is optimal by construction because it is based on how the tracker works, and a feature monitoring method that can detect occlusions, disocclusions, and features that do not correspond to points in the world are proposed.

...read moreread less

Proceedings ArticleDOI

A discriminatively trained, multiscale, deformable part model

Pedro F. Felzenszwalb, +2 more

TL;DR: A discriminatively trained, multiscale, deformable part model for object detection, which achieves a two-fold improvement in average precision over the best performance in the 2006 PASCAL person detection challenge and outperforms the best results in the 2007 challenge in ten out of twenty categories.

...read moreread less

Proceedings ArticleDOI

Face recognition in unconstrained videos with matched background similarity

Lior Wolf, +2 more

TL;DR: A comprehensive database of labeled videos of faces in challenging, uncontrolled conditions, the ‘YouTube Faces’ database, along with benchmark, pair-matching tests are presented and a novel set-to-set similarity measure, the Matched Background Similarity (MBGS), is described.

...read moreread less

Collapse

Total Cluster: A person agnostic clustering method for broadcast videos

Citations

Clustering Millions of Faces by Identity

Efficient Parameter-Free Clustering Using First Neighbor Relations

Clustering Millions of Faces by Identity

Joint Face Representation Adaptation and Clustering in Videos

End-to-End Face Detection and Cast Grouping in Movies Using Erdös-Rényi Clustering

References

Distinctive Image Features from Scale-Invariant Keypoints

Object Detection with Discriminatively Trained Part-Based Models

Good features to track

A discriminatively trained, multiscale, deformable part model

Face recognition in unconstrained videos with matched background similarity

Related Papers (5)

Deep face recognition

FaceNet: A unified embedding for face recognition and clustering

Unsupervised metric learning for face identification in TV video

A Compact and Discriminative Face Track Descriptor

"Hello! My name is... Buffy" - Automatic Naming of Characters in TV Video