Violent Scenes Detection Using Mid-Level Violence Clustering

doi:10.5121/CSIT.2014.4224

Open AccessProceedings ArticleDOI

Violent Scenes Detection Using Mid-Level Violence Clustering

- pp 283-296

TLDR

This work proposes a novel system for Violent Scenes Detection, which is based on the combination of visual and audio features with machine learning at segment-level, and in particular, Mid-level Violence Clustering is proposed in order for mid-level concepts to be implicitly learned, without using manually tagged annotations.

Abstract:

This work proposes a novel system for Violent Scenes Detection, which is based on the combination of visual and audio features with machine learning at segment-level. Multiple Kernel Learning is applied so that multimodality of videos can be maximized. In particular, Mid-level Violence Clustering is proposed in order for mid-level concepts to be implicitly learned, without using manually tagged annotations. Finally a violence-score for each shot is calculated. The whole system is trained ona dataset from MediaEval 2013 Affect Task and evaluated by its official metric. The obtained results outperformed its best score.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Violent Interaction Detection in Video Based on Deep Learning

Peipei Zhou, +3 more

TL;DR: A new input modality, image acceleration field is proposed to better extract the motion attributes and experimental results demonstrate that the proposed model for violent interaction detection shows higher accuracy and better robustness.

...read moreread less

Journal ArticleDOI

Fast fight detection.

Ismael Serrano Gracia, +3 more

- 10 Apr 2015 -

PLOS ONE

TL;DR: This work proposes a novel method to detect violence sequences that is outperformed in accuracy by state of the art, it has a significantly faster computation time thus making it amenable for real-time applications.

...read moreread less

Book ChapterDOI

Violence Detection in Video by Using 3D Convolutional Neural Networks

Chunhui Ding, +4 more

TL;DR: A novel 3D ConvNets model for violence detection in video without using any prior knowledge is developed and results show that the method achieves superior performance without relying on handcrafted features.

...read moreread less

Journal ArticleDOI

Affect in Multimedia: Benchmarking Violent Scenes Detection

- 01 Jan 2022 -

IEEE Transactions on Affective Computing

TL;DR: In this paper , the authors report on the creation of a publicly available, common evaluation framework for violent scenes detection in Hollywood and YouTube videos, and propose a robust data set, the VSD96 dataset, with more than 96 hours of video of various genres, annotations at different levels of detail (e.g., shot-level, segment-level), annotations of mid-level concepts (i.e., blood, fire), various pre-computed multi-modal descriptors, and over 230 system output results as baselines.

...read moreread less

Journal ArticleDOI

Breaking down violence detection

Esra Acar, +2 more

- 05 Oct 2016 -

Neurocomputing

TL;DR: A solution which uses audio-visual features (MFCC-based audio and advanced motion features) and proposes to model violence by means of multiple (sub)concepts is presented and the potential of the proposed approach is demonstrated on the standardized datasets of the latest editions of the MediaEval Affect in Multimedia: Violent Scenes Detection task.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

k-means++: the advantages of careful seeding

David Arthur, +1 more

TL;DR: By augmenting k-means with a very simple, randomized seeding technique, this work obtains an algorithm that is Θ(logk)-competitive with the optimal clustering.

...read moreread less

Proceedings Article

Visual categorization with bags of keypoints

Gabriela Csurka

TL;DR: This bag of keypoints method is based on vector quantization of affine invariant descriptors of image patches and shows that it is simple, computationally efficient and intrinsically invariant.

...read moreread less

Proceedings ArticleDOI

Linear spatial pyramid matching using sparse coding for image classification

Jianchao Yang, +3 more

TL;DR: An extension of the SPM method is developed, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and a linear SPM kernel based on SIFT sparse codes is proposed, leading to state-of-the-art performance on several benchmarks by using a single type of descriptors.

...read moreread less

Proceedings ArticleDOI

Action recognition by dense trajectories

Heng Wang, +3 more

TL;DR: This work introduces a novel descriptor based on motion boundary histograms, which is robust to camera motion and consistently outperforms other state-of-the-art descriptors, in particular in uncontrolled realistic videos.

...read moreread less

Proceedings ArticleDOI

Space-time interest points

Laptev, +1 more

TL;DR: This work builds on the idea of the Harris and Forstner interest point operators and detects local structures in space-time where the image values have significant local variations in both space and time to detect spatio-temporal events.

...read moreread less

Collapse

Multimedia Tools and Applications

Audio-Visual fusion for detecting violent scenes in videos

Theodoros Giannakopoulos, +4 more

Violent Scenes Detection Using Mid-Level Violence Clustering

Citations

Violent Interaction Detection in Video Based on Deep Learning

Fast fight detection.

Violence Detection in Video by Using 3D Convolutional Neural Networks

Affect in Multimedia: Benchmarking Violent Scenes Detection

Breaking down violence detection

References

k-means++: the advantages of careful seeding

Visual categorization with bags of keypoints

Linear spatial pyramid matching using sparse coding for image classification

Action recognition by dense trajectories

Space-time interest points

Related Papers (5)

A naive mid-level concept-based fusion approach to violence detection in Hollywood movies

The Vireo Team at MediaEval 2013: Violent Scenes Detection by Mid-level Concepts Learnt from Youtube

Violence detection in video using computer vision techniques

Evaluation of multiple features for violent scenes detection

Audio-Visual fusion for detecting violent scenes in videos