Bi-Level Semantic Representation Analysis for Multimedia Event Detection

doi:10.1109/TCYB.2016.2539546

Open AccessJournal ArticleDOI

Bi-Level Semantic Representation Analysis for Multimedia Event Detection

Xiaojun Chang, +4 more

- 01 May 2017 -

IEEE Transactions on Systems, Man, and C...

- Vol. 47, Iss: 5, pp 1180-1197

Chats0

TLDR

This work proposes a bi-level semantic representation analyzing method that learns weights of semantic representation attained from different multimedia archives, and restrains the negative influence of noisy or irrelevant concepts in the overall concept-level.

Abstract:

Multimedia event detection has been one of the major endeavors in video event analysis. A variety of approaches have been proposed recently to tackle this problem. Among others, using semantic representation has been accredited for its promising performance and desirable ability for human-understandable reasoning. To generate semantic representation, we usually utilize several external image/video archives and apply the concept detectors trained on them to the event videos. Due to the intrinsic difference of these archives, the resulted representation is presumable to have different predicting capabilities for a certain event. Notwithstanding, not much work is available for assessing the efficacy of semantic representation from the source-level. On the other hand, it is plausible to perceive that some concepts are noisy for detecting a specific event. Motivated by these two shortcomings, we propose a bi-level semantic representation analyzing method. Regarding source-level, our method learns weights of semantic representation attained from different multimedia archives. Meanwhile, it restrains the negative influence of noisy or irrelevant concepts in the overall concept-level. In addition, we particularly focus on efficient multimedia event detection with few positive examples, which is highly appreciated in the real-world scenario. We perform extensive experiments on the challenging TRECVID MED 2013 and 2014 datasets with encouraging results that validate the efficacy of our proposed approach.

Bi-Level Semantic Representation Analysis for Multimedia Event Detection

Citations

Effective android malware detection with a hybrid model based on deep autoencoder and convolutional neural network

Feature Interaction Augmented Sparse Learning for Fast Kinect Motion Detection

A Real-Time Chinese Traffic Sign Detection Algorithm Based on Modified YOLOv2

A Survey of Collaborative Filtering-Based Recommender Systems: From Traditional Methods to Hybrid Methods Based on Social Networks

An overview of Internet of Things (IoT): Architectural aspects, challenges, and protocols

References

The Elements of Statistical Learning

Introduction to Linear Regression Analysis

Large-scale Video Classiﬁcation with Convolutional Neural Networks

Large-Scale Video Classification with Convolutional Neural Networks

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

Related Papers (5)

Semantic Pooling for Complex Event Analysis in Untrimmed Videos

Semisupervised Feature Analysis by Mining Correlations Among Multiple Tasks

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

An Adaptive Semisupervised Feature Analysis for Video Semantic Recognition