scispace - formally typeset
Open AccessJournal ArticleDOI

Bi-Level Semantic Representation Analysis for Multimedia Event Detection

Reads0
Chats0
TLDR
This work proposes a bi-level semantic representation analyzing method that learns weights of semantic representation attained from different multimedia archives, and restrains the negative influence of noisy or irrelevant concepts in the overall concept-level.
Abstract
Multimedia event detection has been one of the major endeavors in video event analysis. A variety of approaches have been proposed recently to tackle this problem. Among others, using semantic representation has been accredited for its promising performance and desirable ability for human-understandable reasoning. To generate semantic representation, we usually utilize several external image/video archives and apply the concept detectors trained on them to the event videos. Due to the intrinsic difference of these archives, the resulted representation is presumable to have different predicting capabilities for a certain event. Notwithstanding, not much work is available for assessing the efficacy of semantic representation from the source-level. On the other hand, it is plausible to perceive that some concepts are noisy for detecting a specific event. Motivated by these two shortcomings, we propose a bi-level semantic representation analyzing method. Regarding source-level, our method learns weights of semantic representation attained from different multimedia archives. Meanwhile, it restrains the negative influence of noisy or irrelevant concepts in the overall concept-level. In addition, we particularly focus on efficient multimedia event detection with few positive examples, which is highly appreciated in the real-world scenario. We perform extensive experiments on the challenging TRECVID MED 2013 and 2014 datasets with encouraging results that validate the efficacy of our proposed approach.

read more

Citations
More filters
Journal ArticleDOI

Effective android malware detection with a hybrid model based on deep autoencoder and convolutional neural network

TL;DR: This work reconstructs the high-dimensional features of Android applications (apps) and employ multiple CNN to detect Android malware and proposes a hybrid model based on deep autoencoder (DAE) and convolutional neural network (CNN), which shows powerful ability in feature extraction and malware detection.
Journal ArticleDOI

Feature Interaction Augmented Sparse Learning for Fast Kinect Motion Detection

TL;DR: The results show that the proposed linear algorithm, which is augmented by feature interaction, has its advantage for motion detection in a real-time Kinect entertaining environment.
Journal ArticleDOI

A Real-Time Chinese Traffic Sign Detection Algorithm Based on Modified YOLOv2

TL;DR: This paper proposes an end-to-end convolutional network inspired by YOLOv2 to achieve real-time Chinese traffic sign detection and demonstrates that the proposed method is the faster and more robust.
Journal ArticleDOI

A Survey of Collaborative Filtering-Based Recommender Systems: From Traditional Methods to Hybrid Methods Based on Social Networks

TL;DR: The recent hybrid CF-based recommendation techniques fusing social networks to solve data sparsity and high dimensionality are introduced and provide a novel point of view to improve the performance of RS, thereby presenting a useful resource in the state-of-the-art research result for future researchers.
Journal ArticleDOI

An overview of Internet of Things (IoT): Architectural aspects, challenges, and protocols

TL;DR: This comprehensive survey focuses on the security architecture of IoT and provides a detailed taxonomy of major challenges associated with the field and the key technologies, including Radio Frequency Identification and Wireless Sensor Networks, that are enabling factors in the development of IoT.
References
More filters
Book

Introduction to Linear Regression Analysis

TL;DR: In this paper, the authors propose a simple linear regression model with variable selection and multicollinearity for robust regression, and validate the model using regression analysis and validation of regression models.
Proceedings ArticleDOI

Large-Scale Video Classification with Convolutional Neural Networks

TL;DR: This work studies multiple approaches for extending the connectivity of a CNN in time domain to take advantage of local spatio-temporal information and suggests a multiresolution, foveated architecture as a promising way of speeding up the training.
Posted Content

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

TL;DR: This work introduces UCF101 which is currently the largest dataset of human actions and provides baseline action recognition results on this new dataset using standard bag of words approach with overall performance of 44.5%.
Related Papers (5)