scispace - formally typeset
Open AccessPosted Content

Multi-View Task-Driven Recognition in Visual Sensor Networks

Reads0
Chats0
TLDR
In this paper, a multi-view task-driven learning for visual sensor network (MT-VSN) is proposed to obtain a compact representation of high-dimensional visual data using sensor fusion techniques.
Abstract
Nowadays, distributed smart cameras are deployed for a wide set of tasks in several application scenarios, ranging from object recognition, image retrieval, and forensic applications. Due to limited bandwidth in distributed systems, efficient coding of local visual features has in fact been an active topic of research. In this paper, we propose a novel approach to obtain a compact representation of high-dimensional visual data using sensor fusion techniques. We convert the problem of visual analysis in resource-limited scenarios to a multi-view representation learning, and we show that the key to finding properly compressed representation is to exploit the position of cameras with respect to each other as a norm-based regularization in the particular signal representation of sparse coding. Learning the representation of each camera is viewed as an individual task and a multi-task learning with joint sparsity for all nodes is employed. The proposed representation learning scheme is referred to as the multi-view task-driven learning for visual sensor network (MT-VSN). We demonstrate that MT-VSN outperforms state-of-the-art in various surveillance recognition tasks.

read more

Citations
More filters
Journal ArticleDOI

Model-Agnostic Metalearning-Based Text-Driven Visual Navigation Model for Unfamiliar Tasks

TL;DR: This article proposes a model-agnostic metalearning based text-driven visual navigation model to achieve generalization to untrained tasks based on meta-reinforcement learning approach, and introduces fully convolutional instance-aware semantic segmentation and Word2vec into the DRL network to improve learning efficiency and accuracy.

Learning Multimodal Structures in Computer Vision

Ali Taalimi
TL;DR: The goal is to extract a discriminative representation of the multimodal data that leads to easily finding its essential characteristics in the subsequent analysis step, e.g., regression and classification, in a decomposition coefficient vector that is favorable towards the maximal discriminatory power.
Proceedings ArticleDOI

Memory-Based Parameterized Skills Learning for Mapless Visual Navigation

TL;DR: A Memory-based Parameterized Skills Learning (MPSL) model for mapless visual navigation that aims to capture more discriminative features by using a scene-specific layer and generalization ability to un-trained tasks.

Wide-Area Control Schemes to Improve Small Signal Stability in Power Systems

TL;DR: A state feedback formulation is proposed that aims to simultaneously optimize a standard Linear Quadratic Regulator cost criterion and induce a pre-defined communication structure and a group-sparse regularization to be added to the optimization cost function.
References
More filters
Journal ArticleDOI

Regularization and variable selection via the elastic net

TL;DR: It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.
Journal ArticleDOI

Robust Face Recognition via Sparse Representation

TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.
Proceedings ArticleDOI

Non-local sparse models for image restoration

TL;DR: Experimental results in image denoising and demosaicking tasks with synthetic and real noise show that the proposed method outperforms the state of the art, making it possible to effectively restore raw images from digital cameras at a reasonable speed and memory cost.
Journal ArticleDOI

Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition

TL;DR: A label consistent K-SVD (LC-KSVD) algorithm to learn a discriminative dictionary for sparse coding and introduces a new label consistency constraint called "discriminative sparse-code error" to enforce discriminability in sparse codes during the dictionary learning process.
Journal ArticleDOI

Task-Driven Dictionary Learning

TL;DR: This paper presents a general formulation for supervised dictionary learning adapted to a wide variety of tasks, and presents an efficient algorithm for solving the corresponding optimization problem.
Related Papers (5)