Multi-View Task-Driven Recognition in Visual Sensor Networks

Open AccessPosted Content

Multi-View Task-Driven Recognition in Visual Sensor Networks

Ali Taalimi, +3 more

- 30 May 2017 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

In this paper, a multi-view task-driven learning for visual sensor network (MT-VSN) is proposed to obtain a compact representation of high-dimensional visual data using sensor fusion techniques.

Abstract:

Nowadays, distributed smart cameras are deployed for a wide set of tasks in several application scenarios, ranging from object recognition, image retrieval, and forensic applications. Due to limited bandwidth in distributed systems, efficient coding of local visual features has in fact been an active topic of research. In this paper, we propose a novel approach to obtain a compact representation of high-dimensional visual data using sensor fusion techniques. We convert the problem of visual analysis in resource-limited scenarios to a multi-view representation learning, and we show that the key to finding properly compressed representation is to exploit the position of cameras with respect to each other as a norm-based regularization in the particular signal representation of sparse coding. Learning the representation of each camera is viewed as an individual task and a multi-task learning with joint sparsity for all nodes is employed. The proposed representation learning scheme is referred to as the multi-view task-driven learning for visual sensor network (MT-VSN). We demonstrate that MT-VSN outperforms state-of-the-art in various surveillance recognition tasks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Model-Agnostic Metalearning-Based Text-Driven Visual Navigation Model for Unfamiliar Tasks

Tianfang Xue, +1 more

- 09 Sep 2020 -

IEEE Access

TL;DR: This article proposes a model-agnostic metalearning based text-driven visual navigation model to achieve generalization to untrained tasks based on meta-reinforcement learning approach, and introduces fully convolutional instance-aware semantic segmentation and Word2vec into the DRL network to improve learning efficiency and accuracy.

...read moreread less

Learning Multimodal Structures in Computer Vision

Ali Taalimi

TL;DR: The goal is to extract a discriminative representation of the multimodal data that leads to easily finding its essential characteristics in the subsequent analysis step, e.g., regression and classification, in a decomposition coefficient vector that is favorable towards the maximal discriminatory power.

...read moreread less

Proceedings ArticleDOI

Memory-Based Parameterized Skills Learning for Mapless Visual Navigation

Yuyang Liu, +2 more

TL;DR: A Memory-based Parameterized Skills Learning (MPSL) model for mapless visual navigation that aims to capture more discriminative features by using a scene-specific layer and generalization ability to un-trained tasks.

...read moreread less

Wide-Area Control Schemes to Improve Small Signal Stability in Power Systems

Meimanat Mahmoudi

TL;DR: A state feedback formulation is proposed that aims to simultaneously optimize a standard Linear Quadratic Regulator cost criterion and induce a pre-defined communication structure and a group-sparse regularization to be added to the optimization cost function.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Regularization and variable selection via the elastic net

Hui Zou, +1 more

- 01 Apr 2005 -

Journal of The Royal Statistical Society...

TL;DR: It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.

...read moreread less

Journal ArticleDOI

Robust Face Recognition via Sparse Representation

John Wright, +4 more

- 01 Feb 2009 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.

...read moreread less

Proceedings ArticleDOI

Non-local sparse models for image restoration

Julien Mairal, +4 more

TL;DR: Experimental results in image denoising and demosaicking tasks with synthetic and real noise show that the proposed method outperforms the state of the art, making it possible to effectively restore raw images from digital cameras at a reasonable speed and memory cost.

...read moreread less

Journal ArticleDOI

Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition

Zhuolin Jiang, +2 more

- 01 Nov 2013 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A label consistent K-SVD (LC-KSVD) algorithm to learn a discriminative dictionary for sparse coding and introduces a new label consistency constraint called "discriminative sparse-code error" to enforce discriminability in sparse codes during the dictionary learning process.

...read moreread less

Journal ArticleDOI

Task-Driven Dictionary Learning

Julien Mairal, +2 more

- 01 Apr 2012 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents a general formulation for supervised dictionary learning adapted to a wide variety of tasks, and presents an efficient algorithm for solving the corresponding optimization problem.

...read moreread less