scispace - formally typeset
Book ChapterDOI

Actions in stillweb images: visualization, detection and retrieval

Piji Li, +2 more
- pp 302-313
TLDR
A framework for human action retrieval in still web images by verb queries, for instance "phoning" is described, which builds a group of visual discriminative instances for each action class, called "Exemplarlets", and employs Multiple Kernel Learning to learn an optimal combination of histogram intersection kernels.
Abstract
We describe a framework for human action retrieval in still web images by verb queries, for instance "phoning". Firstly, we build a group of visual discriminative instances for each action class, called "Exemplarlets". Thereafter we employ Multiple Kernel Learning (MKL) to learn an optimal combination of histogram intersection kernels, each of which captures a state-of-the-art feature channel. Our features include the distribution of edges, dense visual words and feature descriptors at different levels of spatial pyramid. For a new image we can detect the hot-region using a sliding-window detector learnt via MKL. The hotregion can imply latent actions in the image. After the hot-region has been detected, we build a inverted index in the visual search path, which we called Visual Inverted Index (VII). Finally, fusing the visual search path and the text search path, we can get the accurate results either relevant to text or to visual information. We show both the detection and retrieval results on our newly collected dataset of six actions as well as demonstrate improved performance over existing methods.

read more

Citations
More filters
Journal ArticleDOI

A survey on still image based human action recognition

TL;DR: A detailed overview of the state-of-the-art methods for still image-based action recognition is presented, and various high-level cues and low-level features for action analysis in still images are described.
Journal ArticleDOI

Deep Ensemble Learning for Human Action Recognition in Still Images

TL;DR: This research investigates human action recognition in still images and utilizes deep ensemble learning to automatically decompose the body pose and perceive its background information and proposes an end-to-endDeep ensemble learning based on the weight optimization (DELWO) model that contributes to fusing the deep information derived from multiple models automatically from the data.
Proceedings ArticleDOI

Image classification with Bag-of-Words model based on improved SIFT algorithm

TL;DR: A new image classification method with Bag-of-Words model based on improved SIFT algorithm that shows higher classification accuracy and comparison of experimental results shows that, the method presented in this paper showsHigher classification accuracy.
Journal ArticleDOI

Identification of peach leaf disease infected by Xanthomonas campestris with deep learning

TL;DR: CNN is superior to the state-of-the-art in identifying diseased peach leaves and transfer learning was used to fine-tune AlexNet.
Journal ArticleDOI

A novel biologically inspired ELM-based network for image recognition

TL;DR: A novel biologically inspired network for image recognition that combines the Hierarchical model and X model and the extreme learning machine (ELM), to construct a five-layer feed-forward network: S1-C1-S2-C2-H.
References
More filters
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.
Book

Modern Information Retrieval

TL;DR: In this article, the authors present a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective, which provides an up-to-date student oriented treatment of the subject.
Related Papers (5)