Book ChapterDOI
Actions in stillweb images: visualization, detection and retrieval
TLDR
A framework for human action retrieval in still web images by verb queries, for instance "phoning" is described, which builds a group of visual discriminative instances for each action class, called "Exemplarlets", and employs Multiple Kernel Learning to learn an optimal combination of histogram intersection kernels.Abstract:
We describe a framework for human action retrieval in still web images by verb queries, for instance "phoning". Firstly, we build a group of visual discriminative instances for each action class, called "Exemplarlets". Thereafter we employ Multiple Kernel Learning (MKL) to learn an optimal combination of histogram intersection kernels, each of which captures a state-of-the-art feature channel. Our features include the distribution of edges, dense visual words and feature descriptors at different levels of spatial pyramid. For a new image we can detect the hot-region using a sliding-window detector learnt via MKL. The hotregion can imply latent actions in the image. After the hot-region has been detected, we build a inverted index in the visual search path, which we called Visual Inverted Index (VII). Finally, fusing the visual search path and the text search path, we can get the accurate results either relevant to text or to visual information. We show both the detection and retrieval results on our newly collected dataset of six actions as well as demonstrate improved performance over existing methods.read more
Citations
More filters
Journal ArticleDOI
A survey on still image based human action recognition
Guodong Guo,Alice Lai +1 more
TL;DR: A detailed overview of the state-of-the-art methods for still image-based action recognition is presented, and various high-level cues and low-level features for action analysis in still images are described.
Journal ArticleDOI
Deep Ensemble Learning for Human Action Recognition in Still Images
TL;DR: This research investigates human action recognition in still images and utilizes deep ensemble learning to automatically decompose the body pose and perceive its background information and proposes an end-to-endDeep ensemble learning based on the weight optimization (DELWO) model that contributes to fusing the deep information derived from multiple models automatically from the data.
Proceedings ArticleDOI
Image classification with Bag-of-Words model based on improved SIFT algorithm
TL;DR: A new image classification method with Bag-of-Words model based on improved SIFT algorithm that shows higher classification accuracy and comparison of experimental results shows that, the method presented in this paper showsHigher classification accuracy.
Journal ArticleDOI
Identification of peach leaf disease infected by Xanthomonas campestris with deep learning
TL;DR: CNN is superior to the state-of-the-art in identifying diseased peach leaves and transfer learning was used to fine-tune AlexNet.
Journal ArticleDOI
A novel biologically inspired ELM-based network for image recognition
Yu Zhang,Lin Zhang,Ping Li +2 more
TL;DR: A novel biologically inspired network for image recognition that combines the Hierarchical model and X model and the extreme learning machine (ELM), to construct a five-layer feed-forward network: S1-C1-S2-C2-H.
References
More filters
Journal ArticleDOI
LIBSVM: A library for support vector machines
Chih-Chung Chang,Chih-Jen Lin +1 more
TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Proceedings ArticleDOI
Histograms of oriented gradients for human detection
Navneet Dalal,Bill Triggs +1 more
TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI
The Pascal Visual Object Classes (VOC) Challenge
TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI
Object Detection with Discriminatively Trained Part-Based Models
TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.
Book
Modern Information Retrieval
TL;DR: In this article, the authors present a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective, which provides an up-to-date student oriented treatment of the subject.
Related Papers (5)
Leveraging High Level Visual Information for Matching Images and Captions
Fei Yan,Krystian Mikolajczyk +1 more