scispace - formally typeset
Journal ArticleDOI

Discovering Discriminative Graphlets for Aerial Image Categories Recognition

TLDR
Experimental results indicate that this method outperforms several state-of-the-art object/scene recognition models, and the visualized graphlets indicate that the discriminative patterns are discovered by the proposed approach.
Abstract: 
Recognizing aerial image categories is useful for scene annotation and surveillance. Local features have been demonstrated to be robust to image transformations, including occlusions and clutters. However, the geometric property of an aerial image (i.e., the topology and relative displacement of local features), which is key to discriminating aerial image categories, cannot be effectively represented by state-of-the-art generic visual descriptors. To solve this problem, we propose a recognition model that mines graphlets from aerial images, where graphlets are small connected subgraphs reflecting both the geometric property and color/texture distribution of an aerial image. More specifically, each aerial image is decomposed into a set of basic components (e.g., road and playground) and a region adjacency graph (RAG) is accordingly constructed to model their spatial interactions. Aerial image categories recognition can subsequently be casted as RAG-to-RAG matching. Based on graph theory, RAG-to-RAG matching is conducted by comparing all their respective graphlets. Because the number of graphlets is huge, we derive a manifold embedding algorithm to measure different-sized graphlets, after which we select graphlets that have highly discriminative and low redundancy topologies. Through quantizing the selected graphlets from each aerial image into a feature vector, we use support vector machine to discriminate aerial image categories. Experimental results indicate that our method outperforms several state-of-the-art object/scene recognition models, and the visualized graphlets indicate that the discriminative patterns are discovered by our proposed approach.

read more

Citations
More filters
Proceedings ArticleDOI

Efficient Graphlet Counting for Large Networks

TL;DR: This paper proposes a fast, efficient, and parallel algorithm for counting graphlets of size k={3,4}-nodes that take only a fraction of the time to compute when compared with the current methods used, and is on average 460x faster than current methods.
Journal ArticleDOI

Towards unsupervised physical activity recognition using smartphone accelerometers

TL;DR: This work employs an unsupervised method for recognizing physical activities using smartphone accelerometers, extracted from the raw acceleration data collected by smartphones, and finds the method outperforms other existing methods.
Journal ArticleDOI

From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning

TL;DR: Wang et al. as mentioned in this paper proposed a generative approach, referred to as multimodal stochastic recurrent neural networks (MS-RNNs), which models the uncertainty observed in the data using latent variables.
Journal ArticleDOI

Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation

TL;DR: A new photo aesthetics evaluation framework is proposed, focusing on learning the image descriptors that characterize local and global structural aesthetics from multiple visual channels, which significantly outperforms state-of-the-art algorithms in photo aesthetics prediction.
Journal ArticleDOI

A Fine-Grained Image Categorization System by Cellet-Encoded Spatial Pyramid Modeling

TL;DR: A new fine-grained image categorization system that improves spatial pyramid matching is developed that outperforms the state of the art and can be conducted with a trained linear support vector machine.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

Normalized cuts and image segmentation

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Proceedings ArticleDOI

Normalized cuts and image segmentation

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Related Papers (5)