scispace - formally typeset
Journal ArticleDOI

A Deep Scene Representation for Aerial Scene Classification

TLDR
A deep scene representation to achieve the invariance of CNN features and further enhance the discriminative power is proposed and, even with a simple linear classifier, can achieve the state-of-the-art performance.
Abstract
As a fundamental problem in earth observation, aerial scene classification tries to assign a specific semantic label to an aerial image. In recent years, the deep convolutional neural networks (CNNs) have shown advanced performances in aerial scene classification. The successful pretrained CNNs can be transferable to aerial images. However, global CNN activations may lack geometric invariance and, therefore, limit the improvement of aerial scene classification. To address this problem, this paper proposes a deep scene representation to achieve the invariance of CNN features and further enhance the discriminative power. The proposed method: 1) extracts CNN activations from the last convolutional layer of pretrained CNN; 2) performs multiscale pooling (MSP) on these activations; and 3) builds a holistic representation by the Fisher vector method. MSP is a simple and effective multiscale strategy, which enriches multiscale spatial information in affordable computational time. The proposed representation is particularly suited at aerial scenes and consistently outperforms global CNN activations without requiring feature adaptation. Extensive experiments on five aerial scene data sets indicate that the proposed method, even with a simple linear classifier, can achieve the state-of-the-art performance.

read more

Citations
More filters
Journal ArticleDOI

Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities

TL;DR: This article provides a systematic survey of deep learning methods for remote sensing image scene classification by covering more than 160 papers and discusses the main challenges of remote sensing images classification and survey.
Journal ArticleDOI

Spectral–Spatial Attention Network for Hyperspectral Image Classification

TL;DR: A spectral–spatial attention network (SSAN) is proposed to capture discriminative spectral-spatial features from attention areas of HSI cubes to outperforms several state-of-the-art methods.
Journal ArticleDOI

HSI-BERT: Hyperspectral Image Classification Using the Bidirectional Encoder Representation From Transformers

TL;DR: Quantitative and qualitative results demonstrate that HSI-BERT outperforms any other CNN-based model in terms of both classification accuracy and computational time and achieves state-of-the-art performance on three widely used hyperspectral image data sets.
Journal ArticleDOI

Remote Sensing Scene Classification by Gated Bidirectional Network

TL;DR: A gated bidirectional network is proposed to integrate the hierarchical feature aggregation and the interference information elimination into an end-to-end network and can compete with the state-of-the-art methods on four RS scene classification data sets.
Journal ArticleDOI

Multisource Compensation Network for Remote Sensing Cross-Domain Scene Classification

TL;DR: A multisource compensation network (MSCN) is proposed to tackle the challenges: distribution discrepancy and category incompleteness in cross-domain scene classification by using multiple complementary source domains to form the categories of target domain.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Related Papers (5)