scispace - formally typeset
Open AccessJournal ArticleDOI

Remote sensing image scene classification using CNN-CapsNet

Wei Zhang, +2 more
- 01 Feb 2019 - 
- Vol. 11, Iss: 5, pp 494
Reads0
Chats0
TLDR
An effective remote sensing image scene classification architecture named CNN-CapsNet is proposed to make full use of the merits of these two models: CNN and CapsNet to lead to a competitive classification performance compared with the state-of-the-art methods.
Abstract
Remote sensing image scene classification is one of the most challenging problems in understanding high-resolution remote sensing images. Deep learning techniques, especially the convolutional neural network (CNN), have improved the performance of remote sensing image scene classification due to the powerful perspective of feature learning and reasoning. However, several fully connected layers are always added to the end of CNN models, which is not efficient in capturing the hierarchical structure of the entities in the images and does not fully consider the spatial information that is important to classification. Fortunately, capsule network (CapsNet), which is a novel network architecture that uses a group of neurons as a capsule or vector to replace the neuron in the traditional neural network and can encode the properties and spatial information of features in an image to achieve equivariance, has become an active area in the classification field in the past two years. Motivated by this idea, this paper proposes an effective remote sensing image scene classification architecture named CNN-CapsNet to make full use of the merits of these two models: CNN and CapsNet. First, a CNN without fully connected layers is used as an initial feature maps extractor. In detail, a pretrained deep CNN model that was fully trained on the ImageNet dataset is selected as a feature extractor in this paper. Then, the initial feature maps are fed into a newly designed CapsNet to obtain the final classification result. The proposed architecture is extensively evaluated on three public challenging benchmark remote sensing image datasets: the UC Merced Land-Use dataset with 21 scene categories, AID dataset with 30 scene categories, and the NWPU-RESISC45 dataset with 45 challenging scene categories. The experimental results demonstrate that the proposed method can lead to a competitive classification performance compared with the state-of-the-art methods.

read more

Citations
More filters
Journal ArticleDOI

Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities

TL;DR: This article provides a systematic survey of deep learning methods for remote sensing image scene classification by covering more than 160 papers and discusses the main challenges of remote sensing images classification and survey.
Journal ArticleDOI

Capsule Networks – A survey

TL;DR: A comprehensive review of the state of the art architectures, tools and methodologies in existing implementations of capsule networks highlights the successes, failures and opportunities for further research to serve as a motivation to researchers and industry players to exploit the full potential of this new field.
Journal ArticleDOI

Classification of Remote Sensing Images Using EfficientNet-B3 CNN Model With Attention

TL;DR: Wang et al. as discussed by the authors proposed a deep attention convolutional neural network (CNN) for scene classification in remote sensing, which computes a new feature map as a weighted average of these original feature maps.
Journal ArticleDOI

Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification

TL;DR: This study proposes a channel-attention-based DenseNet (CAD) network for scene classification and demonstrates that the CAD network can achieve performance comparable to those of other state-of-the-art methods.
Journal ArticleDOI

Detecting Pneumonia Using Convolutions and Dynamic Capsule Routing for Chest X-ray Images

TL;DR: A combination of convolutions and capsules is used to obtain two models that outperform all models previously proposed and detect pneumonia from chest X-ray (CXR) images with test accuracy of 95.33% and 95.90%, respectively.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
Related Papers (5)