Remote sensing image scene classification using CNN-CapsNet

doi:10.3390/RS11050494

Open AccessJournal ArticleDOI

Remote sensing image scene classification using CNN-CapsNet

Wei Zhang, +2 more

- 01 Feb 2019 -

Remote Sensing

- Vol. 11, Iss: 5, pp 494

Chats0

TLDR

An effective remote sensing image scene classification architecture named CNN-CapsNet is proposed to make full use of the merits of these two models: CNN and CapsNet to lead to a competitive classification performance compared with the state-of-the-art methods.

Abstract:

Remote sensing image scene classification is one of the most challenging problems in understanding high-resolution remote sensing images. Deep learning techniques, especially the convolutional neural network (CNN), have improved the performance of remote sensing image scene classification due to the powerful perspective of feature learning and reasoning. However, several fully connected layers are always added to the end of CNN models, which is not efficient in capturing the hierarchical structure of the entities in the images and does not fully consider the spatial information that is important to classification. Fortunately, capsule network (CapsNet), which is a novel network architecture that uses a group of neurons as a capsule or vector to replace the neuron in the traditional neural network and can encode the properties and spatial information of features in an image to achieve equivariance, has become an active area in the classification field in the past two years. Motivated by this idea, this paper proposes an effective remote sensing image scene classification architecture named CNN-CapsNet to make full use of the merits of these two models: CNN and CapsNet. First, a CNN without fully connected layers is used as an initial feature maps extractor. In detail, a pretrained deep CNN model that was fully trained on the ImageNet dataset is selected as a feature extractor in this paper. Then, the initial feature maps are fed into a newly designed CapsNet to obtain the final classification result. The proposed architecture is extensively evaluated on three public challenging benchmark remote sensing image datasets: the UC Merced Land-Use dataset with 21 scene categories, AID dataset with 30 scene categories, and the NWPU-RESISC45 dataset with 45 challenging scene categories. The experimental results demonstrate that the proposed method can lead to a competitive classification performance compared with the state-of-the-art methods.

Remote sensing image scene classification using CNN-CapsNet

Citations

Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities

Capsule Networks – A survey

Classification of Remote Sensing Images Using EfficientNet-B3 CNN Model With Attention

Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification

Detecting Pneumonia Using Convolutions and Dynamic Capsule Routing for Chest X-ray Images

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

ImageNet Large Scale Visual Recognition Challenge

Related Papers (5)

AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification

Remote Sensing Image Scene Classification: Benchmark and State of the Art

Deep Residual Learning for Image Recognition

Bag-of-visual-words and spatial extensions for land-use classification

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery