Scene Classification Using Hierarchical Wasserstein CNN

doi:10.1109/TGRS.2018.2873966

Journal ArticleDOI

Scene Classification Using Hierarchical Wasserstein CNN

Yishu Liu, +3 more

- 01 May 2019 -

IEEE Transactions on Geoscience and Remo...

- Vol. 57, Iss: 5, pp 2494-2509

TLDR

This paper finds that for two distributions in hierarchically organized data space, WD has a closed-form solution, which is called “hierarchical WD (HWD),” and uses this theory to construct novel loss functions that overcome the shortcomings of CE loss.

Abstract:

In multiclass classification, convolutional neural network (CNN) is generally coupled with the cross-entropy (CE) loss, which only penalizes the predicted probability corresponding to a ground truth class and ignores the interclass relationship. We argue that CNN can be improved by using a better loss function. On the other hand, the Wasserstein distance (WD) is a well-known metric used to measure the distance between two distributions. Directly solving the WD problem requires a prohibitively large amount of computation time, whereas the cheaper iterative algorithms have a variety of shortcomings such as computational instability and difficulty in selecting parameters. In this paper, we address these issues by giving an analytical solution to the WD problem—for the first time, we find that for two distributions in hierarchically organized data space, WD has a closed-form solution, which we call “hierarchical WD (HWD).” We use this theory to construct novel loss functions that overcome the shortcomings of CE loss. To this end, multi-CNN information fusion that provides the basis for building category hierarchies is carried out first. Then, the semantic relationship among classes is modeled as a binary tree. Then, CNN coupled with an HWD-based loss, i.e., hierarchical Wasserstein CNN (HW-CNN), is trained to learn deep features. In this way, prior knowledge about the interclass relationship is embedded into HW-CNN, and information from several CNNs provides guidance in the process of training individual HW-CNNs. We conducted extensive experiments over two publicly available remote sensing data sets and achieved a state-of-the-art performance in scene classification tasks.

Scene Classification Using Hierarchical Wasserstein CNN

Citations

Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities

Review of Image Classification Algorithms Based on Convolutional Neural Networks

Similarity-Based Unsupervised Deep Transfer Learning for Remote Sensing Image Retrieval

High-Resolution Remote Sensing Image Scene Classification via Key Filter Bank Based on Convolutional Neural Network

Combing Triple-Part Features of Convolutional Neural Networks for Scene Classification in Remote Sensing

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Normalized cuts and image segmentation

Related Papers (5)

AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification

When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs

Going deeper with convolutions

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery

Deep Residual Learning for Image Recognition