More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

doi:10.1109/TGRS.2020.3016820

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Attention-Based Adaptive Spectral–Spatial Kernel ResNet for Hyperspectral Image Classification

[...]

Swalpa Kumar Roy¹, Suvojit Manna, Tiecheng Song², Lorenzo Bruzzone³•Institutions (3)

Jalpaiguri Government Engineering College¹, Chongqing University of Posts and Telecommunications², University of Trento³

01 Sep 2021-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This article presents an attention-based adaptive spectral-spatial kernel improved residual network (A²S²K-ResNet) with spectral attention to capture discriminative spectral- Spatial features for HSI classification in an end-to-end training fashion.

...read moreread less

Abstract: Hyperspectral images (HSIs) provide rich spectral–spatial information with stacked hundreds of contiguous narrowbands. Due to the existence of noise and band correlation, the selection of informative spectral–spatial kernel features poses a challenge. This is often addressed by using convolutional neural networks (CNNs) with receptive field (RF) having fixed sizes. However, these solutions cannot enable neurons to effectively adjust RF sizes and cross-channel dependencies when forward and backward propagations are used to optimize the network. In this article, we present an attention-based adaptive spectral–spatial kernel improved residual network ( A2S2K-ResNet ) with spectral attention to capture discriminative spectral–spatial features for HSI classification in an end-to-end training fashion. In particular, the proposed network learns selective 3-D convolutional kernels to jointly extract spectral–spatial features using improved 3-D ResBlocks and adopts an efficient feature recalibration (EFR) mechanism to boost the classification performance. Extensive experiments are performed on three well-known hyperspectral data sets, i.e., IP, KSC, and UP, and the proposed A2S2K-ResNet can provide better classification results in terms of overall accuracy (OA), average accuracy (AA), and Kappa compared with the existing methods investigated. The source code will be made available at https://github.com/suvojit- $0\times 55$ aa/A2S2K-ResNet.

...read moreread less

185 citations

Journal Article•DOI•

Conceptual Implementation of Artificial Intelligent based E-Mobility Controller in smart city Environment

[...]

J. Jayakumar¹, B. Nagaraj, Shanty Chacko¹, P. Ajay²•Institutions (2)

Karunya University¹, Anna University²

07 Oct 2021-Wireless Communications and Mobile Computing

TL;DR: In this paper, the authors proposed the concepts of connected vehicles that exploit vehicular ad hoc network (VANET) communication, embedded system integrated with sensors which acquire the static and dynamic parameter of the electrical vehicle, and cloud integration and dig data analytics tools.

...read moreread less

Abstract: Testing and implementation of integrated and intelligent transport systems (IITS) of an electrical vehicle need many high-performance and high-precision subsystems. The existing systems confine themselves with limited features and have driving range anxiety, charging and discharging time issues, and inter- and intravehicle communication problems. The above issues are the critical barriers to the penetration of EVs with a smart grid. This paper proposes the concepts which consist of connected vehicles that exploit vehicular ad hoc network (VANET) communication, embedded system integrated with sensors which acquire the static and dynamic parameter of the electrical vehicle, and cloud integration and dig data analytics tools. Vehicle control information is generated based on machine learning-based control systems. This paper also focuses on improving the overall performance (discharge time and cycle life) of a lithium ion battery, increasing the range of the electric vehicle, enhancing the safety of the battery that acquires the static and dynamic parameter and driving pattern of the electrical vehicle, establishing vehicular ad hoc network (VANET) communication, and handling and analyzing the acquired data with the help of various artificial big data analytics techniques.

...read moreread less

173 citations

Journal Article•DOI•

Interpretable Hyperspectral Artificial Intelligence: When nonconvex modeling meets hyperspectral remote sensing

[...]

Danfeng Hong¹, Wei He, Naoto Yokoya, Jing Yao², Lianru Gao³, Liangpei Zhang⁴, Jocelyn Chanussot⁵, Xiao Xiang Zhu¹ - Show less +4 more•Institutions (5)

German Aerospace Center¹, Xi'an Jiaotong University², Chinese Academy of Sciences³, Wuhan University⁴, Grenoble Institute of Technology⁵

06 Apr 2021-IEEE Geoscience and Remote Sensing Magazine

TL;DR: Compared to convex models, nonconvex modeling, which is capable of characterizing more complex real scenes and providing model interpretability technically and theoretically, has proven to be a feasible solution that reduces the gap between challenging HS vision tasks and currently advanced intelligent data processing models.

...read moreread less

Abstract: Hyperspectral (HS) imaging, also known as image spectrometry, is a landmark technique in geoscience and remote sensing (RS). In the past decade, enormous efforts have been made to process and analyze these HS products, mainly by seasoned experts. However, with an ever-growing volume of data, the bulk of costs in manpower and material resources poses new challenges for reducing the burden of manual labor and improving efficiency. For this reason, it is urgent that more intelligent and automatic approaches for various HS RS applications be developed. Machine learning (ML) tools with convex optimization have successfully undertaken the tasks of numerous artificial intelligence (AI)-related applications; however, their ability to handle complex practical problems remains limited, particularly for HS data, due to the effects of various spectral variabilities in the process of HS imaging and the complexity and redundancy of higher-dimensional HS signals. Compared to convex models, nonconvex modeling, which is capable of characterizing more complex real scenes and providing model interpretability technically and theoretically, has proven to be a feasible solution that reduces the gap between challenging HS vision tasks and currently advanced intelligent data processing models.

...read moreread less

168 citations

Journal Article•DOI•

Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model.

[...]

Danfeng Hong¹, Jingliang Hu², Jing Yao³, Jocelyn Chanussot³, Jocelyn Chanussot⁴, Xiao Xiang Zhu², Xiao Xiang Zhu¹ - Show less +3 more•Institutions (4)

German Aerospace Center¹, Technische Universität München², Chinese Academy of Sciences³, University of Grenoble⁴

01 Aug 2021-Isprs Journal of Photogrammetry and Remote Sensing

TL;DR: Extensive experiments conducted demonstrate the superiority and advancement of the S2FL model in the task of land cover classification in comparison with previously-proposed state-of-the-art baselines.

...read moreread less

Abstract: As remote sensing (RS) data obtained from different sensors become available largely and openly, multimodal data processing and analysis techniques have been garnering increasing interest in the RS and geoscience community. However, due to the gap between different modalities in terms of imaging sensors, resolutions, and contents, embedding their complementary information into a consistent, compact, accurate, and discriminative representation, to a great extent, remains challenging. To this end, we propose a shared and specific feature learning (S2FL) model. S2FL is capable of decomposing multimodal RS data into modality-shared and modality-specific components, enabling the information blending of multi-modalities more effectively, particularly for heterogeneous data sources. Moreover, to better assess multimodal baselines and the newly-proposed S2FL model, three multimodal RS benchmark datasets, i.e., Houston2013 – hyperspectral and multispectral data, Berlin – hyperspectral and synthetic aperture radar (SAR) data, Augsburg – hyperspectral, SAR, and digital surface model (DSM) data, are released and used for land cover classification. Extensive experiments conducted on the three datasets demonstrate the superiority and advancement of our S2FL model in the task of land cover classification in comparison with previously-proposed state-of-the-art baselines. Furthermore, the baseline codes and datasets used in this paper will be made available freely at https://github.com/danfenghong/ISPRS_S2FL .

...read moreread less

117 citations

Journal Article•DOI•

Endmember-Guided Unmixing Network (EGU-Net): A General Deep Learning Framework for Self-Supervised Hyperspectral Unmixing.

[...]

Danfeng Hong¹, Lianru Gao², Jing Yao², Naoto Yokoya³, Jocelyn Chanussot¹, Uta Heiden⁴, Bing Zhang² - Show less +3 more•Institutions (4)

Grenoble Institute of Technology¹, Chinese Academy of Sciences², University of Tokyo³, German Aerospace Center⁴

28 May 2021-IEEE Transactions on Neural Networks

TL;DR: In this article, the authors proposed an end-member-guided unmixing network (EGU-Net), which is a two-stream Siamese deep network that learns an additional network from the pure or nearly pure endmembers to correct the weights of another unmixer by sharing network parameters and adding spectrally meaningful constraints.

...read moreread less

Abstract: Over the past decades, enormous efforts have been made to improve the performance of linear or nonlinear mixing models for hyperspectral unmixing (HU), yet their ability to simultaneously generalize various spectral variabilities (SVs) and extract physically meaningful endmembers still remains limited due to the poor ability in data fitting and reconstruction and the sensitivity to various SVs. Inspired by the powerful learning ability of deep learning (DL), we attempt to develop a general DL approach for HU, by fully considering the properties of endmembers extracted from the hyperspectral imagery, called endmember-guided unmixing network (EGU-Net). Beyond the alone autoencoder-like architecture, EGU-Net is a two-stream Siamese deep network, which learns an additional network from the pure or nearly pure endmembers to correct the weights of another unmixing network by sharing network parameters and adding spectrally meaningful constraints (e.g., nonnegativity and sum-to-one) toward a more accurate and interpretable unmixing solution. Furthermore, the resulting general framework is not only limited to pixelwise spectral unmixing but also applicable to spatial information modeling with convolutional operators for spatial-spectral unmixing. Experimental results conducted on three different datasets with the ground truth of abundance maps corresponding to each material demonstrate the effectiveness and superiority of the EGU-Net over state-of-the-art unmixing algorithms. The codes will be available from the website: https://github.com/danfenghong/IEEE_TNNLS_EGU-Net.

...read moreread less

96 citations

Collapse

More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

Citations

References

Related Papers (5)