Revisiting Co-Saliency Detection: A Novel Approach Based on Two-Stage Multi-View Spectral Rotation Co-clustering (2017) | Xiwen Yao

Citations

PDF

Open Access

More filters

Journal Article•DOI•

When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs

[...]

Gong Cheng¹, Ceyuan Yang¹, Xiwen Yao¹, Lei Guo¹, Junwei Han¹ - Show less +1 more•Institutions (1)

Northwestern Polytechnical University¹

09 Jan 2018-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This paper proposes a simple but effective method to learn discriminative CNNs (D-CNNs) to boost the performance of remote sensing image scene classification and comprehensively evaluates the proposed method on three publicly available benchmark data sets using three off-the-shelf CNN models.

...read moreread less

Abstract: Remote sensing image scene classification is an active and challenging task driven by many applications. More recently, with the advances of deep learning models especially convolutional neural networks (CNNs), the performance of remote sensing image scene classification has been significantly improved due to the powerful feature representations learnt through CNNs. Although great success has been obtained so far, the problems of within-class diversity and between-class similarity are still two big challenges. To address these problems, in this paper, we propose a simple but effective method to learn discriminative CNNs (D-CNNs) to boost the performance of remote sensing image scene classification. Different from the traditional CNN models that minimize only the cross entropy loss, our proposed D-CNN models are trained by optimizing a new discriminative objective function. To this end, apart from minimizing the classification error, we also explicitly impose a metric learning regularization term on the CNN features. The metric learning regularization enforces the D-CNN models to be more discriminative so that, in the new D-CNN feature spaces, the images from the same scene class are mapped closely to each other and the images of different classes are mapped as farther apart as possible. In the experiments, we comprehensively evaluate the proposed method on three publicly available benchmark data sets using three off-the-shelf CNN models. Experimental results demonstrate that our proposed D-CNN methods outperform the existing baseline methods and achieve state-of-the-art results on all three data sets.

...read moreread less

1,001 citations

Journal Article•DOI•

Deep Visual Attention Prediction

[...]

Wenguan Wang¹, Jianbing Shen¹•Institutions (1)

Beijing Institute of Technology¹

01 May 2018-IEEE Transactions on Image Processing

TL;DR: Wang et al. as discussed by the authors proposed a skip-layer network structure to predict human attention from multiple convolutional layers with various reception fields, which significantly decreases the redundancy of previous approaches of learning multiple network streams with different input scales.

...read moreread less

Abstract: In this paper, we aim to predict human eye fixation with view-free scenes based on an end-to-end deep learning architecture. Although convolutional neural networks (CNNs) have made substantial improvement on human attention prediction, it is still needed to improve the CNN-based attention models by efficiently leveraging multi-scale features. Our visual attention network is proposed to capture hierarchical saliency information from deep, coarse layers with global saliency information to shallow, fine layers with local saliency response. Our model is based on a skip-layer network structure, which predicts human attention from multiple convolutional layers with various reception fields. Final saliency prediction is achieved via the cooperation of those global and local predictions. Our model is learned in a deep supervision manner, where supervision is directly fed into multi-level layers, instead of previous approaches of providing supervision only at the output layer and propagating this supervision back to earlier layers. Our model thus incorporates multi-level saliency predictions within a single network, which significantly decreases the redundancy of previous approaches of learning multiple network streams with different input scales. Extensive experimental analysis on various challenging benchmark data sets demonstrate our method yields the state-of-the-art performance with competitive inference time. 1 1 Our source code is available at https://github.com/wenguanwang/deepattention .

...read moreread less

532 citations

Journal Article•DOI•

Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images

[...]

Ke Li, Gong Cheng¹, Shuhui Bu¹, Xiong You•Institutions (1)

Northwestern Polytechnical University¹

01 Apr 2018-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This paper proposes a novel deep-learning-based object detection framework including region proposal network (RPN) and local-contextual feature fusion network designed for remote sensing images that can deal with the multiangle and multiscale characteristics of geospatial objects.

...read moreread less

Abstract: Most of the existing deep-learning-based methods are difficult to effectively deal with the challenges faced for geospatial object detection such as rotation variations and appearance ambiguity. To address these problems, this paper proposes a novel deep-learning-based object detection framework including region proposal network (RPN) and local-contextual feature fusion network designed for remote sensing images. Specifically, the RPN includes additional multiangle anchors besides the conventional multiscale and multiaspect-ratio ones, and thus can deal with the multiangle and multiscale characteristics of geospatial objects. To address the appearance ambiguity problem, we propose a double-channel feature fusion network that can learn local and contextual properties along two independent pathways. The two kinds of features are later combined in the final layers of processing in order to form a powerful joint representation. Comprehensive evaluations on a publicly available ten-class object detection data set demonstrate the effectiveness of the proposed method.

...read moreread less

296 citations

Journal Article•DOI•

Remote Sensing Image Scene Classification Using Bag of Convolutional Features

[...]

Gong Cheng¹, Zhenpeng Li¹, Xiwen Yao¹, Lei Guo¹, Zhongliang Wei² - Show less +1 more•Institutions (2)

Northwestern Polytechnical University¹, Anhui University of Science and Technology²

11 Aug 2017-IEEE Geoscience and Remote Sensing Letters

TL;DR: This letter proposes a novel feature representation method for scene classification, named bag of convolutional features (BoCF), different from the traditional bag of visual words-based methods in which the visual words are usually obtained by using handcrafted feature descriptors, the proposed BoCF generates visual words from deep convolutionAL features using off-the-shelf Convolutional neural networks.

...read moreread less

Abstract: More recently, remote sensing image classification has been moving from pixel-level interpretation to scene-level semantic understanding, which aims to label each scene image with a specific semantic class. While significant efforts have been made in developing various methods for remote sensing image scene classification, most of them rely on handcrafted features. In this letter, we propose a novel feature representation method for scene classification, named bag of convolutional features (BoCF). Different from the traditional bag of visual words-based methods in which the visual words are usually obtained by using handcrafted feature descriptors, the proposed BoCF generates visual words from deep convolutional features using off-the-shelf convolutional neural networks. Extensive evaluations on a publicly available remote sensing image scene classification benchmark and comparison with the state-of-the-art methods demonstrate the effectiveness of the proposed BoCF method for remote sensing image scene classification.

...read moreread less

276 citations

Cites background from "Revisiting Co-Saliency Detection: A..."

...More recently, various deep learning algorithms, especially convolutional neural networks (CNNs), have shown their much stronger feature representation power in the field of computer vision [26]–[30]....
[...]

Journal Article•DOI•

Spectral–Spatial Unified Networks for Hyperspectral Image Classification

[...]

Yonghao Xu¹, Liangpei Zhang¹, Bo Du¹, Fan Zhang¹•Institutions (1)

Wuhan University¹

09 May 2018-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: A band grouping-based long short-term memory model and a multiscale convolutional neural network are proposed as the spectral and spatial feature extractors, respectively, for the hyperspectral image (HSI) classification.

...read moreread less

Abstract: In this paper, we propose a spectral–spatial unified network (SSUN) with an end-to-end architecture for the hyperspectral image (HSI) classification. Different from traditional spectral–spatial classification frameworks where the spectral feature extraction (FE), spatial FE, and classifier training are separated, these processes are integrated into a unified network in our model. In this way, both FE and classifier training will share a uniform objective function and all the parameters in the network can be optimized at the same time. In the implementation of the SSUN, we propose a band grouping-based long short-term memory model and a multiscale convolutional neural network as the spectral and spatial feature extractors, respectively. In the experiments, three benchmark HSIs are utilized to evaluate the performance of the proposed method. The experimental results demonstrate that the SSUN can yield a competitive performance compared with existing methods.

...read moreread less

259 citations

Cites background from "Revisiting Co-Saliency Detection: A..."

...vision and artificial intelligence [28]–[31], a promising way to extract deep features for hyperspectral data has become...
[...]

Collapse

Revisiting Co-Saliency Detection: A Novel Approach Based on Two-Stage Multi-View Spectral Rotation Co-clustering

Citations

Cites background from "Revisiting Co-Saliency Detection: A..."

Cites background from "Revisiting Co-Saliency Detection: A..."

References

"Revisiting Co-Saliency Detection: A..." refers methods in this paper

"Revisiting Co-Saliency Detection: A..." refers methods in this paper

Related Papers (5)