Salient Object Detection via Structured Matrix Decomposition

doi:10.1109/TPAMI.2016.2562626

Citations

PDF

Open Access

More filters

Posted Content•

Salient Object Detection in the Deep Learning Era: An In-Depth Survey

[...]

Wenguan Wang¹, Qiuxia Lai², Huazhu Fu, Jianbing Shen³, Haibin Ling⁴, Ruigang Yang⁵ - Show less +2 more•Institutions (5)

ETH Zurich¹, The Chinese University of Hong Kong², University of California, Los Angeles³, Stony Brook University⁴, Baidu⁵

19 Apr 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper reviews deep SOD algorithms from different perspectives, including network architecture, level of supervision, learning paradigm, and object-/instance-level detection, and looks into the generalization and difficulty of existing SOD datasets.

...read moreread less

Abstract: As an essential problem in computer vision, salient object detection (SOD) has attracted an increasing amount of research attention over the years. Recent advances in SOD are predominantly led by deep learning-based solutions (named deep SOD). To enable in-depth understanding of deep SOD, in this paper, we provide a comprehensive survey covering various aspects, ranging from algorithm taxonomy to unsolved issues. In particular, we first review deep SOD algorithms from different perspectives, including network architecture, level of supervision, learning paradigm, and object-/instance-level detection. Following that, we summarize and analyze existing SOD datasets and evaluation metrics. Then, we benchmark a large group of representative SOD models, and provide detailed analyses of the comparison results. Moreover, we study the performance of SOD algorithms under different attribute settings, which has not been thoroughly explored previously, by constructing a novel SOD dataset with rich attribute annotations covering various salient object types, challenging factors, and scene categories. We further analyze, for the first time in the field, the robustness of SOD models to random input perturbations and adversarial attacks. We also look into the generalization and difficulty of existing SOD datasets. Finally, we discuss several open issues of SOD and outline future research directions.

...read moreread less

428 citations

Journal Article•DOI•

Review of Visual Saliency Detection With Comprehensive Information

[...]

Runmin Cong¹, Jianjun Lei¹, Huazhu Fu, Ming-Ming Cheng², Weisi Lin³, Qingming Huang⁴ - Show less +2 more•Institutions (4)

Tianjin University¹, Nankai University², Nanyang Technological University³, Chinese Academy of Sciences⁴

01 Oct 2019-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: Zhang et al. as mentioned in this paper reviewed different types of saliency detection algorithms, summarize the important issues of the existing methods, and discuss the existent problems and future works, and the experimental analysis and discussion are conducted to provide a holistic overview of different saliency detectors.

...read moreread less

Abstract: The visual saliency detection model simulates the human visual system to perceive the scene and has been widely used in many vision tasks. With the development of acquisition technology, more comprehensive information, such as depth cue, inter-image correspondence, or temporal relationship, is available to extend image saliency detection to RGBD saliency detection, co-saliency detection, or video saliency detection. The RGBD saliency detection model focuses on extracting the salient regions from RGBD images by combining the depth information. The co-saliency detection model introduces the inter-image correspondence constraint to discover the common salient object in an image group. The goal of the video saliency detection model is to locate the motion-related salient object in video sequences, which considers the motion cue and spatiotemporal constraint jointly. In this paper, we review different types of saliency detection algorithms, summarize the important issues of the existing methods, and discuss the existent problems and future works. Moreover, the evaluation datasets and quantitative measurements are briefly introduced, and the experimental analysis and discussion are conducted to provide a holistic overview of different saliency detection methods.

...read moreread less

328 citations

Journal Article•DOI•

Emerging From Water: Underwater Image Color Correction Based on Weakly Supervised Color Transfer

[...]

Chongyi Li¹, Jichang Guo¹, Chunle Guo¹•Institutions (1)

Tianjin University¹

11 Jan 2018-IEEE Signal Processing Letters

TL;DR: Wang et al. as discussed by the authors proposed a weakly supervised color transfer method to correct color distortion, which relaxes the need for paired underwater images for training and allows the underwater images being taken in unknown locations.

...read moreread less

Abstract: Underwater vision suffers from severe effects due to selective attenuation and scattering when light propagates through water. Such degradation not only affects the quality of underwater images, but limits the ability of vision tasks. Different from existing methods that either ignore the wavelength dependence on the attenuation or assume a specific spectral profile, we tackle color distortion problem of underwater images from a new view. In this letter, we propose a weakly supervised color transfer method to correct color distortion. The proposed method relaxes the need for paired underwater images for training and allows the underwater images being taken in unknown locations. Inspired by cycle-consistent adversarial networks, we design a multiterm loss function including adversarial loss, cycle consistency loss, and structural similarity index measure loss, which makes the content and structure of the outputs same as the inputs, meanwhile the color is similar to the images that were taken without the water. Experiments on underwater images captured under diverse scenes show that our method produces visually pleasing results, even outperforms the state-of-the-art methods. Besides, our method can improve the performance of vision tasks.

...read moreread less

308 citations

Journal Article•DOI•

ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection

[...]

Chongyi Li¹, Runmin Cong², Sam Kwong¹, Junhui Hou¹, Huazhu Fu, Guopu Zhu³, Dingwen Zhang⁴, Qingming Huang³ - Show less +4 more•Institutions (4)

City University of Hong Kong¹, Beijing Jiaotong University², Chinese Academy of Sciences³, Xidian University⁴

01 Jan 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: An attention steered interweave fusion network (ASIF-Net) is proposed to detect salient objects, which progressively integrates cross-modal and cross-level complementarity from the RGB image and corresponding depth map via steering of an attention mechanism.

...read moreread less

Abstract: Salient object detection from RGB-D images is an important yet challenging vision task, which aims at detecting the most distinctive objects in a scene by combining color information and depth constraints. Unlike prior fusion manners, we propose an attention steered interweave fusion network (ASIF-Net) to detect salient objects, which progressively integrates cross-modal and cross-level complementarity from the RGB image and corresponding depth map via steering of an attention mechanism. Specifically, the complementary features from RGB-D images are jointly extracted and hierarchically fused in a dense and interweaved manner. Such a manner breaks down the barriers of inconsistency existing in the cross-modal data and also sufficiently captures the complementarity. Meanwhile, an attention mechanism is introduced to locate the potential salient regions in an attention-weighted fashion, which advances in highlighting the salient objects and suppressing the cluttered background regions. Instead of focusing only on pixelwise saliency, we also ensure that the detected salient objects have the objectness characteristics (e.g., complete structure and sharp boundary) by incorporating the adversarial learning that provides a global semantic constraint for RGB-D salient object detection. Quantitative and qualitative experiments demonstrate that the proposed method performs favorably against 17 state-of-the-art saliency detectors on four publicly available RGB-D salient object detection datasets. The code and results of our method are available at https://github.com/Li-Chongyi/ASIF-Net .

...read moreread less

188 citations

Cites background or methods from "Salient Object Detection via Struct..."

...[13] formulated saliency detection as a structured matrix decomposition problem guided by high-level priors....
[...]
..., the Athena sculpture in the seventh image, the white cat in the fourth last image, and the cake in the second last image) are not effectively detected by the SMD [13] and RCRR [14] methods....
[...]
...state-of-the-art methods on four datasets, including four unsupervised RGB saliency detection methods (DSG [11], MILPS [12], SMD [13], and RCRR [14]), three deep-learning-...
[...]
...For example, the salient objects (e.g., the Athena sculpture in the seventh image, the white cat in the fourth last image, and the cake in the second last image) are not effectively detected by the SMD [13] and RCRR [14] methods....
[...]
...We extensively compare the proposed method with 17 state-of-the-art methods on four datasets, including four unsupervised RGB saliency detection methods (DSG [11], MILPS [12], SMD [13], and RCRR [14]), three deep-learningbased RGB saliency detection methods (DCL [15], DSS [16], and R3Net [19]), three unsupervised RGB-D saliency detection methods (ACSD [56], DCMC [58], and MBP [55]), and seven deep-learning-based RGB-D saliency detection methods (DF [63], CTMF [47], PCFN [64], MMCI [65], CPFP [67], DMRA [68], and TANet [66]....
[...]

Journal Article•DOI•

A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection

[...]

Jianming Zhang¹, Zhipeng Xie¹, Juan Sun¹, Xin Zou¹, Jin Wang¹ - Show less +1 more•Institutions (1)

Changsha University of Science and Technology¹

07 Feb 2020-IEEE Access

TL;DR: A cascaded R-CNN to obtain the multiscale features in pyramids to solve the undetection and false detection of traffic sign detection and the data augment method expands the German traffic sign training dataset by simulation of complex environment changes.

...read moreread less

Abstract: In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7% and 90.5% in GTSDB, 99.7% and 83.62% in CCTSDB and 98.9% and 85.6% in Lisa dataset respectively.

...read moreread less

182 citations

Cites result from "Salient Object Detection via Struct..."

...[17], HOG+SVM [30], CCNN [32], RBD [48], SMD [49], and SRM [50], the results are shown in Table 3, and the specific experimental details are shown in Figure 10....
[...]

Collapse

Salient Object Detection via Structured Matrix Decomposition

Citations

Cites background or methods from "Salient Object Detection via Struct..."

Cites result from "Salient Object Detection via Struct..."

References

"Salient Object Detection via Struct..." refers background in this paper

"Salient Object Detection via Struct..." refers background in this paper

"Salient Object Detection via Struct..." refers methods in this paper

Related Papers (5)

Trending Questions (1)