Ablation-CAM: Visual Explanations for Deep Convolutional Network via Gradient-free Localization

doi:10.1109/WACV45572.2020.9093360

Home
/
Papers
/
Ablation-CAM: Visual Explanations for Deep Convolutional Network via Gradient-free Localization

Proceedings Article•DOI•

Ablation-CAM: Visual Explanations for Deep Convolutional Network via Gradient-free Localization

Saurabh Desai¹, Harish G. Ramaswamy¹•Institutions (1)

Indian Institute of Technology Madras¹

01 Mar 2020-pp 983-991

TL;DR: This approach – Ablation-based Class Activation Mapping (Ablation CAM) uses ablation analysis to determine the importance of individual feature map units w.r.t. class to produce a coarse localization map highlighting the important regions in the image for predicting the concept.

read less

Abstract: In response to recent criticism of gradient-based visualization techniques, we propose a new methodology to generate visual explanations for deep Convolutional Neural Networks (CNN) - based models. Our approach – Ablation-based Class Activation Mapping (Ablation CAM) uses ablation analysis to determine the importance (weights) of individual feature map units w.r.t. class. Further, this is used to produce a coarse localization map highlighting the important regions in the image for predicting the concept. Our objective and subjective evaluations show that this gradient-free approach works better than state-of-the-art Grad-CAM technique. Moreover, further experiments are carried out to show that Ablation-CAM is class discriminative as well as can be used to evaluate trust in a model.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments

[...]

Xiao Bai¹, Xiang Wang¹, Xianglong Liu¹, Qiang Liu², Jingkuan Song³, Niculae Sebe⁴, Been Kim⁵ - Show less +3 more•Institutions (5)

Beihang University¹, University of Texas at Austin², University of Electronic Science and Technology of China³, University of Trento⁴, Google⁵

01 Dec 2021-Pattern Recognition

TL;DR: In this article, explainable deep learning methods are grouped into three main categories: efficient deep learning via model compression and acceleration, as well as robustness and stability in deep learning.

...read moreread less

101 citations

Posted Content•

Axiom-based Grad-CAM: Towards Accurate Visualization and Explanation of CNNs

[...]

Ruigang Fu¹, Qingyong Hu², Xiaohu Dong¹, Yulan Guo¹, Yinghui Gao¹, Biao Li¹ - Show less +2 more•Institutions (2)

National University of Defense Technology¹, University of Oxford²

05 Aug 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper introduces two axioms -- Conservation and Sensitivity -- to the visualization paradigm of the CAM methods and proposes a dedicated Axiom-based Grad-CAM (XGrad-Cam) that is able to achieve better visualization performance and be class-discriminative and easy-to-implement compared with Grad-cAM++ and Ablation-C AM.

...read moreread less

Abstract: To have a better understanding and usage of Convolution Neural Networks (CNNs), the visualization and interpretation of CNNs has attracted increasing attention in recent years. In particular, several Class Activation Mapping (CAM) methods have been proposed to discover the connection between CNN's decision and image regions. In spite of the reasonable visualization, lack of clear and sufficient theoretical support is the main limitation of these methods. In this paper, we introduce two axioms -- Conservation and Sensitivity -- to the visualization paradigm of the CAM methods. Meanwhile, a dedicated Axiom-based Grad-CAM (XGrad-CAM) is proposed to satisfy these axioms as much as possible. Experiments demonstrate that XGrad-CAM is an enhanced version of Grad-CAM in terms of conservation and sensitivity. It is able to achieve better visualization performance than Grad-CAM, while also be class-discriminative and easy-to-implement compared with Grad-CAM++ and Ablation-CAM. The code is available at this https URL.

...read moreread less

85 citations

Cites background or methods from "Ablation-CAM: Visual Explanations f..."

...Besides, they also break the axiom of implementation invariance since they are layer sensitive [4]....
[...]
..., Grad-CAM [23], Grad-CAM++ [3] and Ablation-CAM [4])....
[...]
...[4] proposed Ablation-CAM to remove the dependence on gradients but this method is quite time-consuming since it has to run forward propagation for hundreds of times per image....
[...]
...Note that the original weight of each feature map in Ablation-CAM [4] is defined as Sc(F )−Sc(F\F) ||Flk|| ....
[...]
...This definition is inspired by CAM [32] and further improved by other works, such as Grad-CAM++ [3] and Ablation-CAM [4]....
[...]

Journal Article•DOI•

Review: Deep Learning in Electron Microscopy

[...]

Jeffrey M. Ede¹•Institutions (1)

University of Warwick¹

17 Sep 2020-arXiv: Image and Video Processing

TL;DR: In this paper, a review of deep learning in electron microscopy is presented, with a focus on hardware and software needed to get started with deep learning and interface with electron microscopes.

...read moreread less

Abstract: Deep learning is transforming most areas of science and technology, including electron microscopy. This review paper offers a practical perspective aimed at developers with limited familiarity. For context, we review popular applications of deep learning in electron microscopy. Following, we discuss hardware and software needed to get started with deep learning and interface with electron microscopes. We then review neural network components, popular architectures, and their optimization. Finally, we discuss future directions of deep learning in electron microscopy.

...read moreread less

59 citations

Posted Content•

Deep weakly-supervised learning methods for classification and localization in histology images: a survey.

[...]

Jérôme Rony, Soufiane Belharbi, Jose Dolz, Ismail Ben Ayed, Luke McCaffrey, Eric Granger - Show less +2 more

08 Sep 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: Results indicate that several deep learning models, and in particular WILDCAT and deep MIL can provide a high level of classification accuracy, although pixel-wise localization of cancer regions remains an issue for such images.

...read moreread less

Abstract: Using state-of-the-art deep learning models for cancer diagnosis presents several challenges related to the nature and availability of labeled histology images. In particular, cancer grading and localization in these images normally relies on both image- and pixel-level labels, the latter requiring a costly annotation process. In this survey, deep weakly-supervised learning (WSL) models are investigated to identify and locate diseases in histology images, without the need for pixel-level annotations. Given training data with global image-level labels, these models allow to simultaneously classify histology images and yield pixel-wise localization scores, thereby identifying the corresponding regions of interest (ROI). Since relevant WSL models have mainly been investigated within the computer vision community, and validated on natural scene images, we assess the extent to which they apply to histology images which have challenging properties, e.g. very large size, similarity between foreground/background, highly unstructured regions, stain heterogeneity, and noisy/ambiguous labels. The most relevant models for deep WSL are compared experimentally in terms of accuracy (classification and pixel-wise localization) on several public benchmark histology datasets for breast and colon cancer -- BACH ICIAR 2018, BreaKHis, CAMELYON16, and GlaS. Furthermore, for large-scale evaluation of WSL models on histology images, we propose a protocol to construct WSL datasets from Whole Slide Imaging. Results indicate that several deep learning models can provide a high level of classification accuracy, although accurate pixel-wise localization of cancer regions remains an issue for such images. Code is publicly available.

...read moreread less

48 citations

Posted Content•

SS-CAM: Smoothed Score-CAM for Sharper Visual Feature Localization.

[...]

Haofan Wang¹, Rakshit Naidu², Joy Michael², Soumya Snigdha Kundu•Institutions (2)

Carnegie Mellon University¹, Manipal University²

25 Jun 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper introduces an enhanced visual explanation in terms of visual sharpness called SS-CAM, which produces centralized localization of object features within an image through a smooth operation, which outperforms Score-C CAM on both faithfulness and localization tasks.

...read moreread less

Abstract: Interpretation of the underlying mechanisms of Deep Convolutional Neural Networks has become an important aspect of research in the field of deep learning due to their applications in high-risk environments To explain these black-box architectures there have been many methods applied so the internal decisions can be analyzed and understood In this paper, built on the top of Score-CAM, we introduce an enhanced visual explanation in terms of visual sharpness called SS-CAM, which produces centralized localization of object features within an image through a smooth operation We evaluate our method on the ILSVRC 2012 Validation dataset, which outperforms Score-CAM on both faithfulness and localization tasks

...read moreread less

37 citations

Cites background from "Ablation-CAM: Visual Explanations f..."

...They can be divided into two branches, one is gradient-based CAMs [2], [15], which represent the linear weights corresponding to internal activation maps by gradient information....
[...]
...As the output layer is a non-linear function, gradient-based CAMs tend to diminish the backpropagating gradients which cause gradient saturation thereby making it difficult to provide concrete explanations....
[...]
...These categories are known as Class Activation Maps (CAMs)....
[...]
...The other is gradient-free CAMs [4], [23] which capture the importance of each activation map by the target score in forward propagation....
[...]
...The generalisation of CAMs take place with Grad-CAM [15]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36

Collapse

References

PDF

Open Access

More filters

Posted Content•

VQA: Visual Question Answering

[...]

Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh - Show less +3 more

03 May 2015-arXiv: Computation and Language

TL;DR: The task of free-form and open-ended Visual Question Answering (VQA) is proposed, given an image and a natural language question about the image, the task is to provide an accurate natural language answer.

...read moreread less

Abstract: We propose the task of free-form and open-ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural language answer. Mirroring real-world scenarios, such as helping the visually impaired, both the questions and answers are open-ended. Visual questions selectively target different areas of an image, including background details and underlying context. As a result, a system that succeeds at VQA typically needs a more detailed understanding of the image and complex reasoning than a system producing generic image captions. Moreover, VQA is amenable to automatic evaluation, since many open-ended answers contain only a few words or a closed set of answers that can be provided in a multiple-choice format. We provide a dataset containing ~0.25M images, ~0.76M questions, and ~10M answers (this http URL), and discuss the information it provides. Numerous baselines and methods for VQA are provided and compared with human performance. Our VQA demo is available on CloudCV (this http URL).

...read moreread less

2,365 citations

"Ablation-CAM: Visual Explanations f..." refers background in this paper

...Convolutional Neural Networks (CNNs) are known to show near human-level performance on various computer vision tasks such as image classification [8], object detection [5], semantic segmentation [10] and have performed well on tasks such as image captioning [19] and visual question answering [2]....
[...]

Posted Content•

The Mythos of Model Interpretability

[...]

Zachary C. Lipton¹•Institutions (1)

Carnegie Mellon University¹

10 Jun 2016-arXiv: Learning

TL;DR: This research presents a meta-modelling architecture that automates the very labor-intensive and therefore time-heavy and expensive and therefore expensive and expensive process of training and deploying supervised machine-learning models.

...read moreread less

Abstract: Supervised machine learning models boast remarkable predictive capabilities. But can you trust your model? Will it work in deployment? What else can it tell you about the world? We want models to be not only good, but interpretable. And yet the task of interpretation appears underspecified. Papers provide diverse and sometimes non-overlapping motivations for interpretability, and offer myriad notions of what attributes render models interpretable. Despite this ambiguity, many papers proclaim interpretability axiomatically, absent further explanation. In this paper, we seek to refine the discourse on interpretability. First, we examine the motivations underlying interest in interpretability, finding them to be diverse and occasionally discordant. Then, we address model properties and techniques thought to confer interpretability, identifying transparency to humans and post-hoc explanations as competing notions. Throughout, we discuss the feasibility and desirability of different notions, and question the oft-made assertions that linear models are interpretable and that deep neural networks are not.

...read moreread less

1,423 citations

"Ablation-CAM: Visual Explanations f..." refers background in this paper

...[9] emphasized the need for interpretable and trustworthy networks....
[...]

Journal Article•DOI•

The mythos of model interpretability

[...]

Zachary C. Lipton¹•Institutions (1)

Carnegie Mellon University¹

26 Sep 2018-Communications of The ACM

TL;DR: In machine learning, the concept of interpretability is both important and slippery, so it is important to understand how these concepts can be modified.

...read moreread less

Abstract: Supervised machine-learning models boast remarkable predictive capabilities. But can you trust your model? Will it work in deployment? What else can it tell you about the world?

...read moreread less

1,307 citations

Posted Content•

Axiomatic Attribution for Deep Networks

[...]

Mukund Sundararajan¹, Ankur Taly¹, Qiqi Yan¹•Institutions (1)

Google¹

04 Mar 2017-arXiv: Learning

TL;DR: The problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works, is studied and two fundamental axioms— Sensitivity and Implementation Invariance that attribution methods ought to satisfy are identified.

...read moreread less

Abstract: We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of text models and a chemistry model, demonstrating its ability to debug networks, to extract rules from a network, and to enable users to engage with models better.

...read moreread less

1,282 citations

"Ablation-CAM: Visual Explanations f..." refers methods or result in this paper

...[17] used an axiomatic aprroach for evaluating attribution methods....
[...]
...Our approach is similar to the Integrated gradients approach [17], which also attacks the gradient saturation problem....
[...]
...[17] used integrated gradients to attribute the prediction of CNN to input pixels....
[...]

Posted Content•

Sanity Checks for Saliency Maps

[...]

Julius Adebayo¹, Justin Gilmer², Michael Muelly², Ian Goodfellow², Moritz Hardt², Been Kim² - Show less +2 more•Institutions (2)

Massachusetts Institute of Technology¹, Google²

08 Oct 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: It is shown that some existing saliency methods are independent both of the model and of the data generating process, and methods that fail the proposed tests are inadequate for tasks that are sensitive to either data or model.

...read moreread less

Abstract: Saliency methods have emerged as a popular tool to highlight features in an input deemed relevant for the prediction of a learned model. Several saliency methods have been proposed, often guided by visual appeal on image data. In this work, we propose an actionable methodology to evaluate what kinds of explanations a given method can and cannot provide. We find that reliance, solely, on visual assessment can be misleading. Through extensive experiments we show that some existing saliency methods are independent both of the model and of the data generating process. Consequently, methods that fail the proposed tests are inadequate for tasks that are sensitive to either data or model, such as, finding outliers in the data, explaining the relationship between inputs and outputs that the model learned, and debugging the model. We interpret our findings through an analogy with edge detection in images, a technique that requires neither training data nor model. Theory in the case of a linear model and a single-layer convolutional neural network supports our experimental findings.

...read moreread less

927 citations