scispace - formally typeset
Open AccessJournal ArticleDOI

Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

Amina Adadi, +1 more
- 17 Sep 2018 - 
- Vol. 6, pp 52138-52160
Reads0
Chats0
TLDR
This survey provides an entry point for interested researchers and practitioners to learn key aspects of the young and rapidly growing body of research related to XAI, and review the existing approaches regarding the topic, discuss trends surrounding its sphere, and present major research trajectories.
Abstract
At the dawn of the fourth industrial revolution, we are witnessing a fast and widespread adoption of artificial intelligence (AI) in our daily life, which contributes to accelerating the shift towards a more algorithmic society. However, even with such unprecedented advancements, a key impediment to the use of AI-based systems is that they often lack transparency. Indeed, the black-box nature of these systems allows powerful predictions, but it cannot be directly explained. This issue has triggered a new debate on explainable AI (XAI). A research field holds substantial promise for improving trust and transparency of AI-based systems. It is recognized as the sine qua non for AI to continue making steady progress without disruption. This survey provides an entry point for interested researchers and practitioners to learn key aspects of the young and rapidly growing body of research related to XAI. Through the lens of the literature, we review the existing approaches regarding the topic, discuss trends surrounding its sphere, and present major research trajectories.

read more

Citations
More filters
Posted Content

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI.

TL;DR: Previous efforts to define explainability in Machine Learning are summarized, establishing a novel definition that covers prior conceptual propositions with a major focus on the audience for which explainability is sought, and a taxonomy of recent contributions related to the explainability of different Machine Learning models are proposed.
Journal ArticleDOI

Machine Learning Interpretability: A Survey on Methods and Metrics

TL;DR: A review of the current state of the research field on machine learning interpretability while focusing on the societal impact and on the developed methods and metrics is provided.
Journal ArticleDOI

Artificial Intelligence (AI) : Multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy

TL;DR: This research offers significant and timely insight to AI technology and its impact on the future of industry and society in general, whilst recognising the societal and industrial influence on pace and direction of AI development.
Journal ArticleDOI

Explainable AI: A Review of Machine Learning Interpretability Methods

TL;DR: In this paper, a literature review and taxonomy of machine learning interpretability methods are presented, as well as links to their programming implementations, in the hope that this survey would serve as a reference point for both theorists and practitioners.
References
More filters
Proceedings Article

Interpretable Deep Models for ICU Outcome Prediction.

TL;DR: This paper introduces a simple yet powerful knowledge-distillation approach called interpretable mimic learning, which uses gradient boosting trees to learn interpretable models and at the same time achieves strong prediction performance as deep learning models.
Posted Content

Causal Effect Inference with Deep Latent-Variable Models

TL;DR: In this article, the authors use Variational Autoencoders (VAE) to learn individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient.
Proceedings Article

Learning how to explain neural networks: PatternNet and PatternAttribution

TL;DR: This work argues that explanation methods for neural nets should work reliably in the limit of simplicity, the linear models, and proposes a generalization that yields two explanation techniques (PatternNet and PatternAttribution) that are theoretically sound for linear models and produce improved explanations for deep networks.
Posted Content

Learning how to explain neural networks: PatternNet and PatternAttribution

TL;DR: In this article, the authors argue that explanation methods for neural networks should work reliably in the limit of simplicity, the linear models, and propose a generalization that yields two explanation techniques (PatternNet and PatternAttribution) that are theoretically sound for linear models and produce improved explanations for deep networks.
Posted Content

Real Time Image Saliency for Black Box Classifiers

TL;DR: In this article, a fast saliency detection method that can be applied to any differentiable image classifier is proposed, which requires only a single forward pass to detect saliency.
Related Papers (5)