Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

doi:10.1109/ACCESS.2018.2870052

Open AccessJournal ArticleDOI

Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

Amina Adadi, +1 more

- 17 Sep 2018 -

IEEE Access

- Vol. 6, pp 52138-52160

Chats0

TLDR

This survey provides an entry point for interested researchers and practitioners to learn key aspects of the young and rapidly growing body of research related to XAI, and review the existing approaches regarding the topic, discuss trends surrounding its sphere, and present major research trajectories.

Abstract:

At the dawn of the fourth industrial revolution, we are witnessing a fast and widespread adoption of artificial intelligence (AI) in our daily life, which contributes to accelerating the shift towards a more algorithmic society. However, even with such unprecedented advancements, a key impediment to the use of AI-based systems is that they often lack transparency. Indeed, the black-box nature of these systems allows powerful predictions, but it cannot be directly explained. This issue has triggered a new debate on explainable AI (XAI). A research field holds substantial promise for improving trust and transparency of AI-based systems. It is recognized as the sine qua non for AI to continue making steady progress without disruption. This survey provides an entry point for interested researchers and practitioners to learn key aspects of the young and rapidly growing body of research related to XAI. Through the lens of the literature, we review the existing approaches regarding the topic, discuss trends surrounding its sphere, and present major research trajectories.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

ExplAIn Yourself! Transparency for Positive UX in Autonomous Driving

Tobias Schneider, +5 more

TL;DR: In this article, an initial guideline for autonomous driving experience design, bringing together the areas of user experience, explainable artificial intelligence and autonomous driving, was proposed, and the AVAM questionnaire, UEQ-S and interviews show that explanations during or after the ride help turn a negative user experience into a neutral one.

...read moreread less

Journal ArticleDOI

Cartesian genetic programming for diagnosis of Parkinson disease through handwriting analysis: Performance vs. interpretability issues.

Antonio Parziale, +4 more

- 01 Jan 2021 -

Artificial Intelligence in Medicine

TL;DR: A thorough comparison of different machine learning (ML) techniques, whose classification results are characterized by different levels of interpretability shows that the Cartesian Genetic Programming outperforms the white-box methods in accuracy and the black-box ones in interpretability.

...read moreread less

Journal ArticleDOI

Interpretability of Input Representations for Gait Classification in Patients after Total Hip Arthroplasty.

Carlo Dindorf, +4 more

- 06 Aug 2020 -

Sensors

TL;DR: It is shown that the type of input representation crucially determines interpretability as well as clinical relevance in a trained model using XAI methods, and a combined approach using different forms of representations seems advantageous.

...read moreread less

Proceedings ArticleDOI

A Review of Trust in Artificial Intelligence: Challenges, Vulnerabilities and Future Directions

Steven Lockey, +3 more

TL;DR: A literature review of what is known about the antecedents of trust in AI is taken, a concept matrix identifying the key vulnerabilities to stakeholders raised by each of the challenges is developed, and a multi-stakeholder approach is proposed.

...read moreread less

Posted Content

Principles to Practices for Responsible AI: Closing the Gap.

Daniel Schiff, +4 more

- 08 Jun 2020 -

arXiv: Computers and Society

TL;DR: It is argued that an impact assessment framework which is broad, operationalizable, flexible, iterative, guided, and participatory is a promising approach to close the principles-to-practices gap.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Distilling the Knowledge in a Neural Network

Geoffrey E. Hinton, +2 more

- 09 Mar 2015 -

arXiv: Machine Learning

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.

...read moreread less

Book ChapterDOI

Visualizing and Understanding Convolutional Networks

Matthew D. Zeiler, +1 more

TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.

...read moreread less

Proceedings ArticleDOI

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, +2 more

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

Journal ArticleDOI

Mastering the game of Go without human knowledge

David Silver, +16 more

- 19 Oct 2017 -

Nature

TL;DR: An algorithm based solely on reinforcement learning is introduced, without human data, guidance or domain knowledge beyond game rules, that achieves superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.

...read moreread less

Collapse

Nature Machine Intelligence

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.

Sebastian Bach, +5 more

- 10 Jul 2015 -

PLOS ONE

Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

Citations

ExplAIn Yourself! Transparency for Positive UX in Autonomous Driving

Cartesian genetic programming for diagnosis of Parkinson disease through handwriting analysis: Performance vs. interpretability issues.

Interpretability of Input Representations for Gait Classification in Patients after Total Hip Arthroplasty.

A Review of Trust in Artificial Intelligence: Challenges, Vulnerabilities and Future Directions

Principles to Practices for Responsible AI: Closing the Gap.

References

Distributed Representations of Words and Phrases and their Compositionality

Distilling the Knowledge in a Neural Network

Visualizing and Understanding Convolutional Networks

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Mastering the game of Go without human knowledge

Related Papers (5)

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

A unified approach to interpreting model predictions

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.