Evaluating the Correctness of Explainable AI Algorithms for Classification.

Open AccessPosted Content

Evaluating the Correctness of Explainable AI Algorithms for Classification.

Orcun Yalcin, +2 more

- 20 May 2021 -

arXiv: Artificial Intelligence

Chats0

TLDR

This article developed a method to quantitatively evaluate the correctness of XAI algorithms by creating datasets with known explanation ground truth for binary classification problems and found that classification accuracy is positively correlated with explanation accuracy.

Abstract:

Explainable AI has attracted much research attention in recent years with feature attribution algorithms, which compute "feature importance" in predictions, becoming increasingly popular. However, there is little analysis of the validity of these algorithms as there is no "ground truth" in the existing datasets to validate their correctness. In this work, we develop a method to quantitatively evaluate the correctness of XAI algorithms by creating datasets with known explanation ground truth. To this end, we focus on the binary classification problems. String datasets are constructed using formal language derived from a grammar. A string is positive if and only if a certain property is fulfilled. Symbols serving as explanation ground truth in a positive string are part of an explanation if and only if they contributes to fulfilling the property. Two popular feature attribution explainers, Local Interpretable Model-agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP), are used in our experiments.We show that: (1) classification accuracy is positively correlated with explanation accuracy; (2) SHAP provides more accurate explanations than LIME; (3) explanation accuracy is negatively correlated with dataset complexity.

Evaluating the Correctness of Explainable AI Algorithms for Classification.

Citations

Order in the Court: Explainable AI Methods Prone to Disagreement.

An Initial Study of Machine Learning Underspecification Using Feature Attribution Explainable AI Algorithms: A COVID-19 Virus Transmission Case Study

A Perspective on Explanations of Molecular Prediction Models

Developing a Fidelity Evaluation Approach for Interpretable Machine Learning.

References

UCI Machine Learning Repository

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

A unified approach to interpreting model predictions

An Introduction to Kolmogorov Complexity and Its Applications

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Related Papers (5)

Explaining anomalies detected by autoencoders using Shapley Additive Explanations

A Model-Agnostic Approach to Quantifying the Informativeness of Explanation Methods for Time Series Classification

Evaluating Tree Explanation Methods for Anomaly Reasoning: A Case Study of SHAP TreeExplainer and TreeInterpreter

Consistent feature attribution for tree ensembles

Explainable Image Classification with Evidence Counterfactual

Trending Questions (1)