Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics

doi:10.3390/ELECTRONICS10050593

Open AccessJournal ArticleDOI

Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics

Jianlong Zhou, +3 more

- 04 Mar 2021 -

Electronics

- Vol. 10, Iss: 5, pp 593

TLDR

A comprehensive overview of methods proposed in the current literature for the evaluation of ML explanations is presented, finding that the quantitative metrics for both model-based and example-based explanations are primarily used to evaluate the parsimony/simplicity of interpretability, and subjective measures have been embraced as the focal point for the human-centered evaluation of explainable systems.

Abstract:

The most successful Machine Learning (ML) systems remain complex black boxes to end-users, and even experts are often unable to understand the rationale behind their decisions. The lack of transparency of such systems can have severe consequences or poor uses of limited valuable resources in medical diagnosis, financial decision-making, and in other high-stake domains. Therefore, the issue of ML explanation has experienced a surge in interest from the research community to application domains. While numerous explanation methods have been explored, there is a need for evaluations to quantify the quality of explanation methods to determine whether and to what extent the offered explainability achieves the defined objective, and compare available explanation methods and suggest the best explanation from the comparison for a specific task. This survey paper presents a comprehensive overview of methods proposed in the current literature for the evaluation of ML explanations. We identify properties of explainability from the review of definitions of explainability. The identified properties of explainability are used as objectives that evaluation metrics should achieve. The survey found that the quantitative metrics for both model-based and example-based explanations are primarily used to evaluate the parsimony/simplicity of interpretability, while the quantitative metrics for attribution-based explanations are primarily used to evaluate the soundness of fidelity of explainability. The survey also demonstrated that subjective measures, such as trust and confidence, have been embraced as the focal point for the human-centered evaluation of explainable systems. The paper concludes that the evaluation of ML explanations is a multidisciplinary research topic. It is also not possible to define an implementation of evaluation metrics, which can be applied to all explanation methods.

Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics

Citations

Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review

The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective

Research and Application of Machine Learning for Additive Manufacturing

Transparency of Deep Neural Networks for Medical Image Analysis: A Review of Interpretability Methods.

Transparency of deep neural networks for medical image analysis: A review of interpretability methods

References

Distilling the Knowledge in a Neural Network

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

A Survey of Methods for Explaining Black Box Models

Towards A Rigorous Science of Interpretable Machine Learning

Related Papers (5)

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

The mythos of model interpretability

A unified approach to interpreting model predictions