Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending

doi:10.1109/ACCESS.2020.2984412

Open AccessJournal ArticleDOI

Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending

Miller Janny Ariza-Garzon, +3 more

- 01 Jan 2020 -

IEEE Access

- Vol. 8, pp 64873-64890

Chats0

TLDR

This work assesses the well-known logistic regression model and several machine learning algorithms for granting scoring in P2P lending and reveals that the machine learning alternative is superior in terms of not only classification performance but also explainability.

Abstract:

Peer-to-peer (P2P) lending demands effective and explainable credit risk models. Typical machine learning algorithms offer high prediction performance, but most of them lack explanatory power. However, this deficiency can be solved with the help of the explainability tools proposed in the last few years, such as the SHAP values. In this work, we assess the well-known logistic regression model and several machine learning algorithms for granting scoring in P2P lending. The comparison reveals that the machine learning alternative is superior in terms of not only classification performance but also explainability. More precisely, the SHAP values reveal that machine learning algorithms can reflect dispersion, nonlinearity and structural breaks in the relationships between each feature and the target variable. Our results demonstrate that is possible to have machine learning credit scoring models be both accurate and transparent. Such models provide the trust that the industry, regulators and end-users demand in P2P lending and may lead to a wider adoption of machine learning in this and other risk assessment applications where explainability is required.

Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending

Citations

Counterfactuals and causability in explainable artificial intelligence: Theory, algorithms, and applications

Counterfactuals and causability in explainable artificial intelligence: Theory, algorithms, and applications

Computational approaches and data analytics in financial services: A literature review

LINDA-BN: An interpretable probabilistic approach for demystifying black-box predictive models

SHAP and LIME: An Evaluation of Discriminative Power in Credit Risk.

References

Random Forests

Scikit-learn: Machine Learning in Python

Classification and Regression Trees.

Greedy function approximation: A gradient boosting machine.

Classification and regression trees

Related Papers (5)

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

A unified approach to interpreting model predictions

A Value for n-person Games

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Heterogeneous Ensemble for Default Prediction of Peer-to-Peer Lending in China