Improving Credit Risk Prediction in Online Peer-to-Peer (P2P) Lending Using Imbalanced Learning Techniques

doi:10.1109/ICTAI.2017.00037

Proceedings ArticleDOI

Improving Credit Risk Prediction in Online Peer-to-Peer (P2P) Lending Using Imbalanced Learning Techniques

Luís Ferreira, +3 more

- pp 175-181

Chats0

TLDR

This work wrangle a real-world P2P lending data set from Lending Club, containing a large amount of data gathered from 2007 up to 2016, and analysis how supervised classification models and techniques to handle class imbalance impact creditworthiness prediction rates shows that sampling techniques outperform ensembles and cost sensitive approaches.

Abstract:

Peer-to-peer (P2P) lending is a global trend of financial markets that allow individuals to obtain and concede loans without having financial institutions as a strong proxy. As many real-world applications, P2P lending presents an imbalanced characteristic, where the number of creditworthy loan requests is much larger than the number of non-creditworthy ones. In this work, we wrangle a real-world P2P lending data set from Lending Club, containing a large amount of data gathered from 2007 up to 2016. We analyze how supervised classification models and techniques to handle class imbalance impact creditworthiness prediction rates. Ensembles, cost-sensitive and sampling methods are combined and evaluated along logistic regression, decision tree, and bayesian learning schemes. Results show that, in average, sampling techniques outperform ensembles and cost sensitive approaches.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending

Miller Janny Ariza-Garzon, +3 more

- 01 Jan 2020 -

IEEE Access

TL;DR: This work assesses the well-known logistic regression model and several machine learning algorithms for granting scoring in P2P lending and reveals that the machine learning alternative is superior in terms of not only classification performance but also explainability.

...read moreread less

Journal ArticleDOI

Peer to Peer (P2P) Lending Problems and Potential Solutions: A Systematic Literature Review

Ryan Randy Suryono, +2 more

- 01 Jan 2019 -

Procedia Computer Science

TL;DR: This study aims to identify problems in P2P Lending and present alternative technical and non-technical solutions to the problems and finds a rich picture, creates a table of problem identification and alternative solutions.

...read moreread less

Journal ArticleDOI

Resample-Based Ensemble Framework for Drifting Imbalanced Data Streams

Hang Zhang, +4 more

- 06 May 2019 -

IEEE Access

TL;DR: This paper proposes a Resample-based Ensemble Framework for Drifting Imbalanced Stream (RE-DI), which consists of a long-term static classifier to handle gradual and multiple dynamic classifiers to handle sudden concept drift.

...read moreread less

Proceedings ArticleDOI

Metric Learning from Imbalanced Data

Léo Gautheron, +3 more

TL;DR: In this paper, a new Mahalanobis metric learning algorithm (IML) is proposed to deal with class imbalance in the metric learning problem, where the number of positive examples is much smaller than the negatives.

...read moreread less

Journal ArticleDOI

Risk-Return modelling in the P2P lending market: Trends, Gaps, Recommendations and future directions

Miller Janny Ariza-Garzon, +3 more

- 01 Sep 2021 -

Electronic Commerce Research and Applica...

TL;DR: A bibliometric and systematic analysis is performed on the academic literature published during the last decade on P2P lending to identify the main research trends and find potential gaps that limit stakeholders' use of research proposals.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Random Forests

Leo Breiman

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

Journal ArticleDOI

SMOTE: synthetic minority over-sampling technique

Nitesh V. Chawla, +3 more

- 01 Jan 2002 -

Journal of Artificial Intelligence Resea...

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

Journal ArticleDOI

Bagging predictors

Leo Breiman

TL;DR: Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy.

...read moreread less

Journal ArticleDOI

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Yoav Freund, +1 more

TL;DR: The model studied can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting, and it is shown that the multiplicative weight-update Littlestone?Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems.

...read moreread less

Journal ArticleDOI

SMOTE: Synthetic Minority Over-sampling Technique

Nitesh V. Chawla, +3 more

- 09 Jun 2011 -

arXiv: Artificial Intelligence

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less