Implementation of Credit Card Fraud Detection System with Concept Drifts Adaptation

doi:10.1007/978-981-10-7245-1_46

Home
/
Papers
/
Implementation of Credit Card Fraud Detection System with Concept Drifts Adaptation

Book Chapter•DOI•

Implementation of Credit Card Fraud Detection System with Concept Drifts Adaptation

Anita Jog¹, Anjali Chandavale¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2018-pp 467-477

TL;DR: The developed algorithm detects credit card fraud, filters 80% fraudulent transactions and acts as a support system for the society at a large.

read less

Abstract: There is a large number of credit card payments take place that is targeted by fraudulent activities. Companies which are responsible for the processing of electronic transactions need to efficiently detect the fraudulent activity to maintain customers’ trust and the continuity of their own business. In this paper, the developed algorithm detects credit card fraud. Prediction of any algorithm is based on certain attribute like customer’s buying behavior, a network of merchants that customer usually deals with, the location of the transaction, amount of transaction, etc. But these attribute changes over time. So, the algorithmic model needs to be updated periodically to reduce this kind of errors. Proposed System provides two solutions for handling concept drift. One is an Active solution and another one is Passive. Active solution refers to triggering mechanisms by explicitly detecting a change in statistics. Passive solution suggests updating the model continuously in order to consider newly added records. The proposed and developed system filters 80% fraudulent transactions and acts as a support system for the society at a large.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

An Intelligent Approach to Credit Card Fraud Detection Using an Optimized Light Gradient Boosting Machine

[...]

Altyeb Altaher Taha¹, Sharaf Jameel Malebary¹•Institutions (1)

King Abdulaziz University¹

03 Feb 2020-IEEE Access

TL;DR: The proposed intelligent approach for detecting fraud in credit card transactions using an optimized light gradient boosting machine (OLightGBM) outperformed the other approaches and achieved the highest performance in terms of accuracy.

...read moreread less

Abstract: New advances in electronic commerce systems and communication technologies have made the credit card the potentially most popular method of payment for both regular and online purchases; thus, there is significantly increased fraud associated with such transactions. Fraudulent credit card transactions cost firms and consumers large financial losses every year, and fraudsters continuously attempt to find new technologies and methods for committing fraudulent transactions. The detection of fraudulent transactions has become a significant factor affecting the greater utilization of electronic payment. Thus, there is a need for efficient and effective approaches for detecting fraud in credit card transactions. This paper proposes an intelligent approach for detecting fraud in credit card transactions using an optimized light gradient boosting machine (OLightGBM). In the proposed approach, a Bayesian-based hyperparameter optimization algorithm is intelligently integrated to tune the parameters of a light gradient boosting machine (LightGBM). To demonstrate the effectiveness of our proposed OLightGBM for detecting fraud in credit card transactions, experiments were performed using two real-world public credit card transaction data sets consisting of fraudulent transactions and legitimate ones. Based on a comparison with other approaches using the two data sets, the proposed approach outperformed the other approaches and achieved the highest performance in terms of accuracy (98.40%), Area under receiver operating characteristic curve (AUC) (92.88%), Precision (97.34%) and F1-score (56.95%).

...read moreread less

134 citations

Cites background from "Implementation of Credit Card Fraud..."

...The proposed approach obtained the highest Accuracy (98.40%), while the Concept Drifts Adaptation [31] achieved the lowest Accuracy (80%)....
[...]
...40%), while the Concept Drifts Adaptation [31] achieved the lowest Accuracy (80%)....
[...]

Journal Article•DOI•

Credit card fraud detection using asexual reproduction optimization

[...]

Anahita Farhang Ghahfarokhi, Taha Mansouri, Mohammad Reza Sadeghi Moghaddam, Nila Bahrambeik, Ramin Yavari, Mohammadreza Fani Sani - Show less +2 more

27 Jul 2021-Kybernetes

TL;DR: This paper used asexual reproduction optimization (ARO) algorithm to classify the bank transactions into fraud and legitimate, and achieved a high precision value indicating that if ARO detects a record as a fraud, with a high probability, it is a fraud one.

...read moreread less

Abstract: Purpose – The best algorithm that was implemented on this Brazilian dataset was artificial immune system (AIS) algorithm. But the time and cost of this algorithm are high. Using asexual reproduction optimization (ARO) algorithm, the authors achieved better results in less time. So the authors achieved less cost in a shorter time. Their framework addressed the problems such as high costs and training time in credit card fraud detection. This simple and effective approach has achieved better results than the best techniques implemented on our dataset so far. The purpose of this paper is to detect credit card fraud using ARO. Design/methodology/approach – In this paper, the authors used ARO algorithm to classify the bank transactions into fraud and legitimate. ARO is taken from asexual reproduction. Asexual reproduction refers to a kind of production in which one parent produces offspring identical to herself. In ARO algorithm, an individual is shown by a vector of variables. Each variable is considered as a chromosome. A binary string represents a chromosome consisted of genes. It is supposed that every generated answer exists in the environment, and because of limited resources, only the best solution can remain alive. The algorithm starts with a random individual in the answer scope. This parent reproduces the offspring named bud. Either the parent or the offspring can survive. In this competition, the one which outperforms in fitness function remains alive. If the offspring has suitable performance,it will be the next parent, and the current parent becomes obsolete.Otherwise, the offspring perishes, and the present parent survives. The algorithm recurs until the stop condition occurs. Findings – Results showed that ARO had increased the AUC (i.e. area under a receiver operating characteristic (ROC) curve), sensitivity, precision, specificity and accuracy by 13%, 25%, 56%, 3% and 3%, in comparison with AIS, respectively. The authors achieved a high precision value indicating that if ARO detects a record as a fraud, with a high probability, it is a fraud one. Supporting a real-time fraud detection system is another vital issue. ARO outperforms AIS not only in the mentioned criteria, but also decreases the training time by 75% in comparison with the AIS, which is a significant figure. Originality/value – In this paper, the authors implemented the ARO in credit card fraud detection. The authors compared the results with those of the AIS, which was one of the best methods ever implemented on the benchmark dataset. The chief focus of the fraud detection studies is finding the algorithms that can detect legal transactions from the fraudulent ones with high detection accuracy in the shortest time and at a low cost. That ARO meets all these demands.

...read moreread less

4 citations

Journal Article•DOI•

Internet of Things resource monitoring through proactive fault prediction

[...]

Abishi Chowdhury, Shital A. Raut, Amrit Pal

01 May 2022-Computers & Industrial Engineering

TL;DR: In this article , an incremental deep neural network (DNN) is employed as a predictive model to predict the maintenance notification as instances of data are incrementally added to the existing pool of data.

...read moreread less

1 citations

Posted Content•DOI•

Framework for Credit Card Fraud Detection Using Benefit-Based Learning and Periodic Features

[...]

14 Mar 2023

TL;DR: In this article , the authors proposed a technique for identifying credit card fraud that first accounts for customer spending patterns by aggregating transactions to creative new features based on periodic data, and then, they considered benefits and costs when training an XGBoost classifier in order to achieve maximum benefits.

...read moreread less

Abstract: Abstract Online credit card fraud is an ongoing problem and with the recent COVID-19 pandemic, there has been a surge of merchants moving their businesses online. It is therefore crucial to identify fraudulent activities before it causes loss to both the bank and its customers. Due to the dynamic nature of fraudsters as well as customer spending behavior, machine learning algorithms are appropriate for this task. However, credit card fraud data is typically imbalanced, favoring the positive class (legitimate transactions), causing traditional machine learning algorithms to err on the side of this majority class; since they consider equal costs and benefits for different decision outcomes when training. Nevertheless, it is more beneficial to correctly identify fraudulent transactions. Therefore, in this paper, we propose a technique for identifying credit card fraud that first accounts for customer spending patterns by aggregating transactions to creative new features based on periodic data. Then, we consider benefits and costs when training an XGBoost classifier in order to achieve maximum benefits. We also evaluate the performance of the classifier using benefits and costs. We demonstrate the effectiveness of our approach using data provided by a bank.

...read moreread less

Journal Article•DOI•

Identification of Fraudulent Online Transactions and Protection: State-of-art Techniques

[...]

Akshat Gaurav, Brij B. Gupta

30 Nov 2022-ESIC Digital economy & innovation journal

TL;DR: A survey of the latest frameworks and techniques proposed by the researchers for the identification of fraudulent transactions and securing online transactions is presented in this article , where the authors present a survey of their work.

...read moreread less

Abstract: In this digital era, the trend of online transactions for E-commerce sites and banking services is increasing. By using different online transaction methods users can make payments directly from their bank accounts. But along with the increase of online transactions, there is an increase in fraudulent transactions. These fraudulent transitions have identical features and characteristics of online transactions, so there is a need for the development of frameworks or technologies to detect fraudulent transactions. In this context, this paper represents a survey of the latest frameworks and techniques proposed by the researchers for the identification of fraudulent transactions and securing online transactions.

...read moreread less

References

PDF

Open Access

More filters

Journal Article•DOI•

Learned lessons in credit card fraud detection from a practitioner perspective

[...]

Andrea Dal Pozzolo¹, Olivier Caelen, Yann-Aël Le Borgne¹, Serge Waterschoot, Gianluca Bontempi¹ - Show less +1 more•Institutions (1)

Université libre de Bruxelles¹

01 Aug 2014-Expert Systems With Applications

TL;DR: This paper provides some answers from the practitioner’s perspective by focusing on three crucial issues: unbalancedness, non-stationarity and assessment in fraud detection algorithms.

...read moreread less

Abstract: Billions of dollars of loss are caused every year due to fraudulent credit card transactions. The design of efficient fraud detection algorithms is key for reducing these losses, and more algorithms rely on advanced machine learning techniques to assist fraud investigators. The design of fraud detection algorithms is however particularly challenging due to non-stationary distribution of the data, highly imbalanced classes distributions and continuous streams of transactions. At the same time public data are scarcely available for confidentiality issues, leaving unanswered many questions about which is the best strategy to deal with them. In this paper we provide some answers from the practitioner’s perspective by focusing on three crucial issues: unbalancedness, non-stationarity and assessment. The analysis is made possible by a real credit card dataset provided by our industrial partner.

...read moreread less

357 citations

Journal Article•DOI•

A cost-sensitive decision tree approach for fraud detection

[...]

Yusuf Sahin¹, Serol Bulkan¹, Ekrem Duman²•Institutions (2)

Marmara University¹, Özyeğin University²

01 Nov 2013-Expert Systems With Applications

TL;DR: A new cost-sensitive decision tree approach which minimizes the sum of misclassification costs while selecting the splitting attribute at each non-terminal node is developed and the performance of this approach is compared with the well-known traditional classification models on a real world credit card data set.

...read moreread less

Abstract: With the developments in the information technology, fraud is spreading all over the world, resulting in huge financial losses. Though fraud prevention mechanisms such as CHIP&PIN are developed for credit card systems, these mechanisms do not prevent the most common fraud types such as fraudulent credit card usages over virtual POS (Point Of Sale) terminals or mail orders so called online credit card fraud. As a result, fraud detection becomes the essential tool and probably the best way to stop such fraud types. In this study, a new cost-sensitive decision tree approach which minimizes the sum of misclassification costs while selecting the splitting attribute at each non-terminal node is developed and the performance of this approach is compared with the well-known traditional classification models on a real world credit card data set. In this approach, misclassification costs are taken as varying. The results show that this cost-sensitive decision tree algorithm outperforms the existing well-known methods on the given problem set with respect to the well-known performance metrics such as accuracy and true positive rate, but also a newly defined cost-sensitive metric specific to credit card fraud detection domain. Accordingly, financial losses due to fraudulent transactions can be decreased more by the implementation of this approach in fraud detection systems.

...read moreread less

289 citations

Journal Article•DOI•

APATE: A novel approach for automated credit card transaction fraud detection using network-based extensions

[...]

Véronique Van Vlasselaer¹, Cristián Bravo², Olivier Caelen, Tina Eliassi-Rad³, Leman Akoglu⁴, Monique Snoeck¹, Bart Baesens¹, Bart Baesens⁵ - Show less +4 more•Institutions (5)

Katholieke Universiteit Leuven¹, University of Talca², Rutgers University³, Stony Brook University⁴, University of Southampton⁵

01 Jul 2015

TL;DR: APATE is proposed, a novel approach to detect fraudulent credit card transactions conducted in online stores that combines intrinsic features derived from the characteristics of incoming transactions and the customer spending history using the fundamentals of RFM.

...read moreread less

Abstract: In the last decade, the ease of online payment has opened up many new opportunities for e-commerce, lowering the geographical boundaries for retail While e-commerce is still gaining popularity, it is also the playground of fraudsters who try to misuse the transparency of online purchases and the transfer of credit card records This paper proposes APATE, a novel approach to detect fraudulent credit card transactions conducted in online stores Our approach combines (1) intrinsic features derived from the characteristics of incoming transactions and the customer spending history using the fundamentals of RFM (Recency–Frequency–Monetary); and (2) network-based features by exploiting the network of credit card holders and merchants and deriving a time-dependent suspiciousness score for each network object Our results show that both intrinsic and network-based features are two strongly intertwined sides of the same picture The combination of these two types of features leads to the best performing models which reach AUC-scores higher than 098

...read moreread less

273 citations

Proceedings Article•DOI•

Credit card fraud detection and concept-drift adaptation with delayed supervised information

[...]

Andrea Dal Pozzolo¹, Giacomo Boracchi², Olivier Caelen, Cesare Alippi², Gianluca Bontempi¹ - Show less +1 more•Institutions (2)

Université libre de Bruxelles¹, Polytechnic University of Milan²

12 Jul 2015

TL;DR: This paper designs two FDSs on the basis of an ensemble and a sliding-window approach and shows that the winning strategy consists in training two separate classifiers (on feedbacks and delayed labels, respectively), and then aggregating the outcomes.

...read moreread less

Abstract: Most fraud-detection systems (FDSs) monitor streams of credit card transactions by means of classifiers returning alerts for the riskiest payments. Fraud detection is notably a challenging problem because of concept drift (i.e. customers' habits evolve) and class unbalance (i.e. genuine transactions far outnumber frauds). Also, FDSs differ from conventional classification because, in a first phase, only a small set of supervised samples is provided by human investigators who have time to assess only a reduced number of alerts. Labels of the vast majority of transactions are made available only several days later, when customers have possibly reported unauthorized transactions. The delay in obtaining accurate labels and the interaction between alerts and supervised information have to be carefully taken into consideration when learning in a concept-drifting environment. In this paper we address a realistic fraud-detection setting and we show that investigator's feedbacks and delayed labels have to be handled separately. We design two FDSs on the basis of an ensemble and a sliding-window approach and we show that the winning strategy consists in training two separate classifiers (on feedbacks and delayed labels, respectively), and then aggregating the outcomes. Experiments on large dataset of real-world transactions show that the alert precision, which is the primary concern of investigators, can be substantially improved by the proposed approach.

...read moreread less

101 citations

Journal Article•DOI•

Automated Feature Design for Numeric Sequence Classification by Genetic Programming

[...]

Dustin Harvey¹, Michael D. Todd¹•Institutions (1)

University of California, San Diego¹

01 Aug 2015-IEEE Transactions on Evolutionary Computation

TL;DR: This paper proposes a novel genetic programming (GP) approach to automated feature design called Autofead, a GP variant evolves a population of candidate features built from a library of sequence-handling functions to leverage the power of both numerical optimization and standard pattern recognition algorithms.

...read moreread less

Abstract: Pattern recognition methods rely on maximum-information, minimum-dimension feature sets to reliably perform classification and regression tasks. Many methods exist to reduce feature set dimensionality and construct improved features from an initial set; however, there are few general approaches for the design of features from numeric sequences. Any information lost in preprocessing or feature measurement cannot be recreated during pattern recognition. General approaches are needed to extend pattern recognition to include feature design and selection for numeric sequences, such as time series, within the learning process itself. This paper proposes a novel genetic programming (GP) approach to automated feature design called Autofead. In this method, a GP variant evolves a population of candidate features built from a library of sequence-handling functions. Numerical optimization methods, included through a hybrid approach, ensure that the fitness of candidate algorithms is measured using optimal parameter values. Autofead represents the first automated feature design system for numeric sequences to leverage the power and efficiency of both numerical optimization and standard pattern recognition algorithms. Potential applications include the monitoring of electrocardiogram signals for indications of heart failure, network traffic analysis for intrusion detection systems, vibration measurement for bearing condition determination in rotating machinery, and credit card activity for fraud detection.

...read moreread less

63 citations