Improved Accuracy of Naive Bayes Classifier for Determination of Customer Churn Uses SMOTE and Genetic Algorithms

doi:10.52465/JOSCEX.V1I1.5

Open AccessProceedings ArticleDOI

Improved Accuracy of Naive Bayes Classifier for Determination of Customer Churn Uses SMOTE and Genetic Algorithms

- Vol. 1, Iss: 1, pp 70-75

TLDR

The purpose of this study is to improve the accuracy of the Naive Bayes for customer classification by using the SMOTE and genetic algorithm to handle class imbalance problems and attributes selection.

Abstract:

With increasing competition in the business world, many companies use data mining techniques to determine the level of customer loyalty. The customer data used in this study is the german credit dataset obtained from UCI. Such data have an imbalance problem of class because the amount of data in the loyal class is more than in the churn class. In addition, there are some irrelevant attributes for customer classification, so attributes selection is needed to get more accurate classification results. One classification algorithm is naive bayes. Naive Bayes has been used as an effective classification for years because it is easy to build and give an independent attribute into its structure. The purpose of this study is to improve the accuracy of the Naive Bayes for customer classification. SMOTE and genetic algorithm do for improving the accuracy. The SMOTE is used to handle class imbalance problems, while the genetic algorithm is used for attributes selection. Accuracy using the Naive Bayes is 47.10%, while the mean accuracy results obtained from the Naive Bayes with the application of the SMOTE is 78.15% and the accuracy obtained from the Naive Bayes with the application of the SMOTE and genetic algorithm is 78.46%.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Company bankruptcy prediction framework based on the most influential features using XGBoost and stacking ensemble learning

Much Aziz Muslim, +1 more

- 01 Dec 2021 -

International Journal of Electrical and ...

TL;DR: This study aims to find the best predictive model or method to predict company bankruptcy using the dataset from Polish companies bankruptcy and uses the best feature selection and ensemble learning.

...read moreread less

Journal ArticleDOI

Improved logistics service quality for goods quality delivery services of companies using analytical hierarchy process

Popy Riliandini, +3 more

TL;DR: There is the main dimension of logistic service quality in improving the quality of service, namely ordering condition, time, and information quality, which can be the basis of decision making for companies in choosing alternative criteria priorities.

...read moreread less

Proceedings ArticleDOI

SMOTE Classification and Random Oversampling Naive Bayes in Imbalanced Data : (Case Study of Early Detection of Cervical Cancer in Indonesia)

Nur Silviyah Rahmi, +4 more

TL;DR: In this article , the authors used SMOTE and Random Oversampling (ROS) sampling techniques to overcome imbalanced data combined with the Naive Bayes classification method in cases of detection of early cervical cancer in Indonesia.

...read moreread less

Journal ArticleDOI

Optimize Naïve Bayes Classifier Using Chi Square and Term Frequency Inverse Document Frequency For Amazon Review Sentiment Analysis

Anisa Falasari, +1 more

- 30 Mar 2022 -

Journal of Soft Computing Exploration

TL;DR: In this study, using sentiment labelled dataset (field amazon_labelled) obtained from UCI Machine Learning, the accuracy of the naïve bayes classifier in the amazon review sentiment analysis was 82% and the accuracy by applying chi square and TF-IDF is 83%.

...read moreread less

Journal ArticleDOI

Recommendation of Yogyakarta tourism based on simple additive weighting under fuzziness

Eko Yunanto Utomo

TL;DR: The results of this study obtained the best 2 packages recommended for tourists to choose, namely the Triangular Fuzzy Number and the Simple Additive Weighting method.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

SMOTE: synthetic minority over-sampling technique

Nitesh V. Chawla, +3 more

- 01 Jan 2002 -

Journal of Artificial Intelligence Resea...

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

Journal ArticleDOI

SMOTE: Synthetic Minority Over-sampling Technique

Nitesh V. Chawla, +3 more

- 09 Jun 2011 -

arXiv: Artificial Intelligence

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

...read moreread less

Journal Article

Comparative Study of Chronic Kidney Disease Prediction using KNN and SVM

Parul Sinha, +1 more

- 30 Dec 2015 -

International journal of engineering res...

TL;DR: The aim of this work is to compare the performance of Support vector machine (SVM) and K-Nearest Neighbour (KNN) classifier on the basis of its accuracy, precision and execution time for CKD prediction.

...read moreread less

Predicting Customer Churn in Mobile Telephony Industry Using Probabilistic Classifiers in Data Mining

Clement Kirui, +3 more

TL;DR: A new set of features is proposed with the aim of improving the recognition rates of possible churners, derived from call details and customer profiles and categorized as contract-related, call pattern description, and call pattern changes description features.

...read moreread less

Customer Relationship Management and Its Relationship to the Marketing Performance

Hisham Sayed Soliman

TL;DR: In this paper, the theoretical foundations of customer relationship management and its relationship to the marketing performance from several perspectives were explored, and the study concluded positive relationship between CRM and marketing performance.

...read moreread less

Improved Accuracy of Naive Bayes Classifier for Determination of Customer Churn Uses SMOTE and Genetic Algorithms

Citations

Company bankruptcy prediction framework based on the most influential features using XGBoost and stacking ensemble learning

Improved logistics service quality for goods quality delivery services of companies using analytical hierarchy process

SMOTE Classification and Random Oversampling Naive Bayes in Imbalanced Data : (Case Study of Early Detection of Cervical Cancer in Indonesia)

Optimize Naïve Bayes Classifier Using Chi Square and Term Frequency Inverse Document Frequency For Amazon Review Sentiment Analysis

Recommendation of Yogyakarta tourism based on simple additive weighting under fuzziness

References

SMOTE: synthetic minority over-sampling technique

SMOTE: Synthetic Minority Over-sampling Technique

Comparative Study of Chronic Kidney Disease Prediction using KNN and SVM

Predicting Customer Churn in Mobile Telephony Industry Using Probabilistic Classifiers in Data Mining

Customer Relationship Management and Its Relationship to the Marketing Performance

Related Papers (5)

Local neighbourhood extension of SMOTE for mining imbalanced data

A decision tree-based attribute weighting filter for naive Bayes

Integrating Global and Local Application of Naive Bayes Classifier

An Improved Learning Algorithm for Augmented Naive Bayes

Sequential Feature Selection in Customer Churn Prediction Based on Naive Bayes