Improved Accuracy of Naive Bayes Classifier for Determination of Customer Churn Uses SMOTE and Genetic Algorithms
Afifah Ratna Safitri,Much Aziz Muslim +1 more
- Vol. 1, Iss: 1, pp 70-75
TLDR
The purpose of this study is to improve the accuracy of the Naive Bayes for customer classification by using the SMOTE and genetic algorithm to handle class imbalance problems and attributes selection.Abstract:
With increasing competition in the business world, many companies use data mining techniques to determine the level of customer loyalty. The customer data used in this study is the german credit dataset obtained from UCI. Such data have an imbalance problem of class because the amount of data in the loyal class is more than in the churn class. In addition, there are some irrelevant attributes for customer classification, so attributes selection is needed to get more accurate classification results. One classification algorithm is naive bayes. Naive Bayes has been used as an effective classification for years because it is easy to build and give an independent attribute into its structure. The purpose of this study is to improve the accuracy of the Naive Bayes for customer classification. SMOTE and genetic algorithm do for improving the accuracy. The SMOTE is used to handle class imbalance problems, while the genetic algorithm is used for attributes selection. Accuracy using the Naive Bayes is 47.10%, while the mean accuracy results obtained from the Naive Bayes with the application of the SMOTE is 78.15% and the accuracy obtained from the Naive Bayes with the application of the SMOTE and genetic algorithm is 78.46%.read more
Citations
More filters
Journal ArticleDOI
Company bankruptcy prediction framework based on the most influential features using XGBoost and stacking ensemble learning
Much Aziz Muslim,Yosza Dasril +1 more
TL;DR: This study aims to find the best predictive model or method to predict company bankruptcy using the dataset from Polish companies bankruptcy and uses the best feature selection and ensemble learning.
Journal ArticleDOI
Improved logistics service quality for goods quality delivery services of companies using analytical hierarchy process
TL;DR: There is the main dimension of logistic service quality in improving the quality of service, namely ordering condition, time, and information quality, which can be the basis of decision making for companies in choosing alternative criteria priorities.
Proceedings ArticleDOI
SMOTE Classification and Random Oversampling Naive Bayes in Imbalanced Data : (Case Study of Early Detection of Cervical Cancer in Indonesia)
Nur Silviyah Rahmi,Ni Wayan Surya Wardhani,Maria Bernadetha Mitakda,Regina Syahla Fauztina,Imelda Salsabila +4 more
TL;DR: In this article , the authors used SMOTE and Random Oversampling (ROS) sampling techniques to overcome imbalanced data combined with the Naive Bayes classification method in cases of detection of early cervical cancer in Indonesia.
Journal ArticleDOI
Optimize Naïve Bayes Classifier Using Chi Square and Term Frequency Inverse Document Frequency For Amazon Review Sentiment Analysis
Anisa Falasari,Much Aziz Muslim +1 more
TL;DR: In this study, using sentiment labelled dataset (field amazon_labelled) obtained from UCI Machine Learning, the accuracy of the naïve bayes classifier in the amazon review sentiment analysis was 82% and the accuracy by applying chi square and TF-IDF is 83%.
Journal ArticleDOI
Recommendation of Yogyakarta tourism based on simple additive weighting under fuzziness
TL;DR: The results of this study obtained the best 2 packages recommended for tourists to choose, namely the Triangular Fuzzy Number and the Simple Additive Weighting method.
References
More filters
Journal ArticleDOI
Churn classification model for local telecommunication company based on rough set theory
Mokhairi Makhtar,S. Nafis,Mohamad Afendee Mohamed,Mohd Khalid Awang,Mohd Nordin Abdul Rahman,Mustafa Mat Deris +5 more
TL;DR: The results of the study show that the proposed Rough Set classification model outperforms the existing models and contributes to significant accuracy improvement.
Journal ArticleDOI
Application of the pessimistic pruning to increase the accuracy of C4.5 algorithm in diagnosing chronic kidney disease
TL;DR: Pessimistic pruning is used to identify and remove branches that are not needed, this is done to avoid overfitting the decision tree generated by the C4.5 algorithm in diagnosing chronic kidney disease.
Proceedings Article
A Novel Ensemble Approach to Enhance the Performance of Web Server Logs Classification
TL;DR: The results show that these ensemble machine learning models using voting meta classifier can significantly improve users sessions classification and can achieve high accuracy in comparison with the outcomes of the all base and meta classifiers proposed.
Journal ArticleDOI
Investigating the Performance of Smote for Class Imbalanced Learning: A Case Study of Credit Scoring Datasets
Maira Anis,Mohsin Ali +1 more
TL;DR: This paper aims to investigates and analyze the performance of most widely used oversampling procedure Synthetic Minority Oversampling Technique (SMOTE) for different thresholds of oversampled using four classifiers for three credit scoring datasets.
Journal Article
Application of System Dynamics to Mobile Telecommunication Customer Churn Management
P.K. Banda,S. Tembo +1 more
TL;DR: In this paper, the authors combine factors that lead to customer churn, strategies of MNOs on churn management and demographic data collected from Central Statistical Office (CSO) and the Zambia ICT Agency (ZICTA) into a system dynamics simulation model.