Data Mining Approach to Predict Success of Secondary School Students: A Saudi Arabian Case Study
Amnah Saeed Alghamdi,Atta Rahman +1 more
TLDR
In this article , three models were constructed using different algorithms: Naïve Bayes (NB), Random Forest (RF), and J48, which achieved a prediction accuracy of 99.34%.Abstract:
A problem that pervades throughout students’ careers is their poor performance in high school. Predicting students’ academic performance helps educational institutions in many ways. Knowing and identifying the factors that can affect the academic performance of students at the beginning of the thread can help educational institutions achieve their educational goals by providing support to students earlier. The aim of this study was to predict the achievement of early secondary students. Two sets of data were used for high school students who graduated from the Al-Baha region in the Kingdom of Saudi Arabia. In this study, three models were constructed using different algorithms: Naïve Bayes (NB), Random Forest (RF), and J48. Moreover, the Synthetic Minority Oversampling Technique (SMOTE) technique was applied to balance the data and extract features using the correlation coefficient. The performance of the prediction models has also been validated using 10-fold cross-validation and direct partition in addition to various performance evaluation metrics: accuracy curve, true positive (TP) rate, false positive (FP) rate, accuracy, recall, F-Measurement, and receiver operating characteristic (ROC) curve. The NB model achieved a prediction accuracy of 99.34%, followed by the RF model with 98.7%.read more
Citations
More filters
Journal ArticleDOI
Single vs. Multi-Label: The Issues, Challenges and Insights of Contemporary Classification Schemes
Naseer Ahmed Sajid,Atta Rahman,Munir Ahmad,Dhiaa Musleh,Mohammed Imran Basheer Ahmed,Reem A. Alassaf,Sghaier Chabani,Mohammed Salih Ahmed,Asiya Abdus Salam,Dania Alkhulaifi +9 more
TL;DR: In this article , the authors highlight the issues for single-label and multi-label classification by using either metadata or content of the documents and why metadatabased approaches are better than content-based approaches in terms of feasibility.
Journal ArticleDOI
Intrusion Detection System Using Feature Extraction with Machine Learning Algorithms in IoT
TL;DR: In this paper , a study on ML-based IDS in IoT, considering different feature extraction algorithms with several ML models is presented, and the results showed that VGG-16 combined with stacking resulted in the highest accuracy of 98.3%.
Journal ArticleDOI
SUNFIT: A Machine Learning-Based Sustainable University Field Training Framework for Higher Education
Mohammed Gollapalli,Atta Rahman,Mariam Alkharraa,Linah Saraireh,Dania Alkhulaifi,Asiya Abdus Salam,Gomathi Krishnasamy,Mehwash Farooqui,Maqsood Mahmud +8 more
TL;DR: In this article , a sustainable university field training (SUNFIT) framework is introduced, which is a pedagogical approach towards mining the educational data using machine learning to integrate and measure the field training programs against the internationally recognized accreditation standards such as Accreditation Board for Engineering and Technology.
Journal ArticleDOI
Transfer Learning Approach to Seed Taxonomy: A Wild Plant Case Study
Nehad M. Abdel Rahman Ibrahim,Dalia G. Gabr,Atta Rahman,Dhiaa Musleh,Dania Alkhulaifi,Mariam Alkharraa +5 more
TL;DR: In this article , transfer learning techniques were used to analyze images of plants and extract features that can be used to cluster the species hierarchically using the k-means clustering algorithm.
Journal ArticleDOI
Anomaly Detection for Hydraulic Power Units - A Case Study
TL;DR: In this paper , the authors present the real-world implementation of an anomaly detection system of a hydraulic power unit, which involved the Internet of Things approach and a detailed description of the system architecture is provided.
References
More filters
Journal ArticleDOI
The WEKA data mining software: an update
TL;DR: This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.
Book ChapterDOI
Ensemble Methods in Machine Learning
TL;DR: Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.
Journal ArticleDOI
Random Erasing Data Augmentation
TL;DR: Random Erasing as mentioned in this paper randomly selects a rectangle region in an image and erases its pixels with random values, which reduces the risk of overfitting and makes the model robust to occlusion.
Posted Content
Random Erasing Data Augmentation
TL;DR: In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values and yields consistent improvement over strong baselines in image classification, object detection and person re-identification.
Journal ArticleDOI
Feature Selection: A literature Review
Vipin Kumar,Sonajharia Minz +1 more
TL;DR: The concepts of feature relevance, general procedures, evaluation criteria, and the characteristics of feature selection are introduced and guidelines are provided for user to select a feature selection algorithm without knowing the information of each algorithm.