scispace - formally typeset
Open AccessJournal ArticleDOI

Data Mining Approach to Predict Success of Secondary School Students: A Saudi Arabian Case Study

Amnah Saeed Alghamdi, +1 more
- 09 Mar 2023 - 
- Vol. 13, Iss: 3, pp 293-293
TLDR
In this article , three models were constructed using different algorithms: Naïve Bayes (NB), Random Forest (RF), and J48, which achieved a prediction accuracy of 99.34%.
Abstract
A problem that pervades throughout students’ careers is their poor performance in high school. Predicting students’ academic performance helps educational institutions in many ways. Knowing and identifying the factors that can affect the academic performance of students at the beginning of the thread can help educational institutions achieve their educational goals by providing support to students earlier. The aim of this study was to predict the achievement of early secondary students. Two sets of data were used for high school students who graduated from the Al-Baha region in the Kingdom of Saudi Arabia. In this study, three models were constructed using different algorithms: Naïve Bayes (NB), Random Forest (RF), and J48. Moreover, the Synthetic Minority Oversampling Technique (SMOTE) technique was applied to balance the data and extract features using the correlation coefficient. The performance of the prediction models has also been validated using 10-fold cross-validation and direct partition in addition to various performance evaluation metrics: accuracy curve, true positive (TP) rate, false positive (FP) rate, accuracy, recall, F-Measurement, and receiver operating characteristic (ROC) curve. The NB model achieved a prediction accuracy of 99.34%, followed by the RF model with 98.7%.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Single vs. Multi-Label: The Issues, Challenges and Insights of Contemporary Classification Schemes

TL;DR: In this article , the authors highlight the issues for single-label and multi-label classification by using either metadata or content of the documents and why metadatabased approaches are better than content-based approaches in terms of feasibility.
Journal ArticleDOI

Intrusion Detection System Using Feature Extraction with Machine Learning Algorithms in IoT

TL;DR: In this paper , a study on ML-based IDS in IoT, considering different feature extraction algorithms with several ML models is presented, and the results showed that VGG-16 combined with stacking resulted in the highest accuracy of 98.3%.
Journal ArticleDOI

SUNFIT: A Machine Learning-Based Sustainable University Field Training Framework for Higher Education

TL;DR: In this article , a sustainable university field training (SUNFIT) framework is introduced, which is a pedagogical approach towards mining the educational data using machine learning to integrate and measure the field training programs against the internationally recognized accreditation standards such as Accreditation Board for Engineering and Technology.
Journal ArticleDOI

Transfer Learning Approach to Seed Taxonomy: A Wild Plant Case Study

TL;DR: In this article , transfer learning techniques were used to analyze images of plants and extract features that can be used to cluster the species hierarchically using the k-means clustering algorithm.
Journal ArticleDOI

Anomaly Detection for Hydraulic Power Units - A Case Study

TL;DR: In this paper , the authors present the real-world implementation of an anomaly detection system of a hydraulic power unit, which involved the Internet of Things approach and a detailed description of the system architecture is provided.
References
More filters
Journal ArticleDOI

The WEKA data mining software: an update

TL;DR: This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.
Book ChapterDOI

Ensemble Methods in Machine Learning

TL;DR: Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.
Journal ArticleDOI

Random Erasing Data Augmentation

TL;DR: Random Erasing as mentioned in this paper randomly selects a rectangle region in an image and erases its pixels with random values, which reduces the risk of overfitting and makes the model robust to occlusion.
Posted Content

Random Erasing Data Augmentation

TL;DR: In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values and yields consistent improvement over strong baselines in image classification, object detection and person re-identification.
Journal ArticleDOI

Feature Selection: A literature Review

TL;DR: The concepts of feature relevance, general procedures, evaluation criteria, and the characteristics of feature selection are introduced and guidelines are provided for user to select a feature selection algorithm without knowing the information of each algorithm.
Related Papers (5)