scispace - formally typeset
Proceedings ArticleDOI

Implementing WEKA for medical data classification and early disease prediction

Reads0
Chats0
TLDR
This research work comprehensively compared different data classification techniques and their prediction accuracy for chronic kidney disease dataset using performance measures like ROC, kappa statistics, RMSE and MAE using WEKA tool.
Abstract
In recent years, the advent of latest web and data technologies has encouraged massive data growth in almost every sector. Businesses and leading industries are viewing these huge data repositories as a tool to design future strategies, prediction models by analyzing patterns and gaining knowledge from this unstructured data by applying different data mining techniques. Medical domain has now become richer in term of maintaining digital records of patients related to their diagnosis and treatment. These huge data repositories can range from patient personnel data, diagnosis, treatment histories, test diagnosis, images and various scans. This terabytes of medical data is quantity rich but weaker in information in terms of knowledge and robust tools to identify hidden patterns of knowledge specifically in medical sector. Data Mining as a field of research has already well proven capabilities of identifying hidden patterns, analysis and knowledge applied on different research domains, now gaining popularity day by day among researchers and scientist towards generating novel and deep insights of these large biomedical datasets also. Uncovering new biomedical and healthcare related knowledge in order to support clinical decision making, is another dimension of data mining. Through massive literature survey, it is found that early disease prediction is the most demanded area of research in health care sector. As health care domain is bit wider domain and having different disease characteristics, different techniques have their own prediction efficiencies, which can be enhanced and changed in order to get into most optimize way. In this research work, authors have comprehensively compared different data classification techniques and their prediction accuracy for chronic kidney disease. Authors have compared J48, Naive Bayes, Random Forest, SVM and k-NN classifiers using performance measures like ROC, kappa statistics, RMSE and MAE using WEKA tool. Authors have also compared these classifiers on various accuracy measures like TP rate, FP rate, precision, recall and f-measure by implementing on WEKA. Experimental result shows that random forest classifier has better classification accuracy over others for chronic kidney disease dataset.

read more

Citations
More filters
Journal ArticleDOI

A Decisive Metaheuristic Attribute Selector Enabled Combined Unsupervised-Supervised Model for Chronic Disease Risk Assessment

TL;DR: It is demonstrated that chronic disease diagnosis can be significantly improved by heuristic-based attribute selection coupled with clustering followed by classification, and can be used to develop a decision support system to assist medical experts in the effective analysis of chronic diseases in a cost-effective manner.
Journal ArticleDOI

A diagnostic prediction model for chronic kidney disease in internet of things platform

TL;DR: A diagnostic prediction model for CKD and its severity is proposed that applies IoT multimedia data and the applied dataset with the proposed selected features produces 97% accuracy, 99% sensitivity and 95% specificity via applying decision tree (J48) classifier in comparison to Support Vector Machine (SVM), Multi-Layer Perception (MLP) and Naïve Bayes classifiers.
Journal ArticleDOI

Predicting clinically significant motor function improvement after contemporary task-oriented interventions using machine learning approaches.

TL;DR: In this paper, a secondary analysis of data using two common machine learning approaches, which were the k-nearest neighbor (KNN) and artificial neural network (ANN), was conducted to determine the accuracy and performance of machine learning to predict clinically significant motor function improvements after contemporary task-oriented intervention in chronic stroke patients and identify important predictors for building machine learning prediction models.
Proceedings ArticleDOI

Classification of Diabetes Dataset with Data Mining Techniques by Using WEKA Approach

TL;DR: This study proposed to classify diabetes by using data mining techniques and obtained results indicated that k-NN performed the highest accuracy with 98.07% and this algorithm is the best method to identify and classify diabetes diseases on studies dataset.
Journal ArticleDOI

COVID-19 World Vaccination Progress Using Machine Learning Classification Algorithms

TL;DR: This study will focus on the COVID-19 World Vaccination Progress using Machine learning classification Algorithms, and it was discovered that the Decision Tree outperforms other algorithms in terms of time and accuracy.
References
More filters
Journal ArticleDOI

From Data Mining to Knowledge Discovery in Databases

TL;DR: An overview of this emerging field is provided, clarifying how data mining and knowledge discovery in databases are related both to each other and to related fields, such as machine learning, statistics, and databases.
Journal ArticleDOI

A comparison of linear genetic programming and neural networks in medical data mining

TL;DR: An efficient algorithm that eliminates intron code and a demetic approach to virtually parallelize the system on a single processor are discussed, which show that GP performs comparably in classification and generalization.
Journal ArticleDOI

Improved J48 Classification Algorithm for the Prediction of Diabetes

TL;DR: The modified J48 classifier is used to increase the accuracy rate of the data mining procedure and Experimental results showed a significant improvement over the existing J-48 algorithm.
Proceedings Article

Using decision tree for diagnosing heart disease patients

TL;DR: This research proposes a model that outperforms J4.8 Decision Tree and Bagging algorithm in the diagnosis of heart disease patients and investigates applying a range of techniques to different types of Decision Trees seeking better performance in heart disease diagnosis.
Proceedings ArticleDOI

Combination Data Mining Methods with New Medical Data to Predicting Outcome of Coronary Heart Disease

TL;DR: The comparative study of multiple prediction models for survival of CHD patients along with a 10-fold cross-validation provided us with an insight into the relative prediction ability of different data.
Related Papers (5)