scispace - formally typeset
Journal ArticleDOI

Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus

Reads0
Chats0
TLDR
In this article, support vector machines (SVM) have been used for the diagnosis of type 2 diabetes using an additional explanation module, which turns the "black box" model of an SVM into an intelligible representation of the SVM's diagnostic decision.
Abstract
Diabetes mellitus is a chronic disease and a major public health challenge worldwide. According to the International Diabetes Federation, there are currently 246 million diabetic people worldwide, and this number is expected to rise to 380 million by 2025. Furthermore, 3.8 million deaths are attributable to diabetes complications each year. It has been shown that 80% of type 2 diabetes complications can be prevented or delayed by early identification of people at risk. In this context, several data mining and machine learning methods have been used for the diagnosis, prognosis, and management of diabetes. In this paper, we propose utilizing support vector machines (SVMs) for the diagnosis of diabetes. In particular, we use an additional explanation module, which turns the “black box” model of an SVM into an intelligible representation of the SVM's diagnostic (classification) decision. Results on a real-life diabetes dataset show that intelligible SVMs provide a promising tool for the prediction of diabetes, where a comprehensible ruleset have been generated, with prediction accuracy of 94%, sensitivity of 93%, and specificity of 94%. Furthermore, the extracted rules are medically sound and agree with the outcome of relevant medical studies.

read more

Citations
More filters
Journal ArticleDOI

Feature selection and classification systems for chronic disease prediction: A review

TL;DR: This work presents a comprehensive overview of various feature selection methods and their inherent pros and cons, and analyzes adaptive classification systems and parallel classification systems for chronic disease prediction.
Journal ArticleDOI

Review: Knowledge discovery in medicine: Current issue and future trend

TL;DR: The main idea in this paper is to describe key papers and provide some guidelines to help medical practitioners to explore previous works and identify interesting areas for future research.
Journal ArticleDOI

Big Data in Public Health: Terminology, Machine Learning, and Privacy.

TL;DR: The ethical implications of the big data revolution with particular emphasis on maintaining appropriate care for privacy in a world in which technology is rapidly changing social norms regarding the need for (and even the meaning of) privacy are considered.
Journal ArticleDOI

Performance Analysis of Data Mining Classification Techniques to Predict Diabetes

TL;DR: Overall performance of adaboost ensemble method is better than bagging as well as standalone J48 decision tree as a base learner along with standalone data mining technique J48 to classify patients with diabetes mellitus using diabetes risk factors.
Journal ArticleDOI

The ethics of AI in health care: A mapping review.

TL;DR: A mapping review of the literature concerning the ethics of artificial intelligence (AI) in health care finds that ethical issues can be epistemic, normative or traceability-related and at the relevant level of abstraction.
References
More filters
Journal ArticleDOI

SMOTE: synthetic minority over-sampling technique

TL;DR: In this article, a method of over-sampling the minority class involves creating synthetic minority class examples, which is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.
Book

Classification and regression trees

Leo Breiman
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
Book

Data Mining

Ian Witten
TL;DR: In this paper, generalized estimating equations (GEE) with computing using PROC GENMOD in SAS and multilevel analysis of clustered binary data using generalized linear mixed-effects models with PROC LOGISTIC are discussed.
Book

Introduction to Data Mining

TL;DR: This book discusses data mining through the lens of cluster analysis, which examines the relationships between data, clusters, and algorithms, and some of the techniques used to solve these problems.
Related Papers (5)