Machine Learning and Data Mining Methods in Diabetes Research.
Ioannis Kavakiotis,O. Tsave,Athanasios Salifoglou,Nicos Maglaveras,Ioannis Vlahavas,Ioanna Chouvarda +5 more
Reads0
Chats0
TLDR
A systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular.Abstract:
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.read more
Citations
More filters
Journal ArticleDOI
Environmental exposures in machine learning and data mining approaches to diabetes etiology: A scoping review
TL;DR: The use of machine learning and data mining methods to elucidate environmental triggers of diabetes was largely limited to well-established risk factors identified using easily explainable and interpretable models as mentioned in this paper .
Book ChapterDOI
Diabetes Classification Using ML Algorithms
TL;DR: In this article , a method for predicting diabetes using a number of ML algorithms, such as the support vector classifier (SVC), decision tree classifiers (DTC), K-neighbors classifier(KNC), logistic regression (LR), random forest classifier and AdaBoost classifier, was presented.
Proceedings ArticleDOI
A Framework for T2D Management & Knowledge Discovery of Complications in the Context of Chinese Culture: From Triggers to Causalities
TL;DR: Wang et al. as mentioned in this paper proposed a system that can collect real-time, patient-generated data which will unveil the patterns about the triggers and causalities for better decision-making and, in the long run, provide an overarching understanding of the progression and evolution of T2D and its complications.
Book ChapterDOI
An Analysis of Comorbidities’ Role in Diabetes Mellitus and Its Data-Intensive Technology-Based Prediction to Reduce Risk and Diagnostic Costs
TL;DR: The main purpose of study is to identify the impact of data-driven comorbidity effects in diabetes patients by predicting their risk status through data-intensive technology (Big Data) as uncovered problem domain in computer application to enhance the research potentials on the killer disease, diabetes mellitus.
Proceedings ArticleDOI
Performance Evaluation of Data Mining and Neural Network Based Models For Diabetes Prediction
TL;DR: In this paper , the authors examined the relative merits among several ML and DL approaches to the problem of early diabetic illness prediction, including Random Forest (RF), Logistic Regression (LR), Support Vector Machine (SVM), Artificial Neural Network (ANN), Multilayer Perceptron (MLP), and Decision Tree, Gradient Boost (GB), XGBoost (GB).
References
More filters
Journal ArticleDOI
Random Forests
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Book
Data Mining: Concepts and Techniques
TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Book
Data Mining: Practical Machine Learning Tools and Techniques
TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Book
Artificial Intelligence: A Modern Approach
Stuart Russell,Peter Norvig +1 more
TL;DR: In this article, the authors present a comprehensive introduction to the theory and practice of artificial intelligence for modern applications, including game playing, planning and acting, and reinforcement learning with neural networks.
Proceedings ArticleDOI
Mining association rules between sets of items in large databases
TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.