scispace - formally typeset
Open AccessJournal ArticleDOI

Machine Learning and Data Mining Methods in Diabetes Research.

Reads0
Chats0
TLDR
A systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular.
Abstract
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.

read more

Citations
More filters
Journal ArticleDOI

A Comparative Analysis on Blockchain versus Centralized Authentication Architectures for IoT-Enabled Smart Devices in Smart Cities: A Comprehensive Review, Recent Advances, and Future Research Directions

TL;DR: An updated review of authentication mechanisms by categorizing centralized and distributed architectures, and the security issues regarding the authentication of these IoT-enabled smart devices are discussed.
Journal ArticleDOI

Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences

TL;DR: This paper provides a comprehensive, systematic meta-mapping of research questions in the social and health sciences to appropriate ML approaches by incorporating the necessary requirements to statistical analysis in these disciplines.
Proceedings ArticleDOI

Prediction Models for Risk of Type-2 Diabetes Using Health Claims

TL;DR: Investigating whether prediction accuracy can be improved by utilizing lab test data obtained from health checkups and incorporating health claim text data such as medically diagnosed diseases with ICD10 codes and pharmacy information and confirmed that onset of type-2 diabetes can be predicted with a high degree of accuracy when the XGBoost model is used.
Posted ContentDOI

Acceptability, Appropriateness, and Feasibility of Automated Screening Approaches and Family Communication Methods for Identification of Familial Hypercholesterolemia: Stakeholder Engagement Results from the IMPACT-FH study

TL;DR: In this paper, the authors explored the acceptability, appropriateness, and feasibility of automated screening approaches utilizing existing health data to identify those who require subsequent diagnostic evaluation for familial hypercholesterolemia (FH) and family communication methods including chatbots and direct contact to communicate information about inherited risk for FH.
Journal ArticleDOI

Acceptability, Appropriateness, and Feasibility of Automated Screening Approaches and Family Communication Methods for Identification of Familial Hypercholesterolemia: Stakeholder Engagement Results from the IMPACT-FH Study.

TL;DR: In this paper, the authors explored the acceptability, appropriateness, and feasibility of automated screening approaches utilizing existing health data to identify those who require subsequent diagnostic evaluation for familial hypercholesterolemia (FH) and family communication methods including chatbots and direct contact to communicate information about inherited risk for FH.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Book

Data Mining: Concepts and Techniques

TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Book

Data Mining: Practical Machine Learning Tools and Techniques

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Book

Artificial Intelligence: A Modern Approach

TL;DR: In this article, the authors present a comprehensive introduction to the theory and practice of artificial intelligence for modern applications, including game playing, planning and acting, and reinforcement learning with neural networks.
Proceedings ArticleDOI

Mining association rules between sets of items in large databases

TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.
Related Papers (5)