scispace - formally typeset
Open AccessJournal ArticleDOI

Machine Learning and Data Mining Methods in Diabetes Research.

Reads0
Chats0
TLDR
A systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular.
Abstract
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.

read more

Citations
More filters
Journal ArticleDOI

Machine learning

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Journal ArticleDOI

Comparing different supervised machine learning algorithms for disease prediction

TL;DR: It is found that the Support Vector Machine (SVM) algorithm is applied most frequently (in 29 studies) followed by the Naïve Bayes algorithm (in 23 studies), however, the Random Forest algorithm showed superior accuracy comparatively.
Journal ArticleDOI

Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: A review

TL;DR: The ongoing development in AI and ML has significantly improved treatment, medication, screening, prediction, forecasting, contact tracing, and drug/vaccine development process for the Covid-19 pandemic and reduce the human intervention in medical practice.
Journal ArticleDOI

Predicting Diabetes Mellitus With Machine Learning Techniques.

TL;DR: The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used and principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) was used to reduce the dimensionality.
References
More filters
Journal ArticleDOI

Exudate-based diabetic macular edema detection in fundus images using publicly available datasets.

TL;DR: A new methodology for diagnosis of DME using a novel set of features based on colour, wavelet decomposition and automatic lesion segmentation is introduced, able to achieve diagnosis performance comparable to retina experts on the MESSIDOR with cross-dataset testing.
Journal ArticleDOI

Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms

TL;DR: In this article, a rotation forest (RF) ensemble classifiers of 30 machine learning algorithms was constructed to evaluate their classification performance using Parkinson's, diabetes and heart diseases from literature, which achieved average accuracies of 74.47%, 80.49% and 87.13% for respective diseases.
Journal ArticleDOI

Comparison of three data mining models for predicting diabetes or prediabetes by risk factors.

TL;DR: The decision tree model (C5.0) had the best classification accuracy, followed by the logistic regression model, and the ANN gave the lowest accuracy.
Journal ArticleDOI

Probability machines: consistent probability estimation using nonparametric learning machines.

TL;DR: Random forest algorithms as well as nearest neighbor approaches are valid machine learning methods for estimating individual probabilities for binary responses.
Journal ArticleDOI

Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.

TL;DR: In this paper, a new approach to population health, in which data-driven predictive models are learned for outcomes such as type 2 diabetes, is presented, which enables risk assessment from readily available electronic claims data on large populations, without additional screening cost.
Related Papers (5)