Machine Learning and Data Mining Methods in Diabetes Research.
Ioannis Kavakiotis,O. Tsave,Athanasios Salifoglou,Nicos Maglaveras,Ioannis Vlahavas,Ioanna Chouvarda +5 more
Reads0
Chats0
TLDR
A systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular.Abstract:
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.read more
Citations
More filters
Journal ArticleDOI
A Comparative Analysis of Machine Learning Models: A Case Study in Predicting Chronic Kidney Disease
Hasnain Iftikhar,Murad Khan Khan,Zardad Khan,Faridoon Khan,Huda Mohammed H Alshanbari,Zubair Ahmad +5 more
TL;DR: In this article , the authors predict chronic kidney disease using machine learning models, including logistic, probit, random forest, decision tree, k-nearest neighbor, and support vector machine with four kernel functions (linear, Laplacian, Bessel, and radial basis kernels).
Proceedings ArticleDOI
Efficient Feature Selection for Prediction of Diabetic Using LASSO
TL;DR: The main aim of this paper is to present a novel method for selecting efficient features for predicting diabetes using the Least Absolute Shrinkage and Selection Operator (LASSO) method.
Journal ArticleDOI
A Distributed Snapshot Protocol for Efficient Artificial Intelligence Computation in Cloud Computing Environments
TL;DR: The proposed snapshot protocol is based on a distributed algorithm to run interconnected multiple nodes in a scalable fashion and is able to deal with artificial intelligence applications, in which a large number of computing nodes are running.
Book
Predicting Diabetes Mellitus and Analysing Risk-Factors Correlation
Md. Faisal Faruque,Asaduzzaman Asaduzzaman,Syed Md. Minhaz Hossain,Md. Hasan Furhad,Iqbal H. Sarker +4 more
TL;DR: There is a positive correlation for predicting kidney complications (Nephropathy) and blood pressure (Hypertension) complications and a negative correlation at predicting hearing loss and skin complications (diabetes dermopathy) from diabetic patients.
Book ChapterDOI
Study and impact analysis of COVID-19 pandemic clinical data on infection spreading
TL;DR: In this paper , the authors proposed a decision tree approach for controlling the spread of COVID-19 outbreak in clinical data of India through a case study, where the data is a time series data, and analyzed the data from March 1, 2020 to April 15, 2020.
References
More filters
Journal ArticleDOI
Random Forests
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Book
Data Mining: Concepts and Techniques
TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Book
Data Mining: Practical Machine Learning Tools and Techniques
TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Book
Artificial Intelligence: A Modern Approach
Stuart Russell,Peter Norvig +1 more
TL;DR: In this article, the authors present a comprehensive introduction to the theory and practice of artificial intelligence for modern applications, including game playing, planning and acting, and reinforcement learning with neural networks.
Proceedings ArticleDOI
Mining association rules between sets of items in large databases
TL;DR: An efficient algorithm is presented that generates all significant association rules between items in the database of customer transactions and incorporates buffer management and novel estimation and pruning techniques.