scispace - formally typeset
Open AccessJournal ArticleDOI

Predicting the Presence of Heart Diseases using Comparative Data Mining and Machine Learning Algorithms

Daniel Ananey-Obiri, +1 more
- 15 Apr 2020 - 
- Vol. 176, Iss: 11, pp 17-21
TLDR
This study will use data exploratory and mining techniques to extract hidden patterns using python to seek better performance in predicting heart diseases to reduce the number of tests require for the diagnosis of heart diseases.
Abstract
Heart disease, an example of cardiovascular diseases is the number one notable reason for the death of many people in the world. Of recent, studies have concentrated on using alternative efficient techniques such as data mining and machine learning in the diagnosis of diseases based on certain features of an individual. This study will use data exploratory and mining techniques to extract hidden patterns using python. By this, machine learning algorithms (logistic linear regression, decision tree classifier, Gaussian Naïve Bayes models) will be developed to predict the presence of heart diseases in patients. This will try to seek better performance in predicting heart diseases to reduce the number of tests require for the diagnosis of heart diseases. The k-fold cross validation approach will be used in assessing the resulting models for receiver operating characteristic (ROC) curves (sensitivity against specificity). The dataset was collected from UCI machine learning repository which contains information on patients with heart disease. The dataset has 14 attributes and measured on 303 individuals. General Terms Algorithms, pattern recognition, supervised learning, machine learning, heart disease.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Heart Disease Risk Prediction Using Machine Learning Classifiers with Attribute Evaluators

TL;DR: The SMO classifier stood as the best prediction method compared to other techniques, and IBk achieved an 8.25% accuracy improvement by tuning the hyperparameter ‘k’ to 9 with the chi-squared attribute set.
Journal ArticleDOI

Identification of Crop Diseases and Insect Pests Based on Deep Learning

Bo Wang
TL;DR: The experimental analysis of the proposed model based on the constructed data set shows that the average recognition accuracy and recognition time of fragrant pear diseases and insect pests are better than other comparison models.
Journal ArticleDOI

Word2vec neural model-based technique to generate protein vectors for combating COVID-19: a machine learning approach

TL;DR: In this article , the authors developed models that can accurately classify COVID-19 viral sequences rapidly using protein vectors generated by neural word embedding technique using five machine learning models; K nearest neighbor regression (KNN), support vector machine (SVM), random forest (RF), Linear discriminant analysis (LDA), and Logistic regression were developed using datasets from the National Center for Biotechnology.
Journal ArticleDOI

Incorporating CNN Features for Optimizing Performance of Ensemble Classifier for Cardiovascular Disease Prediction

TL;DR: This study proposes the novel use of feature extraction from a convolutional neural network (CNN) using a CNN model to enlarge the feature set to train linear models including stochastic gradient descent classifier, logistic regression, and support vector machine that comprise the soft-voting based ensemble model.
Journal ArticleDOI

Comparison of various machine learning approaches uses in heart ailments prediction

TL;DR: By analyzing hundreds of healthcare data and other semantics, machine learning algorithms can analyze related cases with diseases and health conditions and generate predictions through different models from patient data.
References
More filters

An empirical study of the naive Bayes classifier

Irina Rish
TL;DR: This work analyzes the impact of the distribution entropy on the classification error, showing that low-entropy feature distributions yield good performance of naive Bayes and demonstrates that naive Baye works well for certain nearlyfunctional feature dependencies.
Proceedings Article

The Optimality of Naive Bayes.

TL;DR: A sufficient condition for the optimality of naive Bayes is presented and proved, in which the dependence between attributes do exist, and evidence that dependence among attributes may cancel out each other is provided.
Journal ArticleDOI

Prognostic modeling with logistic regression analysis: in search of a sensible strategy in small data sets.

TL;DR: A sensible strategy in small data sets is to apply shrinkage methods in full models that include well-coded predictors that are selected based on external information, such as full models including all available covariables.
Proceedings Article

Decision Tree Analysis on J48 Algorithm for Data Mining

TL;DR: This research is focussed on J48 algorithm which is used to create Univariate Decision Trees and discusses about the idea of multivariate decision tree with process of classify instance by using more than one attribute at each internal node.
Journal ArticleDOI

Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques

TL;DR: This paper has analysed prediction systems for Heart disease using more number of input attributes and shows that out of these three classification models Neural Networks predicts Heart disease with highest accuracy.
Related Papers (5)