scispace - formally typeset
Journal ArticleDOI

Machine Learning Based Big Data Processing Framework for Cancer Diagnosis Using Hidden Markov Model and GM Clustering

TLDR
A Bayesian hidden Markov model (HMM) with Gaussian Mixture (GM) Clustering approach is used to model the DNA copy number change across the genome and is compared with various existing approaches such as Pruned Exact Linear Time method, binary segmentation method and segment neighborhood method.
Abstract
The change in the DNA is a form of genetic variation in the human genome. In addition, the DNA copy number change is also linked with the progression of many emerging diseases. Array-based Comparative Genomic Hybridization (CGH) is considered as a major task when measuring the DNA copy number change across the genome. Moreover, DNA copy number change is an essential measure to diagnose the cancer disease. Next generation sequencing is an important method for studying the spread of infectious disease qualitatively and quantitatively. CGH is widely used in continuous monitoring of copy number of thousands of genes throughout the genome. In recent years, the size of the DNA sequence data is very large. Hence, there is a need to use a scalable machine learning approach to overcome the various issues in DNA copy number change detection. In this paper, we use a Bayesian hidden Markov model (HMM) with Gaussian Mixture (GM) Clustering approach to model the DNA copy number change across the genome. The proposed Bayesian HMM with GM Clustering approach is compared with various existing approaches such as Pruned Exact Linear Time method, binary segmentation method and segment neighborhood method. Experimental results demonstrate the effectiveness of our proposed change detection algorithm.

read more

Citations
More filters
Journal ArticleDOI

A new architecture of Internet of Things and big data ecosystem for secured smart healthcare monitoring and alerting system

TL;DR: A new architecture for the implementation of IoT to store and process scalable sensor data (big data) for health care applications and uses MapReduce based prediction model to predict the heart diseases is proposed.
Journal ArticleDOI

Hybrid Recommendation System for Heart Disease Diagnosis based on Multiple Kernel Learning with Adaptive Neuro-Fuzzy Inference System

TL;DR: The proposed MKL with ANFIS based deep learning method follows two-fold approach and has produced high sensitivity, high specificity and less Mean Square Error for the for the KEGG Metabolic Reaction Network dataset.
Journal ArticleDOI

An ontology-driven personalized food recommendation in IoT-based healthcare system

TL;DR: The ProTrip RS is a health-centric RS which is capable of suggesting the food availability through considering climate attributes based on user’s personal choice and nutritive value, and the developed food recommendation approach is evaluated for the real-time IoT-based healthcare support system.
Journal ArticleDOI

IOT based wearable sensor for diseases prediction and symptom analysis in healthcare sector

TL;DR: Wearable sensor which is connected to Internet of things (IoT) based big data i.e. data mining analysis in healthcare is proposed and Regularization _ Genome wide association study (GWAS) is used to predict the diseases.
Journal ArticleDOI

An optimized feature selection based on genetic approach and support vector machine for heart disease

TL;DR: This paper proposes an optimization function on the basis of support vector machine (SVM) that is used in the genetic algorithm (GA) for selecting the more significant features to get heart disease.
References
More filters
Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.
Proceedings Article

A density-based algorithm for discovering clusters in large spatial Databases with Noise

TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.
Journal ArticleDOI

A Cluster Analysis Method for Grouping Means in the Analysis of Variance

A. J. Scott, +1 more
- 01 Sep 1974 - 
TL;DR: In this paper, the authors used the techniques of cluster analysis to split the treatments into reasonably homogeneous groups and developed a likelihood ratio test for judging the significance of differences among the resulting groups.
Journal ArticleDOI

Hidden markov models in computational biology: applications to protein modeling

TL;DR: The results suggest the presence of an EF-hand calcium binding motif in a highly conserved and evolutionary preserved putative intracellular region of 155 residues in the alpha-1 subunit of L-type calcium channels which play an important role in excitation-contraction coupling.
Journal ArticleDOI

PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data

TL;DR: PennCNV, a hidden Markov model (HMM) based approach, is presented for kilobase-resolution detection of CNVs from Illumina high-density SNP genotyping data, demonstrating the feasibility of whole-genome fine-mapping ofCNVs via high- density SNP genotypesing.
Related Papers (5)