Clustering Algorithms: Their Application to Gene Expression Data

doi:10.4137/BBI.S38316

Open AccessJournal ArticleDOI

Clustering Algorithms: Their Application to Gene Expression Data

Jelili Oyelade, +7 more

- 30 Nov 2016 -

Bioinformatics and Biology Insights

- Vol. 10, Iss: 10, pp 237-253

Chats0

TLDR

This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure.

Abstract:

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure.

Clustering Algorithms: Their Application to Gene Expression Data

Citations

Interactive dynamics of matrix adhesion and reaction-diffusion predict diverse multiscale strategies of cancer cell invasion

UMAP guided topological analysis of transcriptomic data for cancer subtyping

Application of K-Means Clustering to Identify Similar Gene Expression Patterns during Erythroid Development

Interactive dynamics of reaction-diffusion and adhesion predict diverse invasion strategies of cancer cells in matrix-like microenvironments

Microarray in bioinformatics

References

Maximum likelihood from incomplete data via the EM algorithm

Scikit-learn: Machine Learning in Python

Scikit-learn: Machine Learning in Python

Some methods for classification and analysis of multivariate observations

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

Related Papers (5)

Some methods for classification and analysis of multivariate observations

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

Cluster analysis and display of genome-wide expression patterns

Visualizing Data using t-SNE

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Trending Questions (1)