scispace - formally typeset
Book ChapterDOI

Integration of Gene Expression and Ontology for Clustering Functionally Similar Genes

TLDR
In this work, it has been shown that incorporation of integrated dissimilarity measure increases the functional similarity of cluster of genes as compared to the methods that are based on either type of dissimilarities measure.
Abstract
Clustering functionally similar genes helps in understanding the mechanism of a biological pathway. It also provides information of those genes whose biological importance is previously not known. Clustering of genes is highly dependent on the similarity or dissimilarity criterion. Usually, microarray gene expression data is used to cluster genes. However, a microarray data may contain noise that may lead to undesired results. Therefore, incorporating gene ontology information may improve the clustering solutions. In this regard, an integrated dissimilarity measure is introduced for grouping functionally similar genes. It is comprised of city block distance and gene ontology based semantic dissimilarity. While, the city block distance is used to compute distance between two gene expression vectors, gene ontology based semantic dissimilarity measure is used for incorporating biological knowledge. The importance of the integrated dissimilarity measure is shown by incorporating it in different c-means clustering algorithms including rough-fuzzy clustering algorithms. In this work it has been shown that incorporation of integrated dissimilarity measure increases the functional similarity of cluster of genes as compared to the methods that are based on either type of dissimilarity measure. It is also observed that the rough-fuzzy clustering algorithm performs better with the new dissimilarity measure compared to different c-means clustering algorithms.

read more

Citations
More filters
Proceedings ArticleDOI

Multi-View Gene Clustering using Gene Ontology and Expression-based Similarities

TL;DR: A multi-view multi-objective clustering approach where Euclidean distances between the gene expression values and GO-based multi-factored gene-gene semantic similarity are considered as two complementary views and a consensus partitioning is obtained that satisfies both the aspects or views.
Proceedings ArticleDOI

Comparing Dissimilarity Metrics for Clustering Gene into Functional Modules using Machine Learning

TL;DR: 1-abs(Pearson correlation) works the best in two test cases for identifying genes involved in ethanol metabolism and galactose metabolism and is proposed to be used for future studies of clustering of genes based on expression level.
References
More filters
Journal ArticleDOI

Cluster analysis and display of genome-wide expression patterns

TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Journal ArticleDOI

A Cluster Separation Measure

TL;DR: A measure is presented which indicates the similarity of clusters which are assumed to have a data density which is a decreasing function of distance from a vector characteristic of the cluster which can be used to infer the appropriateness of data partitions.
Proceedings Article

An Information-Theoretic Definition of Similarity

Dekang Lin
TL;DR: This work presents an informationtheoretic definition of similarity that is applicable as long as there is a probabilistic model and demonstrates how this definition can be used to measure the similarity in a number of different domains.
Posted Content

Using Information Content to Evaluate Semantic Similarity in a Taxonomy

TL;DR: In this article, a new measure of semantic similarity in an IS-A taxonomy based on the notion of information content is presented, and experimental evaluation suggests that the measure performs encouragingly well (a correlation of r = 0.79 with a benchmark set of human similarity judgments, with an upper bound of r < 0.90 for human subjects performing the same task).
Journal ArticleDOI

A Genome-Wide Transcriptional Analysis of the Mitotic Cell Cycle

TL;DR: The genome-wide characterization of mRNA transcript levels during the cell cycle of the budding yeast S. cerevisiae indicates a mechanism for local chromosomal organization in global mRNA regulation and links a range of human genes to cell cycle period-specific biological functions.
Related Papers (5)