scispace - formally typeset
Journal ArticleDOI

Noise-robust soft clustering of gene expression time-course data

TLDR
To overcome the limitations of hard clustering, this work applied soft clustering which offers several advantages for researchers, including more noise robust and a priori pre-filtering of genes can be avoided.
Abstract
Clustering is an important tool in microarray data analysis. This unsupervised learning technique is commonly used to reveal structures hidden in large gene expression data sets. The vast majority of clustering algorithms applied so far produce hard partitions of the data, i.e. each gene is assigned exactly to one cluster. Hard clustering is favourable if clusters are well separated. However, this is generally not the case for microarray time-course data, where gene clusters frequently overlap. Additionally, hard clustering algorithms are often highly sensitive to noise. To overcome the limitations of hard clustering, we applied soft clustering which offers several advantages for researchers. First, it generates accessible internal cluster structures, i.e. it indicates how well corresponding clusters represent genes. This can be used for the more targeted search for regulatory elements. Second, the overall relation between clusters, and thus a global clustering structure, can be defined. Additionally, soft clustering is more noise robust and a priori pre-filtering of genes can be avoided. This prevents the exclusion of biologically relevant genes from the data analysis. Soft clustering was implemented here using the fuzzy c-means algorithm. Procedures to find optimal clustering parameters were developed. A software package for soft clustering has been developed based on the open-source statistical language R. The package called Mfuzz is freely available.

read more

Citations
More filters
Journal ArticleDOI

Integrative proteomics and metabolomics approach to identify the key roles of icariin-mediated protective effects against cyclophosphamide-induced spermatogenesis dysfunction in mice

TL;DR: In this paper , the authors investigated the role of ICA in preventing spermatogenesis dysfunction caused by cyclophosphamide (CP) and found that ICA effectively reduces testis injury and may have a role in male infertility preservation.
Journal ArticleDOI

Seiðr: Efficient calculation of robust ensemble gene networks

TL;DR: Seidr as discussed by the authors is a software toolkit designed to assist scientists in gene regulatory and gene co-expression network inference, which creates community networks to reduce algorithmic bias and utilizes noise corrected network backboning to prune noisy edges in the networks.
Journal ArticleDOI

Identification of the Novel Gene Markers Based on the Gene Profile among Different Severity of Obstructive Sleep Apnea

TL;DR: In this paper , a set of marker genes that can detect the severity of OSA were screened by bioinformatics methods, which could be jointly used with the traditional OSA diagnostic index to achieve a more reliable OSA severity evaluation.
Journal ArticleDOI

Screening core genes and signaling pathways after SFTSV infection by integrated transcriptome profiling analysis

TL;DR: Wang et al. as mentioned in this paper found that severe fever with thrombocytopenia syndrome virus (SFTSV) infection induced the expression of genes responsible for numerous cytokine-related pathways.
Proceedings ArticleDOI

Adapting Standard External Clustering Metrics for Repetitive, Noisy Observations

TL;DR: It is demonstrated that (external) clustering metrics that explicitly treat noise are more robust than standard ( External Clustering metrics in the presence of noise.
References
More filters
Journal ArticleDOI

Cluster analysis and display of genome-wide expression patterns

TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Book

Pattern Recognition with Fuzzy Objective Function Algorithms

TL;DR: Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.
Related Papers (5)