scispace - formally typeset
Journal ArticleDOI

Noise-robust soft clustering of gene expression time-course data

TLDR
To overcome the limitations of hard clustering, this work applied soft clustering which offers several advantages for researchers, including more noise robust and a priori pre-filtering of genes can be avoided.
Abstract
Clustering is an important tool in microarray data analysis. This unsupervised learning technique is commonly used to reveal structures hidden in large gene expression data sets. The vast majority of clustering algorithms applied so far produce hard partitions of the data, i.e. each gene is assigned exactly to one cluster. Hard clustering is favourable if clusters are well separated. However, this is generally not the case for microarray time-course data, where gene clusters frequently overlap. Additionally, hard clustering algorithms are often highly sensitive to noise. To overcome the limitations of hard clustering, we applied soft clustering which offers several advantages for researchers. First, it generates accessible internal cluster structures, i.e. it indicates how well corresponding clusters represent genes. This can be used for the more targeted search for regulatory elements. Second, the overall relation between clusters, and thus a global clustering structure, can be defined. Additionally, soft clustering is more noise robust and a priori pre-filtering of genes can be avoided. This prevents the exclusion of biologically relevant genes from the data analysis. Soft clustering was implemented here using the fuzzy c-means algorithm. Procedures to find optimal clustering parameters were developed. A software package for soft clustering has been developed based on the open-source statistical language R. The package called Mfuzz is freely available.

read more

Citations
More filters
Posted ContentDOI

ADAR1-dependent miR-3144-3p editing simultaneously induces MSI2 and suppresses SLC38A4 in liver cancer

TL;DR: In this paper , a multi-step hepatocellular carcinogenesis transcriptome data analysis, together with publicly available data, indicated that ADAR1 is the most dysregulated gene among the RNA editing enzyme families in liver cancer.
Posted ContentDOI

Global and Gene-specific Transcriptional Responses to Acute Stress

TL;DR: In this article, a two-component response to acute stress, one focused on the +1 nucleosome, the second on RNA polymerase II (Pol2), was uncovered.
Journal ArticleDOI

Measuring the Competitive Advantage of Countries Along the Belt and Road From the Perspective of Complex Social Networks

TL;DR: Based on the complex network theory, the authors constructs a complex network model of global value chain (GVC) division of labor system by using the Multi-Regional Input-output (MRIO) table, and reveals the variation trend of competitiveness of industrial sectors and economies on the GVC network by the National Competitive Advantage Index.
Journal ArticleDOI

Effects of dietary fibers or probiotics on functional constipation symptoms and roles of gut microbiota: a double-blinded randomized placebo trial

TL;DR: In this paper , the effects of formulas with dietary fibers or probiotics on functional constipation symptoms, and to identify modulations of gut microbiota of relevance, were evaluated in a 4-week double-blinded randomized placebo-controlled trial.
Posted ContentDOI

Adaptive evolution of proteins expressed in late development and adult in animals

TL;DR: There is a strong and consistent signal that positive selection mainly affects genes and pathways expressed in late development and adult, which implies that evolution of embryogenesis is mostly conservative, with most adaptive evolution affecting post-embryonic gene expression, and thus post-Embryonic phenotypes.
References
More filters
Journal ArticleDOI

Cluster analysis and display of genome-wide expression patterns

TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Book

Pattern Recognition with Fuzzy Objective Function Algorithms

TL;DR: Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.
Related Papers (5)