scispace - formally typeset
Journal ArticleDOI

Biclustering Algorithms for Biological Data Analysis: A Survey

Reads0
Chats0
TLDR
In this comprehensive survey, a large number of existing approaches to biclustering are analyzed, and they are classified in accordance with the type of biclusters they can find, the patterns of bIClusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.
Abstract
A large number of clustering approaches have been proposed for the analysis of gene expression data obtained from microarray experiments. However, the results from the application of standard clustering methods to genes are limited. This limitation is imposed by the existence of a number of experimental conditions where the activity of genes is uncorrelated. A similar limitation exists when clustering of conditions is performed. For this reason, a number of algorithms that perform simultaneous clustering on the row and column dimensions of the data matrix has been proposed. The goal is to find submatrices, that is, subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated activities for every condition. In this paper, we refer to this class of algorithms as biclustering. Biclustering is also referred in the literature as coclustering and direct clustering, among others names, and has also been used in fields such as information retrieval and data mining. In this comprehensive survey, we analyze a large number of existing approaches to biclustering, and classify them in accordance with the type of biclusters they can find, the patterns of biclusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Discovery of microRNA–mRNA modules via population-based probabilistic learning

TL;DR: A probabilistic learning method to identify synergistic miRNAs involving regulation of their condition-specific target genes (mRNAs) from multiple information sources, i.e. computationally predicted target genes of miRNA and target sets presumed to constitute closely related parts of gene regulatory pathways.
Journal ArticleDOI

Finding large average submatrices in high dimensional data

TL;DR: In this article, a statistically motivated biclustering procedure (LAS) is proposed to find large average submatrices within a given real-valued data matrix, and the procedure operates in an iterative-residual fashion, and is driven by a Bonferroni-based significance score that effectively trades off between submatrix size and average value.
Journal IssueDOI

Seriation and matrix reordering methods: An historical overview

TL;DR: Seriation is an exploratory combinatorial data analysis technique to reorder objects into a sequence along a one-dimensional continuum so that it best reveals regularity and patterning among the whole series.
Journal ArticleDOI

Rate-optimal graphon estimation

TL;DR: In this paper, the authors established the optimal rate of convergence for graphon estimation for the stochastic block model with k-clusters and showed that the optimal convergence rate is n −1/log k+k −2/n −2 +k−2 +n−2.
Journal ArticleDOI

Clustering Network Layers with the Strata Multilayer Stochastic Block Model

TL;DR: An algorithm for separating layers into their appropriate strata and an inference technique for estimating the SBM parameters for each stratum are described, which demonstrate the method using synthetic networks and a multilayer network inferred from data collected in the Human Microbiome Project.
References
More filters
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Journal ArticleDOI

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.
Journal ArticleDOI

Comprehensive Identification of Cell Cycle–regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization

TL;DR: A comprehensive catalog of yeast genes whose transcript levels vary periodically within the cell cycle is created, and it is found that the mRNA levels of more than half of these 800 genes respond to one or both of these cyclins.
Related Papers (5)