scispace - formally typeset
Journal ArticleDOI

Biclustering Algorithms for Biological Data Analysis: A Survey

Reads0
Chats0
TLDR
In this comprehensive survey, a large number of existing approaches to biclustering are analyzed, and they are classified in accordance with the type of biclusters they can find, the patterns of bIClusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.
Abstract
A large number of clustering approaches have been proposed for the analysis of gene expression data obtained from microarray experiments. However, the results from the application of standard clustering methods to genes are limited. This limitation is imposed by the existence of a number of experimental conditions where the activity of genes is uncorrelated. A similar limitation exists when clustering of conditions is performed. For this reason, a number of algorithms that perform simultaneous clustering on the row and column dimensions of the data matrix has been proposed. The goal is to find submatrices, that is, subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated activities for every condition. In this paper, we refer to this class of algorithms as biclustering. Biclustering is also referred in the literature as coclustering and direct clustering, among others names, and has also been used in fields such as information retrieval and data mining. In this comprehensive survey, we analyze a large number of existing approaches to biclustering, and classify them in accordance with the type of biclusters they can find, the patterns of biclusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Scalable biclustering - the future of big data exploration?

TL;DR: The caveats of bic Lustering are discussed and its current challenges and guidelines for practitioners are presented and it is explained why biclustering may soon become one of the standards for big data analytics.
Journal ArticleDOI

On context-aware co-clustering with metadata support

TL;DR: Three alternative strategies for embedding available contextual knowledge into the co-clustering process are proposed and Experimental results show that it is possible to leverage the available metadata in discovering contextually-relevant co- clusters, without significant overheads in terms of information theoretical co-Cluster quality or execution cost.
Journal ArticleDOI

Design Exploration of Geometric Biclustering for Microarray Data Analysis in Data Mining

TL;DR: This paper designs a multi-threaded software running on a server grade multi-core CPU system, a CUDA program for GPU to accelerate the GBC algorithm, and a novel parameterizable and scalable hardware architecture implemented on an FPGA to accelerate this G BC algorithm.
Dissertation

Graph-based genomic signatures

Amrita Pati
Journal ArticleDOI

Finding checkerboard patterns via fractional 0–1 programming

TL;DR: A new mathematical programming formulation for unsupervised biclustering is provided, which involves the solution of a fractional 0–1 programming problem and a linear-mixed 0-1 reformulation as well as two heuristic-based approaches are developed.
References
More filters
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Journal ArticleDOI

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

TL;DR: A generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case and suggests a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.
Journal ArticleDOI

Comprehensive Identification of Cell Cycle–regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization

TL;DR: A comprehensive catalog of yeast genes whose transcript levels vary periodically within the cell cycle is created, and it is found that the mRNA levels of more than half of these 800 genes respond to one or both of these cyclins.
Related Papers (5)