scispace - formally typeset
Open AccessJournal ArticleDOI

Information theory applications for biological sequence analysis

Susana Vinga
- 01 May 2014 - 
- Vol. 15, Iss: 3, pp 376-389
Reads0
Chats0
TLDR
This review covers several aspects of IT applications, ranging from genome global analysis and comparison, including block-entropy estimation and resolution-free metrics based on iterative maps, to local analysis, comprising the classification of motifs, prediction of transcription factor binding sites and sequence characterization based on linguistic complexity and entropic profiles.
Abstract
Information theory (IT) addresses the analysis of communication systems and has been widely applied in molecular biology. In particular, alignment-free sequence analysis and comparison greatly benefited from concepts derived from IT, such as entropy and mutual information. This review covers several aspects of IT applications, ranging from genome global analysis and comparison, including block-entropy estimation and resolution-free metrics based on iterative maps, to local analysis, comprising the classification of motifs, prediction of transcription factor binding sites and sequence characterization based on linguistic complexity and entropic profiles. IT has also been applied to high-level correlations that combine DNA, RNA or protein features with sequence-independent properties, such as gene mapping and phenotype analysis, and has also provided models based on communication systems theory to describe information transmission channels at the cell level and also during evolutionary processes. While not exhaustive, this review attempts to categorize existing methods and to indicate their relation with broader transversal topics such as genomic signatures, data compression and complexity, time series analysis and phylogenetic classification, providing a resource for future developments in this promising area.

read more

Citations
More filters
Journal ArticleDOI

Alignment-free sequence comparison: benefits, applications, and tools

TL;DR: This work provides a guide to the currently available alignment-free sequence analysis tools and addresses questions about how these methods work, how they compare to alignment-based methods, and what their potential is for use for their research.
Journal ArticleDOI

Emerging Computational Methods for the Rational Discovery of Allosteric Drugs.

TL;DR: Algorithms that predict allosteric sites based on sequence data and molecular dynamics simulations are reviewed, tools that assess the druggability of these pockets are described, and how Markov state models and topology analyses provide insight into the relationship between protein dynamics andallosteric drug binding are discussed.
Journal ArticleDOI

A Tutorial for Information Theory in Neuroscience

TL;DR: This article walks through the mathematics of information theory along with common logistical problems associated with data type, data binning, data quantity requirements, bias, and significance testing, and provides a free MATLAB software package that can be applied to a wide range of data from neuroscience experiments, as well as from other fields of study.
Journal ArticleDOI

Alignment-Free Sequence Analysis and Applications

TL;DR: A review of word-count based approaches for alignment-free sequence analysis can be found in this article, where the authors provide an updated review of these applications and other related developments of word count-based approaches.
References
More filters
Journal ArticleDOI

A mathematical theory of communication

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.
Book

Elements of information theory

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
Related Papers (5)