scispace - formally typeset
Journal ArticleDOI

Genomic signal processing

D. Anastassiou
- 01 Jul 2001 - 
- Vol. 18, Iss: 4, pp 8-20
Reads0
Chats0
TLDR
Digital signal processing provides a set of novel and useful tools for solving highly relevant problems in genomic information science and technology, in the form of local texture, color spectrograms visually provide significant information about biomolecular sequences which facilitates understanding of local nature, structure, and function.
Abstract
Genomics is a highly cross-disciplinary field that creates paradigm shifts in such diverse areas as medicine and agriculture. It is believed that many significant scientific and technological endeavors in the 21st century will be related to the processing and interpretation of the vast information that is currently revealed from sequencing the genomes of many living organisms, including humans. Genomic information is digital in a very real sense; it is represented in the form of sequences of which each element can be one out of a finite number of entities. Such sequences, like DNA and proteins, have been mathematically represented by character strings, in which each character is a letter of an alphabet. In the case of DNA, the alphabet is size 4 and consists of the letters A, T, C and G; in the case of proteins, the size of the corresponding alphabet is 20. As the list of references shows, biomolecular sequence analysis has already been a major research topic among computer scientists, physicists, and mathematicians. The main reason that the field of signal processing does not yet have significant impact in the field is because it deals with numerical sequences rather than character strings. However, if we properly map a character string into, one or more numerical sequences, then digital signal processing (DSP) provides a set of novel and useful tools for solving highly relevant problems. For example, in the form of local texture, color spectrograms visually provide significant information about biomolecular sequences which facilitates understanding of local nature, structure, and function. Furthermore, both the magnitude and the phase of properly defined Fourier transforms can be used to predict important features like the location and certain properties of protein coding regions in DNA. Even the process of mapping DNA into proteins and the interdependence of the two kinds of sequences can be analyzed using simulations based on digital filtering. These and other DSP-based approaches result in alternative mathematical formulations and may provide improved computational techniques for the solution of useful problems in genomic information science and technology.

read more

Citations
More filters
Journal ArticleDOI

The role of signal-processing concepts in genomics and proteomics

TL;DR: The role of digital filtering techniques in gene identification and the topic of long-range correlation between base pairs in DNA sequences, which corresponds to a 1/f type of power spectrum are reviewed.
Journal ArticleDOI

Signal Processing in Sequence Analysis: Advances in Eukaryotic Gene Prediction

TL;DR: A new technique for the recognition of acceptor splice sites is proposed, which combines signal processing-based gene and exon prediction methods with an existing data-driven statistical method, and reveals a consistent reduction in false positives at different levels of sensitivity.

Manipulation, analysis and retrieval systems for audio signals

TL;DR: A general multi-feature audio texture segmentation methodology, feature extraction from mp3 compressed data, automatic beat detection and analysis based on the Discrete Wavelet Transform and musical genre classification combining timbral, rhythmic and harmonic features are described.
Journal ArticleDOI

Understanding Long-range Correlations in DNA Sequences

TL;DR: A review of the literature on statistical long-range correlation in DNA sequences can be found in this paper, where the authors conclude that a mixture of many length scales (including some relatively long ones) is responsible for the observed 1/f-like spectral component.
Journal ArticleDOI

Autoregressive modeling and feature analysis of DNA sequences

TL;DR: A parametric signal processing approach for DNA sequence analysis based on autoregressive (AR) modeling is presented, indicating a high specificity of coding DNA sequences, while AR feature-based analysis helps distinguish between coding and noncoding DNA sequences.
References
More filters
Journal ArticleDOI

Recognition of protein coding regions in DNA sequences

TL;DR: The test has been thoroughly proven on 400,000 bases of sequence data: it misclassifies 5% of the regions tested and gives an answer of "No Opinion" one fifth of the time.
Journal ArticleDOI

Evolution of long-range fractal correlations and 1/f noise in DNA base sequences.

TL;DR: Spectral density measurements of individual base positions demonstrate the ubiquity of low-frequency 1/f β noise and long-range fractal correlations as well as prominent short-range periodicities.
Journal ArticleDOI

Prediction of probable genes by Fourier analysis of genomic sequences

TL;DR: The aim is to use Fourier techniques to analyse this periodicity, and thereby to develop a tool to recognize coding regions in genomic DNA, and find that the relative-height of the peak at f = 1/3 in the Fourier spectrum is a good discriminator of coding potential.
Journal ArticleDOI

Assessment of protein coding measures

TL;DR: This paper reviews and synthesizes the underlying coding measures from published algorithms and concludes that a very simple and obvious measure--counting oligomers--is more effective than any of the more sophisticated measures.
Journal ArticleDOI

Characterizing long-range correlations in DNA sequences from wavelet analysis.

TL;DR: The wavelet transform microscope is demonstrated and quantify the existence of long-range correlations in genes containing introns and noncoding regions, and the fluctuations in the patchy landscapes of DNA walks are found to be homogeneous with Gaussian statistics.
Related Papers (5)