scispace - formally typeset
Open AccessJournal ArticleDOI

Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

TLDR
DFT provides a robust detection method for higher order periodicity and is robust with respect to monomer insertions and deletions, random sequence insertions etc.
Abstract
Background Identification of approximate tandem repeats is an important task of broad significance and still remains a challenging problem of computational genomics. Often there is no single best approach to periodicity detection and a combination of different methods may improve the prediction accuracy. Discrete Fourier transform (DFT) has been extensively used to study primary periodicities in DNA sequences. Here we investigate the application of DFT method to identify and study alphoid higher order repeats.

read more

Content maybe subject to copyright    Report

Citations
More filters
BookDOI

Data Mining Techniques for the Life Sciences

TL;DR: "Data Mining Techniques for the Life Sciences" seeks to aid students and researchers in the life sciences who wish to get a condensed introduction into the vital world of biological databases and their many applications.
Journal ArticleDOI

Understanding Long-range Correlations in DNA Sequences

TL;DR: A review of the literature on statistical long-range correlation in DNA sequences can be found in this paper, where the authors conclude that a mixture of many length scales (including some relatively long ones) is responsible for the observed 1/f-like spectral component.

Measure representation and multifractal analysis of complete genomes

TL;DR: Spectral analyses performed indicate that these measure representations, considered as time series, exhibit strong long-range correlation and the multifractal property of the measure representation and the classification of bacteria.
Journal ArticleDOI

Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

TL;DR: This work presents several case studies of GRM use, and presents the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram.
Journal ArticleDOI

Coexistence of different base periodicities in prokaryotic genomes as related to DNA curvature, supercoiling, and transcription.

TL;DR: The comparison with available experimental data indicates that promoters with the most pronounced periodicities may be related to the supercoiling-sensitive genes.
References
More filters
Journal ArticleDOI

Fourier and Wavelet Transform Analysis, a Tool for Visualizing Regular Patterns in DNA Sequences

TL;DR: A correlation function that compares each base in a DNA sequence to its various neighbours and which is subsequently processed by Fourier and wavelet transforms has been developed and permits to readily visualize regular features in DNA which are related to the stability of heteroduplexes formed upon strand slippage.
Journal ArticleDOI

Chromosome-specific subfamilies within human alphoid repetitive DNA.

TL;DR: The expected presence of only one or a few distinct subfamilies on individual chromosomes is supported by the study of the nucleotide sequence of 17 cloned fragments of alphoid repetitive DNA from chromosome 7, which all contain the characteristic pattern of 36 common nucleotide changes that defines one of the subfam families described.
Journal ArticleDOI

What Is the Centromere

Journal ArticleDOI

Measure representation and multifractal analysis of complete genomes

TL;DR: In this paper, a measure representation of DNA sequences is proposed and spectral analysis and multifractal analysis are performed on the measure representations of a large number of complete genomes, and it is concluded that these complete genomes are not random sequences.
Journal ArticleDOI

Finite sample effects in sequence analysis

TL;DR: In this paper, it is shown that entropy calculations are seriously affected by systematic errors due to the finite size of the samples, and that these difficulties can be dealt with by assuming simple probability distributions underlying the generating process (e.g. equidistribution, power-law distribution, exponential distribution).
Related Papers (5)