Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.
Citations
75 citations
Cites background or methods from "Prediction of protein coding region..."
...Yin and Yau (2007) also used the SNR....
[...]
...Other techniques used other tools, but the goal is the same which is analyzing the 3-base periodicity of DNA sequences to differentiate between coding and non-coding regions (Yin and Yau, 2007; Mena-Chalco et al., 2008; Ma and Zhu, 2007; Kahumani et al., 2008)....
[...]
...In general, compared with EPND (Yin and Yau, 2007), the threshold in this technique is more accurate since it is calculated based on the sequence to be predicted....
[...]
...Yin and Yau (2007) used the nucleotide distributions to compute PS(N/3) of a DNA sequence accumulatively....
[...]
...Other DSP-based methods that measure the 3-base periodicity without computing the DFT sometimes do not use a sliding window in the analysis of DNA sequences (Yin and Yau, 2007; Mena-Chalco et al., 2008)....
[...]
72 citations
71 citations
Cites methods from "Prediction of protein coding region..."
...E-mail address: yau@uic.edu (S.-T. Yau). patterns of that sequence, and it has been applied to identify protein coding regions in genomic sequences (Fukushima et al., 2002; Yin and Yau, 2005, 2007)....
[...]
65 citations
Cites background from "Prediction of protein coding region..."
...[39] where the authors also study the relationship between the relative abundance of the nucleotides and the period-3 property....
[...]
61 citations
Cites methods from "Prediction of protein coding region..."
...With this characteristic, DFT has been used in numerous DNA researches, such as gene prediction [22], protein coding region [23], and periodicity analysis [24]....
[...]
References
3,709 citations
"Prediction of protein coding region..." refers methods in this paper
...The method computes the 3-base periodicity and the background noise of the stepwise DNA segments of the target DNA sequences using nucleotide distributions in the three codon positions of the DNA sequences....
[...]
...As examples, GenScan algorithm (Burge and Karlin, 1997) measured distinct statistics features of exons and introns within genomes and employed them in prediction via hidden Markov model (HMM); MZFF method (Zhang, 1997) was developed for predicting protein coding regions using quadratic discriminant…...
[...]
875 citations
"Prediction of protein coding region..." refers background in this paper
...Keywords: Exon; Intron; 3-Base periodicity; Fourier transform...
[...]
...=3 at coding regions were addressed by Ficket (Fickett, 1982; Ficket and Tung, 1992)....
[...]
...It was demonstrated that the 3-base periodicity in a DNA sequence is partly caused by the unbalanced nucleotide distributions in the three coding positions in the sequence (Fickett, 1982; Ficket and Tung, 1992; Tiwari et al., 1997; Yin and Yau, 2005)....
[...]
...The 3-base periodicity magnitude and background noise can be directly computed from the nucleotide distributions (Ficket and Tung, 1992; Yin and Yau, 2005)....
[...]
...During the last two decades, a variety of computational algorithms have been developed to predict exons (for reviews, Ficket and Tung, 1992; Fickett, 1996; Zhang, 2002; Mathé et al., 2002)....
[...]
848 citations
"Prediction of protein coding region..." refers background in this paper
...A symbolic DNA sequence, denoted as, xð0Þ; xð1Þ; ... ; xðN � 1Þ, is first converted to four binary indicator sequences, uAðnÞ; uT ðnÞ; uCðnÞ ,a nduGðnÞ, which indicate the presence or absence of four nucleotides, A, T, C ,a ndG, at the nth position, respectively ( Voss, 1992; Tiwari et al., 1997; Anastassiou, 2000)....
[...]
...…denoted as, xð0Þ; xð1Þ; . . . ; xðN # 1Þ, is first converted to four binary indicator sequences, uAðnÞ; uT ðnÞ; uCðnÞ, and uGðnÞ, which indicate the presence or absence of four nucleotides, A, T, C, and G, at the nth position, respectively (Voss, 1992; Tiwari et al., 1997; Anastassiou, 2000)....
[...]
...Tiwari et al. (1997) explored the measure of spectral content (SC) in DNA sequences based on the fact that the 3-base periodicity, identified as a pronounced peak at the frequency N=3 of the Fourier power spectrum of the DNA sequences (N is the length of the DNA sequence), is prevalent in most protein coding regions, but does not exist in noncoding regions (Tsonis et al., 1991; Voss, 1992; Chechetkin and Turygin, 1995; Dodin et al., 2000)....
[...]
...=3 of the Fourier power spectrum of the DNA sequences (N is the length of the DNA sequence), is prevalent in most protein coding regions, but does not exist in noncoding regions (Tsonis et al., 1991; Voss, 1992; Chechetkin and Turygin, 1995; Dodin et al., 2000)....
[...]
749 citations
"Prediction of protein coding region..." refers background in this paper
...=3 of the Fourier power spectrum of the DNA sequences (N is the length of the DNA sequence), is prevalent in most protein coding regions, but does not exist in noncoding regions (Tsonis et al., 1991; Voss, 1992; Chechetkin and Turygin, 1995; Dodin et al., 2000)....
[...]
478 citations
"Prediction of protein coding region..." refers background in this paper
...During the last two decades, a variety of computational algorithms have been developed to predict exons (for reviews, Ficket and Tung, 1992; Fickett, 1996; Zhang, 2002; Mathé et al., 2002)....
[...]
...During the last two decades, a variety of computational algorithms have been developed to predict exons (for reviews, Ficket and Tung, 1992; Fickett, 1996; Zhang, 2002; Mathé et al., 2002)....
[...]