scispace - formally typeset
Search or ask a question
Topic

Pseudogene

About: Pseudogene is a research topic. Over the lifetime, 5528 publications have been published within this topic receiving 336634 citations. The topic is also known as: Ψ & pseudogenes.


Papers
More filters
Journal ArticleDOI
01 Jul 1992-Genomics
TL;DR: The complex patterns of expression of individual cyclin D genes and their evolutionary conservation across species suggest that each family member may play a distinct role in cell cycle progression.

255 citations

Journal ArticleDOI
TL;DR: Members of phylogenetic subgroups of the class 2 NBS–LRR genes mapped to as many as ten different chromosomes indicate that they were duplicated by many independent genetic events that have occurred continuously through the expansion of the NBS-LRR superfamily and the evolution of the modern rice genome.
Abstract: The availability of the rice genome sequence enabled the global characterization of nucleotide-binding site (NBS)-leucine-rich repeat (LRR) genes, the largest class of plant disease resistance genes. The rice genome carries approximately 500 NBS-LRR genes that are very similar to the non-Toll/interleukin-1 receptor homology region (TIR) class (class 2) genes of Arabidopsis but none that are homologous to the TIR class genes. Over 100 of these genes were predicted to be pseudogenes in the rice cultivar Nipponbare, but some of these are functional in other rice lines. Over 80 other NBS-encoding genes were identified that belonged to four different classes, only two of which are present in dicotyledonous plant sequences present in databases. Map positions of the identified genes show that these genes occur in clusters, many of which included members from distantly related groups. Members of phylogenetic subgroups of the class 2 NBS-LRR genes mapped to as many as ten different chromosomes. The patterns of duplication of the NBS-LRR genes indicate that they were duplicated by many independent genetic events that have occurred continuously through the expansion of the NBS-LRR superfamily and the evolution of the modern rice genome. Genetic events, such as inversions, that inhibit the ability of recently duplicated genes to recombine promote the divergence of their sequences by inhibiting concerted evolution.

255 citations

Journal ArticleDOI
TL;DR: It is found that deletions are about three times more common than insertions, and the frequencies of both these events follow characteristic power-law behavior associated with the size of the indel, but unexpectedly, the frequency of 3 bp deletions violates this trend.
Abstract: Nucleotide substitution, insertion and deletion (indel) events are the major driving forces that have shaped genomes. Using the recently identified human ribosomal protein (RP) pseudogene sequences, we have thoroughly studied DNA mutation patterns in the human genome. We analyzed a total of 1726 processed RP pseudogene sequences, comprising more than 700 000 bases. To be sure to differentiate the sequence changes occurring in the functional genes during evolution from those occurring in pseudogenes after they were fixed in the genome, we used only pseudogene sequences originating from parts of RP genes that are identical in human and mouse. Overall, we found that nucleotide transitions are more common than transversions, by roughly a factor of two. Moreover, the substitution rates amongst the 12 possible nucleotide pairs are not homogeneous as they are affected by the type of immediately neighboring nucleotides and the overall local G+C content. Finally, our dataset is large enough that it has many indels, thus allowing for the first time statistically robust analysis of these events. Overall, we found that deletions are about three times more common than insertions (3740 versus 1291). The frequencies of both these events follow characteristic power-law behavior associated with the size of the indel. However, unexpectedly, the frequency of 3 bp deletions (in contrast to 3 bp insertions) violates this trend, being considerably higher than that of 2 bp deletions. The possible biological implications of such a 3 bp bias are discussed.

254 citations

Journal ArticleDOI
TL;DR: The genome sequence of A. salmonicida was determined to provide a better understanding of the virulence factors used by this pathogen to infect fish and provide insights into the mechanisms used by the bacterium for infection and avoidance of host defence systems.
Abstract: Aeromonas salmonicida subsp. salmonicida is a Gram-negative bacterium that is the causative agent of furunculosis, a bacterial septicaemia of salmonid fish. While other species of Aeromonas are opportunistic pathogens or are found in commensal or symbiotic relationships with animal hosts, A. salmonicida subsp. salmonicida causes disease in healthy fish. The genome sequence of A. salmonicida was determined to provide a better understanding of the virulence factors used by this pathogen to infect fish. The nucleotide sequences of the A. salmonicida subsp. salmonicida A449 chromosome and two large plasmids are characterized. The chromosome is 4,702,402 bp and encodes 4388 genes, while the two large plasmids are 166,749 and 155,098 bp with 178 and 164 genes, respectively. Notable features are a large inversion in the chromosome and, in one of the large plasmids, the presence of a Tn21 composite transposon containing mercury resistance genes and an In2 integron encoding genes for resistance to streptomycin/spectinomycin, quaternary ammonia compounds, sulphonamides and chloramphenicol. A large number of genes encoding potential virulence factors were identified; however, many appear to be pseudogenes since they contain insertion sequences, frameshifts or in-frame stop codons. A total of 170 pseudogenes and 88 insertion sequences (of ten different types) are found in the A. salmonicida genome. Comparison with the A. hydrophila ATCC 7966T genome reveals multiple large inversions in the chromosome as well as an approximately 9% difference in gene content indicating instances of single gene or operon loss or gain. A limited number of the pseudogenes found in A. salmonicida A449 were investigated in other Aeromonas strains and species. While nearly all the pseudogenes tested are present in A. salmonicida subsp. salmonicida strains, only about 25% were found in other A. salmonicida subspecies and none were detected in other Aeromonas species. Relative to the A. hydrophila ATCC 7966T genome, the A. salmonicida subsp. salmonicida genome has acquired multiple mobile genetic elements, undergone substantial rearrangement and developed a significant number of pseudogenes. These changes appear to be a consequence of adaptation to a specific host, salmonid fish, and provide insights into the mechanisms used by the bacterium for infection and avoidance of host defence systems.

253 citations

Journal ArticleDOI
TL;DR: All intergenic regions in the human genome are screened with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS), and nonprocessed pseudogenes appear to be enriched in regions with high gene density.
Abstract: We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95% +/- 3% are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70% of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.

253 citations


Network Information
Related Topics (5)
Gene
211.7K papers, 10.3M citations
95% related
Genome
74.2K papers, 3.8M citations
93% related
Regulation of gene expression
85.4K papers, 5.8M citations
91% related
Gene expression
113.3K papers, 5.5M citations
90% related
Transcription factor
82.8K papers, 5.4M citations
89% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023120
2022250
2021123
2020160
2019119
2018127