scispace - formally typeset
Search or ask a question
Topic

Genome

About: Genome is a research topic. Over the lifetime, 74231 publications have been published within this topic receiving 3819713 citations.


Papers
More filters
Journal ArticleDOI
27 Nov 1997-Nature
TL;DR: The A. fulgidus genome encodes functionally uncharacterized yet conserved proteins, two-thirds of which are shared with M. jannaschii (428 ORFs), indicating substantial archaeal gene diversity.
Abstract: Archaeoglobus fulgidus is the first sulphur-metabolizing organism to have its genome sequence determined. Its genome of 2,178,400 base pairs contains 2,436 open reading frames (ORFs). The information processing systems and the biosynthetic pathways for essential components (nucleotides, amino acids and cofactors) have extensive correlation with their counterparts in the archaeon Methanococcus jannaschii. The genomes of these two Archaea indicate dramatic differences in the way these organisms sense their environment, perform regulatory and transport functions, and gain energy. In contrast to M. jannaschii, A. fulgidus has fewer restriction-modification systems, and none of its genes appears to contain inteins. A quarter (651 ORFs) of the A. fulgidus genome encodes functionally uncharacterized yet conserved proteins, two-thirds of which are shared with M. jannaschii (428 ORFs). Another quarter of the genome encodes new proteins indicating substantial archaeal gene diversity.

1,394 citations

Journal ArticleDOI
TL;DR: The 3,308,274-bp sequence of the chromosome of Lactobacillus plantarum strain WCFS1, a single colony isolate of strain NCIMB8826 that was originally isolated from human saliva, has been determined, and contains 3,052 predicted protein-encoding genes, suggesting that these genes form a lifestyle adaptation region in the chromosome.
Abstract: The 3,308,274-bp sequence of the chromosome of Lactobacillus plantarum strain WCFS1, a single colony isolate of strain NCIMB8826 that was originally isolated from human saliva, has been determined, and contains 3,052 predicted protein-encoding genes. Putative biological functions could be assigned to 2,120 (70%) of the predicted proteins. Consistent with the classification of L. plantarum as a facultative heterofermentative lactic acid bacterium, the genome encodes all enzymes required for the glycolysis and phosphoketolase pathways, all of which appear to belong to the class of potentially highly expressed genes in this organism, as was evident from the codon-adaptation index of individual genes. Moreover, L. plantarum encodes a large pyruvate-dissipating potential, leading to various end-products of fermentation. L. plantarum is a species that is encountered in many different environmental niches, and this flexible and adaptive behavior is reflected by the relatively large number of regulatory and transport functions, including 25 complete PTS sugar transport systems. Moreover, the chromosome encodes >200 extracellular proteins, many of which are predicted to be bound to the cell envelope. A large proportion of the genes encoding sugar transport and utilization, as well as genes encoding extracellular functions, appear to be clustered in a 600-kb region near the origin of replication. Many of these genes display deviation of nucleotide composition, consistent with a foreign origin. These findings suggest that these genes, which provide an important part of the interaction of L. plantarum with its environment, form a lifestyle adaptation region in the chromosome.

1,392 citations

Journal ArticleDOI
01 Nov 1996-Science
TL;DR: Diagnostic sequencing indicated that a 280-kilobase region containing the maize Adh1-F and u22 genes is composed primarily of retrotransposons inserted within each other, and ten retroelement families were discovered.
Abstract: The relative organization of genes and repetitive DNAs in complex eukaryotic genomes is not well understood. Diagnostic sequencing indicated that a 280-kilobase region containing the maize Adh1-F and u22 genes is composed primarily of retrotransposons inserted within each other. Ten retroelement families were discovered, with reiteration frequencies ranging from 10 to 30,000 copies per haploid genome. These retrotransposons accounted for more than 60 percent of the Adh1-F region and at least 50 percent of the nuclear DNA of maize. These elements were largely intact and are dispersed throughout the gene-containing regions of the maize genome.

1,391 citations

Journal ArticleDOI
TL;DR: The results suggest that strand-slippage theories alone are insufficient to explain microsatellite distribution in the genome as a whole and that taxon-specific variation could also be detected in the frequency distributions of simple sequence motifs.
Abstract: We examined the abundance of microsatellites with repeated unit lengths of 1-6 base pairs in several eukaryotic taxonomic groups: primates, rodents, other mammals, nonmammalian vertebrates, arthropods, Caenorhabditis elegans, plants, yeast, and other fungi. Distribution of simple sequence repeats was compared between exons, introns, and intergenic regions. Tri- and hexanucleotide repeats prevail in protein-coding exons of all taxa, whereas the dependence of repeat abundance on the length of the repeated unit shows a very different pattern as well as taxon-specific variation in intergenic regions and introns. Although it is known that coding and noncoding regions differ significantly in their microsatellite distribution, in addition we could demonstrate characteristic differences between intergenic regions and introns. We observed striking relative abundance of (CCG)(n)*(CGG)(n) trinucleotide repeats in intergenic regions of all vertebrates, in contrast to the almost complete lack of this motif from introns. Taxon-specific variation could also be detected in the frequency distributions of simple sequence motifs. Our results suggest that strand-slippage theories alone are insufficient to explain microsatellite distribution in the genome as a whole. Other possible factors contributing to the observed divergence are discussed.

1,391 citations

Journal ArticleDOI
TL;DR: Simple sequence repeats are a group of repetitive DNA sequences that represent a significant portion of higher eukaryote genomes and can serve as highly informative genetic markers, and in conjunction with the use of polymerase chain reaction technology enable the detection of length variation.

1,388 citations


Network Information
Related Topics (5)
Gene
211.7K papers, 10.3M citations
96% related
Transcription (biology)
56.5K papers, 2.9M citations
92% related
RNA
111.6K papers, 5.4M citations
91% related
Regulation of gene expression
85.4K papers, 5.8M citations
91% related
Gene expression
113.3K papers, 5.5M citations
90% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20242
20237,313
202214,209
20214,955
20205,080
20194,839