Topic
Genome
About: Genome is a research topic. Over the lifetime, 74231 publications have been published within this topic receiving 3819713 citations.
Papers published on a yearly basis
Papers
More filters
••
University of Washington1, Institute for Systems Biology2, University of Massachusetts Amherst3, University of Bath4, Northern Illinois University5, University of Wisconsin-Madison6, University of Pennsylvania7, University of Texas Health Science Center at Houston8, Ohio State University9, University of British Columbia10, Stanford University11, Marine Biological Laboratory12
TL;DR: Analysis of the genome sequence shows the presence of pathways for uptake and utilization of amino acids, active sodium-proton antiporter and potassium uptake systems, sophisticated photosensory and signal transduction pathways, and DNA replication, transcription, and translation systems resembling more complex eukaryotic organisms.
Abstract: We report the complete sequence of an extreme halophile, Halobacterium sp. NRC-1, harboring a dynamic 2,571,010-bp genome containing 91 insertion sequences representing 12 families and organized into a large chromosome and 2 related minichromosomes. The Halobacterium NRC-1 genome codes for 2,630 predicted proteins, 36% of which are unrelated to any previously reported. Analysis of the genome sequence shows the presence of pathways for uptake and utilization of amino acids, active sodium-proton antiporter and potassium uptake systems, sophisticated photosensory and signal transduction pathways, and DNA replication, transcription, and translation systems resembling more complex eukaryotic organisms. Whole proteome comparisons show the definite archaeal nature of this halophile with additional similarities to the Gram-positive Bacillus subtilis and other bacteria. The ease of culturing Halobacterium and the availability of methods for its genetic manipulation in the laboratory, including construction of gene knockouts and replacements, indicate this halophile can serve as an excellent model system among the archaea.
690 citations
••
University of California, Davis1, Yale University2, Laval University3, Joint Genome Institute4, Centre national de la recherche scientifique5, École normale supérieure de Cachan6, Wayne State University7, University of Georgia8, University of Udine9, Wellcome Trust Sanger Institute10, University of California, Santa Cruz11, University of Notre Dame12, European Bioinformatics Institute13, Duke University14, Baylor College of Medicine15, Broad Institute16, University of Washington17, University of Maryland, College Park18, University of California, Berkeley19, University of Lisbon20, Howard Hughes Medical Institute21, University of California, San Francisco22, Cold Spring Harbor Laboratory23, Royal Institute of Technology24
TL;DR: The Assemblathon 2 as mentioned in this paper presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.
Abstract: Background - The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results - In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions - Many current genome assemblers produced useful assemblies, containing a significant representation of their genes, regulatory sequences, and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.
690 citations
••
TL;DR: This draft genome sequence provides insight into the environmental adaptation of bread wheat and can aid in defining the large and complicated genomes of wheat species.
Abstract: About 8,000 years ago in the Fertile Crescent, a spontaneous hybridization of the wild diploid grass Aegilops tauschii (2n = 14; DD) with the cultivated tetraploid wheat Triticum turgidum (2n = 4x = 28; AABB) resulted in hexaploid wheat (T. aestivum; 2n = 6x = 42; AABBDD). Wheat has since become a primary staple crop worldwide as a result of its enhanced adaptability to a wide range of climates and improved grain quality for the production of baker's flour. Here we describe sequencing the Ae. tauschii genome and obtaining a roughly 90-fold depth of short reads from libraries with various insert sizes, to gain a better understanding of this genetically complex plant. The assembled scaffolds represented 83.4% of the genome, of which 65.9% comprised transposable elements. We generated comprehensive RNA-Seq data and used it to identify 43,150 protein-coding genes, of which 30,697 (71.1%) were uniquely anchored to chromosomes with an integrated high-density genetic map. Whole-genome analysis revealed gene family expansion in Ae. tauschii of agronomically relevant gene families that were associated with disease resistance, abiotic stress tolerance and grain quality. This draft genome sequence provides insight into the environmental adaptation of bread wheat and can aid in defining the large and complicated genomes of wheat species.
689 citations
••
TL;DR: The complete sequence of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3, has been determined by assembling the sequences of the physical map-based contigs of fosmid clones and of long polymerase chain reaction (PCR) products which were used for gap-filling.
Abstract: The complete sequence of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3, has been determined by assembling the sequences of the physical map-based contigs of fosmid clones and of long polymerase chain reaction (PCR) products which were used for gap-filling. The entire length of the genome was 1,738,505 bp. The authenticity of the entire genome sequence was supported by restriction analysis of long PCR products, which were directly amplified from the genomic DNA. As the potential protein-coding regions, a total of 2061 open reading frames (ORFs) were assigned, and by similarity search against public databases, 406 (19.7%) were related to genes with putative function and 453 (22.0%) to the sequences registered but with unknown function. The remaining 1202 ORFs (58.3%) did not show any significant similarity to the sequences in the databases. Sequence comparison among the assigned ORFs in the genome provided evidence that a considerable number of ORFs were generated by sequence duplication. By similarity search, 11 ORFs were assumed to contain the intein elements. The RNA genes identified were a single 16S-23S rRNA operon, two 5S rRNA genes and 46 tRNA genes including two with the intron structure. All the assigned ORFs and RNA coding regions occupied 91.25% of the whole genome. The data presented in this paper are available on the internet at http://www.nite.go.jp.
689 citations
••
TL;DR: Although the DNA sequence recognized by Rap1 is found in both coding and intergenic sequences, the binding of Rap1 to the genome was highly specific to intergenic regions with the potential to act as promoters, indicating the existence of a genome-wide molecular mechanism for marking promoter regions.
Abstract: We determined the distribution of repressor-activator protein 1 (Rap1) and the accessory silencing proteins Sir2, Sir3 and Sir4 in vivo on the entire yeast genome, at a resolution of 2 kb. Rap1 is central to the cellular economy during rapid growth, targeting 294 loci, about 5% of yeast genes, and participating in the activation of 37% of all RNA polymerase II initiation events in exponentially growing cells. Although the DNA sequence recognized by Rap1 is found in both coding and intergenic sequences, the binding of Rap1 to the genome was highly specific to intergenic regions with the potential to act as promoters. This global phenomenon, which may be a general characteristic of sequence-specific transcriptional factors, indicates the existence of a genome-wide molecular mechanism for marking promoter regions.
689 citations