scispace - formally typeset
Search or ask a question
Topic

Genome

About: Genome is a research topic. Over the lifetime, 74231 publications have been published within this topic receiving 3819713 citations.


Papers
More filters
Journal Article•DOI•
Gerald A. Tuskan1, Gerald A. Tuskan2, Stephen P. DiFazio2, Stephen P. DiFazio3, Stefan Jansson4, Joerg Bohlmann5, Igor V. Grigoriev6, Uffe Hellsten6, Nicholas H. Putnam6, Steven G. Ralph5, Stephane Rombauts7, Asaf Salamov6, Jacquie Schein, Lieven Sterck7, Andrea Aerts6, Rishikeshi Bhalerao4, Rishikesh P. Bhalerao8, Damien Blaudez9, Wout Boerjan7, Annick Brun9, Amy M. Brunner10, Victor Busov11, Malcolm M. Campbell12, John E. Carlson13, Michel Chalot9, Jarrod Chapman6, G.-L. Chen2, Dawn Cooper5, Pedro M. Coutinho14, Jérémy Couturier9, Sarah F. Covert15, Quentin C. B. Cronk5, R. Cunningham2, John M. Davis16, Sven Degroeve7, Annabelle Déjardin9, Claude W. dePamphilis13, John C. Detter6, Bill Dirks17, Inna Dubchak18, Inna Dubchak6, Sébastien Duplessis9, Jürgen Ehlting5, Brian E. Ellis5, Karla C Gendler19, David Goodstein6, Michael Gribskov20, Jane Grimwood21, Andrew Groover22, Lee E. Gunter2, Björn Hamberger5, Berthold Heinze, Yrjö Helariutta23, Yrjö Helariutta8, Yrjö Helariutta24, Bernard Henrissat14, D. Holligan15, Robert A. Holt, Wenyu Huang6, N. Islam-Faridi22, Steven J.M. Jones, M. Jones-Rhoades25, Richard A. Jorgensen19, Chandrashekhar P. Joshi11, Jaakko Kangasjärvi23, Jan Karlsson4, Colin T. Kelleher5, Robert Kirkpatrick, Matias Kirst16, Annegret Kohler9, Udaya C. Kalluri2, Frank W. Larimer2, Jim Leebens-Mack15, Jean-Charles Leplé9, Philip F. LoCascio2, Y. Lou6, Susan Lucas6, Francis Martin9, Barbara Montanini9, Carolyn A. Napoli19, David R. Nelson26, C D Nelson22, Kaisa Nieminen23, Ove Nilsson8, V. Pereda9, Gary F. Peter16, Ryan N. Philippe5, Gilles Pilate9, Alexander Poliakov18, J. Razumovskaya2, Paul G. Richardson6, Cécile Rinaldi9, Kermit Ritland5, Pierre Rouzé7, D. Ryaboy18, Jeremy Schmutz21, J. Schrader27, Bo Segerman4, H. Shin, Asim Siddiqui, Fredrik Sterky, Astrid Terry6, Chung-Jui Tsai11, Edward C. Uberbacher2, Per Unneberg, Jorma Vahala23, Kerr Wall13, Susan R. Wessler15, Guojun Yang15, T. Yin2, Carl J. Douglas5, Marco A. Marra, Göran Sandberg8, Y. Van de Peer7, Daniel S. Rokhsar17, Daniel S. Rokhsar6 •
15 Sep 2006-Science
TL;DR: The draft genome of the black cottonwood tree, Populus trichocarpa, has been reported in this paper, with more than 45,000 putative protein-coding genes identified.
Abstract: We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the Populus genome. A second, older duplication event is indistinguishably coincident with the divergence of the Populus and Arabidopsis lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in Populus than in Arabidopsis. Populus has more protein-coding genes than Arabidopsis, ranging on average from 1.4 to 1.6 putative Populus homologs for each Arabidopsis gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in Populus include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.

4,025 citations

Journal Article•DOI•
21 Oct 2004-Nature
TL;DR: The current human genome sequence (Build 35) as discussed by the authors contains 2.85 billion nucleotides interrupted by only 341 gaps and is accurate to an error rate of approximately 1 event per 100,000 bases.
Abstract: The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.

3,989 citations

Journal Article•DOI•
TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.
Abstract: Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.

3,902 citations

Journal Article•DOI•
17 May 2001-Nature
TL;DR: This review summarizes the main DNA caretaking systems and their impact on genome stability and carcinogenesis.
Abstract: The early notion that cancer is caused by mutations in genes critical for the control of cell growth implied that genome stability is important for preventing oncogenesis. During the past decade, knowledge about the mechanisms by which genes erode and the molecular machinery designed to counteract this time-dependent genetic degeneration has increased markedly. At the same time, it has become apparent that inherited or acquired deficiencies in genome maintenance systems contribute significantly to the onset of cancer. This review summarizes the main DNA caretaking systems and their impact on genome stability and carcinogenesis.

3,898 citations

Journal Article•DOI•
Patrick S. Schnable1, Doreen Ware2, Robert S. Fulton3, Joshua C. Stein2  +156 more•Institutions (18)
20 Nov 2009-Science
TL;DR: The sequence of the maize genome reveals it to be the most complex genome known to date and the correlation of methylation-poor regions with Mu transposon insertions and recombination and how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state is reported.
Abstract: We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.

3,761 citations


Network Information
Related Topics (5)
Gene
211.7K papers, 10.3M citations
96% related
Transcription (biology)
56.5K papers, 2.9M citations
92% related
RNA
111.6K papers, 5.4M citations
91% related
Regulation of gene expression
85.4K papers, 5.8M citations
91% related
Gene expression
113.3K papers, 5.5M citations
90% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20242
20237,313
202214,209
20214,955
20205,080
20194,839