Home
/
Authors
/
Sam Rash

Author

Sam Rash

Other affiliations: Agency for Science, Technology and Research, United States Department of Energy, Agricultural Research Service

Bio: Sam Rash is an academic researcher from Joint Genome Institute. The author has contributed to research in topics: Gene & Genome. The author has an hindex of 9, co-authored 10 publications receiving 4756 citations. Previous affiliations of Sam Rash include Agency for Science, Technology and Research & United States Department of Energy.

Topics: Gene, Genome, Chromosome 19, Chromosome 16, Autosome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The draft genome of Ciona intestinalis : insights into chordate and vertebrate origins

[...]

Paramvir S. Dehal¹, Yutaka Satou², Robert K. Campbell³, Jarrod Chapman¹, Bernard M. Degnan⁴, Anthony W. De Tomaso⁵, Brad Davidson⁶, Anna Di Gregorio⁶, Maarten D. Sollewijn Gelpke¹, David Goodstein¹, Naoe Harafuji⁶, Kenneth E. M. Hastings⁷, Isaac Ho¹, Kohji Hotta⁸, Wayne Huang¹, Takeshi Kawashima², Patrick Lemaire⁹, Diego Martinez¹, Ian A. Meinertzhagen¹⁰, Simona Necula¹, Masaru Nonaka¹¹, Nik Putnam¹, Sam Rash¹, Hidetoshi Saiga¹², Masanobu Satake¹³, Astrid Terry¹, Lixy Yamada², Hong Gang Wang¹⁴, Satoko Awazu², Kaoru Azumi¹⁵, Jeffrey L. Boore¹, Margherita Branno¹⁶, Stephen T. Chin-Bow¹⁷, Rosaria DeSantis¹⁶, Sharon A. Doyle¹, Pilar Francino¹, David N. Keys¹, David N. Keys⁶, Shinobu Haga⁸, Hiroko Hayashi⁸, Kyosuke Hino², Kaoru S. Imai², Kazuo Inaba¹³, Shungo Kano², Shungo Kano¹⁶, Kenji Kobayashi², Mari Kobayashi², Byung In Lee¹, Kazuhiro W. Makabe², Chitra Manohar¹, Giorgio Matassi¹⁶, Mónica Medina¹, Yasuaki Mochizuki², Steve Mount¹⁸, Tomomi Morishita⁸, Sachiko Miura⁸, Akie Nakayama², Satoko Nishizaka⁸, Hisayo Nomoto⁸, Fumiko Ohta⁸, Kazuko Oishi⁸, Isidore Rigoutsos¹⁷, Masako Sano⁸, Akane Sasaki², Yasunori Sasakura², Eiichi Shoguchi², Tadasu Shin-I⁸, Antoinetta Spagnuolo¹⁶, Didier Y.R. Stainier¹⁹, Miho Suzuki²⁰, Olivier Tassy⁹, Naohito Takatori², Miki Tokuoka², Kasumi Yagi², Fumiko Yoshizaki¹¹, Shuichi Wada², Cindy Zhang¹, P. Douglas Hyatt²¹, Frank W. Larimer²¹, Chris Detter¹, Norman A. Doggett²², Tijana Glavina¹, Trevor Hawkins¹, Paul G. Richardson¹, Susan Lucas¹, Yuji Kohara⁸, Michael Levine⁶, Nori Satoh², Daniel S. Rokhsar¹, Daniel S. Rokhsar⁶ - Show less +86 more•Institutions (22)

United States Department of Energy¹, Kyoto University², Marine Biological Laboratory³, University of Queensland⁴, Stanford University⁵, University of California, Berkeley⁶, McGill University⁷, National Institute of Genetics⁸, Aix-Marseille University⁹, Dalhousie University¹⁰, University of Tokyo¹¹, Tokyo Metropolitan University¹², Tohoku University¹³, University of South Florida¹⁴, Hokkaido University¹⁵, Stazione Zoologica Anton Dohrn¹⁶, IBM¹⁷, University of Maryland, College Park¹⁸, University of California, San Francisco¹⁹, University of Edinburgh²⁰, Oak Ridge National Laboratory²¹, Los Alamos National Laboratory²²

13 Dec 2002-Science

TL;DR: A draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis, is generated, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development.

...read moreread less

Abstract: The first chordates appear in the fossil record at the time of the Cambrian explosion, nearly 550 million years ago. The modern ascidian tadpole represents a plausible approximation to these ancestral chordates. To illuminate the origins of chordate and vertebrates, we generated a draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis. The Ciona genome contains approximately 16,000 protein-coding genes, similar to the number in other invertebrates, but only half that found in vertebrates. Vertebrate gene families are typically found in simplified form in Ciona, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development. The ascidian genome has also acquired a number of lineage-specific innovations, including a group of genes engaged in cellulose metabolism that are related to those in bacteria and fungi.

...read moreread less

1,582 citations

Journal Article•DOI•

Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripes

[...]

Samuel Aparicio¹, Jarrod Chapman¹, Elia Stupka¹, Nik Putnam¹, Jer Ming Chia¹, Paramvir S. Dehal¹, Alan Christoffels¹, Sam Rash¹, Shawn Hoon¹, Arian F.A. Smit¹, Maarten D. Sollewijn Gelpke¹, Jared C. Roach¹, Tania Oh¹, Isaac Ho¹, Marie Wong¹, Chris Detter¹, Frans Verhoef¹, Paul Predki¹, Alice Tay¹, Susan Lucas¹, Paul G. Richardson¹, Sarah Smith¹, Melody S. Clark¹, Yvonne J. K. Edwards¹, Norman A. Doggett¹, Andrey Zharkikh¹, Sean V. Tavtigian¹, Dmitry Pruss¹, Mary Barnstead¹, Cheryl Evans¹, Holly Baden¹, Justin Powell¹, Gustavo Glusman¹, Lee Rowen¹, Leroy Hood¹, Y. H. Tan¹, Greg Elgar¹, Trevor Hawkins¹, Byrappa Venkatesh¹, Daniel S. Rokhsar¹, Sydney Brenner¹ - Show less +37 more•Institutions (1)

Agency for Science, Technology and Research¹

23 Aug 2002-Science

TL;DR: The Fugu rubripes genome has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds as discussed by the authors.

...read moreread less

Abstract: The compact genome of Fugu rubripes has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds. In this 365-megabase vertebrate genome, repetitive DNA accounts for less than one-sixth of the sequence, and gene loci occupy about one-third of the genome. As with the human genome, gene loci are not evenly distributed, but are clustered into sparse and dense regions. Some “giant” genes were observed that had average coding sequence sizes but were spread over genomic lengths significantly larger than those of their human orthologs. Although three-quarters of predicted human proteins have a strong match toFugu, approximately a quarter of the human proteins had highly diverged from or had no pufferfish homologs, highlighting the extent of protein evolution in the 450 million years since teleosts and mammals diverged. Conserved linkages between Fugu and human genes indicate the preservation of chromosomal segments from the common vertebrate ancestor, but with considerable scrambling of gene order.

...read moreread less

1,446 citations

Journal Article•DOI•

Phytophthora Genome Sequences Uncover Evolutionary Origins and Mechanisms of Pathogenesis

[...]

Brett M. Tyler¹, Sucheta Tripathy¹, Xuemin Zhang¹, Paramvir S. Dehal², Paramvir S. Dehal³, Rays H. Y. Jiang¹, Rays H. Y. Jiang⁴, Andrea Aerts³, Andrea Aerts², Felipe D. Arredondo¹, Laura Baxter⁵, Douda Bensasson⁶, Douda Bensasson³, Douda Bensasson², Jim Beynon⁵, Jarrod Chapman², Jarrod Chapman³, Jarrod Chapman⁷, C. M. B. Damasceno⁸, Anne E. Dorrance⁹, Daolong Dou¹, Allan W. Dickerman¹, Inna Dubchak², Inna Dubchak³, Matteo Garbelotto⁷, Mark Gijzen¹⁰, Stuart G. Gordon⁹, Francine Govers⁴, Niklaus J. Grünwald¹¹, Wayne Huang³, Wayne Huang¹², Kelly Ivors⁷, Kelly Ivors¹³, Richard W. Jones¹¹, Sophien Kamoun⁹, Konstantinos Krampis¹, Kurt Lamour¹⁴, Mi-Kyung Lee, W. Hayes McDonald¹⁵, MoÌnica Medina¹⁶, Harold J. G. Meijer⁴, Eric K. Nordberg¹, Donald J. Maclean¹⁷, Manuel D. Ospina-Giraldo¹⁸, Paul Morris¹⁹, Vipaporn Phuntumart¹⁹, Nicholas H. Putnam², Nicholas H. Putnam³, Sam Rash¹¹, Sam Rash³, Jocelyn K. C. Rose⁸, Yasuko Sakihama²⁰, Asaf Salamov³, Asaf Salamov², Alon Savidor¹⁴, Chantel F. Scheuring, Brian M. Smith¹, Bruno W. S. Sobral¹, Astrid Terry¹¹, Astrid Terry³, Trudy Torto-Alalibo¹, Joe Win⁹, Zhanyou Xu, Hong-Bin Zhang, Igor V. Grigoriev³, Igor V. Grigoriev², Daniel S. Rokhsar⁷, Daniel S. Rokhsar³, Jeffrey L. Boore - Show less +65 more•Institutions (20)

Virginia Tech¹, Lawrence Berkeley National Laboratory², Joint Genome Institute³, Wageningen University and Research Centre⁴, University of Warwick⁵, Imperial College London⁶, University of California, Berkeley⁷, Cornell University⁸, Ohio Agricultural Research and Development Center⁹, Agriculture and Agri-Food Canada¹⁰, Agricultural Research Service¹¹, Lawrence Livermore National Laboratory¹², North Carolina State University¹³, University of Tennessee¹⁴, Oak Ridge National Laboratory¹⁵, University of California, Merced¹⁶, University of Queensland¹⁷, Wilkes University¹⁸, Bowling Green State University¹⁹, Hokkaido University²⁰

01 Sep 2006-Science

TL;DR: Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oömycete avirulence genes.

...read moreread less

Abstract: Draft genome sequences have been determined for the soybean pathogen Phytophthora sojae and the sudden oak death pathogen Phytophthora ramorum. Oomycetes such as these Phytophthora species share the kingdom Stramenopila with photosynthetic algae such as diatoms, and the presence of many Phytophthora genes of probable phototroph origin supports a photosynthetic ancestry for the stramenopiles. Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oomycete avirulence genes.

...read moreread less

1,016 citations

Journal Article•DOI•

The DNA sequence and biology of human chromosome 19

[...]

Jane Grimwood¹, Laurie Gordon², Laurie Gordon³, Anne S. Olsen³, Anne S. Olsen², Astrid Terry², Jeremy Schmutz¹, Jane Lamerdin³, Jane Lamerdin², Uffe Hellsten², David Goodstein², Olivier Couronne², Mary Bao Tran-Gyamfi³, Mary Bao Tran-Gyamfi², Andrea Aerts², Michael R. Altherr⁴, Michael R. Altherr², Linda K. Ashworth³, Linda K. Ashworth², Eva Bajorek¹, Stacey Black¹, Elbert Branscomb², Elbert Branscomb³, Sean Caenepeel², Anthony V. Carrano³, Anthony V. Carrano², Chenier Caoile¹, Yee Man Chan¹, Mari Christensen³, Mari Christensen², Catherine A. Cleland², Catherine A. Cleland⁴, Alex Copeland², Eileen Dalin², Paramvir S. Dehal², Mirian Denys¹, John C. Detter², Julio Escobar¹, Dave Flowers¹, Dea Fotopulos¹, Carmen Rosa Albacete García¹, Anca M. Georgescu³, Anca M. Georgescu², Tijana Glavina², Maria Gomez¹, Eidelyn Gonzales¹, Matthew Groza², Matthew Groza³, Nancy Hammon², Trevor Hawkins², Lauren Haydu¹, Isaac Ho², Wayne Huang², Sanjay Israni², Jamie Jett², Kristen Kadner², Heather Kimball², Arthur Kobayashi³, Arthur Kobayashi², Vladimer Larionov, Sun-Hee Leem, Frederick Lopez¹, Yunian Lou², Steve Lowry², Stephanie Malfatti², Stephanie Malfatti³, Diego Martinez², Paula McCready², Paula McCready³, Catherine Medina¹, Jenna Morgan², Kathryn Nelson², Kathryn Nelson⁴, Matt Nolan², Ivan Ovcharenko², Ivan Ovcharenko³, Sam Pitluck², Martin Pollard², Anthony P. Popkie⁵, Paul Predki², Glenda Quan³, Glenda Quan², Lucía Ramírez¹, Sam Rash², James Retterer¹, Alex Rodriguez¹, Stephanine Rogers¹, Asaf Salamov², Angelica Salazar¹, Xinwei She⁵, Doug Smith², Tom Slezak³, Tom Slezak², Victor V. Solovyev², Nina Thayer⁴, Nina Thayer², Hope Tice², Ming Tsai¹, Anna Ustaszewska², Nu Vo¹, Mark C. Wagner³, Mark C. Wagner², Jeremy Wheeler¹, Kevin Wu¹, Gary Xie², Gary Xie⁴, Joan Yang¹, Inna Dubchak², Terrence S. Furey⁶, Pieter J. deJong⁷, Mark Dickson¹, David Gordon⁸, Evan E. Eichler⁵, Len A. Pennacchio², Paul G. Richardson², Lisa Stubbs³, Lisa Stubbs², Daniel S. Rokhsar², Richard M. Myers¹, Edward M. Rubin², Susan Lucas² - Show less +117 more•Institutions (8)

Stanford University¹, Joint Genome Institute², Lawrence Livermore National Laboratory³, Los Alamos National Laboratory⁴, Case Western Reserve University⁵, University of California, Santa Cruz⁶, Children's Hospital Oakland⁷, University of Washington⁸

01 Apr 2004-Nature

TL;DR: Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.

...read moreread less

Abstract: Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.

...read moreread less

307 citations

Journal Article•DOI•

Human chromosome 19 and related regions in mouse: conservative and lineage-specific evolution.

[...]

Paramvir S. Dehal¹, Paramvir S. Dehal², Paul Predki², Anne S. Olsen², Art Kobayashi², Peg Folta², Susan Lucas², Miriam Land³, Miriam Land², Astrid Terry², Carol L. Ecale Zhou², Sam Rash², Qing Zhang², Laurie Gordon², Joomyeong Kim², Christopher J. Elkin⁴, Christopher J. Elkin², Martin Pollard², Martin Pollard⁵, Paul G. Richardson², Daniel S. Rokhsar², Daniel S. Rokhsar⁶, Ed Uberbacher³, Ed Uberbacher², Trevor Hawkins², Elbert Branscomb², Lisa Stubbs² - Show less +23 more•Institutions (6)

University of California, Davis¹, Joint Genome Institute², Oak Ridge National Laboratory³, Lawrence Livermore National Laboratory⁴, Lawrence Berkeley National Laboratory⁵, University of California, Berkeley⁶

06 Jul 2001-Science

TL;DR: To illuminate the function and evolutionary history of both genomes, mouse DNA related to human chromosome 19 is sequenced and breakpoints of all 15 evolutionary rearrangements are sequenced, providing a view of the forces that drive chromosome evolution in mammals.

...read moreread less

Abstract: To illuminate the function and evolutionary history of both genomes, we sequenced mouse DNA related to human chromosome 19. Comparative sequence alignments yielded confirmatory evidence for hypothetical genes and identified exons, regulatory elements, and candidate genes that were missed by other predictive methods. Chromosome-wide comparisons revealed a difference between single-copy HSA19 genes, which are overwhelmingly conserved in mouse, and genes residing in tandem familial clusters, which differ extensively in number, coding capacity, and organization between the two species. Finally, we sequenced breakpoints of all 15 evolutionary rearrangements, providing a view of the forces that drive chromosome evolution in mammals.

...read moreread less

218 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Initial sequencing and comparative analysis of the mouse genome.

[...]

Robert H. Waterston¹, Kerstin Lindblad-Toh², Ewan Birney, Jane Rogers³ +219 more•Institutions (26)

05 Dec 2002-Nature

TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.

...read moreread less

Abstract: The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

...read moreread less

6,643 citations

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

Journal Article•DOI•

The COG database: an updated version includes eukaryotes

[...]

Roman L. Tatusov¹, Natalie D. Fedorova¹, John D. Jackson¹, Aviva R. Jacobs¹, Boris Kiryutin¹, Eugene V. Koonin¹, Dmitri M. Krylov¹, Raja Mazumder², Sergei L. Mekhedov¹, Anastasia N. Nikolskaya², B Sridhar Rao¹, Sergei Smirnov¹, Alexander V. Sverdlov¹, Sona Vasudevan¹, Yuri I. Wolf¹, Jodie J. Yin¹, Darren A. Natale² - Show less +13 more•Institutions (2)

National Institutes of Health¹, Georgetown University Medical Center²

11 Sep 2003-BMC Bioinformatics

TL;DR: A major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes is described and is expected to be a useful platform for functional annotation of newlysequenced genomes, including those of complex eukARYotes, and genome-wide evolutionary studies.

...read moreread less

Abstract: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.

...read moreread less

4,167 citations

Journal Article•DOI•

Finishing the euchromatic sequence of the human genome

[...]

Chris P. Ponting, Daniel Barker

21 Oct 2004-Nature

TL;DR: The current human genome sequence (Build 35) as discussed by the authors contains 2.85 billion nucleotides interrupted by only 341 gaps and is accurate to an error rate of approximately 1 event per 100,000 bases.

...read moreread less

Abstract: The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.

...read moreread less

3,989 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse