Home
/
Authors
/
Christopher D. Town

Author

Christopher D. Town

Other affiliations: TigerLogic

Bio: Christopher D. Town is an academic researcher from J. Craig Venter Institute. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 61, co-authored 121 publications receiving 16588 citations. Previous affiliations of Christopher D. Town include TigerLogic.

Topics: Genome, Gene, Expressed sequence tag, Arabidopsis, Genomics ...read more

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1992
1991
1979

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

[...]

Boulos Chalhoub¹, Shengyi Liu², Isobel A. P. Parkin³, Haibao Tang⁴, Haibao Tang⁵, Xiyin Wang⁶, Julien Chiquet¹, Harry Belcram¹, Chaobo Tong², Birgit Samans⁷, Margot Correa⁸, Corinne Da Silva⁸, Jérémy Just¹, Cyril Falentin⁹, Chu Shin Koh¹⁰, Isabelle Le Clainche¹, Maria Bernard⁸, Pascal Bento⁸, Benjamin Noel⁸, Karine Labadie⁸, Adriana Alberti⁸, Mathieu Charles⁹, Dominique Arnaud¹, Hui Guo⁶, Christian Daviaud, Salman Alamery¹¹, Kamel Jabbari¹², Kamel Jabbari¹, Meixia Zhao¹³, Patrick P. Edger¹⁴, Houda Chelaifa¹, David C. Tack¹⁵, Gilles Lassalle⁹, Imen Mestiri¹, Nicolas Schnel⁹, Marie-Christine Le Paslier⁹, Guangyi Fan, Victor Renault¹⁶, Philippe E. Bayer¹¹, Agnieszka A. Golicz¹¹, Sahana Manoli¹¹, Tae-Ho Lee⁶, Vinh Ha Dinh Thi¹, Smahane Chalabi¹, Qiong Hu², Chuchuan Fan¹⁷, Reece Tollenaere¹¹, Yunhai Lu¹, Christophe Battail⁸, Jinxiong Shen¹⁷, Christine Sidebottom¹⁰, Xinfa Wang², Aurélie Canaguier¹, Aurélie Chauveau⁹, Aurélie Bérard⁹, G. Deniot⁹, Mei Guan¹⁸, Zhongsong Liu¹⁸, Fengming Sun, Yong Pyo Lim¹⁹, Eric Lyons²⁰, Christopher D. Town⁴, Ian Bancroft²¹, Xiaowu Wang, Jinling Meng¹⁷, Jianxin Ma¹³, J. Chris Pires²², Graham J.W. King²³, Dominique Brunel⁹, Régine Delourme⁹, Michel Renard⁹, Jean-Marc Aury⁸, Keith L. Adams¹⁵, Jacqueline Batley¹¹, Jacqueline Batley²⁴, Rod J. Snowdon⁷, Jörg Tost, David Edwards²⁴, David Edwards¹¹, Yongming Zhou¹⁷, Wei Hua², Andrew G. Sharpe¹⁰, Andrew H. Paterson⁶, Chunyun Guan¹⁸, Patrick Wincker²⁵, Patrick Wincker⁸, Patrick Wincker¹ - Show less +83 more•Institutions (25)

University of Évry Val d'Essonne¹, Crops Research Institute², Agriculture and Agri-Food Canada³, J. Craig Venter Institute⁴, Fujian Agriculture and Forestry University⁵, Plant Genome Mapping Laboratory⁶, University of Giessen⁷, French Alternative Energies and Atomic Energy Commission⁸, Institut national de la recherche agronomique⁹, National Research Council¹⁰, Australian Centre for Plant Functional Genomics¹¹, University of Cologne¹², Purdue University¹³, University of California, Berkeley¹⁴, University of British Columbia¹⁵, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain¹⁶, Huazhong Agricultural University¹⁷, Hunan Agricultural University¹⁸, Chungnam National University¹⁹, University of Arizona²⁰, University of York²¹, University of Missouri²², Southern Cross University²³, University of Western Australia²⁴, Centre national de la recherche scientifique²⁵

22 Aug 2014-Science

TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.

...read moreread less

Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

...read moreread less

1,743 citations

Journal Article•DOI•

Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies

[...]

Brian J. Haas¹, Arthur L. Delcher², Stephen M. Mount³, Jennifer R. Wortman², Roger Smith², Linda Hannick², Rama Maiti², Catherine M. Ronning², Douglas B. Rusch, Christopher D. Town², Steven L. Salzberg², Owen White² - Show less +8 more•Institutions (3)

TigerLogic¹, J. Craig Venter Institute², University of Maryland, College Park³

01 Oct 2003-Nucleic Acids Research

TL;DR: The algorithm of the Program to Assemble Spliced Alignments (PASA) tool is described, as well as the results of automated updates to Arabidopsis gene annotations.

...read moreread less

Abstract: The spliced alignment of expressed sequence data to genomic sequence has proven a key tool in the comprehensive annotation of genes in eukaryotic genomes. A novel algorithm was developed to assemble clusters of overlapping transcript alignments (ESTs and full-length cDNAs) into maximal alignment assemblies, thereby comprehensively incorporating all available transcript data and capturing subtle splicing variations. Complete and partial gene structures identified by this method were used to improve The Institute for Genomic Research Arabidopsis genome annotation (TIGR release v.4.0). The alignment assemblies permitted the automated modeling of several novel genes and >1000 alternative splicing variations as well as updates (including UTR annotations) to nearly half of the ~27 000 annotated protein coding genes. The algorithm of the Program to Assemble Spliced Alignments (PASA) tool is described, as well as the results of automated updates to Arabidopsis gene annotations.

...read moreread less

1,441 citations

Journal Article•DOI•

The Medicago genome provides insight into the evolution of rhizobial symbioses

[...]

Nevin D. Young¹, Frédéric Debellé², Frédéric Debellé³, Giles E. D. Oldroyd⁴, René Geurts⁵, Steven B. Cannon⁶, Steven B. Cannon⁷, Michael K. Udvardi, Vagner A. Benedito⁸, Klaus F. X. Mayer, Jérôme Gouzy³, Jérôme Gouzy², Heiko Schoof⁹, Yves Van de Peer¹⁰, Sebastian Proost¹⁰, Douglas R. Cook¹¹, Blake C. Meyers¹², Manuel Spannagl, Foo Cheung¹³, Stéphane De Mita⁵, Vivek Krishnakumar¹³, Heidrun Gundlach, Shiguo Zhou¹⁴, Joann Mudge¹⁵, Arvind K. Bharti¹⁵, Jeremy D. Murray⁴, Marina Naoumkina, Benjamin D. Rosen¹¹, Kevin A. T. Silverstein¹, Haibao Tang¹³, Stephane Rombauts¹⁰, Patrick X. Zhao, Peng Zhou¹, Valérie Barbe, Philippe Bardou³, Philippe Bardou², Michael Bechner¹⁴, Arnaud Bellec², Anne Berger, Hélène Bergès², Shelby L. Bidwell¹³, Ton Bisseling⁵, Ton Bisseling¹⁶, Nathalie Choisne, Arnaud Couloux, Roxanne Denny¹, Shweta Deshpande¹⁷, Xinbin Dai, Jeff J. Doyle¹⁸, Anne Marie Dudez², Anne Marie Dudez³, Andrew Farmer¹⁵, Stéphanie Fouteau, Carolien Franken⁵, Chrystel Gibelin³, Chrystel Gibelin², John Gish¹¹, Steven A. Goldstein¹⁴, Alvaro J. González¹², Pamela J. Green¹², Asis Hallab¹⁹, Marijke Hartog⁵, Axin Hua¹⁷, Sean Humphray²⁰, Dong-Hoon Jeong¹², Yi Jing¹⁷, Anika Jöcker¹⁹, Steve Kenton¹⁷, Dong-Jin Kim¹¹, Dong-Jin Kim²¹, Kathrin Klee¹⁹, Hongshing Lai¹⁷, Chunting Lang⁵, Shaoping Lin¹⁷, Simone L. Macmil¹⁷, Ghislaine Magdelenat, Lucy Matthews²⁰, Jamison McCorrison¹³, Erin L. Monaghan¹³, Jeong Hwan Mun¹¹, Jeong Hwan Mun²², Fares Z. Najar¹⁷, Christine Nicholson²⁰, Céline Noirot², Majesta O'Bleness¹⁷, Charles Paule¹, Julie Poulain, Florent Prion³, Florent Prion², Baifang Qin¹⁷, Chunmei Qu¹⁷, Ernest F. Retzel¹⁵, Claire Riddle²⁰, Erika Sallet², Erika Sallet³, Sylvie Samain, Nicolas Samson³, Nicolas Samson², Iryna Sanders¹⁷, Olivier Saurat², Olivier Saurat³, Claude Scarpelli, Thomas Schiex², Béatrice Segurens, Andrew J. Severin⁶, D. Janine Sherrier¹², Ruihua Shi¹⁷, Sarah Sims²⁰, Susan R. Singer²³, Senjuti Sinharoy, Lieven Sterck¹⁰, Agnès Viollet, Bing Bing Wang¹, Keqin Wang¹⁷, Mingyi Wang, Xiaohong Wang¹, Jens Warfsmann¹⁹, Jean Weissenbach, Doug White¹⁷, James D. White¹⁷, Graham B. Wiley¹⁷, Patrick Wincker, Yanbo Xing¹⁷, Limei Yang¹⁷, Ziyun Yao¹⁷, Fu Ying¹⁷, Jixian Zhai¹², Liping Zhou¹⁷, Antoine Zuber², Antoine Zuber³, Jean Dénarié², Jean Dénarié³, Richard A. Dixon, Gregory D. May¹⁵, David C. Schwartz¹⁴, Jane Rogers²⁴, Francis Quetier, Christopher D. Town¹³, Bruce A. Roe¹⁷ - Show less +135 more•Institutions (24)

University of Minnesota¹, Institut national de la recherche agronomique², Centre national de la recherche scientifique³, John Innes Centre⁴, Laboratory of Molecular Biology⁵, Iowa State University⁶, Agricultural Research Service⁷, West Virginia University⁸, University of Bonn⁹, Ghent University¹⁰, University of California, Davis¹¹, Delaware Biotechnology Institute¹², J. Craig Venter Institute¹³, University of Wisconsin-Madison¹⁴, National Center for Genome Resources¹⁵, King Saud University¹⁶, University of Oklahoma¹⁷, Cornell University¹⁸, Max Planck Society¹⁹, Wellcome Trust²⁰, International Institute of Minnesota²¹, Rural Development Administration²², Carleton College²³, Norwich Research Park²⁴

22 Dec 2011-Nature

TL;DR: The draft sequence of the M. truncatula genome sequence is described, a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics, which provides significant opportunities to expand al falfa’s genomic toolbox.

...read moreread less

Abstract: Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing ∼94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.

...read moreread less

1,153 citations

Journal Article•DOI•

Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana

[...]

Xiaoying Lin, Samir Kaul, Steve Rounsley, Terrance Shea, Maria-Ines Benito, Christopher D. Town, Claire Fujii, Tanya Mason, Cheryl Bowman, Mary Barnstead, Tamara Feldblyum, C. Robin Buell, Karen A. Ketchum, John Lee, Catherine M. Ronning, Hean L. Koo, Kelly Moffat, Lisa A. Cronin, Mian Shen, Grace Pai, Susan Van Aken, Lowell Umayam, Luke J. Tallon, John Gill, Mark Raymond Adams¹, Ana J. Carrera, Todd Creasy, Howard M. Goodman², Chris Somerville³, Gregory P. Copenhaver⁴, Daphne Preuss⁴, William C. Nierman, Owen White, Jonathan A. Eisen, Steven L. Salzberg, Claire M. Fraser, J. Craig Venter¹ - Show less +33 more•Institutions (4)

Celera Corporation¹, Harvard University², Carnegie Institution for Science³, University of Chicago⁴

16 Dec 1999-Nature

TL;DR: The sequence of chromosome 2 from the Columbia ecotype is reported in two gap-free assemblies (contigs) of 3.6 and 16 megabases, which represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date.

...read moreread less

Abstract: Arabidopsis thaliana (Arabidopsis) is unique among plant model organisms in having a small genome (130-140 Mb), excellent physical and genetic maps, and little repetitive DNA. Here we report the sequence of chromosome 2 from the Columbia ecotype in two gap-free assemblies (contigs) of 3.6 and 16 megabases (Mb). The latter represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date. Chromosome 2 represents 15% of the genome and encodes 4,037 genes, 49% of which have no predicted function. Roughly 250 tandem gene duplications were found in addition to large-scale duplications of about 0.5 and 4.5 Mb between chromosomes 2 and 1 and between chromosomes 2 and 4, respectively. Sequencing of nearly 2 Mb within the genetically defined centromere revealed a low density of recognizable genes, and a high density and diverse range of vestigial and presumably inactive mobile elements. More unexpected is what appears to be a recent insertion of a continuous stretch of 75% of the mitochondrial genome into chromosome 2.

...read moreread less

792 citations

Journal Article•DOI•

Araport11: a complete reannotation of the Arabidopsis thaliana reference genome

[...]

Chia Yi Cheng¹, Vivek Krishnakumar¹, Agnes P. Chan¹, Françoise Thibaud-Nissen², Seth Schobel¹, Christopher D. Town¹ - Show less +2 more•Institutions (2)

J. Craig Venter Institute¹, National Institutes of Health²

01 Feb 2017-Plant Journal

TL;DR: This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further the understanding of the biological processes of this plant model but also of other species.

...read moreread less

Abstract: Summary The flowering plant Arabidopsis thaliana is a dicot model organism for research in many aspects of plant biology. A comprehensive annotation of its genome paves the way for understanding the functions and activities of all types of transcripts, including mRNA, the various classes of non-coding RNA, and small RNA. The TAIR10 annotation update had a profound impact on Arabidopsis research but was released more than 5 years ago. Maintaining the accuracy of the annotation continues to be a prerequisite for future progress. Using an integrative annotation pipeline, we assembled tissue-specific RNA-Seq libraries from 113 datasets and constructed 48 359 transcript models of protein-coding genes in eleven tissues. In addition, we annotated various classes of non-coding RNA including microRNA, long intergenic RNA, small nucleolar RNA, natural antisense transcript, small nuclear RNA, and small RNA using published datasets and in-house analytic results. Altogether, we identified 635 novel protein-coding genes, 508 novel transcribed regions, 5178 non-coding RNAs, and 35 846 small RNA loci that were formerly unannotated. Analysis of the splicing events and RNA-Seq based expression profiles revealed the landscapes of gene structures, untranslated regions, and splicing activities to be more intricate than previously appreciated. Furthermore, we present 692 uniformly expressed housekeeping genes, 43% of whose human orthologs are also housekeeping genes. This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further our understanding of the biological processes of this plant model but also of other species.

...read moreread less

769 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

[...]

Cole Trapnell¹, Cole Trapnell², Brian A. Williams³, Geo Pertea², Ali Mortazavi³, Gordon Kwan³, Marijke J. van Baren⁴, Steven L. Salzberg², Barbara J. Wold³, Lior Pachter¹ - Show less +6 more•Institutions (4)

University of California, Berkeley¹, University of Maryland, College Park², California Institute of Technology³, Washington University in St. Louis⁴

01 May 2010-Nature Biotechnology

TL;DR: The results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

Abstract: High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.

...read moreread less

13,337 citations

Journal Article•DOI•

The sequence of the human genome.

[...]

J. Craig Venter¹, Mark Raymond Adams¹, Eugene W. Myers¹, Peter W. Li¹ +269 more•Institutions (12)

16 Feb 2001-Science

TL;DR: Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems are indicated.

...read moreread less

Abstract: A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp DNA sequence was generated over 9 months from 27,271,853 high-quality sequence reads (5.11-fold coverage of the genome) from both ends of plasmid clones made from the DNA of five individuals. Two assembly strategies-a whole-genome assembly and a regional chromosome assembly-were used, each combining sequence data from Celera and the publicly funded genome effort. The public data were shredded into 550-bp segments to create a 2.9-fold coverage of those genome regions that had been sequenced, without including biases inherent in the cloning and assembly procedure used by the publicly funded group. This brought the effective coverage in the assemblies to eightfold, reducing the number and size of gaps in the final assembly over what would be obtained with 5.11-fold coverage. The two assembly strategies yielded very similar results that largely agree with independent mapping data. The assemblies effectively cover the euchromatic regions of the human chromosomes. More than 90% of the genome is in scaffold assemblies of 100,000 bp or more, and 25% of the genome is in scaffolds of 10 million bp or larger. Analysis of the genome sequence revealed 26,588 protein-encoding transcripts for which there was strong corroborating evidence and an additional approximately 12,000 computationally derived genes with mouse matches or other weak supporting evidence. Although gene-dense clusters are obvious, almost half the genes are dispersed in low G+C sequence separated by large tracts of apparently noncoding sequence. Only 1.1% of the genome is spanned by exons, whereas 24% is in introns, with 75% of the genome being intergenic DNA. Duplications of segmental blocks, ranging in size up to chromosomal lengths, are abundant throughout the genome and reveal a complex evolutionary history. Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems. DNA sequence comparisons between the consensus sequence and publicly funded genome data provided locations of 2.1 million single-nucleotide polymorphisms (SNPs). A random pair of human haploid genomes differed at a rate of 1 bp per 1250 on average, but there was marked heterogeneity in the level of polymorphism across the genome. Less than 1% of all SNPs resulted in variation in proteins, but the task of determining which SNPs have functional consequences remains an open challenge.

...read moreread less

12,098 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

[...]

Arabidopsis Genome Initiative¹•Institutions (1)

J. Craig Venter Institute¹

14 Dec 2000-Nature

TL;DR: This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

...read moreread less

Abstract: The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

...read moreread less

8,742 citations

Journal Article•DOI•

The genome sequence of Drosophila melanogaster

[...]

Mark Raymond Adams¹, Susan E. Celniker², Robert A. Holt¹, Cheryl A. Evans¹ +191 more•Institutions (23)

24 Mar 2000-Science

TL;DR: The nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome is determined using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map.

...read moreread less

Abstract: The fly Drosophila melanogaster is one of the most intensively studied organisms in biology and serves as a model system for the investigation of many developmental and cellular processes common to higher eukaryotes, including humans. We have determined the nucleotide sequence of nearly all of the approximately 120-megabase euchromatic portion of the Drosophila genome using a whole-genome shotgun sequencing strategy supported by extensive clone-based sequence and a high-quality bacterial artificial chromosome physical map. Efforts are under way to close the remaining gaps; however, the sequence is of sufficient accuracy and contiguity to be declared substantially complete and to support an initial analysis of genome structure and preliminary gene annotation and interpretation. The genome encodes approximately 13,600 genes, somewhat fewer than the smaller Caenorhabditis elegans genome, but with comparable functional diversity.

...read moreread less

6,180 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse