Home
/
Authors
/
Maarten D. Sollewijn Gelpke

Author

Maarten D. Sollewijn Gelpke

Other affiliations: United States Department of Energy, Agency for Science, Technology and Research

Bio: Maarten D. Sollewijn Gelpke is an academic researcher from Joint Genome Institute. The author has contributed to research in topics: Genome & Shotgun sequencing. The author has an hindex of 3, co-authored 3 publications receiving 3773 citations. Previous affiliations of Maarten D. Sollewijn Gelpke include United States Department of Energy & Agency for Science, Technology and Research.

Topics: Genome, Shotgun sequencing, Gene, Repeated sequence, Human genome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The draft genome of Ciona intestinalis : insights into chordate and vertebrate origins

[...]

Paramvir S. Dehal¹, Yutaka Satou², Robert K. Campbell³, Jarrod Chapman¹, Bernard M. Degnan⁴, Anthony W. De Tomaso⁵, Brad Davidson⁶, Anna Di Gregorio⁶, Maarten D. Sollewijn Gelpke¹, David Goodstein¹, Naoe Harafuji⁶, Kenneth E. M. Hastings⁷, Isaac Ho¹, Kohji Hotta⁸, Wayne Huang¹, Takeshi Kawashima², Patrick Lemaire⁹, Diego Martinez¹, Ian A. Meinertzhagen¹⁰, Simona Necula¹, Masaru Nonaka¹¹, Nik Putnam¹, Sam Rash¹, Hidetoshi Saiga¹², Masanobu Satake¹³, Astrid Terry¹, Lixy Yamada², Hong Gang Wang¹⁴, Satoko Awazu², Kaoru Azumi¹⁵, Jeffrey L. Boore¹, Margherita Branno¹⁶, Stephen T. Chin-Bow¹⁷, Rosaria DeSantis¹⁶, Sharon A. Doyle¹, Pilar Francino¹, David N. Keys⁶, David N. Keys¹, Shinobu Haga⁸, Hiroko Hayashi⁸, Kyosuke Hino², Kaoru S. Imai², Kazuo Inaba¹³, Shungo Kano¹⁶, Shungo Kano², Kenji Kobayashi², Mari Kobayashi², Byung In Lee¹, Kazuhiro W. Makabe², Chitra Manohar¹, Giorgio Matassi¹⁶, Mónica Medina¹, Yasuaki Mochizuki², Steve Mount¹⁸, Tomomi Morishita⁸, Sachiko Miura⁸, Akie Nakayama², Satoko Nishizaka⁸, Hisayo Nomoto⁸, Fumiko Ohta⁸, Kazuko Oishi⁸, Isidore Rigoutsos¹⁷, Masako Sano⁸, Akane Sasaki², Yasunori Sasakura², Eiichi Shoguchi², Tadasu Shin-I⁸, Antoinetta Spagnuolo¹⁶, Didier Y.R. Stainier¹⁹, Miho Suzuki²⁰, Olivier Tassy⁹, Naohito Takatori², Miki Tokuoka², Kasumi Yagi², Fumiko Yoshizaki¹¹, Shuichi Wada², Cindy Zhang¹, P. Douglas Hyatt²¹, Frank W. Larimer²¹, Chris Detter¹, Norman A. Doggett²², Tijana Glavina¹, Trevor Hawkins¹, Paul G. Richardson¹, Susan Lucas¹, Yuji Kohara⁸, Michael Levine⁶, Nori Satoh², Daniel S. Rokhsar¹, Daniel S. Rokhsar⁶ - Show less +86 more•Institutions (22)

United States Department of Energy¹, Kyoto University², Marine Biological Laboratory³, University of Queensland⁴, Stanford University⁵, University of California, Berkeley⁶, McGill University⁷, National Institute of Genetics⁸, Aix-Marseille University⁹, Dalhousie University¹⁰, University of Tokyo¹¹, Tokyo Metropolitan University¹², Tohoku University¹³, University of South Florida¹⁴, Hokkaido University¹⁵, Stazione Zoologica Anton Dohrn¹⁶, IBM¹⁷, University of Maryland, College Park¹⁸, University of California, San Francisco¹⁹, University of Edinburgh²⁰, Oak Ridge National Laboratory²¹, Los Alamos National Laboratory²²

13 Dec 2002-Science

TL;DR: A draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis, is generated, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development.

...read moreread less

Abstract: The first chordates appear in the fossil record at the time of the Cambrian explosion, nearly 550 million years ago. The modern ascidian tadpole represents a plausible approximation to these ancestral chordates. To illuminate the origins of chordate and vertebrates, we generated a draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis. The Ciona genome contains approximately 16,000 protein-coding genes, similar to the number in other invertebrates, but only half that found in vertebrates. Vertebrate gene families are typically found in simplified form in Ciona, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development. The ascidian genome has also acquired a number of lineage-specific innovations, including a group of genes engaged in cellulose metabolism that are related to those in bacteria and fungi.

...read moreread less

1,582 citations

Journal Article•DOI•

Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripes

[...]

Samuel Aparicio¹, Jarrod Chapman¹, Elia Stupka¹, Nik Putnam¹, Jer Ming Chia¹, Paramvir S. Dehal¹, Alan Christoffels¹, Sam Rash¹, Shawn Hoon¹, Arian F.A. Smit¹, Maarten D. Sollewijn Gelpke¹, Jared C. Roach¹, Tania Oh¹, Isaac Ho¹, Marie Wong¹, Chris Detter¹, Frans Verhoef¹, Paul Predki¹, Alice Tay¹, Susan Lucas¹, Paul G. Richardson¹, Sarah Smith¹, Melody S. Clark¹, Yvonne J. K. Edwards¹, Norman A. Doggett¹, Andrey Zharkikh¹, Sean V. Tavtigian¹, Dmitry Pruss¹, Mary Barnstead¹, Cheryl Evans¹, Holly Baden¹, Justin Powell¹, Gustavo Glusman¹, Lee Rowen¹, Leroy Hood¹, Y. H. Tan¹, Greg Elgar¹, Trevor Hawkins¹, Byrappa Venkatesh¹, Daniel S. Rokhsar¹, Sydney Brenner¹ - Show less +37 more•Institutions (1)

Agency for Science, Technology and Research¹

23 Aug 2002-Science

TL;DR: The Fugu rubripes genome has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds as discussed by the authors.

...read moreread less

Abstract: The compact genome of Fugu rubripes has been sequenced to over 95% coverage, and more than 80% of the assembly is in multigene-sized scaffolds. In this 365-megabase vertebrate genome, repetitive DNA accounts for less than one-sixth of the sequence, and gene loci occupy about one-third of the genome. As with the human genome, gene loci are not evenly distributed, but are clustered into sparse and dense regions. Some “giant” genes were observed that had average coding sequence sizes but were spread over genomic lengths significantly larger than those of their human orthologs. Although three-quarters of predicted human proteins have a strong match toFugu, approximately a quarter of the human proteins had highly diverged from or had no pufferfish homologs, highlighting the extent of protein evolution in the 450 million years since teleosts and mammals diverged. Conserved linkages between Fugu and human genes indicate the preservation of chromosomal segments from the common vertebrate ancestor, but with considerable scrambling of gene order.

...read moreread less

1,446 citations

Journal Article•DOI•

Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78

[...]

Diego Martinez¹, Luis F. Larrondo², Nik Putnam¹, Nik Putnam³, Maarten D. Sollewijn Gelpke¹, Katherine H. Huang¹, Jarrod Chapman³, Jarrod Chapman¹, Kevin G. Helfenbein⁴, Preethi Ramaiya⁵, J. Chris Detter¹, Frank W. Larimer⁶, Pedro M. Coutinho⁷, Bernard Henrissat⁷, Randy M. Berka⁵, Dan Cullen⁸, Daniel S. Rokhsar¹ - Show less +13 more•Institutions (8)

Joint Genome Institute¹, Pontifical Catholic University of Chile², University of California, Berkeley³, American Museum of Natural History⁴, Novozymes⁵, Oak Ridge National Laboratory⁶, University of Provence⁷, United States Forest Service⁸

01 Jun 2004-Nature Biotechnology

TL;DR: The sequenced genome of Phanerochaete chrysosporium strain RP78 reveals an impressive array of genes encoding secreted oxidases, peroxidases and hydrolytic enzymes that cooperate in wood decay, and provides a framework for further development of bioprocesses for biomass utilization, organopollutant degradation and fiber bleaching.

...read moreread less

Abstract: White rot fungi efficiently degrade lignin, a complex aromatic polymer in wood that is among the most abundant natural materials on earth. These fungi use extracellular oxidative enzymes that are also able to transform related aromatic compounds found in explosive contaminants, pesticides and toxic waste. We have sequenced the 30-million base-pair genome of Phanerochaete chrysosporium strain RP78 using a whole genome shotgun approach. The P. chrysosporium genome reveals an impressive array of genes encoding secreted oxidases, peroxidases and hydrolytic enzymes that cooperate in wood decay. Analysis of the genome data will enhance our understanding of lignocellulose degradation, a pivotal process in the global carbon cycle, and provide a framework for further development of bioprocesses for biomass utilization, organopollutant degradation and fiber bleaching. This genome provides a high quality draft sequence of a basidiomycete, a major fungal phylum that includes important plant and animal pathogens.

...read moreread less

883 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

Journal Article•DOI•

The COG database: an updated version includes eukaryotes

[...]

Roman L. Tatusov¹, Natalie D. Fedorova¹, John D. Jackson¹, Aviva R. Jacobs¹, Boris Kiryutin¹, Eugene V. Koonin¹, Dmitri M. Krylov¹, Raja Mazumder², Sergei L. Mekhedov¹, Anastasia N. Nikolskaya², B Sridhar Rao¹, Sergei Smirnov¹, Alexander V. Sverdlov¹, Sona Vasudevan¹, Yuri I. Wolf¹, Jodie J. Yin¹, Darren A. Natale² - Show less +13 more•Institutions (2)

National Institutes of Health¹, Georgetown University Medical Center²

11 Sep 2003-BMC Bioinformatics

TL;DR: A major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes is described and is expected to be a useful platform for functional annotation of newlysequenced genomes, including those of complex eukARYotes, and genome-wide evolutionary studies.

...read moreread less

Abstract: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.

...read moreread less

4,167 citations

Journal Article•DOI•

Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution

[...]

LaDeana W. Hillier¹, Webb Miller², Ewan Birney, Wesley C. Warren¹ +171 more•Institutions (39)

09 Dec 2004-Nature

TL;DR: A draft genome sequence of the red jungle fowl, Gallus gallus, provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes.

...read moreread less

Abstract: We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

...read moreread less

2,579 citations

Journal Article•DOI•

MetaSPAdes: A new versatile metagenomic assembler

[...]

Sergey Nurk¹, Dmitry Meleshko¹, Anton Korobeynikov¹, Pavel A. Pevzner¹, Pavel A. Pevzner² - Show less +1 more•Institutions (2)

Saint Petersburg State University¹, University of California, San Diego²

01 May 2017-Genome Research

TL;DR: MetaSPAdes as mentioned in this paper addresses various challenges of metagenomic assembly by capitalizing on computational ideas that proved to be useful in assemblies of single cells and highly polymorphic diploid genomes.

...read moreread less

Abstract: While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amplifying the challenge of metagenomic assembly. metaSPAdes addresses various challenges of metagenomic assembly by capitalizing on computational ideas that proved to be useful in assemblies of single cells and highly polymorphic diploid genomes. We benchmark metaSPAdes against other state-of-the-art metagenome assemblers and demonstrate that it results in high-quality assemblies across diverse data sets.

...read moreread less

2,295 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse