Home
/
Authors
/
Stefan Engelen

Author

Stefan Engelen

French Alternative Energies and Atomic Energy Commission

Other affiliations: Commissariat à l'énergie atomique et aux énergies alternatives, University of Évry Val d'Essonne, Pierre-and-Marie-Curie University ...read more

Bio: Stefan Engelen is an academic researcher from French Alternative Energies and Atomic Energy Commission. The author has contributed to research in topics: Nanopore sequencing & Genome. The author has an hindex of 19, co-authored 40 publications receiving 5061 citations. Previous affiliations of Stefan Engelen include Commissariat à l'énergie atomique et aux énergies alternatives & University of Évry Val d'Essonne.

Topics: Nanopore sequencing, Genome, Population, Domestication, Sequence assembly ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Structure and function of the global ocean microbiome

[...]

Shinichi Sunagawa, Luis Pedro Coelho, Samuel Chaffron¹, Jens Roat Kultima, Karine Labadie, Guillem Salazar², Bardya Djahanschiri, Georg Zeller, Daniel R. Mende, Adriana Alberti, Francisco M. Cornejo-Castillo², Paul I. Costea, Corinne Cruaud, Francesco d'Ovidio³, Stefan Engelen, Isabel Ferrera², Josep M. Gasol², Lionel Guidi, Falk Hildebrand, Florian Kokoszka⁴, Cyrille Lepoivre⁵, Gipsi Lima-Mendez¹, Julie Poulain, Bonnie T. Poulos⁶, Marta Royo-Llonch², Hugo Sarmento⁷, Sara Vieira-Silva¹, Céline Dimier⁴, Marc Picheral, Sarah Searson³, Stefanie Kandels-Lewis, Tara Oceans Coordinators⁴, Chris Bowler⁴, Colomban de Vargas, Gabriel Gorsky, Nigel Grimsley, Pascal Hingamp⁵, Daniele Iudicone⁸, Olivier Jaillon, Fabrice Not, Hiroyuki Ogata⁹, Stephane Pesant, Sabrina Speich⁴, Lars Stemmann, Matthew B. Sullivan⁶, Jean Weissenbach, Patrick Wincker, Eric Karsenti⁴, Jeroen Raes¹⁰, Silvia G. Acinas², Peer Bork - Show less +47 more•Institutions (10)

Vrije Universiteit Brussel¹, Spanish National Research Council², University of Paris³, École Normale Supérieure⁴, Aix-Marseille University⁵, University of Arizona⁶, Federal University of São Carlos⁷, Stazione Zoologica Anton Dohrn⁸, Kyoto University⁹, Université catholique de Louvain¹⁰

22 May 2015-Science

TL;DR: This work identifies ocean microbial core functionality and reveals that >73% of its abundance is shared with the human gut microbiome despite the physicochemical differences between these two ecosystems.

...read moreread less

Abstract: Microbes are dominant drivers of biogeochemical processes, yet drawing a global picture of functional diversity, microbial community structure, and their ecological determinants remains a grand challenge. We analyzed 7.2 terabases of metagenomic data from 243 Tara Oceans samples from 68 locations in epipelagic and mesopelagic waters across the globe to generate an ocean microbial reference gene catalog with >40 million nonredundant, mostly novel sequences from viruses, prokaryotes, and picoeukaryotes. Using 139 prokaryote-enriched samples, containing >35,000 species, we show vertical stratification with epipelagic community composition mostly driven by temperature rather than other environmental factors or geography. We identify ocean microbial core functionality and reveal that >73% of its abundance is shared with the human gut microbiome despite the physicochemical differences between these two ecosystems.

...read moreread less

1,934 citations

Journal Article•DOI•

Eukaryotic plankton diversity in the sunlit ocean

[...]

Colomban de Vargas¹, Colomban de Vargas², Stéphane Audic², Stéphane Audic¹, Nicolas Henry¹, Nicolas Henry², Johan Decelle¹, Johan Decelle², Frédéric Mahé³, Frédéric Mahé¹, Frédéric Mahé², Ramiro Logares⁴, Enrique Lara, Cédric Berney², Cédric Berney¹, Noan Le Bescot², Noan Le Bescot¹, Ian Probert¹, Ian Probert², Margaux Carmichael⁵, Margaux Carmichael², Margaux Carmichael¹, Julie Poulain⁶, Sarah Romac¹, Sarah Romac², Sébastien Colin⁵, Sébastien Colin¹, Sébastien Colin², Jean-Marc Aury⁶, Lucie Bittner, Samuel Chaffron⁷, Samuel Chaffron⁸, Micah Dunthorn³, Stefan Engelen⁶, Olga Flegontova⁹, Olga Flegontova¹⁰, Lionel Guidi², Lionel Guidi¹, Aleš Horák¹⁰, Aleš Horák⁹, Olivier Jaillon⁶, Olivier Jaillon¹¹, Olivier Jaillon¹, Gipsi Lima-Mendez⁷, Gipsi Lima-Mendez⁸, Julius Lukeš¹⁰, Julius Lukeš⁹, Julius Lukeš¹², Shruti Malviya⁵, Raphael Morard¹, Raphael Morard², Raphael Morard¹³, Matthieu Mulot, Eleonora Scalco¹⁴, Raffaele Siano¹⁵, Flora Vincent⁸, Flora Vincent⁵, Adriana Zingone¹⁴, Céline Dimier², Céline Dimier¹, Céline Dimier⁵, Marc Picheral¹, Marc Picheral², Sarah Searson², Sarah Searson¹, Stefanie Kandels-Lewis¹⁶, Tara Oceans Coordinators¹⁷, Silvia G. Acinas⁴, Peer Bork¹⁶, Peer Bork¹⁸, Chris Bowler⁵, Gabriel Gorsky², Gabriel Gorsky¹, Nigel Grimsley¹, Nigel Grimsley¹⁹, Pascal Hingamp²⁰, Daniele Iudicone¹⁴, Fabrice Not¹, Fabrice Not², Hiroyuki Ogata¹⁷, Stephane Pesant¹³, Jeroen Raes⁷, Jeroen Raes⁸, Michael E. Sieracki²¹, Michael E. Sieracki²², Sabrina Speich²³, Sabrina Speich⁵, Lars Stemmann¹, Lars Stemmann², Shinichi Sunagawa¹⁶, Jean Weissenbach¹¹, Jean Weissenbach¹, Jean Weissenbach⁶, Patrick Wincker¹¹, Patrick Wincker¹, Patrick Wincker⁶, Eric Karsenti⁵, Eric Karsenti¹⁶ - Show less +94 more•Institutions (23)

Centre national de la recherche scientifique¹, Pierre-and-Marie-Curie University², Kaiserslautern University of Technology³, Spanish National Research Council⁴, École Normale Supérieure⁵, Commissariat à l'énergie atomique et aux énergies alternatives⁶, Vrije Universiteit Brussel⁷, Katholieke Universiteit Leuven⁸, Sewanee: The University of the South⁹, Academy of Sciences of the Czech Republic¹⁰, University of Évry Val d'Essonne¹¹, Canadian Institute for Advanced Research¹², University of Bremen¹³, Stazione Zoologica Anton Dohrn¹⁴, IFREMER¹⁵, European Bioinformatics Institute¹⁶, Kyoto University¹⁷, Max Delbrück Center for Molecular Medicine¹⁸, University of Paris¹⁹, Aix-Marseille University²⁰, National Science Foundation²¹, Bigelow Laboratory For Ocean Sciences²², University of Western Brittany²³

22 May 2015-Science

TL;DR: Diversity emerged at all taxonomic levels, both within the groups comprising the ~11,200 cataloged morphospecies of eukaryotic plankton and among twice as many other deep-branching lineages of unappreciated importance in plankton ecology studies.

...read moreread less

Abstract: Marine plankton support global biological and geochemical processes. Surveys of their biodiversity have hitherto been geographically restricted and have not accounted for the full range of plankton size. We assessed eukaryotic diversity from 334 size-fractionated photic-zone plankton communities collected across tropical and temperate oceans during the circumglobal Tara Oceans expedition. We analyzed 18S ribosomal DNA sequences across the intermediate plankton-size spectrum from the smallest unicellular eukaryotes (protists, >0.8 micrometers) to small animals of a few millimeters. Eukaryotic ribosomal diversity saturated at ~150,000 operational taxonomic units, about one-third of which could not be assigned to known eukaryotic groups. Diversity emerged at all taxonomic levels, both within the groups comprising the ~11,200 cataloged morphospecies of eukaryotic plankton and among twice as many other deep-branching lineages of unappreciated importance in plankton ecology studies. Most eukaryotic plankton biodiversity belonged to heterotrophic protistan groups, particularly those known to be parasites or symbiotic hosts.

...read moreread less

1,378 citations

Journal Article•DOI•

Genome evolution across 1,011 Saccharomyces cerevisiae isolates

[...]

Jackson Peter¹, Matteo De Chiara², Anne Friedrich¹, Jia-Xing Yue², David Pflieger¹, Anders Bergström², Anastasie Sigwalt¹, Benjamin Barré², Kelle C. Freel¹, Agnès Llored², Corinne Cruaud³, Karine Labadie³, Jean-Marc Aury³, Benjamin Istace³, Kevin Lebrigand⁴, Pascal Barbry⁴, Stefan Engelen³, Arnaud Lemainque³, Patrick Wincker³, Patrick Wincker⁵, Gianni Liti², Joseph Schacherer¹ - Show less +18 more•Institutions (5)

University of Strasbourg¹, French Institute of Health and Medical Research², French Alternative Energies and Atomic Energy Commission³, Centre national de la recherche scientifique⁴, University of Évry Val d'Essonne⁵

11 Apr 2018-Nature

TL;DR: Whole-genome sequencing and phenotyping of 1,011 natural isolates of the yeast Saccharomyces cerevisiae reveal its evolutionary history, including a single out-of-China origin and multiple domestication events, and provides a framework for genotype–phenotype studies in this model organism.

...read moreread less

Abstract: Large-scale population genomic surveys are essential to explore the phenotypic diversity of natural populations. Here we report the whole-genome sequencing and phenotyping of 1,011 Saccharomyces cerevisiae isolates, which together provide an accurate evolutionary picture of the genomic variants that shape the species-wide phenotypic landscape of this yeast. Genomic analyses support a single ‘out-of-China’ origin for this species, followed by several independent domestication events. Although domesticated isolates exhibit high variation in ploidy, aneuploidy and genome content, genome evolution in wild isolates is mainly driven by the accumulation of single nucleotide polymorphisms. A common feature is the extensive loss of heterozygosity, which represents an essential source of inter-individual variation in this mainly asexual species. Most of the single nucleotide polymorphisms, including experimentally identified functional polymorphisms, are present at very low frequencies. The largest numbers of variants identified by genome-wide association are copy-number changes, which have a greater phenotypic effect than do single nucleotide polymorphisms. This resource will guide future population genomics and genotype–phenotype studies in this classic model system. Whole-genome sequencing of 1,011 natural isolates of the yeast Saccharomyces cerevisiae reveals its evolutionary history, including a single out-of-China origin and multiple domestication events, and provides a framework for genotype–phenotype studies in this model organism.

...read moreread less

727 citations

Journal Article•DOI•

MicroScope—an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data

[...]

David Vallenet¹, Eugeni Belda¹, Alexandra Calteau¹, Stéphane Cruveiller¹, Stefan Engelen¹, Aurélie Lajus¹, François Le Fèvre¹, Cyrille Longin¹, Damien Mornico¹, David Roche¹, Zoé Rouy¹, Gregory Salvignol¹, Claude Scarpelli¹, Adam Alexander T. Smith¹, Marion Weiman¹, Claudine Médigue¹ - Show less +12 more•Institutions (1)

University of Évry Val d'Essonne¹

01 Jan 2013-Nucleic Acids Research

TL;DR: Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly.

...read moreread less

Abstract: MicroScope is an integrated platform dedicated to both the methodical updating of microbial genome annotation and to comparative analysis. The resource provides data from completed and ongoing genome projects (automatic and expert annotations), together with data sources from post-genomic experiments (i.e. transcriptomics, mutant collections) allowing users to perfect and improve the understanding of gene functions. MicroScope (http://www.genoscope.cns.fr/agc/microscope) combines tools and graphical interfaces to analyse genomes and to perform the manual curation of gene annotations in a comparative context. Since its first publication in January 2006, the system (previously named MaGe for Magnifying Genomes) has been continuously extended both in terms of data content and analysis tools. The last update of MicroScope was published in 2009 in the Database journal. Today, the resource contains data for >1600 microbial genomes, of which ∼300 are manually curated and maintained by biologists (1200 personal accounts today). Expert annotations are continuously gathered in the MicroScope database (∼50 000 a year), contributing to the improvement of the quality of microbial genomes annotations. Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly. Furthermore, in the context of the European project Microme (Framework Program 7 Collaborative Project), MicroScope is becoming a resource providing for the curation and analysis of both genomic and metabolic data. An increasing number of projects are related to the study of environmental bacterial (meta)genomes that are able to metabolize a large variety of chemical compounds that may be of high industrial interest.

...read moreread less

376 citations

Journal Article•DOI•

MicroScope: a platform for microbial genome annotation and comparative genomics

[...]

David Vallenet¹, Stefan Engelen¹, Damien Mornico¹, Stéphane Cruveiller¹, Ludovic Fleury¹, Aurélie Lajus¹, Zoé Rouy¹, David Roche¹, Gregory Salvignol¹, Claude Scarpelli¹, Claudine Médigue¹ - Show less +7 more•Institutions (1)

Centre national de la recherche scientifique¹

01 Jan 2009-Database

TL;DR: This article emphasizes the essential role of expert annotation as a complement of automatic annotation in microbial genome annotation, especially for genomes initially analyzed by automatic procedures alone.

...read moreread less

Abstract: The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http:// www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope’s rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of microbial genome annotation, especially for genomes initially analyzed by automatic procedures alone. Database URLs: http://www.genoscope.cns.fr/agc/mage and http://www.genoscope.cns.fr/agc/microcyc

...read moreread less

284 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

[...]

Sergey Koren¹, Brian P. Walenz¹, Konstantin Berlin², Jason R. Miller³, Nicholas H. Bergman, Adam M. Phillippy¹ - Show less +2 more•Institutions (3)

National Institutes of Health¹, Invincea², J. Craig Venter Institute³

15 Mar 2017-Genome Research

TL;DR: Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences, is presented, demonstrating that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences or Oxford Nanopore technologies.

...read moreread less

Abstract: Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based on tf-idf weighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences (PacBio) or Oxford Nanopore technologies and achieves a contig NG50 of >21 Mbp on both human and Drosophila melanogaster PacBio data sets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes.

...read moreread less

4,806 citations

Journal Article•DOI•

Evolution of Protein Molecules

[...]

S. Jeffery

01 Apr 1979-Biochemical Society Transactions

3,734 citations

Journal Article•DOI•

Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

[...]

Ryan R. Wick¹, Louise M. Judd¹, Claire L. Gorrie¹, Kathryn E. Holt¹•Institutions (1)

University of Melbourne¹

08 Jun 2017-PLOS Computational Biology

TL;DR: Tests on both synthetic and real reads show Unicycler can assemble larger contigs with fewer misassemblies than other hybrid assemblers, even when long-read depth and accuracy are low.

...read moreread less

Abstract: The Illumina DNA sequencing platform generates accurate but short reads, which can be used to produce accurate but fragmented genome assemblies. Pacific Biosciences and Oxford Nanopore Technologies DNA sequencing platforms generate long reads that can produce complete genome assemblies, but the sequencing is more expensive and error-prone. There is significant interest in combining data from these complementary sequencing technologies to generate more accurate "hybrid" assemblies. However, few tools exist that truly leverage the benefits of both types of data, namely the accuracy of short reads and the structural resolving power of long reads. Here we present Unicycler, a new tool for assembling bacterial genomes from a combination of short and long reads, which produces assemblies that are accurate, complete and cost-effective. Unicycler builds an initial assembly graph from short reads using the de novo assembler SPAdes and then simplifies the graph using information from short and long reads. Unicycler uses a novel semi-global aligner to align long reads to the assembly graph. Tests on both synthetic and real reads show Unicycler can assemble larger contigs with fewer misassemblies than other hybrid assemblers, even when long-read depth and accuracy are low. Unicycler is open source (GPLv3) and available at github.com/rrwick/Unicycler.

...read moreread less

2,245 citations

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

2,187 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse