Home
/
Authors
/
Christophe Dessimoz

Author

Christophe Dessimoz

Other affiliations: ETH Zurich, University College London, École Polytechnique Fédérale de Lausanne ...read more

Bio: Christophe Dessimoz is an academic researcher from University of Lausanne. The author has contributed to research in topics: Phylogenetic tree & Genome. The author has an hindex of 49, co-authored 138 publications receiving 9296 citations. Previous affiliations of Christophe Dessimoz include ETH Zurich & University College London.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Towards practical, high-capacity, low-maintenance information storage in synthesized DNA

[...]

Nick Goldman¹, Paul Bertone¹, Siyuan Chen², Christophe Dessimoz¹, Emily M LeProust², Botond Sipos¹, Ewan Birney¹ - Show less +3 more•Institutions (2)

European Bioinformatics Institute¹, Agilent Technologies²

07 Feb 2013-Nature

TL;DR: Theoretical analysis indicates that the DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving.

...read moreread less

Abstract: Digital production, transmission and storage have revolutionized how we access and use information but have also made archiving an increasingly complex task that requires active, continuing maintenance of digital media. This challenge has focused some interest on DNA as an attractive target for information storage because of its capacity for high-density information encoding, longevity under easily achieved conditions and proven track record as an information bearer. Previous DNA-based information storage approaches have encoded only trivial amounts of information or were not amenable to scaling-up, and used no robust error-correction and lacked examination of their cost-efficiency for large-scale information archival. Here we describe a scalable method that can reliably store more information than has been handled before. We encoded computer files totalling 739 kilobytes of hard-disk storage and with an estimated Shannon information of 5.2 × 10(6) bits into a DNA code, synthesized this DNA, sequenced it and reconstructed the original files with 100% accuracy. Theoretical analysis indicates that our DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving. In fact, current trends in technological advances are reducing DNA synthesis costs at a pace that should make our scheme cost-effective for sub-50-year archiving within a decade.

...read moreread less

900 citations

Journal Article•DOI•

Allele-Specific HLA Loss and Immune Escape in Lung Cancer Evolution

[...]

Nicholas McGranahan¹, Rachel Rosenthal¹, Crispin T. Hiley¹, Crispin T. Hiley² +216 more•Institutions (4)

30 Nov 2017-Cell

TL;DR: It is found that HLA LOH occurs in 40% of non-small-cell lung cancers (NSCLCs) and is associated with a high subclonal neoantigen burden, APOBEC-mediated mutagenesis, upregulation of cytolytic activity, and PD-L1 positivity.

...read moreread less

850 citations

Journal Article•DOI•

Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation Schemes

[...]

Maria Anisimova¹, Manuel Gil¹, Manuel Gil², Jean-François Dufayard, Christophe Dessimoz¹, Christophe Dessimoz², Olivier Gascuel - Show less +3 more•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, Swiss Institute of Bioinformatics²

01 Oct 2011-Systematic Biology

TL;DR: This work compares the performance of the three fast likelihood-based methods with the standard bootstrap (SBS), the Bayesian approach, and the recently introduced rapid bootstrap, and proposes an additional method: a Bayesian-like transformation of aLRT (aBayes).

...read moreread less

Abstract: Phylogenetic inference and evaluating support for inferred relationships is at the core of many studies testing evolutionary hypotheses. Despite the popularity of nonparametric bootstrap frequencies and Bayesian posterior probabil- ities, the interpretation of these measures of tree branch support remains a source of discussion. Furthermore, both meth- ods are computationally expensive and become prohibitive for large data sets. Recent fast approximate likelihood-based measures of branch supports (approximate likelihood ratio test (aLRT) and Shimodaira-Hasegawa (SH)-aLRT) provide a compelling alternative to these slower conventional methods, offering not only speed advantages but also excellent levels of accuracy and power. Here we propose an additional method: a Bayesian-like transformation of aLRT (aBayes). Consider- ing both probabilistic and frequentist frameworks, we compare the performance of the three fast likelihood-based methods with the standard bootstrap (SBS), the Bayesian approach, and the recently introduced rapid bootstrap. Our simulations and real data analyses show that with moderate model violations, all tests are sufficiently accurate, but aLRT and aBayes offer the highest statistical power and are very fast. With severe model violations aLRT, aBayes and Bayesian posteriors can produce elevated false-positive rates. With data sets for which such violation can be detected, we recommend using SH-aLRT, the nonparametric version of aLRT based on a procedure similar to the Shimodaira-Hasegawa tree selection. In general, the SBS seems to be excessively conservative and is much slower than our approximate likelihood-based methods. (Accuracy; aLRT; branch support methods; evolution; model violation; phylogenetic inference; power; SH-aLRT.)

...read moreread less

799 citations

Journal Article•DOI•

GOATOOLS: A Python library for Gene Ontology analyses.

[...]

D. V. Klopfenstein¹, Liangsheng Zhang², Brent S. Pedersen³, Fidel Ramírez⁴, Alex Warwick Vesztrocy⁵, Aurélien Naldi⁶, Christopher J. Mungall⁷, Jeffrey M. Yunes⁸, Olga Botvinnik⁹, Mark Weigel, Will Dampier¹, Christophe Dessimoz⁵, Patrick Flick¹⁰, Haibao Tang² - Show less +10 more•Institutions (10)

Drexel University¹, Fujian Agriculture and Forestry University², University of Utah³, Max Planck Society⁴, University College London⁵, University of Lausanne⁶, Lawrence Berkeley National Laboratory⁷, University of California, San Francisco⁸, University of California, San Diego⁹, Georgia Institute of Technology¹⁰

18 Jul 2018-Scientific Reports

TL;DR: GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLs GO grouping method.

...read moreread less

Abstract: The biological interpretation of gene lists with interesting shared properties, such as up- or down-regulation in a particular experiment, is typically accomplished using gene ontology enrichment analysis tools. Given a list of genes, a gene ontology (GO) enrichment analysis may return hundreds of statistically significant GO results in a "flat" list, which can be challenging to summarize. It can also be difficult to keep pace with rapidly expanding biological knowledge, which often results in daily changes to any of the over 47,000 gene ontologies that describe biological knowledge. GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLS GO grouping method. We performed functional analyses on both stochastic simulation data and real data from a published RNA-seq study to compare the enrichment results from GOATOOLS to two other popular tools: DAVID and GOstats. GOATOOLS is freely available through GitHub: https://github.com/tanghaibao/goatools .

...read moreread less

603 citations

Journal Article•DOI•

Approximate Bayesian computation

[...]

Mikael Sunnåker¹, Alberto Giovanni Busetto¹, Elina Numminen², Jukka Corander², Matthieu Foll³, Christophe Dessimoz¹, Christophe Dessimoz⁴, Christophe Dessimoz⁵ - Show less +4 more•Institutions (5)

ETH Zurich¹, University of Helsinki², University of Bern³, European Bioinformatics Institute⁴, Swiss Institute of Bioinformatics⁵

10 Jan 2013-PLOS Computational Biology

TL;DR: Approximate Bayesian computation (ABC) constitutes a class of computational methods rooted in Bayesian statistics that widen the realm of models for which statistical inference can be considered and exacerbates the challenges of parameter estimation and model selection.

...read moreread less

Abstract: Approximate Bayesian computation (ABC) constitutes a class of computational methods rooted in Bayesian statistics. In all model-based statistical inference, the likelihood function is of central importance, since it expresses the probability of the observed data under a particular statistical model, and thus quantifies the support data lend to particular values of parameters and to choices among different models. For simple models, an analytical formula for the likelihood function can typically be derived. However, for more complex models, an analytical formula might be elusive or the likelihood function might be computationally very costly to evaluate. ABC methods bypass the evaluation of the likelihood function. In this way, ABC methods widen the realm of models for which statistical inference can be considered. ABC methods are mathematically well-founded, but they inevitably make assumptions and approximations whose impact needs to be carefully assessed. Furthermore, the wider application domain of ABC exacerbates the challenges of parameter estimation and model selection. ABC has rapidly gained popularity over the last years and in particular for the analysis of complex problems arising in biological sciences (e.g., in population genetics, ecology, epidemiology, and systems biology).

...read moreread less

531 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

[...]

Kazutaka Katoh¹, Daron M. Standley¹•Institutions (1)

Osaka University¹

01 Apr 2013-Molecular Biology and Evolution

TL;DR: This version of MAFFT has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update.

...read moreread less

Abstract: We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.

...read moreread less

27,771 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Prodigal: prokaryotic gene recognition and translation initiation site identification

[...]

Doug Hyatt¹, Doug Hyatt², Gwo Liang Chen², Philip F. LoCascio², Miriam Land², Frank W. Larimer¹, Frank W. Larimer², Loren Hauser² - Show less +4 more•Institutions (2)

University of Tennessee¹, Oak Ridge National Laboratory²

08 Mar 2010-BMC Bioinformatics

TL;DR: This work developed a new gene prediction algorithm called Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm), which achieved good results compared to existing methods, and it is believed it will be a valuable asset to automated microbial annotation pipelines.

...read moreread less

Abstract: The quality of automated gene prediction in microbial organisms has improved steadily over the past decade, but there is still room for improvement. Increasing the number of correct identifications, both of genes and of the translation initiation sites for each gene, and reducing the overall number of false positives, are all desirable goals. With our years of experience in manually curating genomes for the Joint Genome Institute, we developed a new gene prediction algorithm called Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm). With Prodigal, we focused specifically on the three goals of improved gene structure prediction, improved translation initiation site recognition, and reduced false positives. We compared the results of Prodigal to existing gene-finding methods to demonstrate that it met each of these objectives. We built a fast, lightweight, open source gene prediction program called Prodigal http://compbio.ornl.gov/prodigal/ . Prodigal achieved good results compared to existing methods, and we believe it will be a valuable asset to automated microbial annotation pipelines.

...read moreread less

7,157 citations

Journal Article•DOI•

UniProt: A worldwide hub of protein knowledge

[...]

Alex Bateman

01 Jan 2019-Nucleic Acids Research

5,284 citations

Journal Article•DOI•

IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.

[...]

Bui Quang Minh¹, Heiko A. Schmidt², Olga Chernomor², Dominik Schrempf², Dominik Schrempf³, Michael D. Woodhams⁴, Arndt von Haeseler², Arndt von Haeseler⁵, Robert Lanfear¹ - Show less +5 more•Institutions (5)

Australian National University¹, Medical University of Vienna², Eötvös Loránd University³, University of Tasmania⁴, University of Vienna⁵

01 May 2020-Molecular Biology and Evolution

TL;DR: Some notable features of IQ-TREE version 2 are described and the key advantages over other software are highlighted.

...read moreread less

Abstract: IQ-TREE (http://www.iqtree.org, last accessed February 6, 2020) is a user-friendly and widely used software package for phylogenetic inference using maximum likelihood. Since the release of version 1 in 2014, we have continuously expanded IQ-TREE to integrate a plethora of new models of sequence evolution and efficient computational approaches of phylogenetic inference to deal with genomic data. Here, we describe notable features of IQ-TREE version 2 and highlight the key advantages over other software.

...read moreread less

4,337 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse