Home
/
Authors
/
Osvaldo Zagordi

Author

Osvaldo Zagordi

Other affiliations: ETH Zurich, Swiss Institute of Bioinformatics, International School for Advanced Studies

Bio: Osvaldo Zagordi is an academic researcher from University of Zurich. The author has contributed to research in topics: Population & Deep sequencing. The author has an hindex of 20, co-authored 34 publications receiving 1736 citations. Previous affiliations of Osvaldo Zagordi include ETH Zurich & Swiss Institute of Bioinformatics.

Topics: Population, Deep sequencing, Hypervariable region, Viral quasispecies, Antibody ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

ShoRAH: estimating the genetic diversity of a mixed sample from next-generation sequencing data

[...]

Osvaldo Zagordi¹, Osvaldo Zagordi², Arnab Bhattacharya¹, Nicholas Eriksson, Niko Beerenwinkel², Niko Beerenwinkel¹ - Show less +2 more•Institutions (2)

ETH Zurich¹, Swiss Institute of Bioinformatics²

26 Apr 2011-BMC Bioinformatics

TL;DR: ShoRAH, a computational method for quantifying genetic diversity in a mixed sample and for identifying the individual clones in the population, while accounting for sequencing errors, is developed.

...read moreread less

Abstract: Background: With next-generation sequencing technologies, experiments that were considered prohibitive only a few years ago are now possible. However, while these technologies have the ability to produce enormous volumes of data, the sequence reads are prone to error. This poses fundamental hurdles when genetic diversity is investigated. Results: We developed ShoRAH, a computational method for quantifying genetic diversity in a mixed sample and for identifying the individual clones in the population, while accounting for sequencing errors. The software was run on simulated data and on real data obtained in wet lab experiments to assess its reliability. Conclusions: ShoRAH is implemented in C++, Python, and Perl and has been tested under Linux and Mac OS X. Source code is available under the GNU General Public License at http://www.cbg.ethz.ch/software/shorah.

...read moreread less

301 citations

Journal Article•DOI•

Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies

[...]

Osvaldo Zagordi¹, Rolf Klein², Martin Däumer², Niko Beerenwinkel²•Institutions (2)

ETH Zurich¹, Swiss Institute of Bioinformatics²

01 Nov 2010-Nucleic Acids Research

TL;DR: It is concluded that pyrosequencing can be used to investigate genetically diverse samples with high accuracy if technical errors are properly treated and probabilistic haplotype inference outperforms the counting-based calling method in both precision and recall.

...read moreread less

Abstract: Next-generation sequencing technologies can be used to analyse genetically heterogeneous samples at unprecedented detail. The high coverage achievable with these methods enables the detection of many low-frequency variants. However, sequencing errors complicate the analysis of mixed populations and result in inflated estimates of genetic diversity. We developed a probabilistic Bayesian approach to minimize the effect of errors on the detection of minority variants. We applied it to pyrosequencing data obtained from a 1.5-kb-fragment of the HIV-1 gag/pol gene in two control and two clinical samples. The effect of PCR amplification was analysed. Error correction resulted in a two- and five-fold decrease of the pyrosequencing base substitution rate, from 0.05% to 0.03% and from 0.25% to 0.05% in the non-PCR and PCR-amplified samples, respectively. We were able to detect viral clones as rare as 0.1% with perfect sequence reconstruction. Probabilistic haplotype inference outperforms the counting-based calling method in both precision and recall. Genetic diversity observed within and between two clinical samples resulted in various patterns of phenotypic drug resistance and suggests a close epidemiological link. We conclude that pyrosequencing can be used to investigate genetically diverse samples with high accuracy if technical errors are properly treated.

...read moreread less

229 citations

Journal Article•DOI•

Ultra-deep sequencing for the analysis of viral populations

[...]

Niko Beerenwinkel¹, Osvaldo Zagordi², Osvaldo Zagordi¹•Institutions (2)

ETH Zurich¹, Swiss Institute of Bioinformatics²

01 Nov 2011-Current Opinion in Virology

TL;DR: Analysis of ultra-deep sequencing data obtained from diverse virus populations is challenging because of PCR and sequencing errors and short read lengths, such that the experiment provides only indirect evidence of the underlying viral population structure.

...read moreread less

179 citations

Journal Article•DOI•

Probabilistic inference of viral quasispecies subject to recombination

[...]

Armin Töpfer¹, Osvaldo Zagordi, Sandhya Prabhakaran, Volker Roth, Eran Halperin, Niko Beerenwinkel - Show less +2 more•Institutions (1)

ETH Zurich¹

01 Feb 2013-Journal of Computational Biology

TL;DR: A jumping hidden Markov model is presented that describes the generation of viral quasispecies and a method to infer its parameters from next-generation sequencing data and introduces position-specific probability tables over the sequence alphabet to explain the diversity that can be found in the population at each site.

...read moreread less

Abstract: RNA viruses exist in their hosts as populations of different but related strains. The virus population, often called quasispecies, is shaped by a combination of genetic change and natural selection. Genetic change is due to both point mutations and recombination events. We present a jumping hidden Markov model that describes the generation of viral quasispecies and a method to infer its parameters from next-generation sequencing data. The model introduces position-specific probability tables over the sequence alphabet to explain the diversity that can be found in the population at each site. Recombination events are indicated by a change of state, allowing a single observed read to originate from multiple sequences. We present a specific implementation of the expectation maximization (EM) algorithm to find maximum a posteriori estimates of the model parameters and a method to estimate the distribution of viral strains in the quasispecies. The model is validated on simulated data, showing the advantage of explicitly taking the recombination process into account, and applied to reads obtained from a clinical HIV sample.

...read moreread less

142 citations

Journal Article•DOI•

Glycosylations in the Globular Head of the Hemagglutinin Protein Modulate the Virulence and Antigenic Properties of the H1N1 Influenza Viruses

[...]

Rafael A. Medina¹, Rafael A. Medina², Silke Stertz¹, Silke Stertz³, Balaji Manicassamy⁴, Balaji Manicassamy¹, Petra Zimmermann³, Xiangjie Sun⁵, Randy A. Albrecht¹, Hanni Uusi-Kerttula², Osvaldo Zagordi³, Robert B. Belshe⁶, Sharon E. Frey⁶, Terrence M. Tumpey⁵, Adolfo García-Sastre¹ - Show less +11 more•Institutions (6)

Icahn School of Medicine at Mount Sinai¹, Pontifical Catholic University of Chile², University of Zurich³, University of Chicago⁴, Centers for Disease Control and Prevention⁵, Saint Louis University⁶

29 May 2013-Science Translational Medicine

TL;DR: This article found that the polyclonal antibody response elicited by wild-type rpH1N1 HA was likely directed against an immunodominant region, which could be shielded by glycosylation at position 144.

...read moreread less

Abstract: With the global spread of the 2009 pandemic H1N1 (pH1N1) influenza virus, there are increasing worries about evolution through antigenic drift. One way previous seasonal H1N1 and H3N2 influenza strains have evolved over time is by acquiring additional glycosylations in the globular head of their hemagglutinin (HA) proteins; these glycosylations have been believed to shield antigenically relevant regions from antibody immune responses. We added additional HA glycosylation sites to influenza A/Netherlands/602/2009 recombinant (rpH1N1) viruses, reflecting their temporal appearance in previous seasonal H1N1 viruses. Additional glycosylations resulted in substantially attenuated infection in mice and ferrets, whereas deleting HA glycosylation sites from a pre-pandemic virus resulted in increased pathogenicity in mice. We then more directly investigated the interactions of HA glycosylations and antibody responses through mutational analysis. We found that the polyclonal antibody response elicited by wild-type rpH1N1 HA was likely directed against an immunodominant region, which could be shielded by glycosylation at position 144. However, rpH1N1 HA glycosylated at position 144 elicited a broader polyclonal response able to cross-neutralize all wild-type and glycosylation mutant pH1N1 viruses. Moreover, mice infected with a recent seasonal virus in which glycosylation sites were removed elicited antibodies that protected against challenge with the antigenically distant pH1N1 virus. Thus, acquisition of glycosylation sites in the HA of H1N1 human influenza viruses affected not only their pathogenicity and ability to escape from polyclonal antibodies elicited by previous influenza virus strains but also their ability to induce cross-reactive antibodies against drifted antigenic variants.

...read moreread less

110 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

DADA2: High-resolution sample inference from Illumina amplicon data

[...]

Benjamin J. Callahan¹, Paul J. McMurdie, Michael J. Rosen¹, Andrew W. Han, Amy Jo A. Johnson, Susan Holmes¹ - Show less +2 more•Institutions (1)

Stanford University¹

01 Jul 2016-Nature Methods

TL;DR: The open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors is presented, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

Abstract: We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

14,505 citations

Journal Article•DOI•

LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets

[...]

Andreas Wilm¹, Pauline Poh Kim Aw¹, Denis Bertrand¹, Grace Hui Ting Yeo¹, Swee Hoe Ong¹, Chang Hua Wong¹, Chiea Chuen Khor¹, Rosemary Petric¹, Martin L. Hibberd¹, Niranjan Nagarajan¹ - Show less +6 more•Institutions (1)

Genome Institute of Singapore¹

01 Dec 2012-Nucleic Acids Research

TL;DR: It is shown that LoFreq has near-perfect specificity, with significantly improved sensitivity compared with existing methods and can efficiently analyze deep Illumina sequencing datasets without resorting to approximations or heuristics.

...read moreread less

Abstract: The study of cell-population heterogeneity in a range of biological systems, from viruses to bacterial isolates to tumor samples, has been transformed by recent advances in sequencing throughput. While the high-coverage afforded can be used, in principle, to identify very rare variants in a population, existing ad hoc approaches frequently fail to distinguish true variants from sequencing errors. We report a method (LoFreq) that models sequencing run-specific error rates to accurately call variants occurring in <0.05% of a population. Using simulated and real datasets (viral, bacterial and human), we show that LoFreq has near-perfect specificity, with significantly improved sensitivity compared with existing methods and can efficiently analyze deep Illumina sequencing datasets without resorting to approximations or heuristics. We also present experimental validation for LoFreq on two different platforms (Fluidigm and Sequenom) and its application to call rare somatic variants from exome sequencing datasets for gastric cancer. Source code and executables for LoFreq are freely available at http://sourceforge.net/projects/lofreq/.

...read moreread less

1,018 citations

Journal Article•DOI•

Detection of ultra-rare mutations by next-generation sequencing

[...]

Michael W. Schmitt¹, Scott R. Kennedy, Jesse J. Salk, Edward J. Fox, Joseph Hiatt, Lawrence A. Loeb¹ - Show less +2 more•Institutions (1)

University of Washington¹

04 Sep 2012-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is determined that Duplex Sequencing has a theoretical background error rate of less than one artifactual mutation per billion nucleotides sequenced and that detection of mutations present in only one of the two strands of duplex DNA can be used to identify sites of DNA damage.

...read moreread less

Abstract: Next-generation DNA sequencing promises to revolutionize clinical medicine and basic research. However, while this technology has the capacity to generate hundreds of billions of nucleotides of DNA sequence in a single experiment, the error rate of ∼1% results in hundreds of millions of sequencing mistakes. These scattered errors can be tolerated in some applications but become extremely problematic when “deep sequencing” genetically heterogeneous mixtures, such as tumors or mixed microbial populations. To overcome limitations in sequencing accuracy, we have developed a method termed Duplex Sequencing. This approach greatly reduces errors by independently tagging and sequencing each of the two strands of a DNA duplex. As the two strands are complementary, true mutations are found at the same position in both strands. In contrast, PCR or sequencing errors result in mutations in only one strand and can thus be discounted as technical error. We determine that Duplex Sequencing has a theoretical background error rate of less than one artifactual mutation per billion nucleotides sequenced. In addition, we establish that detection of mutations present in only one of the two strands of duplex DNA can be used to identify sites of DNA damage. We apply the method to directly assess the frequency and pattern of random mutations in mitochondrial DNA from human cells.

...read moreread less

944 citations

Journal Article•DOI•

Viral Quasispecies Evolution

[...]

Esteban Domingo¹, Julie Sheldon¹, Celia Perales¹•Institutions (1)

Spanish National Research Council¹

01 Jun 2012-Microbiology and Molecular Biology Reviews

TL;DR: The understanding of viruses as quasispecies has led to new antiviral designs, such as lethal mutagenesis, whose aim is to drive viruses toward low fitness values with limited chances of fitness recovery.

...read moreread less

Abstract: Summary: Evolution of RNA viruses occurs through disequilibria of collections of closely related mutant spectra or mutant clouds termed viral quasispecies. Here we review the origin of the quasispecies concept and some biological implications of quasispecies dynamics. Two main aspects are addressed: (i) mutant clouds as reservoirs of phenotypic variants for virus adaptability and (ii) the internal interactions that are established within mutant spectra that render a virus ensemble the unit of selection. The understanding of viruses as quasispecies has led to new antiviral designs, such as lethal mutagenesis, whose aim is to drive viruses toward low fitness values with limited chances of fitness recovery. The impact of quasispecies for three salient human pathogens, human immunodeficiency virus and the hepatitis B and C viruses, is reviewed, with emphasis on antiviral treatment strategies. Finally, extensions of quasispecies to nonviral systems are briefly mentioned to emphasize the broad applicability of quasispecies theory.

...read moreread less

852 citations

Journal Article•DOI•

Sequencing pools of individuals — mining genome-wide polymorphism data without big funding

[...]

Christian Schlötterer, Raymond Tobler, Robert Kofler, Viola Nolte

01 Nov 2014-Nature Reviews Genetics

TL;DR: This Review demonstrates the breadth of questions that are being addressed by Pool-seq but also discusses its limitations and provides guidelines for users.

...read moreread less

Abstract: The analysis of polymorphism data is becoming increasingly important as a complementary tool to classical genetic analyses. Nevertheless, despite plunging sequencing costs, genomic sequencing of individuals at the population scale is still restricted to a few model species. Whole-genome sequencing of pools of individuals (Pool-seq) provides a cost-effective alternative to sequencing individuals separately. With the availability of custom-tailored software tools, Pool-seq is being increasingly used for population genomic research on both model and non-model organisms. In this Review, we not only demonstrate the breadth of questions that are being addressed by Pool-seq but also discuss its limitations and provide guidelines for users.

...read moreread less

642 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse