Home
/
Authors
/
Vladislav Saveliev

Author

Vladislav Saveliev

Other affiliations: University of California, San Diego

Bio: Vladislav Saveliev is an academic researcher from Saint Petersburg State University. The author has contributed to research in topics: Sequence assembly & Genome. The author has an hindex of 5, co-authored 7 publications receiving 4432 citations. Previous affiliations of Vladislav Saveliev include University of California, San Diego.

Topics: Sequence assembly, Genome, Genomics, Reference genome, Key (lock) ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

QUAST: quality assessment tool for genome assemblies

[...]

Alexey Gurevich¹, Vladislav Saveliev¹, Nikolay Vyahhi¹, Glenn Tesler¹•Institutions (1)

University of California, San Diego¹

15 Apr 2013-Bioinformatics

TL;DR: This tool improves on leading assembly comparison software with new ideas and quality metrics, and can evaluate assemblies both with a reference genome, as well as without a reference.

...read moreread less

Abstract: Summary: Limitations of genome sequencing techniques have led to dozens of assembly algorithms, none of which is perfect. A number of methods for comparing assemblers have been developed, but none is yet a recognized benchmark. Further, most existing methods for comparing assemblies are only applicable to new assemblies of finished genomes; the problem of evaluating assemblies of previously unsequenced species has not been adequately considered. Here, we present QUAST—a quality assessment tool for evaluating and comparing genome assemblies. This tool improves on leading assembly comparison software with new ideas and quality metrics. QUAST can evaluate assemblies both with a reference genome, as well as without a reference. QUAST produces many reports, summary tables and plots to help scientists in their research and in their publications. In this study, we used QUAST to compare several genome assemblers on three datasets. QUAST tables and plots for all of them are available in the Supplementary Material, and interactive versions of these reports are on the QUAST website.

...read moreread less

5,757 citations

Journal Article•DOI•

Versatile genome assembly evaluation with QUAST-LG.

[...]

Alla Mikheenko¹, Andrey D. Prjibelski¹, Vladislav Saveliev¹, Dmitry Antipov¹, Alexey Gurevich¹ - Show less +1 more•Institutions (1)

Saint Petersburg State University¹

01 Jul 2018-Bioinformatics

TL;DR: This manuscript demonstrates performance of the state‐of‐the‐art genome assembly software on six eukaryotic datasets sequenced using different technologies and introduces a concept of upper bound assembly for a given genome and set of reads, and compute theoretical limits on assembly correctness and completeness.

...read moreread less

Abstract: Motivation The emergence of high-throughput sequencing technologies revolutionized genomics in early 2000s. The next revolution came with the era of long-read sequencing. These technological advances along with novel computational approaches became the next step towards the automatic pipelines capable to assemble nearly complete mammalian-size genomes. Results In this manuscript, we demonstrate performance of the state-of-the-art genome assembly software on six eukaryotic datasets sequenced using different technologies. To evaluate the results, we developed QUAST-LG-a tool that compares large genomic de novo assemblies against reference sequences and computes relevant quality metrics. Since genomes generally cannot be reconstructed completely due to complex repeat patterns and low coverage regions, we introduce a concept of upper bound assembly for a given genome and set of reads, and compute theoretical limits on assembly correctness and completeness. Using QUAST-LG, we show how close the assemblies are to the theoretical optimum, and how far this optimum is from the finished reference. Availability and implementation http://cab.spbu.ru/software/quast-lg. Supplementary information Supplementary data are available at Bioinformatics online.

...read moreread less

562 citations

Journal Article•DOI•

MetaQUAST: evaluation of metagenome assemblies

[...]

Alla Mikheenko¹, Vladislav Saveliev¹, Alexey Gurevich¹•Institutions (1)

Saint Petersburg State University¹

01 Apr 2016-Bioinformatics

TL;DR: MetaQUAST is presented, a modification of QUAST, the state-of-the-art tool for genome assembly evaluation based on alignment of contigs to a reference, and addresses such metagenome datasets features as unknown species content by detecting and downloading reference sequences.

...read moreread less

Abstract: Summary: During the past years we have witnessed the rapid development of new metagenome assembly methods. Although there are many benchmark utilities designed for single-genome assemblies, there is no well-recognized evaluation and comparison tool for metagenomic-specific analogues. In this article, we present MetaQUAST, a modification of QUAST, the state-of-the-art tool for genome assembly evaluation based on alignment of contigs to a reference. MetaQUAST addresses such metagenome datasets features as (i) unknown species content by detecting and downloading reference sequences, (ii) huge diversity by giving comprehensive reports for multiple genomes and (iii) presence of highly relative species by detecting chimeric contigs. We demonstrate MetaQUAST performance by comparing several leading assemblers on one simulated and two real datasets. Availability and implementation: http://bioinf.spbau.ru/metaquast. Contact: aleksey.gurevich@spbu.ru Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

408 citations

Journal Article•DOI•

Icarus: visualizer for de novo assembly evaluation

[...]

Alla Mikheenko¹, Gleb Valin², Andrey D. Prjibelski¹, Vladislav Saveliev¹, Alexey Gurevich¹ - Show less +1 more•Institutions (2)

Saint Petersburg State University¹, Saint Petersburg Academic University²

01 Nov 2016-Bioinformatics

TL;DR: Icarus is a novel genome visualizer for accurate assessment and analysis of genomic draft assemblies, which is based on the tool QUAST, and can be used in studies where a related reference genome is available, as well as for non-model organisms.

...read moreread less

Abstract: Summary : Data visualization plays an increasingly important role in NGS data analysis. With advances in both sequencing and computational technologies, it has become a new bottleneck in genomics studies. Indeed, evaluation of de novo genome assemblies is one of the areas that can benefit from the visualization. However, even though multiple quality assessment methods are now available, existing visualization tools are hardly suitable for this purpose. Here, we present Icarus—a novel genome visualizer for accurate assessment and analysis of genomic draft assemblies, which is based on the tool QUAST. Icarus can be used in studies where a related reference genome is available, as well as for non-model organisms. The tool is available online and as a standalone application. Availability and Implementation: http://cab.spbu.ru/software/icarus Contact: aleksey.gurevich@spbu.ru Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

88 citations

Journal Article•DOI•

Icarus: visualizer for de novo assembly evaluation

[...]

Alla Mikheenko, Vladislav Saveliev, Alexey Gurevich

02 Sep 2016-F1000Research

23 citations

Cited by

PDF

Open Access

More filters

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs

[...]

Felipe A. Simão¹, Robert M. Waterhouse¹, Panagiotis Ioannidis¹, Evgenia V. Kriventseva¹, Evgeny M. Zdobnov¹ - Show less +1 more•Institutions (1)

Swiss Institute of Bioinformatics¹

01 Oct 2015-Bioinformatics

TL;DR: Zdobnov et al. as discussed by the authors proposed a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content, and implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs.

...read moreread less

Abstract: Motivation Genomics has revolutionized biological research, but quality assessment of the resulting assembled sequences is complicated and remains mostly limited to technical measures like N50. Results We propose a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content. We implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO. Availability and implementation Software implemented in Python and datasets available for download from http://busco.ezlab.org. Contact evgeny.zdobnov@unige.ch Supplementary information Supplementary data are available at Bioinformatics online.

...read moreread less

7,747 citations

Journal Article•DOI•

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

[...]

Donovan H. Parks¹, Michael Imelfort¹, Connor T. Skennerton¹, Philip Hugenholtz¹, Gene W. Tyson¹ - Show less +1 more•Institutions (1)

University of Queensland¹

01 Jul 2015-Genome Research

TL;DR: An objective measure of genome quality is proposed that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities and is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches.

...read moreread less

Abstract: Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.

...read moreread less

5,788 citations

BUSCO: Assessing Genome Assembly and Annotation Completeness with Single-Copy Orthologs

[...]

Felipe A. Simão

09 Jan 2016

TL;DR: This work proposes a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content, implemented in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO.

...read moreread less

Abstract: MOTIVATION Genomics has revolutionized biological research, but quality assessment of the resulting assembled sequences is complicated and remains mostly limited to technical measures like N50. RESULTS We propose a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content. We implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO. AVAILABILITY AND IMPLEMENTATION Software implemented in Python and datasets available for download from http://busco.ezlab.org. CONTACT evgeny.zdobnov@unige.ch SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

...read moreread less

4,036 citations

Journal Article•DOI•

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

[...]

Dinghua Li¹, Chi-Man Liu¹, Ruibang Luo¹, Kunihiko Sadakane¹, Tak-Wah Lam¹ - Show less +1 more•Institutions (1)

National Institute of Informatics¹

15 May 2015-Bioinformatics

TL;DR: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner and generated a three-time larger assembly, with longer contig N50 and average contig length.

...read moreread less

Abstract: Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252Gbps in 44.1 hours and 99.6 hours on a single computing node with and without a GPU, respectively. MEGAHIT assembles the data as a whole, i.e., no pre-processing like partitioning and normalization was needed. When compared with previous methods (Chikhi and Rizk, 2012; Howe, et al., 2014) on assembling the soil data, MEGAHIT generated a 3-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a 4-fold improvement . Availability: The source code of MEGAHIT is freely available at https://github.com/voutcn/megahit under GPLv3 license. Contact: rb@l3-bioinfo.com, twlam@cs.hku.hk

...read moreread less

3,634 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse