SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
Anton Bankevich,Sergey Nurk,Dmitry Antipov,Alexey Gurevich,Mikhail Dvorkin,Alexander S. Kulikov,Valery M. Lesin,Sergey I. Nikolenko,Son Pham,Andrey D. Prjibelski,Alexey V. Pyshkin,Alexander Sirotkin,Nikolay Vyahhi,Glenn Tesler,Max A. Alekseyev,Pavel A. Pevzner +15 more
TLDR
SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies.Abstract:
The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V−SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.read more
Citations
More filters
Journal ArticleDOI
QUAST: quality assessment tool for genome assemblies
TL;DR: This tool improves on leading assembly comparison software with new ideas and quality metrics, and can evaluate assemblies both with a reference genome, as well as without a reference.
Journal ArticleDOI
MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
TL;DR: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner and generated a three-time larger assembly, with longer contig N50 and average contig length.
Posted Content
MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
TL;DR: MEGAHIT as mentioned in this paper is a NGS de novo assembler for assembling large and complex metagenomics data in a time and cost-efficient manner, which avoids preprocessing like partitioning and normalization, which might compromise on result integrity.
Journal ArticleDOI
MetaSPAdes: A new versatile metagenomic assembler
TL;DR: MetaSPAdes as mentioned in this paper addresses various challenges of metagenomic assembly by capitalizing on computational ideas that proved to be useful in assemblies of single cells and highly polymorphic diploid genomes.
Journal ArticleDOI
Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.
TL;DR: Tests on both synthetic and real reads show Unicycler can assemble larger contigs with fewer misassemblies than other hybrid assemblers, even when long-read depth and accuracy are low.
References
More filters
Journal ArticleDOI
Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Daniel R. Zerbino,Ewan Birney +1 more
TL;DR: Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.
Journal ArticleDOI
Base-calling of automated sequencer traces using Phred. I. accuracy assessment
TL;DR: In this article, a base-calling program for automated sequencer traces, phred, with improved accuracy was proposed. But it was not shown to achieve a lower error rate than the ABI software, averaging 40%-50% fewer errors in the data sets examined independent of position in read, machine running conditions, or sequencing chemistry.
Journal ArticleDOI
Metagenomic Analysis of the Human Distal Gut Microbiome
Steven R. Gill,Mihai Pop,Robert T. DeBoy,Paul B. Eckburg,Paul B. Eckburg,Peter J. Turnbaugh,Buck S. Samuel,Jeffrey I. Gordon,David A. Relman,David A. Relman,Claire M. Fraser-Liggett,Karen E. Nelson +11 more
TL;DR: Using metabolic function analyses of identified genes, the human genome is compared with the average content of previously sequenced microbial genomes and humans are superorganisms whose metabolism represents an amalgamation of microbial and human attributes.
Journal ArticleDOI
ABySS: A parallel assembler for short read sequence data
Jared T. Simpson,Kim Wong,Shaun D. Jackman,Jacqueline E. Schein,Steven J.M. Jones,Inanc Birol +5 more
TL;DR: ABySS (Assembly By Short Sequences), a parallelized sequence assembler, was developed and assembled 3.5 billion paired-end reads from the genome of an African male publicly released by Illumina, Inc, representing 68% of the reference human genome.
Journal ArticleDOI
De novo assembly of human genomes with massively parallel short read sequencing
Ruiqiang Li,Hongmei Zhu,Jue Ruan,Wubin Qian,Xiaodong Fang,Zhongbin Shi,Yingrui Li,Shengting Li,Gao Shan,Karsten Kristiansen,Songgang Li,Huanming Yang,Jing Wang,Jun Wang +13 more
TL;DR: The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.