scispace - formally typeset
Open AccessJournal ArticleDOI

Velvet: Algorithms for de novo short read assembly using de Bruijn graphs

Daniel R. Zerbino, +1 more
- 01 May 2008 - 
- Vol. 18, Iss: 5, pp 821-829
TLDR
Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.
Abstract
We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of approximately 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.

read more

Citations
More filters
Journal ArticleDOI

Trimmomatic: a flexible trimmer for Illumina sequence data

TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.
Journal ArticleDOI

TopHat: discovering splice junctions with RNA-Seq

TL;DR: The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer.

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

Glenn Tesler
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
References
More filters
Journal ArticleDOI

Initial sequencing and analysis of the human genome.

Eric S. Lander, +248 more
- 15 Feb 2001 - 
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Journal ArticleDOI

Initial sequencing and comparative analysis of the mouse genome.

Robert H. Waterston, +222 more
- 05 Dec 2002 - 
TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.
Journal ArticleDOI

Sequence the Human Genome

TL;DR: This book aims to provide a history of Chinese modern art from 17th Century to the present day through the lens of 20th Century critics, practitioners, journalists, and mediaeval and modern-day critics.
Journal ArticleDOI

Genome-Wide Mapping of in Vivo Protein-DNA Interactions

TL;DR: A large-scale chromatin immunoprecipitation assay based on direct ultrahigh-throughput DNA sequencing was developed, which was then used to map in vivo binding of the neuron-restrictive silencer factor (NRSF; also known as REST) to 1946 locations in the human genome.
Related Papers (5)