Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Daniel R. Zerbino,Ewan Birney +1 more
TLDR
Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.Abstract:
We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of approximately 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.read more
Citations
More filters
Journal ArticleDOI
Trimmomatic: a flexible trimmer for Illumina sequence data
TL;DR: Timmomatic is developed as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data and is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested.
Journal ArticleDOI
SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
Anton Bankevich,Sergey Nurk,Dmitry Antipov,Alexey Gurevich,Mikhail Dvorkin,Alexander S. Kulikov,Valery M. Lesin,Sergey I. Nikolenko,Son Pham,Andrey D. Prjibelski,Alexey V. Pyshkin,Alexander Sirotkin,Nikolay Vyahhi,Glenn Tesler,Max A. Alekseyev,Pavel A. Pevzner +15 more
TL;DR: SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies.
Journal ArticleDOI
Full-length transcriptome assembly from RNA-Seq data without a reference genome.
Manfred Grabherr,Brian J. Haas,Moran Yassour,Moran Yassour,Joshua Z. Levin,Dawn Thompson,Ido Amit,Xian Adiconis,Lin Fan,Raktima Raychowdhury,Qiandong Zeng,Zehua Chen,Evan Mauceli,Nir Hacohen,Andreas Gnirke,Nicholas Rhind,Federica Di Palma,Bruce W. Birren,Chad Nusbaum,Kerstin Lindblad-Toh,Kerstin Lindblad-Toh,Nir Friedman,Aviv Regev +22 more
TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.
Journal ArticleDOI
TopHat: discovering splice junctions with RNA-Seq
TL;DR: The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer.
SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
References
More filters
Journal ArticleDOI
Initial sequencing and analysis of the human genome.
Eric S. Lander,Lauren Linton,Bruce W. Birren,Chad Nusbaum,Michael C. Zody,Jennifer Baldwin,Keri Devon,Ken Dewar,Michael Doyle,William Fitzhugh,Roel Funke,Diane Gage,Katrina Harris,Andrew Heaford,John Howland,Lisa Kann,Jessica A. Lehoczky,Rosie Levine,Paul A. McEwan,Kevin McKernan,James Meldrim,Jill P. Mesirov,Cher Miranda,William Morris,Jerome Naylor,Christina Raymond,Mark Rosetti,Ralph Santos,Andrew Sheridan,Carrie Sougnez,Nicole Stange-Thomann,Nikola Stojanovic,Aravind Subramanian,Dudley Wyman,Jane Rogers,John Sulston,R Ainscough,Stephan Beck,David Bentley,John Burton,C M Clee,Nigel P. Carter,Alan Coulson,Rebecca Deadman,Panos Deloukas,Andrew Dunham,Ian Dunham,Richard Durbin,Lisa French,Darren Grafham,Simon G. Gregory,Tim Hubbard,Sean Humphray,Adrienne Hunt,Matthew Jones,Christine Lloyd,Amanda McMurray,Lucy Matthews,Simon Mercer,Sarah Milne,James C. Mullikin,Andrew J. Mungall,Robert W. Plumb,Mark T. Ross,Ratna Shownkeen,Sarah Sims,Robert H. Waterston,Richard K. Wilson,LaDeana W. Hillier,John Douglas Mcpherson,Marco A. Marra,Elaine R. Mardis,Lucinda Fulton,Asif T. Chinwalla,Kymberlie H. Pepin,Warren Gish,Stephanie L. Chissoe,Michael C. Wendl,Kim D. Delehaunty,Tracie L. Miner,Andrew Delehaunty,Jason B. Kramer,Lisa Cook,Robert S. Fulton,Douglas L. Johnson,Patrick Minx,Sandra W. Clifton,Trevor Hawkins,Elbert Branscomb,Paul Predki,Paul G. Richardson,Sarah Wenning,Tom Slezak,Norman A. Doggett,Jan Fang Cheng,Anne S. Olsen,Susan Lucas,Christopher J. Elkin,Edward Uberbacher,Marvin Frazier,Richard A. Gibbs,Donna M. Muzny,Steven E. Scherer,John Bouck,Erica Sodergren,Kim C. Worley,Catherine M. Rives,James H. Gorrell,Michael L. Metzker,Susan L. Naylor,Raju Kucherlapati,David L. Nelson,George M. Weinstock,Yoshiyuki Sakaki,Asao Fujiyama,Masahira Hattori,Tetsushi Yada,Atsushi Toyoda,Takehiko Itoh,Chiharu Kawagoe,Hidemi Watanabe,Yasushi Totoki,Todd D. Taylor,Jean Weissenbach,Roland Heilig,William Saurin,François Artiguenave,Philippe Brottier,Thomas Brüls,Eric Pelletier,Catherine Robert,Patrick Wincker,André Rosenthal,Matthias Platzer,Gerald Nyakatura,Stefan Taudien,Andreas Rump,Douglas R. Smith,Lynn Doucette-Stamm,Marc Rubenfield,Keith Weinstock,Mei Lee Hong,Joann Dubois,Huanming Yang,Jun Yu,Jian Wang,Guyang Huang,Jun Gu,Leroy Hood,Lee Rowen,Anup Madan,Shizen Qin,Ronald W. Davis,Nancy A. Federspiel,A. Pia Abola,Michael Proctor,Bruce A. Roe,Feng Chen,Huaqin Pan,Juliane Ramser,Hans Lehrach,Richard Reinhardt,W. Richard McCombie,Melissa De La Bastide,Neilay Dedhia,H. Blöcker,K. Hornischer,Gabriele Nordsiek,Richa Agarwala,L. Aravind,Jeffrey A. Bailey,Alex Bateman,Serafim Batzoglou,Ewan Birney,Peer Bork,Daniel G. Brown,Christopher B. Burge,Lorenzo Cerutti,Hsiu Chuan Chen,Deanna M. Church,Michele Clamp,Richard R. Copley,Tobias Doerks,Sean R. Eddy,Evan E. Eichler,Terrence S. Furey,James E. Galagan,James G. R. Gilbert,Cyrus L. Harmon,Yoshihide Hayashizaki,David Haussler,Henning Hermjakob,Karsten Hokamp,Wonhee Jang,L. Steven Johnson,Thomas A. Jones,Simon Kasif,Arek Kaspryzk,Scot Kennedy,W. James Kent,Paul Kitts,Eugene V. Koonin,Ian F Korf,David Kulp,Doron Lancet,Todd M. Lowe,Aoife McLysaght,Tarjei S. Mikkelsen,John V. Moran,Nicola Mulder,Victor J. Pollara,Chris P. Ponting,Greg Schuler,Jörg Schultz,Guy Slater,Arian F.A. Smit,Elia Stupka,Joseph Szustakowki,Danielle Thierry-Mieg,Jean Thierry-Mieg,Lukas Wagner,John W. Wallis,Raymond Wheeler,Alan Williams,Yuri I. Wolf,Kenneth H. Wolfe,Shiaw Pyng Yang,Ru Fang Yeh,Francis S. Collins,Mark S. Guyer,Jane Peterson,Adam Felsenfeld,Kris A. Wetterstrand,Richard M. Myers,Jeremy Schmutz,Mark Dickson,Jane Grimwood,David R. Cox,Maynard V. Olson,Rajinder Kaul,Christopher K. Raymond,Nobuyoshi Shimizu,Kazuhiko Kawasaki,Shinsei Minoshima,Glen A. Evans,Maria Athanasiou,Roger A. Schultz,Aristides Patrinos,Michael J. Morgan +248 more
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Journal ArticleDOI
Genome sequencing in microfabricated high-density picolitre reactors
Marcel Margulies,Michael Egholm,William E. Altman,Said Attiya,Joel S. Bader,Lisa A. Bemben,Jan Berka,Michael S. Braverman,Yi-Ju Chen,Zhoutao Chen,Scott Dewell,Lei Du,J. M. Fierro,Xavier V. Gomes,Brian C. Godwin,Wen He,Scott Edward Helgesen,Chun Heen Ho,Gerard P. Irzyk,Szilveszter C. Jando,Maria L. I. Alenquer,Thomas P. Jarvie,Kshama B. Jirage,Jong-Bum Kim,James R. Knight,Janna R. Lanza,John H. Leamon,Steven Lefkowitz,Ming Lei,Jing Li,Kenton Lohman,Hong Lu,Vinod Makhijani,Keith Mcdade,Michael P. McKenna,Eugene W. Myers,Elizabeth Nickerson,John Nobile,Ramona Plant,Bernard P. Puc,Michael T. Ronan,George T. Roth,Gary J. Sarkis,Jan Fredrik Simons,John Simpson,Maithreyan Srinivasan,Karrie R. Tartaro,Alexander Tomasz,Kari A. Vogt,Greg A. Volkmer,Shally H. Wang,Yong Wang,Michael P. Weiner,Pengguang Yu,Richard F. Begley,Jonathan M. Rothberg +55 more
TL;DR: A scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments with 96% coverage at 99.96% accuracy in one run of the machine is described.
Journal ArticleDOI
Initial sequencing and comparative analysis of the mouse genome.
Robert H. Waterston,Kerstin Lindblad-Toh,Ewan Birney,Jane Rogers,Josep F. Abril,Pankaj K. Agarwal,Richa Agarwala,Rachel Ainscough,Marina Alexandersson,Peter An,Stylianos E. Antonarakis,John Attwood,Robert Baertsch,J Bailey,K F Barlow,Stephan Beck,Eric Berry,Bruce W. Birren,Toby Bloom,Peer Bork,Marc Botcherby,Nicolas Bray,Michael R. Brent,Daniel G. Brown,Daniel G. Brown,Stephen D. Brown,Carol J. Bult,John Burton,Jonathan Butler,R. D. Campbell,Piero Carninci,Simon Cawley,Francesca Chiaromonte,Asif T. Chinwalla,Deanna M. Church,Michele Clamp,C M Clee,Francis S. Collins,Lisa Cook,Richard R. Copley,Alan Coulson,Olivier Couronne,James Cuff,Val Curwen,Tim Cutts,Mark J. Daly,Robert David,Joy Davies,Kimberly D. Delehaunty,Justin Deri,Emmanouil T. Dermitzakis,Colin N. Dewey,Nicholas J. Dickens,Mark Diekhans,Sheila Dodge,Inna Dubchak,Diane M. Dunn,Sean R. Eddy,Laura Elnitski,Richard D. Emes,Pallavi Eswara,Eduardo Eyras,Adam Felsenfeld,Ginger A. Fewell,Paul Flicek,Karen Foley,Wayne N. Frankel,Lucinda Fulton,Robert S. Fulton,Terrence S. Furey,Diane Gage,Richard A. Gibbs,Gustavo Glusman,Sante Gnerre,Nick Goldman,Leo Goodstadt,Darren Grafham,Tina Graves,Eric D. Green,Simon G. Gregory,Roderic Guigó,Mark S. Guyer,Ross C. Hardison,David Haussler,Yoshihide Hayashizaki,Deana W. LaHillier,Angela S. Hinrichs,Wratko Hlavina,Timothy Holzer,Fan Hsu,Axin Hua,Tim Hubbard,Adrienne Hunt,Ian J. Jackson,David B. Jaffe,L. Steven Johnson,Matthew Jones,Thomas A. Jones,A Joy,Michael Kamal,Elinor K. Karlsson,Donna Karolchik,Arkadiusz Kasprzyk,Jun Kawai,Evan Keibler,Cristyn Kells,W. James Kent,Andrew Kirby,Diana L. Kolbe,Ian F Korf,Raju Kucherlapati,Edward J. Kulbokas,David Kulp,Tom Landers,J. P. Leger,Steven Leonard,Ivica Letunic,Rosie Levine,Jia Li,Ming Li,Christine Lloyd,Susan Lucas,Bin Ma,Donna Maglott,Elaine R. Mardis,Lucy Matthews,Evan Mauceli,John Mayer,Megan McCarthy,W. Richard McCombie,Stuart McLaren,Kirsten McLay,John Douglas Mcpherson,James Meldrim,Beverley Meredith,Jill P. Mesirov,Webb Miller,Tracie L. Miner,Emmanuel Mongin,Kate Montgomery,Michael J. Morgan,Richard Mott,James C. Mullikin,Donna M. Muzny,William E. Nash,Joanne O. Nelson,Michael N. Nhan,Robert Nicol,Zemin Ning,Chad Nusbaum,Michael J. O’Connor,Yasushi Okazaki,Karen Oliver,Emma Overton-Larty,Lior Pachter,Genís Parra,Kymberlie H. Pepin,Jane Peterson,Pavel A. Pevzner,Robert W. Plumb,Craig Pohl,Alex Poliakov,Tracy C. Ponce,Chris P. Ponting,Simon C. Potter,Michael A. Quail,Alexandre Reymond,Bruce A. Roe,Krishna M. Roskin,Edward M. Rubin,Alistair G. Rust,Ralph Santos,Victor Sapojnikov,Brian Schultz,Jörg Schultz,Matthias S. Schwartz,Scott Schwartz,Carol Scott,Steven Seaman,Steve Searle,Ted Sharpe,Andrew Sheridan,Ratna Shownkeen,Sarah Sims,Jonathan Singer,Guy Slater,Arian F.A. Smit,Douglas Smith,Brian Spencer,Arne Stabenau,Nicole Stange-Thomann,Charles W. Sugnet,Mikita Suyama,Glenn Tesler,Johanna Thompson,David Torrents,Evanne Trevaskis,John Tromp,Catherine Ucla,Abel Ureta-Vidal,Jade P. Vinson,Andrew von Niederhausern,Claire M. Wade,Melanie M. Wall,R. J. Weber,Robert B. Weiss,Michael C. Wendl,Anthony P. West,Kris A. Wetterstrand,Raymond Wheeler,Simon Whelan,Jamey Wierzbowski,David Willey,Sophie Williams,Richard K. Wilson,Eitan E. Winter,Kim C. Worley,Dudley Wyman,Shan Yang,Shiaw Pyng Yang,Evgeny M. Zdobnov,Michael C. Zody,Eric S. Lander +222 more
TL;DR: The results of an international collaboration to produce a high-quality draft sequence of the mouse genome are reported and an initial comparative analysis of the Mouse and human genomes is presented, describing some of the insights that can be gleaned from the two sequences.
Journal ArticleDOI
Sequence the Human Genome
TL;DR: This book aims to provide a history of Chinese modern art from 17th Century to the present day through the lens of 20th Century critics, practitioners, journalists, and mediaeval and modern-day critics.
Journal ArticleDOI
Genome-Wide Mapping of in Vivo Protein-DNA Interactions
David S. Johnson,Ali Mortazavi,Ali Mortazavi,Richard M. Myers,Richard M. Myers,Barbara J. Wold,Barbara J. Wold +6 more
TL;DR: A large-scale chromatin immunoprecipitation assay based on direct ultrahigh-throughput DNA sequencing was developed, which was then used to map in vivo binding of the neuron-restrictive silencer factor (NRSF; also known as REST) to 1946 locations in the human genome.