Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
Keith Bradnam,Joseph Fass,Anton Alexandrov,Paul Baranay,Michael Bechner,Inanc Birol,Sébastien Boisvert,Jarrod Chapman,Guillaume Chapuis,Guillaume Chapuis,Rayan Chikhi,Rayan Chikhi,Hamidreza Chitsaz,Wen-Chi Chou,Jacques Corbeil,Cristian Del Fabbro,T. Roderick Docking,Richard Durbin,Dent Earl,Scott J. Emrich,Pavel Fedotov,Nuno A. Fonseca,Ganeshkumar Ganapathy,Richard A. Gibbs,Sante Gnerre,Elenie Godzaridis,Steve Goldstein,Matthias Haimel,Giles Hall,David Haussler,Joseph B. Hiatt,Isaac Ho,Jason T. Howard,Martin Hunt,Shaun D. Jackman,David B. Jaffe,Erich D. Jarvis,Huaiyang Jiang,Sergey Kazakov,Paul J. Kersey,Jacob O. Kitzman,James R. Knight,Sergey Koren,Tak-Wah Lam,Dominique Lavenier,Dominique Lavenier,François Laviolette,Yingrui Li,Zhenyu Li,Binghang Liu,Yue Liu,Ruibang Luo,Iain MacCallum,Matthew D. MacManes,Nicolas Maillet,Sergey Melnikov,Bruno Vieira,Delphine Naquin,Zemin Ning,Thomas D. Otto,Benedict Paten,Octávio S. Paulo,Adam M. Phillippy,Francisco Pina-Martins,Michael Place,Dariusz Przybylski,Xiang Qin,Carson Qu,Filipe J. Ribeiro,Stephen Richards,Daniel S. Rokhsar,Daniel S. Rokhsar,J. Graham Ruby,J. Graham Ruby,Simone Scalabrin,Michael C. Schatz,David C. Schwartz,Alexey Sergushichev,Ted Sharpe,Timothy I. Shaw,Jay Shendure,Yujian Shi,Jared T. Simpson,Henry Song,Fedor Tsarev,Francesco Vezzi,Riccardo Vicedomini,Jun Wang,Kim C. Worley,Shuangye Yin,Siu-Ming Yiu,Jianying Yuan,Guojie Zhang,Hao Zhang,Shiguo Zhou,Ian F Korf +95 more
TLDR
The Assemblathon 2 as mentioned in this paper presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.Abstract:
Background - The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results - In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions - Many current genome assemblers produced useful assemblies, containing a significant representation of their genes, regulatory sequences, and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.read more
Citations
More filters
Journal ArticleDOI
featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features
TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.
Journal ArticleDOI
Sequencing depth and coverage: key considerations in genomic analyses
TL;DR: The issue of sequencing depth in the design of next-generation sequencing experiments is discussed and current guidelines and precedents on the issue of coverage are reviewed for four major study designs, including de novo genome sequencing, genome resequencing, transcriptome sequencing and genomic location analyses.
Journal ArticleDOI
Shotgun metagenomics, from sampling to analysis
Christopher Quince,Alan W. Walker,Jared T. Simpson,Jared T. Simpson,Nicholas J. Loman,Nicola Segata +5 more
TL;DR: Computational approaches to overcome the challenges that affect both assembly-based and mapping-based metagenomic profiling, particularly of high-complexity samples or environments containing organisms with limited similarity to sequenced genomes, are needed.
Journal ArticleDOI
Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads
Rei Kajitani,Kouta Toshimoto,Hideki Noguchi,Atsushi Toyoda,Yoshitoshi Ogura,Miki Okuno,Mitsuru Yabana,Masayuki Harada,Eiji Nagayasu,Haruhiko Maruyama,Yuji Kohara,Asao Fujiyama,Tetsuya Hayashi,Takehiko Itoh +13 more
TL;DR: Platanus provides a novel and efficient approach for the assembly of gigabase-sized highly heterozygous genomes and is an attractive alternative to the existing assemblers designed for genomes of lower heterozygosity.
Journal ArticleDOI
Comparative genomics reveals insights into avian genome evolution and adaptation.
Guojie Zhang,Guojie Zhang,Cai Li,Qiye Li,Bo Li,Denis M. Larkin,Chul Hee Lee,Jay F. Storz,Agostinho Antunes,Matthew J. Greenwold,Robert W. Meredith,Anders Ödeen,Jie Cui,Qi Zhou,Luohao Xu,Hailin Pan,Zongji Wang,Lijun Jin,Pei Zhang,Haofu Hu,Wei Yang,Jiang Hu,Jin Xiao,Zhikai Yang,Yang Liu,Qiaolin Xie,Hao Yu,Jinmin Lian,Ping Wen,Fang Zhang,Hui Li,Yongli Zeng,Zijun Xiong,Shiping Liu,Long Zhou,Zhiyong Huang,Na An,Jie Wang,Qiumei Zheng,Yingqi Xiong,Guangbiao Wang,Bo Wang,Jingjing Wang,Yu Fan,Rute R. da Fonseca,Alonzo Alfaro-Núñez,Mikkel Schubert,Ludovic Orlando,Tobias Mourier,Jason T. Howard,Ganeshkumar Ganapathy,Andreas R. Pfenning,Osceola Whitney,Miriam V. Rivas,Erina Hara,Julia Smith,Marta Farré,Jitendra Narayan,Gancho T. Slavov,Michael N Romanov,Rui Borges,João Paulo Machado,Imran Khan,Mark S. Springer,John Gatesy,Federico G. Hoffmann,Juan C. Opazo,Olle Håstad,Roger H. Sawyer,Heebal Kim,Kyu-Won Kim,Hyeon Jeong Kim,Seoae Cho,Ning Li,Yinhua Huang,Michael William Bruford,Xiangjiang Zhan,Andrew Dixon,Mads F. Bertelsen,Elizabeth P. Derryberry,Wesley C. Warren,Richard K. Wilson,Shengbin Li,David A. Ray,Richard E. Green,Stephen J. O'Brien,Darren K. Griffin,Warren E. Johnson,David Haussler,Oliver A. Ryder,Eske Willerslev,Gary R. Graves,Per Alström,Jon Fjeldså,David P. Mindell,Scott V. Edwards,Edward L. Braun,Carsten Rahbek,David W. Burt,Peter Houde,Yong Zhang,Huanming Yang,Jian Wang,Erich D. Jarvis,M. Thomas P. Gilbert,M. Thomas P. Gilbert,Jun Wang +106 more
TL;DR: This work explored bird macroevolution using full genomes from 48 avian species representing all major extant clades to reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.
References
More filters
Journal ArticleDOI
Basic Local Alignment Search Tool
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
Fast and accurate short read alignment with Burrows–Wheeler transform
Heng Li,Richard Durbin +1 more
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Journal ArticleDOI
Initial sequencing and analysis of the human genome.
Eric S. Lander,Lauren Linton,Bruce W. Birren,Chad Nusbaum,Michael C. Zody,Jennifer Baldwin,Keri Devon,Ken Dewar,Michael Doyle,William Fitzhugh,Roel Funke,Diane Gage,Katrina Harris,Andrew Heaford,John Howland,Lisa Kann,Jessica A. Lehoczky,Rosie Levine,Paul A. McEwan,Kevin McKernan,James Meldrim,Jill P. Mesirov,Cher Miranda,William Morris,Jerome Naylor,Christina Raymond,Mark Rosetti,Ralph Santos,Andrew Sheridan,Carrie Sougnez,Nicole Stange-Thomann,Nikola Stojanovic,Aravind Subramanian,Dudley Wyman,Jane Rogers,John Sulston,R Ainscough,Stephan Beck,David Bentley,John Burton,C M Clee,Nigel P. Carter,Alan Coulson,Rebecca Deadman,Panos Deloukas,Andrew Dunham,Ian Dunham,Richard Durbin,Lisa French,Darren Grafham,Simon G. Gregory,Tim Hubbard,Sean Humphray,Adrienne Hunt,Matthew Jones,Christine Lloyd,Amanda McMurray,Lucy Matthews,Simon Mercer,Sarah Milne,James C. Mullikin,Andrew J. Mungall,Robert W. Plumb,Mark T. Ross,Ratna Shownkeen,Sarah Sims,Robert H. Waterston,Richard K. Wilson,LaDeana W. Hillier,John Douglas Mcpherson,Marco A. Marra,Elaine R. Mardis,Lucinda Fulton,Asif T. Chinwalla,Kymberlie H. Pepin,Warren Gish,Stephanie L. Chissoe,Michael C. Wendl,Kim D. Delehaunty,Tracie L. Miner,Andrew Delehaunty,Jason B. Kramer,Lisa Cook,Robert S. Fulton,Douglas L. Johnson,Patrick Minx,Sandra W. Clifton,Trevor Hawkins,Elbert Branscomb,Paul Predki,Paul G. Richardson,Sarah Wenning,Tom Slezak,Norman A. Doggett,Jan Fang Cheng,Anne S. Olsen,Susan Lucas,Christopher J. Elkin,Edward Uberbacher,Marvin Frazier,Richard A. Gibbs,Donna M. Muzny,Steven E. Scherer,John Bouck,Erica Sodergren,Kim C. Worley,Catherine M. Rives,James H. Gorrell,Michael L. Metzker,Susan L. Naylor,Raju Kucherlapati,David L. Nelson,George M. Weinstock,Yoshiyuki Sakaki,Asao Fujiyama,Masahira Hattori,Tetsushi Yada,Atsushi Toyoda,Takehiko Itoh,Chiharu Kawagoe,Hidemi Watanabe,Yasushi Totoki,Todd D. Taylor,Jean Weissenbach,Roland Heilig,William Saurin,François Artiguenave,Philippe Brottier,Thomas Brüls,Eric Pelletier,Catherine Robert,Patrick Wincker,André Rosenthal,Matthias Platzer,Gerald Nyakatura,Stefan Taudien,Andreas Rump,Douglas R. Smith,Lynn Doucette-Stamm,Marc Rubenfield,Keith Weinstock,Mei Lee Hong,Joann Dubois,Huanming Yang,Jun Yu,Jian Wang,Guyang Huang,Jun Gu,Leroy Hood,Lee Rowen,Anup Madan,Shizen Qin,Ronald W. Davis,Nancy A. Federspiel,A. Pia Abola,Michael Proctor,Bruce A. Roe,Feng Chen,Huaqin Pan,Juliane Ramser,Hans Lehrach,Richard Reinhardt,W. Richard McCombie,Melissa De La Bastide,Neilay Dedhia,H. Blöcker,K. Hornischer,Gabriele Nordsiek,Richa Agarwala,L. Aravind,Jeffrey A. Bailey,Alex Bateman,Serafim Batzoglou,Ewan Birney,Peer Bork,Daniel G. Brown,Christopher B. Burge,Lorenzo Cerutti,Hsiu Chuan Chen,Deanna M. Church,Michele Clamp,Richard R. Copley,Tobias Doerks,Sean R. Eddy,Evan E. Eichler,Terrence S. Furey,James E. Galagan,James G. R. Gilbert,Cyrus L. Harmon,Yoshihide Hayashizaki,David Haussler,Henning Hermjakob,Karsten Hokamp,Wonhee Jang,L. Steven Johnson,Thomas A. Jones,Simon Kasif,Arek Kaspryzk,Scot Kennedy,W. James Kent,Paul Kitts,Eugene V. Koonin,Ian F Korf,David Kulp,Doron Lancet,Todd M. Lowe,Aoife McLysaght,Tarjei S. Mikkelsen,John V. Moran,Nicola Mulder,Victor J. Pollara,Chris P. Ponting,Greg Schuler,Jörg Schultz,Guy Slater,Arian F.A. Smit,Elia Stupka,Joseph Szustakowki,Danielle Thierry-Mieg,Jean Thierry-Mieg,Lukas Wagner,John W. Wallis,Raymond Wheeler,Alan Williams,Yuri I. Wolf,Kenneth H. Wolfe,Shiaw Pyng Yang,Ru Fang Yeh,Francis S. Collins,Mark S. Guyer,Jane Peterson,Adam Felsenfeld,Kris A. Wetterstrand,Richard M. Myers,Jeremy Schmutz,Mark Dickson,Jane Grimwood,David R. Cox,Maynard V. Olson,Rajinder Kaul,Christopher K. Raymond,Nobuyoshi Shimizu,Kazuhiko Kawasaki,Shinsei Minoshima,Glen A. Evans,Maria Athanasiou,Roger A. Schultz,Aristides Patrinos,Michael J. Morgan +248 more
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Journal ArticleDOI
Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Daniel R. Zerbino,Ewan Birney +1 more
TL;DR: Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies and is in close agreement with simulated results without read-pair information.
Related Papers (5)
Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Daniel R. Zerbino,Ewan Birney +1 more