PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data
Jan O. Korbel,Jan O. Korbel,Alexej Abyzov,Xinmeng Jasmine Mu,Nicholas Carriero,Philip Cayting,Zhengdong D. Zhang,Michael Snyder,Mark Gerstein +8 more
TLDR
Paired-End Mapper demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors.Abstract:
Personal-genomics endeavors, such as the 1000 Genomes project, are generating maps of genomic structural variants by analyzing ends of massively sequenced genome fragments. To process these we developed Paired-End Mapper (PEMer; http://sv.gersteinlab.org/pemer). This comprises an analysis pipeline, compatible with several next-generation sequencing platforms; simulation-based error models, yielding confidence-values for each structural variant; and a back-end database. The simulations demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors.read more
Citations
More filters
Journal ArticleDOI
DELLY: structural variant discovery by integrated paired-end and split-read analysis
TL;DR: An SV discovery method that integrates short insert paired-ends, long-range mate-pairs and split-read alignments to accurately delineate genomic rearrangements at single-nucleotide resolution, called DELLY, which enables to ascertain the full spectrum of genomic rearrANGements, including complex events.
Journal ArticleDOI
Genome structural variation discovery and genotyping
TL;DR: It is argued that the long-term goal should be routine, cost-effective and high quality de novo assembly of human genomes to comprehensively assess all classes of structural variation.
Journal ArticleDOI
CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing
TL;DR: By genotyping CNVs in the CEPH, Yoruba, and Chinese-Japanese populations, it is estimated that at least 11% of all CNV loci involve complex, multi-allelic events, a considerably higher estimate than reported earlier.
Journal ArticleDOI
Genome Sequencing of Pediatric Medulloblastoma Links Catastrophic DNA Rearrangements with TP53 Mutations
Tobias Rausch,David T.W. Jones,Marc Zapatka,Adrian M. Stütz,Thomas Zichner,Joachim Weischenfeldt,Natalie Jäger,Marc Remke,David Shih,Paul A. Northcott,Elke Pfaff,Jelena Tica,Qi Wang,Luca Massimi,Hendrik Witt,Sebastian Bender,Sabrina Pleier,Huriye Cin,Cynthia Hawkins,Christian Beck,Andreas von Deimling,Volkmar Hans,Benedikt Brors,Roland Eils,Wolfram Scheurlen,Jonathon Blake,Vladimir Benes,Andreas E. Kulozik,Olaf Witt,Olaf Witt,Dianna Martin,Cindy Zhang,Rinnat Porat,Diana M. Merino,Jonathan D. Wasserman,Nada Jabado,Adam M. Fontebasso,Lars Bullinger,Frank G. Rücker,Konstanze Döhner,Hartmut Döhner,Jan Koster,Jan J. Molenaar,Rogier Versteeg,Marcel Kool,Uri Tabori,David Malkin,Andrey Korshunov,Michael D. Taylor,Peter Lichter,Stefan M. Pfister,Jan O. Korbel +51 more
TL;DR: The whole-genome sequencing-based analysis of a Sonic-Hedgehog medulloblastoma brain tumor from a patient with a germline TP53 mutation is reported, uncovering massive, complex chromosome rearrangements and connecting p53 status and chromothripsis in specific tumor types.
Journal ArticleDOI
De novo assembly and genotyping of variants using colored de Bruijn graphs
TL;DR: An efficient software implementation, Cortex, the first de novo assembler capable of assembling multiple eukaryotic genomes simultaneously is provided, and how population information from ten chimpanzees enables accurate variant calls without a reference sequence is shown.
References
More filters
Journal ArticleDOI
Basic Local Alignment Search Tool
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI
Identification of common molecular subsequences.
TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI
Genome sequencing in microfabricated high-density picolitre reactors
Marcel Margulies,Michael Egholm,William E. Altman,Said Attiya,Joel S. Bader,Lisa A. Bemben,Jan Berka,Michael S. Braverman,Yi-Ju Chen,Zhoutao Chen,Scott Dewell,Lei Du,J. M. Fierro,Xavier V. Gomes,Brian C. Godwin,Wen He,Scott Edward Helgesen,Chun Heen Ho,Gerard P. Irzyk,Szilveszter C. Jando,Maria L. I. Alenquer,Thomas P. Jarvie,Kshama B. Jirage,Jong-Bum Kim,James R. Knight,Janna R. Lanza,John H. Leamon,Steven Lefkowitz,Ming Lei,Jing Li,Kenton Lohman,Hong Lu,Vinod Makhijani,Keith Mcdade,Michael P. McKenna,Eugene W. Myers,Elizabeth Nickerson,John Nobile,Ramona Plant,Bernard P. Puc,Michael T. Ronan,George T. Roth,Gary J. Sarkis,Jan Fredrik Simons,John Simpson,Maithreyan Srinivasan,Karrie R. Tartaro,Alexander Tomasz,Kari A. Vogt,Greg A. Volkmer,Shally H. Wang,Yong Wang,Michael P. Weiner,Pengguang Yu,Richard F. Begley,Jonathan M. Rothberg +55 more
TL;DR: A scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments with 96% coverage at 99.96% accuracy in one run of the machine is described.
Journal ArticleDOI
BLAT—The BLAST-Like Alignment Tool
TL;DR: How BLAT was optimized is described, which is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences.
Journal ArticleDOI
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
Ewan Birney,John A. Stamatoyannopoulos,Anindya Dutta,Roderic Guigó,Thomas R. Gingeras,Elliott H. Margulies,Zhiping Weng,Michael Snyder,Emmanouil T. Dermitzakis,Robert E. Thurman,Michael S. Kuehn,Christopher M. Taylor,Shane Neph,Christoph M. Koch,Saurabh Asthana,Ankit Malhotra,Ivan Adzhubei,Jason A. Greenbaum,Robert M. Andrews,Paul Flicek,Patrick J. Boyle,Hua Cao,Nigel P. Carter,Gayle K. Clelland,Sean Davis,Nathan Day,Pawandeep Dhami,Shane C. Dillon,Michael O. Dorschner,Heike Fiegler,Paul G. Giresi,Jeff Goldy,Michael Hawrylycz,Andrew Haydock,Richard Humbert,Keith D. James,Brett E. Johnson,Ericka M. Johnson,Tristan Frum,Elizabeth Rosenzweig,Neerja Karnani,Kirsten Lee,Gregory Lefebvre,Patrick A. Navas,Fidencio Neri,Stephen C. J. Parker,Peter J. Sabo,Richard Sandstrom,Anthony Shafer,David Vetrie,Molly Weaver,Sarah Wilcox,Man Yu,Francis S. Collins,Job Dekker,Jason D. Lieb,Thomas D. Tullius,Gregory E. Crawford,Shamil R. Sunyaev,William Stafford Noble,Ian Dunham,Alexandre Reymond,Alexandre Reymond,Philipp Kapranov,Joel Rozowsky,Deyou Zheng,Robert Castelo,Adam Frankish,Jennifer Harrow,Srinka Ghosh,Albin Sandelin,Ivo L. Hofacker,Robert Baertsch,Damian Keefe,Sujit Dike,Jill Cheng,Heather A. Hirsch,Edward A. Sekinger,Julien Lagarde,Josep F. Abril,Josep F. Abril,Atif Shahab,Christoph Flamm,Christoph Flamm,Claudia Fried,Jörg Hackermüller,Jana Hertel,Manja Lindemeyer,Kristin Missal,Andrea Tanzer,Andrea Tanzer,Stefan Washietl,Jan O. Korbel,Olof Emanuelsson,Jakob Skou Pedersen,Nancy Holroyd,Ruth Taylor,David Swarbreck,Nicholas Matthews,Mark Dickson,Daryl J. Thomas,Matthew T. Weirauch,James G. R. Gilbert,Jorg Drenkow,Ian Bell,Xiaodong Zhao,Kandhadayar G. Srinivasan,Wing-Kin Sung,Hong Sain Ooi,Kuo Ping Chiu,Sylvain Foissac,Tyler Alioto,Michael R. Brent,Lior Pachter,Michael L. Tress,Alfonso Valencia,Siew Woh Choo,Chiou Yu Choo,Catherine Ucla,Caroline Manzano,Carine Wyss,Evelyn Cheung,Taane G. Clark,James B. Brown,Madhavan Ganesh,Sandeep Patel,Hari Tammana,Jacqueline Chrast,Charlotte N. Henrichsen,Chikatoshi Kai,Jun Kawai,Ugrappa Nagalakshmi,Jia Qian Wu,Zheng Lian,Jin Lian,Peter E. Newburger,Xueqing Zhang,Peter J. Bickel,John S. Mattick,Piero Carninci,Yoshihide Hayashizaki,Sherman M. Weissman,Tim Hubbard,Richard M. Myers,Jane Rogers,Peter F. Stadler,Peter F. Stadler,Peter F. Stadler,Todd M. Lowe,Chia-Lin Wei,Yijun Ruan,Kevin Struhl,Mark Gerstein,Stylianos E. Antonarakis,Yutao Fu,Eric D. Green,Ulas Karaoz,Adam Siepel,Adam Siepel,James Taylor,Laura A. Liefer,Kris A. Wetterstrand,Peter J. Good,Elise A. Feingold,Mark S. Guyer,Gregory M. Cooper,Gregory M. Cooper,George Asimenos,Colin N. Dewey,Minmei Hou,Sergey Nikolaev,Juan I. Montoya-Burgos,Ari Löytynoja,Simon Whelan,Fabio Pardi,Tim Massingham,Haiyan Huang,Nan Zhang,Nan Zhang,Ian Holmes,James C. Mullikin,Abel Ureta-Vidal,Benedict Paten,Michael Seringhaus,Deanna M. Church,Kate R. Rosenbloom,W. James Kent,Eric A. Stone,Serafim Batzoglou,Nick Goldman,Ross C. Hardison,David Haussler,Webb Miller,Arend Sidow,Nathan D. Trinklein,Zhengdong D. Zhang,Leah O. Barrera,Rhona K. Stuart,David C. King,Adam Ameur,Stefan Enroth,Mark Bieda,Jonghwan Kim,Akshay Bhinge,Nan Jiang,Jun Liu,Fei Yao,Vinsensius B. Vega,Charlie W.H. Lee,Patrick Ng,Annie Yang,Zarmik Moqtaderi,Zhou Zhu,Xiaoqin Xu,Sharon L. Squazzo,Matthew J. Oberley,David R. Inman,Michael A. Singer,Todd Richmond,Kyle J. Munn,Kyle J. Munn,Alvaro Rada-Iglesias,Ola Wallerman,Jan Komorowski,Joanna C. Fowler,Phillippe Couttet,Alexander W. Bruce,Oliver M. Dovey,Peter D. Ellis,Cordelia Langford,David A. Nix,Ghia Euskirchen,Stephen Hartman,Alexander E. Urban,Peter Kraus,Sara Van Calcar,Nate Heintzman,Tae Hoon Kim,Kun Wang,Chunxu Qu,Gary C. Hon,Rosa Luna,Christopher K. Glass,M. Geoff Rosenfeld,Shelley Force Aldred,Sara J. Cooper,Anason S. Halees,Jane M. Lin,Hennady P. Shulha,Xiaoling Zhang,Mousheng Xu,Jaafar N. Haidar,Yong Yu,Vishwanath R. Iyer,Roland Green,Claes Wadelius,Peggy J. Farnham,Bing Ren,Rachel A. Harte,Angie S. Hinrichs,Heather Trumbower,Hiram Clawson,Jennifer Hillman-Jackson,Ann S. Zweig,Kayla E. Smith,Archana Thakkapallayil,Galt P. Barber,Robert M. Kuhn,Donna Karolchik,Lluís Armengol,Christine P. Bird,Paul I.W. de Bakker,Andrew D. Kern,Nuria Lopez-Bigas,Joel D. Martin,Barbara E. Stranger,Abigail Woodroffe,Eugene Davydov,Antigone S. Dimas,Eduardo Eyras,Ingileif B. Hallgrímsdóttir,Julian L. Huppert,Michael C. Zody,Gonçalo R. Abecasis,Xavier Estivill,Gerard G. Bouffard,Xiaobin Guan,Nancy F. Hansen,Jacquelyn R. Idol,Valerie Maduro,Baishali Maskeri,Jennifer C. McDowell,Morgan Park,Pamela J. Thomas,Alice C. Young,Robert W. Blakesley,Donna M. Muzny,Erica Sodergren,David A. Wheeler,Kim C. Worley,Huaiyang Jiang,George M. Weinstock,Richard A. Gibbs,Tina Graves,Robert S. Fulton,Elaine R. Mardis,Richard K. Wilson,Michele Clamp,James Cuff,Sante Gnerre,David B. Jaffe,Jean L. Chang,Kerstin Lindblad-Toh,Eric S. Lander,Eric S. Lander,Maxim Koriabine,Mikhail Nefedov,Kazutoyo Osoegawa,Yuko Yoshinaga,Baoli Zhu,Pieter J. de Jong +320 more
TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.
Related Papers (5)
Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads
CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing
Paired-end mapping reveals extensive structural variation in the human genome.
Jan O. Korbel,Alexander E. Urban,Jason P. Affourtit,Brian C. Godwin,Fabian Grubert,Jan Fredrik Simons,Philip M. Kim,Dean Palejev,Nicholas Carriero,Lei Du,Bruce E. Taillon,Zhoutao Chen,Andrea Tanzer,A. C. Eugenia Saunders,Jianxiang Chi,Fengtang Yang,Nigel P. Carter,Matthew E. Hurles,Sherman M. Weissman,Timothy T. Harkins,Mark Gerstein,Michael Egholm,Michael Snyder +22 more