The 1000 Genomes Project: data management and community access
Laura Clarke,Xiangqun Zheng-Bradley,Richard J.H. Smith,Eugene Kulesha,Chunlin Xiao,Iliana Toneva,Brendan Vaughan,Don Preuss,Rasko Leinonen,Martin Shumway,Stephen T. Sherry,Paul Flicek +11 more
TLDR
Members of the project data coordination center have developed and deployed several tools to enable widespread data access and to create a deep catalog of human genetic variation.Abstract:
The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to the primary scientific goals of creating both a deep catalog of human genetic variation and extensive methods to accurately discover and characterize variation using new sequencing technologies, the project makes all of its data publicly available. Members of the project data coordination center have developed and deployed several tools to enable widespread data access.read more
Citations
More filters
Journal ArticleDOI
A global reference for human genetic variation.
Adam Auton,Gonçalo R. Abecasis,David Altshuler,Richard Durbin,David R. Bentley,Aravinda Chakravarti,Andrew G. Clark,Peter Donnelly,Evan E. Eichler,Paul Flicek,Stacey Gabriel,Richard A. Gibbs,Eric D. Green,Matthew E. Hurles,Bartha Maria Knoppers,Jan O. Korbel,Eric S. Lander,Charles Lee,Hans Lehrach,Elaine R. Mardis,Gabor T. Marth,Gil McVean,Deborah A. Nickerson,Jeanette Schmidt,Stephen T. Sherry,Jun Wang,Richard K. Wilson,Eric Boerwinkle,Harsha Doddapaneni,Yi Han,Viktoriya Korchina,Christie Kovar,Sandra L. Lee,Donna M. Muzny,Jeffrey G. Reid,Yiming Zhu,Yuqi Chang,Qiang Feng,Qiang Feng,Xiaodong Fang,Xiaodong Fang,Xiaosen Guo,Xiaosen Guo,Min Jian,Min Jian,Hui Jiang,Hui Jiang,Xin Jin,Tianming Lan,Guoqing Li,Jingxiang Li,Yingrui Li,Shengmao Liu,Xiao Liu,Xiao Liu,Yao Lu,Xuedi Ma,Meifang Tang,Bo Wang,Guangbiao Wang,Honglong Wu,Renhua Wu,Xun Xu,Ye Yin,Dandan Zhang,Wenwei Zhang,Jiao Zhao,Meiru Zhao,Xiaole Zheng,Namrata Gupta,Neda Gharani,Lorraine Toji,Norman P. Gerry,Alissa M. Resch,Jonathan Barker,Laura Clarke,Laurent Gil,Sarah E. Hunt,Gavin Kelman,Eugene Kulesha,Rasko Leinonen,William M. McLaren,Rajesh Radhakrishnan,Asier Roa,Dmitriy Smirnov,Richard Smith,Ian Streeter,Anja Thormann,Iliana Toneva,Brendan Vaughan,Xiangqun Zheng-Bradley,Russell J. Grocock,Sean Humphray,Terena James,Zoya Kingsbury,Ralf Sudbrak,M. Albrecht,Vyacheslav Amstislavskiy,Tatiana A. Borodina,Matthias Lienhard,Florian Mertes,Marc Sultan,Bernd Timmermann,Marie-Laure Yaspo,Lucinda Fulton,Victor Ananiev,Zinaida Belaia,Dimitriy Beloslyudtsev,Nathan Bouk,Chao Chen,Deanna M. Church,Robert M. Cohen,Charles Cook,John Garner,Timothy Hefferon,Mikhail Kimelman,Chunlei Liu,John Lopez,Peter Meric,Chris O’Sullivan,Yuri Ostapchuk,Lon Phan,Sergiy Ponomarov,Valerie A. Schneider,Eugene Shekhtman,Karl Sirotkin,Douglas J. Slotta,Hua Zhang,Senduran Balasubramaniam,John Burton,Petr Danecek,Thomas M. Keane,Anja Kolb-Kokocinski,Shane A. McCarthy,James Stalker,Michael A. Quail,Christopher Davies,Jeremy Gollub,Teresa Webster,Brant Wong,Yiping Zhan,Christopher L. Campbell,Yu Kong,Anthony Marcketta,Fuli Yu,Lilian Antunes,Matthew N. Bainbridge,Aniko Sabo,Zhuoyi Huang,Lachlan J. M. Coin,Lin Fang,Lin Fang,Qibin Li,Zhenyu Li,Haoxiang Lin,Binghang Liu,Ruibang Luo,Haojing Shao,Haojing Shao,Yinlong Xie,Chen Ye,Chang Yu,Fan Zhang,Hancheng Zheng,Zhu Hongmei,Can Alkan,Elif Dal,Fatma Kahveci,Erik Garrison,Deniz Kural,Wan-Ping Lee,Wen Fung Leong,Michael Strömberg,Alistair Ward,Jiantao Wu,Mengyao Zhang,Mark J. Daly,Mark A. DePristo,Robert E. Handsaker,Robert E. Handsaker,Eric Banks,Gaurav Bhatia,Guillermo del Angel,Giulio Genovese,Heng Li,Seva Kashin,Seva Kashin,Steven A. McCarroll,Steven A. McCarroll,James Nemesh,Ryan Poplin,Seungtai Yoon,Jayon Lihm,Vladimir Makarov,Srikanth Gottipati,Alon Keinan,Juan L. Rodriguez-Flores,Tobias Rausch,Markus Hsi-Yang Fritz,Adrian M. Stütz,Kathryn Beal,Avik Datta,Javier Herrero,Graham R. S. Ritchie,Daniel R. Zerbino,Pardis C. Sabeti,Pardis C. Sabeti,Ilya Shlyakhter,Ilya Shlyakhter,Stephen F. Schaffner,Stephen F. Schaffner,Joseph J. Vitti,Joseph J. Vitti,David Neil Cooper,Edward V. Ball,Peter D. Stenson,Bret Barnes,Markus J. Bauer,R. Keira Cheetham,Anthony J. Cox,Michael A. Eberle,Scott Kahn,Lisa Murray,John F. Peden,Richard Shaw,Eimear E. Kenny,Mark A. Batzer,Miriam K. Konkel,Jerilyn A. Walker,Daniel G. MacArthur,Monkol Lek,Ralf Herwig,Li Ding,Daniel C. Koboldt,David E. Larson,Kai Ye,Simon Gravel,Anand Swaroop,Emily Y. Chew,Tuuli Lappalainen,Yaniv Erlich,Melissa Gymrek,Melissa Gymrek,Thomas Willems,Jared T. Simpson,Mark D. Shriver,Jeffrey A. Rosenfeld,Carlos Bustamante,Stephen B. Montgomery,Francisco M. De La Vega,Jake K. Byrnes,Andrew Carroll,Marianne K. DeGorter,Phil Lacroute,Brian K. Maples,Alicia R. Martin,Andrés Moreno-Estrada,Andrés Moreno-Estrada,Suyash Shringarpure,Fouad Zakharia,Eran Halperin,Eran Halperin,Yael Baran,Eliza Cerveira,Jaeho Hwang,Ankit Malhotra,Dariusz Plewczynski,Kamen Radew,Mallory Romanovitch,Chengsheng Zhang,Fiona Hyland,David Craig,Alexis Christoforides,Nils Homer,Tyler Izatt,Ahmet Kurdoglu,Shripad Sinari,Kevin Squire,Chunlin Xiao,Jonathan Sebat,Danny Antaki,Madhusudan Gujral,Amina Noor,Kenny Ye,Esteban G. Burchard,Ryan D. Hernandez,Christopher R. Gignoux,David Haussler,David Haussler,Sol Katzman,W. James Kent,Bryan Howie,Andres Ruiz-Linares,Emmanouil T. Dermitzakis,Emmanouil T. Dermitzakis,Scott E. Devine,Hyun Min Kang,Jeffrey M. Kidd,Thomas W. Blackwell,Sean Caron,Wei Chen,S. Emery,Lars G. Fritsche,Christian Fuchsberger,Goo Jun,Goo Jun,Bingshan Li,Robert H. Lyons,Chris Scheller,Carlo Sidore,Carlo Sidore,Carlo Sidore,Shiya Song,Elzbieta Sliwerska,Daniel Taliun,Adrian Tan,Ryan P. Welch,Mary Kate Wing,Xiaowei Zhan,Philip Awadalla,Philip Awadalla,Alan Hodgkinson,Yun Li,Xinghua Shi,Andrew Quitadamo,Gerton Lunter,Jonathan Marchini,Simon Myers,Claire Churchhouse,Olivier Delaneau,Olivier Delaneau,Anjali Gupta-Hinch,Warren W. Kretzschmar,Zamin Iqbal,Iain Mathieson,Androniki Menelaou,Androniki Menelaou,Andy Rimmer,Dionysia Kiara Xifara,Taras K. Oleksyk,Yunxin Fu,Xiaoming Liu,Momiao Xiong,Lynn B. Jorde,David J. Witherspoon,Jinchuan Xing,Brian L. Browning,Sharon R. Browning,Fereydoun Hormozdiari,Peter H. Sudmant,Ekta Khurana,Chris Tyler-Smith,Cornelis A. Albers,Qasim Ayub,Yuan Chen,Vincenza Colonna,Vincenza Colonna,Luke Jostins,Klaudia Walter,Yali Xue,Mark Gerstein,Alexej Abyzov,Suganthi Balasubramanian,Jieming Chen,Declan Clarke,Yao Fu,Arif Harmanci,Mike Jin,Dong-Hoon Lee,Jeremy Liu,Xinmeng Jasmine Mu,Xinmeng Jasmine Mu,Jing Zhang,Yan Zhang,Christopher Hartl,Khalid Shakir,Jeremiah D. Degenhardt,Sascha Meiers,Benjamin Raeder,Francesco Paolo Casale,Oliver Stegle,Eric-Wubbo Lameijer,Ira M. Hall,Vineet Bafna,Jacob J. Michaelson,Eugene J. Gardner,Ryan E. Mills,Gargi Dayama,Ken Chen,Xian Fan,Zechen Chong,Tenghui Chen,Mark Chaisson,John Huddleston,Maika Malig,Bradley J. Nelson,Nicholas F. Parrish,Ben Blackburne,Sarah J. Lindsay,Zemin Ning,Yujun Zhang,Hugo Y. K. Lam,Cristina Sisu,Danny Challis,Uday S. Evani,James T. Lu,Uma Nagaswamy,Jin Yu,Wangshen Li,Lukas Habegger,Haiyuan Yu,Fiona Cunningham,Ian Dunham,Kasper Lage,Kasper Lage,Jakob Berg Jespersen,Jakob Berg Jespersen,Jakob Berg Jespersen,Heiko Horn,Heiko Horn,Donghoon Kim,Rob DeSalle,Apurva Narechania,Melissa A. Wilson Sayres,Fernando L. Mendez,G. David Poznik,Peter A. Underhill,David Mittelman,Ruby Banerjee,Maria Cerezo,Thomas W. Fitzgerald,Sandra Louzada,Andrea Massaia,Fengtang Yang,Divya Kalra,Walker Hale,Xu Dan,Kathleen C. Barnes,Christine Beiswanger,Hongyu Cai,Hongzhi Cao,Hongzhi Cao,Brenna M. Henn,Danielle Jones,Jane Kaye,Alastair Kent,Angeliki Kerasidou,Rasika A. Mathias,Pilar N. Ossorio,Michael Parker,Charles N. Rotimi,Charmaine D.M. Royal,Karla Sandoval,Yeyang Su,Zhongming Tian,Sarah A. Tishkoff,Marc Via,Yuhong Wang,Huanming Yang,Ling Yang,Jiayong Zhu,Walter F. Bodmer,Gabriel Bedoya,Zhiming Cai,Yang Gao,Jiayou Chu,Leena Peltonen,Andrés C. García-Montero,Alberto Orfao,Julie Dutil,Juan Carlos Martínez-Cruzado,R. Mathias,Anselm Hennis,Harold Watson,Colin A. McKenzie,Firdausi Qadri,Regina C. LaRocque,Xiaoyan Deng,Danny Asogun,Onikepe A. Folarin,Christian T. Happi,Omonwunmi Omoniwa,Matt Stremlau,Matt Stremlau,Ridhi Tariyal,Ridhi Tariyal,M Jallow,M Jallow,Fatoumatta Sisay Joof,Fatoumatta Sisay Joof,Tumani Corrah,Tumani Corrah,Kirk A. Rockett,Kirk A. Rockett,Dominic P. Kwiatkowski,Dominic P. Kwiatkowski,Jaspal S. Kooner,Tran Tinh Hien,Sarah J. Dunstan,Sarah J. Dunstan,Nguyen ThuyHang,Richard Fonnie,Robert F. Garry,Lansana Kanneh,Lina M. Moses,John S. Schieffelin,Donald S. Grant,Carla Gallo,Giovanni Poletti,Danish Saleheen,Asif Rasheed,Lisa D. Brooks,Adam Felsenfeld,Jean E. McEwen,Yekaterina Vaydylevich,Audrey Duncanson,Michael Dunn,Jeffery A. Schloss +517 more
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Journal ArticleDOI
The Ensembl Variant Effect Predictor.
William M. McLaren,Laurent Gil,Sarah E. Hunt,Harpreet Singh Riat,Graham R. S. Ritchie,Anja Thormann,Paul Flicek,Fiona Cunningham +7 more
TL;DR: The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.
Journal ArticleDOI
Common genetic variants, acting additively, are a major source of risk for autism
Lambertus Klei,Stephen Sanders,Michael T. Murtha,Vanessa Hus,Jennifer K. Lowe,A. Jeremy Willsey,Daniel Moreno-De-Luca,Timothy W. Yu,Eric Fombonne,Daniel H. Geschwind,Dorothy E. Grice,David H. Ledbetter,Catherine Lord,Shrikant Mane,Christa Lese Martin,Donna M. Martin,Eric M. Morrow,Christopher A. Walsh,Nadine M. Melhem,Pauline Chaste,James S. Sutcliffe,Matthew W. State,Edwin H. Cook,Kathryn Roeder,Bernie Devlin +24 more
TL;DR: It is shown that common genetic polymorphism exerts substantial additive genetic effects on ASD liability and that simplex/multiplex family status has an impact on the identified composition of that risk.
Journal ArticleDOI
Tracking the origins and drivers of subclonal metastatic expansion in prostate cancer
Matthew K.H. Hong,Geoff Macintyre,David C. Wedge,Peter Van Loo,Keval M. Patel,Sebastian Lunke,Ludmil B. Alexandrov,Clare Sloggett,Marek Cmero,Francesco Marass,Dana W.Y. Tsui,Stefano Mangiola,Andrew Lonie,Haroon Naeem,Nikhil Sapre,Pramit M. Phal,Natalie Kurganovs,Xiaowen Chin,Michael Kerger,Anne Y. Warren,David E. Neal,Vincent Gnanapragasam,Nitzan Rosenfeld,John Pedersen,Andrew Ryan,Izhak Haviv,Anthony J. Costello,Niall M. Corcoran,Christopher M. Hovens +28 more
TL;DR: The precise direction of metastatic spread is revealed across four lethal prostate cancer patients using whole-genome and ultra-deep targeted sequencing of longitudinally collected primary and metastatic tumours and analysis of mutations associated with metastasis reveals an enrichment of TP53 mutations.
Journal ArticleDOI
Next-generation sequencing data interpretation: enhancing reproducibility and accessibility
Anton Nekrutenko,James Taylor +1 more
TL;DR: Currently pressing issues with analysis, interpretation, reproducibility and accessibility of next-generation sequencing data are discussed, and promising solutions are presented and potential future developments are explored.
References
More filters
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
A method and server for predicting damaging missense mutations.
Ivan Adzhubei,Steffen Schmidt,Leonid Peshkin,Vasily Ramensky,Anna Gerasimova,Peer Bork,Alexey S. Kondrashov,Shamil R. Sunyaev +7 more
TL;DR: A new method and the corresponding software tool, PolyPhen-2, which is different from the early tool polyPhen1 in the set of predictive features, alignment pipeline, and the method of classification is presented and performance, as presented by its receiver operating characteristic curves, was consistently superior.
Journal ArticleDOI
The variant call format and VCFtools
Petr Danecek,Adam Auton,Gonçalo R. Abecasis,Cornelis A. Albers,Eric Banks,Mark A. DePristo,Robert E. Handsaker,Gerton Lunter,Gabor T. Marth,Stephen T. Sherry,Gilean McVean,Richard Durbin +11 more
TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.
Journal ArticleDOI
A Map of Human Genome Variation From Population-Scale Sequencing
Gonçalo R. Abecasis,David Altshuler,David Altshuler,Adam Auton,Lisa D Brooks,Richard Durbin,Richard A. Gibbs,Matthew E. Hurles,Gil McVean +8 more
TL;DR: The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype as mentioned in this paper, and the results of the pilot phase of the project, designed to develop and compare different strategies for genomewide sequencing with high-throughput platforms.
Journal ArticleDOI
Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm.
TL;DR: This protocol describes the use of the 'Sorting Tolerant From Intolerant' (SIFT) algorithm in predicting whether an AAS affects protein function.
Related Papers (5)
A framework for variation discovery and genotyping using next-generation DNA sequencing data
Mark A. DePristo,Eric Banks,Ryan Poplin,Kiran V. Garimella,Jared Maguire,Christopher Hartl,Anthony A. Philippakis,Anthony A. Philippakis,Anthony A. Philippakis,Guillermo del Angel,Manuel A. Rivas,Manuel A. Rivas,Matt Hanna,Aaron McKenna,Timothy Fennell,Andrew Kernytsky,Andrey Sivachenko,Kristian Cibulskis,Stacey Gabriel,David Altshuler,David Altshuler,Mark J. Daly,Mark J. Daly +22 more