Second-generation PLINK: rising to the challenge of larger and richer datasets
Christopher C. Chang,Carson C. Chow,Laurent C. A. M. Tellier,Shashaank Vattikuti,Shaun Purcell,James J. Lee +5 more
Reads0
Chats0
TLDR
PLINK as discussed by the authors is a C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics, which has been widely used in the literature.Abstract:
PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format.
To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information.
The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.read more
Citations
More filters
Journal ArticleDOI
Item-level analyses reveal genetic heterogeneity in neuroticism.
TL;DR: The items used to measure neuroticism are genetically heterogeneous, and that biological understanding can be gained by studying them in genetically more homogeneous clusters, and genetic heterogeneity at the item-level is demonstrated.
Journal ArticleDOI
Investigating the genetic architecture of dementia with Lewy bodies: a two-stage genome-wide association study
Rita Guerreiro,Rita Guerreiro,Owen A. Ross,Celia Kun-Rodrigues,Dena G. Hernandez,Dena G. Hernandez,Tatiana Orme,John D. Eicher,Claire E. Shepherd,Claire E. Shepherd,Laura Parkkinen,Lee Darwent,Michael G. Heckman,Sonja W. Scholz,Juan C. Troncoso,Olga Pletnikova,Olaf Ansorge,Jordi Clarimón,Alberto Lleó,Estrella Morenas-Rodríguez,Lorraine N. Clark,Lawrence S. Honig,Karen Marder,Afina W. Lemstra,Ekaterina Rogaeva,Peter St George-Hyslop,Peter St George-Hyslop,Elisabet Londos,Henrik Zetterberg,Henrik Zetterberg,Henrik Zetterberg,Imelda Barber,Anne Braae,Kristelle Brown,Kevin Morgan,Claire Troakes,Safa Al-Sarraj,Tammaryn Lashley,Janice L. Holton,Yaroslau Compta,Yaroslau Compta,Vivianna M. Van Deerlin,Geidy E. Serrano,Thomas G. Beach,Suzanne Lesage,Douglas Galasko,Douglas Galasko,Eliezer Masliah,Isabel Santana,Pau Pastor,Pau Pastor,Monica Diez-Fairen,Monica Diez-Fairen,Miquel Aguilar,Miquel Aguilar,Pentti J. Tienari,Liisa Myllykangas,Minna Oinas,Tamas Revesz,Andrew J. Lees,Brad F. Boeve,Ronald C. Petersen,Tanis J. Ferman,Valentina Escott-Price,Neill R. Graff-Radford,Nigel J. Cairns,John C. Morris,Stuart Pickering-Brown,David M. A. Mann,Glenda M. Halliday,Glenda M. Halliday,Glenda M. Halliday,John Hardy,John Q. Trojanowski,Dennis W. Dickson,Andrew B. Singleton,David J. Stone,Jose Bras,Jose Bras +78 more
TL;DR: Despite the small sample size for a genome-wide association study, and acknowledging the potential biases from ascertaining samples from multiple locations, this study presents the most comprehensive and well powered genetic study in dementia with Lewy bodies so far.
Journal ArticleDOI
Genetic origins of the Minoans and Mycenaeans
Iosif Lazaridis,Iosif Lazaridis,Alissa Mittnik,Alissa Mittnik,Nick Patterson,Nick Patterson,Swapan Mallick,Swapan Mallick,Swapan Mallick,Nadin Rohland,Saskia Pfrengle,Anja Furtwängler,Alexander Peltzer,Cosimo Posth,Andonis Vasilakis,P. J. P. McGeorge,Eleni Konsolaki-Yannopoulou,George N. Korres,Holley Martlew,Manolis Michalodimitrakis,Mehmet Özsait,Nesrin Özsait,Anastasia Papathanasiou,Michael P. Richards,Songül Alpaslan Roodenberg,Yannis Tzedakis,Robert Arnott,Daniel Fernandes,Daniel Fernandes,Jeffery R. Hughey,Dimitra Lotakis,Patrick A. Navas,Yannis Maniatis,John A. Stamatoyannopoulos,Kristin Stewardson,Kristin Stewardson,Philipp W. Stockhammer,Ron Pinhasi,Ron Pinhasi,David Reich,David Reich,David Reich,Johannes Krause,George Stamatoyannopoulos +43 more
TL;DR: It is shown that Minoans and Mycenaeans were genetically similar, having at least three-quarters of their ancestry from the first Neolithic farmers of western Anatolia and the Aegean, and most of the remainder from ancient populations related to those of the Caucasus and Iran.
Journal ArticleDOI
Genome-wide association study identifies four novel loci associated with Alzheimer's endophenotypes and disease modifiers.
TL;DR: Bioinformatics indicate that the intronic SERPINB1 variant affects expression of SER PINB1 in various tissues, including the hippocampus, suggesting that SERPINb1 influences AD through an Aβ-associated mechanism, and can be used to inform future AD studies.
Journal ArticleDOI
Use of genetic variation to separate the effects of early and later life adiposity on disease risk: mendelian randomisation study
TL;DR: The findings suggest that the positive association between body size in childhood and risk of coronary artery disease and type 2 diabetes in adulthood can be attributed to individuals remaining large into later life.
References
More filters
Journal ArticleDOI
The Sequence Alignment/Map format and SAMtools
Heng Li,Bob Handsaker,Alec Wysoker,T. J. Fennell,Jue Ruan,Nils Homer,Gabor T. Marth,Gonçalo R. Abecasis,Richard Durbin +8 more
TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.
Journal ArticleDOI
PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Shaun Purcell,Shaun Purcell,Benjamin M. Neale,Benjamin M. Neale,Kathe Todd-Brown,Lori Thomas,Manuel A. R. Ferreira,David Bender,David Bender,Julian Maller,Julian Maller,Pamela Sklar,Pamela Sklar,Paul I.W. de Bakker,Paul I.W. de Bakker,Mark J. Daly,Mark J. Daly,Pak C. Sham +17 more
TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.
Journal ArticleDOI
The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data
Aaron McKenna,Matthew Hanna,Eric Banks,Andrey Sivachenko,Kristian Cibulskis,Andrew Kernytsky,Kiran V. Garimella,David Altshuler,Stacey Gabriel,Mark J. Daly,Mark A. DePristo +10 more
TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Journal ArticleDOI
Haploview: analysis and visualization of LD and haplotype maps
TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.
Journal ArticleDOI
The variant call format and VCFtools
Petr Danecek,Adam Auton,Gonçalo R. Abecasis,Cornelis A. Albers,Eric Banks,Mark A. DePristo,Robert E. Handsaker,Gerton Lunter,Gabor T. Marth,Stephen T. Sherry,Gilean McVean,Richard Durbin +11 more
TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.
Related Papers (5)
PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
A global reference for human genetic variation.
Adam Auton,Gonçalo R. Abecasis,David Altshuler,Richard Durbin,David R. Bentley,Aravinda Chakravarti,Andrew G. Clark,Peter Donnelly,Evan E. Eichler,Paul Flicek,Stacey Gabriel,Richard A. Gibbs,Eric D. Green,Matthew E. Hurles,Bartha Maria Knoppers,Jan O. Korbel,Eric S. Lander,Charles Lee,Hans Lehrach,Elaine R. Mardis,Gabor T. Marth,Gil McVean,Deborah A. Nickerson,Jeanette Schmidt,Stephen T. Sherry,Jun Wang,Richard K. Wilson,Eric Boerwinkle,Harsha Doddapaneni,Yi Han,Viktoriya Korchina,Christie Kovar,Sandra L. Lee,Donna M. Muzny,Jeffrey G. Reid,Yiming Zhu,Yuqi Chang,Qiang Feng,Qiang Feng,Xiaodong Fang,Xiaodong Fang,Xiaosen Guo,Xiaosen Guo,Min Jian,Min Jian,Hui Jiang,Hui Jiang,Xin Jin,Tianming Lan,Guoqing Li,Jingxiang Li,Yingrui Li,Shengmao Liu,Xiao Liu,Xiao Liu,Yao Lu,Xuedi Ma,Meifang Tang,Bo Wang,Guangbiao Wang,Honglong Wu,Renhua Wu,Xun Xu,Ye Yin,Dandan Zhang,Wenwei Zhang,Jiao Zhao,Meiru Zhao,Xiaole Zheng,Namrata Gupta,Neda Gharani,Lorraine Toji,Norman P. Gerry,Alissa M. Resch,Jonathan Barker,Laura Clarke,Laurent Gil,Sarah E. Hunt,Gavin Kelman,Eugene Kulesha,Rasko Leinonen,William M. McLaren,Rajesh Radhakrishnan,Asier Roa,Dmitriy Smirnov,Richard Smith,Ian Streeter,Anja Thormann,Iliana Toneva,Brendan Vaughan,Xiangqun Zheng-Bradley,Russell J. Grocock,Sean Humphray,Terena James,Zoya Kingsbury,Ralf Sudbrak,M. Albrecht,Vyacheslav Amstislavskiy,Tatiana A. Borodina,Matthias Lienhard,Florian Mertes,Marc Sultan,Bernd Timmermann,Marie-Laure Yaspo,Lucinda Fulton,Victor Ananiev,Zinaida Belaia,Dimitriy Beloslyudtsev,Nathan Bouk,Chao Chen,Deanna M. Church,Robert M. Cohen,Charles Cook,John Garner,Timothy Hefferon,Mikhail Kimelman,Chunlei Liu,John Lopez,Peter Meric,Chris O’Sullivan,Yuri Ostapchuk,Lon Phan,Sergiy Ponomarov,Valerie A. Schneider,Eugene Shekhtman,Karl Sirotkin,Douglas J. Slotta,Hua Zhang,Senduran Balasubramaniam,John Burton,Petr Danecek,Thomas M. Keane,Anja Kolb-Kokocinski,Shane A. McCarthy,James Stalker,Michael A. Quail,Christopher Davies,Jeremy Gollub,Teresa Webster,Brant Wong,Yiping Zhan,Christopher L. Campbell,Yu Kong,Anthony Marcketta,Fuli Yu,Lilian Antunes,Matthew N. Bainbridge,Aniko Sabo,Zhuoyi Huang,Lachlan J. M. Coin,Lin Fang,Lin Fang,Qibin Li,Zhenyu Li,Haoxiang Lin,Binghang Liu,Ruibang Luo,Haojing Shao,Haojing Shao,Yinlong Xie,Chen Ye,Chang Yu,Fan Zhang,Hancheng Zheng,Zhu Hongmei,Can Alkan,Elif Dal,Fatma Kahveci,Erik Garrison,Deniz Kural,Wan-Ping Lee,Wen Fung Leong,Michael Strömberg,Alistair Ward,Jiantao Wu,Mengyao Zhang,Mark J. Daly,Mark A. DePristo,Robert E. Handsaker,Robert E. Handsaker,Eric Banks,Gaurav Bhatia,Guillermo del Angel,Giulio Genovese,Heng Li,Seva Kashin,Seva Kashin,Steven A. McCarroll,Steven A. McCarroll,James Nemesh,Ryan Poplin,Seungtai Yoon,Jayon Lihm,Vladimir Makarov,Srikanth Gottipati,Alon Keinan,Juan L. Rodriguez-Flores,Tobias Rausch,Markus Hsi-Yang Fritz,Adrian M. Stütz,Kathryn Beal,Avik Datta,Javier Herrero,Graham R. S. Ritchie,Daniel R. Zerbino,Pardis C. Sabeti,Pardis C. Sabeti,Ilya Shlyakhter,Ilya Shlyakhter,Stephen F. Schaffner,Stephen F. Schaffner,Joseph J. Vitti,Joseph J. Vitti,David Neil Cooper,Edward V. Ball,Peter D. Stenson,Bret Barnes,Markus J. Bauer,R. Keira Cheetham,Anthony J. Cox,Michael A. Eberle,Scott Kahn,Lisa Murray,John F. Peden,Richard Shaw,Eimear E. Kenny,Mark A. Batzer,Miriam K. Konkel,Jerilyn A. Walker,Daniel G. MacArthur,Monkol Lek,Ralf Herwig,Li Ding,Daniel C. Koboldt,David E. Larson,Kai Ye,Simon Gravel,Anand Swaroop,Emily Y. Chew,Tuuli Lappalainen,Yaniv Erlich,Melissa Gymrek,Melissa Gymrek,Thomas Willems,Jared T. Simpson,Mark D. Shriver,Jeffrey A. Rosenfeld,Carlos Bustamante,Stephen B. Montgomery,Francisco M. De La Vega,Jake K. Byrnes,Andrew Carroll,Marianne K. DeGorter,Phil Lacroute,Brian K. Maples,Alicia R. Martin,Andrés Moreno-Estrada,Andrés Moreno-Estrada,Suyash Shringarpure,Fouad Zakharia,Eran Halperin,Eran Halperin,Yael Baran,Eliza Cerveira,Jaeho Hwang,Ankit Malhotra,Dariusz Plewczynski,Kamen Radew,Mallory Romanovitch,Chengsheng Zhang,Fiona Hyland,David Craig,Alexis Christoforides,Nils Homer,Tyler Izatt,Ahmet Kurdoglu,Shripad Sinari,Kevin Squire,Chunlin Xiao,Jonathan Sebat,Danny Antaki,Madhusudan Gujral,Amina Noor,Kenny Ye,Esteban G. Burchard,Ryan D. Hernandez,Christopher R. Gignoux,David Haussler,David Haussler,Sol Katzman,W. James Kent,Bryan Howie,Andres Ruiz-Linares,Emmanouil T. Dermitzakis,Emmanouil T. Dermitzakis,Scott E. Devine,Hyun Min Kang,Jeffrey M. Kidd,Thomas W. Blackwell,Sean Caron,Wei Chen,S. Emery,Lars G. Fritsche,Christian Fuchsberger,Goo Jun,Goo Jun,Bingshan Li,Robert H. Lyons,Chris Scheller,Carlo Sidore,Carlo Sidore,Carlo Sidore,Shiya Song,Elzbieta Sliwerska,Daniel Taliun,Adrian Tan,Ryan P. Welch,Mary Kate Wing,Xiaowei Zhan,Philip Awadalla,Philip Awadalla,Alan Hodgkinson,Yun Li,Xinghua Shi,Andrew Quitadamo,Gerton Lunter,Jonathan Marchini,Simon Myers,Claire Churchhouse,Olivier Delaneau,Olivier Delaneau,Anjali Gupta-Hinch,Warren W. Kretzschmar,Zamin Iqbal,Iain Mathieson,Androniki Menelaou,Androniki Menelaou,Andy Rimmer,Dionysia Kiara Xifara,Taras K. Oleksyk,Yunxin Fu,Xiaoming Liu,Momiao Xiong,Lynn B. Jorde,David J. Witherspoon,Jinchuan Xing,Brian L. Browning,Sharon R. Browning,Fereydoun Hormozdiari,Peter H. Sudmant,Ekta Khurana,Chris Tyler-Smith,Cornelis A. Albers,Qasim Ayub,Yuan Chen,Vincenza Colonna,Vincenza Colonna,Luke Jostins,Klaudia Walter,Yali Xue,Mark Gerstein,Alexej Abyzov,Suganthi Balasubramanian,Jieming Chen,Declan Clarke,Yao Fu,Arif Harmanci,Mike Jin,Dong-Hoon Lee,Jeremy Liu,Xinmeng Jasmine Mu,Xinmeng Jasmine Mu,Jing Zhang,Yan Zhang,Christopher Hartl,Khalid Shakir,Jeremiah D. Degenhardt,Sascha Meiers,Benjamin Raeder,Francesco Paolo Casale,Oliver Stegle,Eric-Wubbo Lameijer,Ira M. Hall,Vineet Bafna,Jacob J. Michaelson,Eugene J. Gardner,Ryan E. Mills,Gargi Dayama,Ken Chen,Xian Fan,Zechen Chong,Tenghui Chen,Mark Chaisson,John Huddleston,Maika Malig,Bradley J. Nelson,Nicholas F. Parrish,Ben Blackburne,Sarah J. Lindsay,Zemin Ning,Yujun Zhang,Hugo Y. K. Lam,Cristina Sisu,Danny Challis,Uday S. Evani,James T. Lu,Uma Nagaswamy,Jin Yu,Wangshen Li,Lukas Habegger,Haiyuan Yu,Fiona Cunningham,Ian Dunham,Kasper Lage,Kasper Lage,Jakob Berg Jespersen,Jakob Berg Jespersen,Jakob Berg Jespersen,Heiko Horn,Heiko Horn,Donghoon Kim,Rob DeSalle,Apurva Narechania,Melissa A. Wilson Sayres,Fernando L. Mendez,G. David Poznik,Peter A. Underhill,David Mittelman,Ruby Banerjee,Maria Cerezo,Thomas W. Fitzgerald,Sandra Louzada,Andrea Massaia,Fengtang Yang,Divya Kalra,Walker Hale,Xu Dan,Kathleen C. Barnes,Christine Beiswanger,Hongyu Cai,Hongzhi Cao,Hongzhi Cao,Brenna M. Henn,Danielle Jones,Jane Kaye,Alastair Kent,Angeliki Kerasidou,Rasika A. Mathias,Pilar N. Ossorio,Michael Parker,Charles N. Rotimi,Charmaine D.M. Royal,Karla Sandoval,Yeyang Su,Zhongming Tian,Sarah A. Tishkoff,Marc Via,Yuhong Wang,Huanming Yang,Ling Yang,Jiayong Zhu,Walter F. Bodmer,Gabriel Bedoya,Zhiming Cai,Yang Gao,Jiayou Chu,Leena Peltonen,Andrés C. García-Montero,Alberto Orfao,Julie Dutil,Juan Carlos Martínez-Cruzado,R. Mathias,Anselm Hennis,Harold Watson,Colin A. McKenzie,Firdausi Qadri,Regina C. LaRocque,Xiaoyan Deng,Danny Asogun,Onikepe A. Folarin,Christian T. Happi,Omonwunmi Omoniwa,Matt Stremlau,Matt Stremlau,Ridhi Tariyal,Ridhi Tariyal,M Jallow,M Jallow,Fatoumatta Sisay Joof,Fatoumatta Sisay Joof,Tumani Corrah,Tumani Corrah,Kirk A. Rockett,Kirk A. Rockett,Dominic P. Kwiatkowski,Dominic P. Kwiatkowski,Jaspal S. Kooner,Tran Tinh Hien,Sarah J. Dunstan,Sarah J. Dunstan,Nguyen ThuyHang,Richard Fonnie,Robert F. Garry,Lansana Kanneh,Lina M. Moses,John S. Schieffelin,Donald S. Grant,Carla Gallo,Giovanni Poletti,Danish Saleheen,Asif Rasheed,Lisa D. Brooks,Adam Felsenfeld,Jean E. McEwen,Yekaterina Vaydylevich,Audrey Duncanson,Michael Dunn,Jeffery A. Schloss +517 more
A reference panel of 64,976 haplotypes for genotype imputation
Shane A. McCarthy,Sayantan Das,Warren W. Kretzschmar,Olivier Delaneau,Andrew R. Wood,Alexander Teumer,Hyun Min Kang,Christian Fuchsberger,Petr Danecek,Kevin Sharp,Yang Luo,C Sidore,Alan Kwong,Nicholas J. Timpson,Seppo Koskinen,Scott I. Vrieze,Laura J. Scott,He Zhang,Anubha Mahajan,Jan H. Veldink,Ulrike Peters,Ulrike Peters,Carlos N. Pato,Cornelia M. van Duijn,Christopher E. Gillies,Ilaria Gandin,Massimo Mezzavilla,Arthur Gilly,Massimiliano Cocca,Michela Traglia,Andrea Angius,Jeffrey C. Barrett,D.I. Boomsma,Kari Branham,Gerome Breen,Gerome Breen,Chad M. Brummett,Fabio Busonero,Harry Campbell,Andrew T. Chan,Sai Chen,Emily Y. Chew,Francis S. Collins,Laura J Corbin,George Davey Smith,George Dedoussis,Marcus Dörr,Aliki-Eleni Farmaki,Luigi Ferrucci,Lukas Forer,Ross M. Fraser,Stacey Gabriel,Shawn Levy,Leif Groop,Leif Groop,Tabitha A. Harrison,Andrew T. Hattersley,Oddgeir L. Holmen,Kristian Hveem,Matthias Kretzler,James Lee,Matt McGue,Thomas Meitinger,David Melzer,Josine L. Min,Karen L. Mohlke,John B. Vincent,Matthias Nauck,Deborah A. Nickerson,Aarno Palotie,Aarno Palotie,Michele T. Pato,Nicola Pirastu,Melvin G. McInnis,J. Brent Richards,J. Brent Richards,Cinzia Sala,Veikko Salomaa,David Schlessinger,Sebastian Schoenherr,P. Eline Slagboom,Kerrin S. Small,Tim D. Spector,Dwight Stambolian,Marcus A. Tuke,Jaakko Tuomilehto,Leonard H. van den Berg,Wouter van Rheenen,Uwe Völker,Cisca Wijmenga,Daniela Toniolo,Eleftheria Zeggini,Paolo Gasparini,Matthew G. Sampson,James F. Wilson,Timothy M. Frayling,Paul I.W. de Bakker,Morris A. Swertz,Steven A. McCarroll,Charles Kooperberg,Annelot M. Dekker,David Altshuler,Cristen J. Willer,William G. Iacono,Samuli Ripatti,Nicole Soranzo,Nicole Soranzo,Klaudia Walter,Anand Swaroop,Francesco Cucca,Carl A. Anderson,Richard M. Myers,Michael Boehnke,Mark I. McCarthy,Mark I. McCarthy,Richard Durbin,Gonçalo R. Abecasis,Jonathan Marchini +117 more