PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
Shaun Purcell,Shaun Purcell,Benjamin M. Neale,Benjamin M. Neale,Kathe Todd-Brown,Lori Thomas,Manuel A. R. Ferreira,David Bender,David Bender,Julian Maller,Julian Maller,Pamela Sklar,Pamela Sklar,Paul I.W. de Bakker,Paul I.W. de Bakker,Mark J. Daly,Mark J. Daly,Pak C. Sham +17 more
Reads0
Chats0
TLDR
This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.Abstract:
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.read more
Citations
More filters
Journal ArticleDOI
Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows
TL;DR: The main innovations of the new version of the Arlequin program include enhanced outputs in XML format, the possibility to embed graphics displaying computation results directly into output files, and the implementation of a new method to detect loci under selection from genome scans.
Journal ArticleDOI
Second-generation PLINK: rising to the challenge of larger and richer datasets
Christopher C. Chang,Carson C. Chow,Laurent C. A. M. Tellier,Shashaank Vattikuti,Shaun Purcell,James J. Lee +5 more
TL;DR: The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility, and for the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
Journal ArticleDOI
GCTA: a tool for genome-wide complex trait analysis.
TL;DR: The GCTA software is a versatile tool to estimate and partition complex trait variation with large GWAS data sets and focuses on the function of estimating the variance explained by all the SNPs on the X chromosome and testing the hypotheses of dosage compensation.
Journal ArticleDOI
Common polygenic variation contributes to risk of schizophrenia and bipolar disorder
Shaun Purcell,Shaun Purcell,Naomi R. Wray,Jennifer Stone,Jennifer Stone,Peter M. Visscher,Michael Conlon O'Donovan,Patrick F. Sullivan,Pamela Sklar,Pamela Sklar,Douglas M. Ruderfer,Andrew McQuillin,Derek W. Morris,Colm O'Dushlaine,Aiden Corvin,Peter Holmans,Stuart MacGregor,Hugh Gurling,Douglas Blackwood,Nicholas John Craddock,Michael Gill,Christina M. Hultman,Christina M. Hultman,George Kirov,Paul Lichtenstein,Walter J. Muir,Michael John Owen,Carlos N. Pato,Edward M. Scolnick,Edward M. Scolnick,David St Clair,Nigel Williams,Lyudmila Georgieva,Ivan Nikolov,Nadine Norton,Hywel Williams,Draga Toncheva,Vihra Milanova,Emma Flordal Thelander,Patrick Sullivan,Elaine Kenny,Emma M. Quinn,Khalid Choudhury,Susmita Datta,Jonathan Pimm,Srinivasa Thirumalai,Vinay Puri,Robert Krasucki,Jacob Lawrence,Digby Quested,Nicholas Bass,Caroline Crombie,Gillian Fraser,Soh Leh Kuan,Nicholas Walker,Kevin A. McGhee,Ben S. Pickard,P. Malloy,Alan W Maclean,Margaret Van Beck,Michele T. Pato,Helena Medeiros,Frank A. Middleton,Célia Barreto Carvalho,Christopher P. Morley,Ayman H. Fanous,David V. Conti,James A. Knowles,Carlos Ferreira,António Macedo,M. Helena Azevedo,Andrew Kirby,Andrew Kirby,Manuel A. R. Ferreira,Manuel A. R. Ferreira,Mark J. Daly,Mark J. Daly,Kimberly Chambert,Finny G Kuruvilla,Stacey Gabriel,Kristin G. Ardlie,Jennifer L. Moran +81 more
TL;DR: The extent to which common genetic variation underlies the risk of schizophrenia is shown, using two analytic approaches, and the major histocompatibility complex is implicate, which is shown to involve thousands of common alleles of very small effect.
Journal ArticleDOI
A second generation human haplotype map of over 3.1 million SNPs
Kelly A. Frazer,Dennis G. Ballinger,David R. Cox,David A. Hinds,Laura L. Stuve,Richard A. Gibbs,John W. Belmont,Andrew Boudreau,Paul Hardenbol,Suzanne M. Leal,Shiran Pasternak,David A. Wheeler,Thomas D. Willis,Fuli Yu,Huanming Yang,Changqing Zeng,Gao Yang,H. B. Hu,Weitao Hu,Chaohua Li,Wei Lin,Siqi Liu,Hao Pan,Xiaoli Tang,Jian Wang,Wei Wang,Jun Yu,Bo Zhang,Qingrun Zhang,Hongbin Zhao,Hui Zhao,Jun Zhou,Stacey Gabriel,Rachel Barry,Brendan Blumenstiel,Amy L. Camargo,Matthew Defelice,Maura Faggart,Mary Goyette,Supriya Gupta,Jamie Moore,Huy Nguyen,Robert C. Onofrio,Melissa Parkin,Jessica Roy,Erich Stahl,Ellen Winchester,Liuda Ziaugra,David Altshuler,Yan Shen,Zhijian Yao,Wei Huang,Xun Chu,Yungang He,Li Jin,Yangfan Liu,Yayun Shen,Weiwei Sun,Haifeng Wang,Yi Wang,Ying Wang,Xiaoyan Xiong,Liang Xu,Mary M.Y. Waye,Stephen Kwok-Wing Tsui,Hong Xue,J. Tze Fei Wong,Luana Galver,Jian-Bing Fan,Kevin L. Gunderson,Sarah S. Murray,Arnold Oliphant,Mark S. Chee,Alexandre Montpetit,Fanny Chagnon,Vincent Ferretti,Martin Leboeuf,Jean François Olivier,Michael S. Phillips,Stéphanie Roumy,Clémentine Sallée,Andrei Verner,Thomas J. Hudson,Pui-Yan Kwok,Dongmei Cai,Daniel C. Koboldt,Raymond D. Miller,Ludmila Pawlikowska,Patricia Taillon-Miller,Ming Xiao,Lap-Chee Tsui,William Mak,Qiang Song You,Paul K.H. Tam,Yusuke Nakamura,Takahisa Kawaguchi,Takuya Kitamoto,Takashi Morizono,Atsushi Nagashima,Yozo Ohnishi,Akihiro Sekine,Toshihiro Tanaka,Tatsuhiko Tsunoda,Panos Deloukas,Christine P. Bird,Marcos Delgado,Emmanouil T. Dermitzakis,Rhian Gwilliam,Sarah E. Hunt,Jonathan J. Morrison,Don Powell,Barbara E. Stranger,Pamela Whittaker,David R. Bentley,Mark J. Daly,Paul I.W. de Bakker,Jeffrey C. Barrett,Yves Chretien,Julian Maller,Steve McCarroll,Nick Patterson,Itsik Pe'er,Alkes L. Price,Shaun Purcell,Daniel J. Richter,Pardis C. Sabeti,Richa Saxena,Stephen F. Schaffner,Pak C. Sham,Patrick Varilly,Lincoln Stein,Lalitha Krishnan,Albert V. Smith,Marcela K. Tello-Ruiz,Gudmundur A. Thorisson,Aravinda Chakravarti,Peter E. Chen,David J. Cutler,Carl S. Kashuk,Shin Lin,Gonçalo R. Abecasis,Weihua Guan,Yun Li,Heather M. Munro,Zhaohui S. Qin,Daryl J. Thomas,Gilean McVean,Adam Auton,Leonardo Bottolo,Niall Cardin,Susana Eyheramendy,Colin Freeman,Jonathan Marchini,Simon Myers,Chris C. A. Spencer,Matthew Stephens,Peter Donnelly,Lon R. Cardon,Geraldine M. Clarke,David M. Evans,Andrew P. Morris,Bruce S. Weir,Todd A. Johnson,James C. Mullikin,Stephen T. Sherry,Michael Feolo,Andrew D. Skol,Houcan Zhang,Ichiro Matsuda,Yoshimitsu Fukushima,Darryl Macer,Eiko Suda,Charles N. Rotimi,Clement Adebamowo,Ike Ajayi,Toyin Aniagwu,Patricia A. Marshall,Chibuzor Nkwodimmah,Charmaine D.M. Royal,Mark Leppert,Missy Dixon,Andy Peiffer,Renzong Qiu,Alastair Kent,Kazuto Kato,Norio Niikawa,Isaac F. Adewole,Bartha Maria Knoppers,Morris W. Foster,Ellen Wright Clayton,Jessica Watkin,Donna M. Muzny,Lynne V. Nazareth,Erica Sodergren,George M. Weinstock,Imtaz Yakub,Bruce W. Birren,Richard K. Wilson,Lucinda Fulton,Jane Rogers,John Burton,Nigel P. Carter,C M Clee,Mark Griffiths,Matthew C. Jones,Kirsten McLay,Robert W. Plumb,Mark T. Ross,Sarah Sims,David Willey,Zhu Chen,Hua Han,Le Kang,Martin Godbout,John C. Wallenburg,Paul L'Archevêque,Guy Bellemare,Koji Saeki,Hongguang Wang,Daochang An,Hongbo Fu,Qing Li,Zhen Wang,Renwu Wang,Arthur L. Holden,Lisa D. Brooks,Jean E. McEwen,Mark S. Guyer,Vivian Ota Wang,Jane Peterson,Michael Shi,Jack Spiegel,Lawrence M. Sung,Lynn F. Zacharia,Francis S. Collins,Karen Kennedy,Ruth Jamieson,John Stewart +237 more
TL;DR: The Phase II HapMap is described, which characterizes over 3.1 million human single nucleotide polymorphisms genotyped in 270 individuals from four geographically diverse populations and includes 25–35% of common SNP variation in the populations surveyed, and increased differentiation at non-synonymous, compared to synonymous, SNPs is demonstrated.
References
More filters
Journal ArticleDOI
Assessing the impact of population stratification on genetic association studies.
Matthew L. Freedman,Matthew L. Freedman,David Reich,David Reich,Kathryn L. Penney,Kathryn L. Penney,Gavin J. McDonald,Gavin J. McDonald,Andre A. Mignault,Nick Patterson,Stacey Gabriel,Eric J. Topol,Jordan W. Smoller,Carlos N. Pato,Carlos N. Pato,Michele T. Pato,Michele T. Pato,Tracey L. Petryshen,Laurence N. Kolonel,Eric S. Lander,Eric S. Lander,Pamela Sklar,Pamela Sklar,Brian E. Henderson,Joel N. Hirschhorn,Joel N. Hirschhorn,David Altshuler +26 more
TL;DR: The results suggest that modest amounts of stratification can exist even in well designed studies and that assessments based on a few dozen markers lack power to rule out moderate levels of stratisation that could cause false positive associations in studies designed to detect modest genetic risk factors.
Journal ArticleDOI
A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test.
TL;DR: This article describes a method, called the "sib TDT" (or "S-TDT"), that overcomes this problem by use of marker data from unaffected sibs instead of from parents, thus allowing application of the principle of the TDT to sibships without parental data.
Journal Article
The transmission/disequilibrium test: history, subdivision, and admixture.
TL;DR: The effects of subdivision and admixture, viewed as processes in population history, are described, and the properties of the TDT as a test for linkage (and association) between marker and disease and the conventional contingency statistic used with family data to test for population association are analyzed.
Journal ArticleDOI
Population structure, differential bias and genomic control in a large-scale, case-control association study.
David Clayton,Neil Walker,Deborah J. Smyth,Rebecca Pask,Jason D. Cooper,Lisa M. Maier,Luc J. Smink,Alex C. Lam,Nigel R. Ovington,Helen Stevens,Sarah Nutland,Joanna M. M. Howson,Malek Faham,Martin Moorhead,Hywel B. Jones,Matthew Falkowski,Paul Hardenbol,Thomas D. Willis,John A. Todd +18 more
TL;DR: To avoid excluding SNPs and losing valuable information, the genomic control method is extended by applying a variable downweighting to each SNP, which explains part of the significant +11.2% inflation of test statistics the authors observed in an analysis of 6,322 nonsynonymous SNPs.
Journal ArticleDOI
Combined Linkage and Association Sib-Pair Analysis for Quantitative Traits
TL;DR: An extension to current maximum-likelihood variance-components procedures for mapping quantitative-trait loci in sib pairs that allows a simultaneous test of allelic association is proposed, which partitions the mean effect of a locus into between- and within-sibship components.